Kun Ding

Orcid: 0000-0002-2256-8815

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, Beijing, China


According to our database1, Kun Ding authored at least 48 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
WikiSeeker: Rethinking the Role of Vision-Language Models in Knowledge-Based Visual Question Answering.
CoRR, April, 2026

SeaVIS: Sound-Enhanced Association for Online Audio-Visual Instance Segmentation.
CoRR, March, 2026

CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering.
CoRR, February, 2026

DSFC-Net: A Dual-Encoder Spatial and Frequency Co-Awareness Network for Rural Road Extraction.
CoRR, February, 2026

USVTrack: A Benchmark for Multi-Object Tracking in Complex Water Surface Scenes.
IEEE Trans. Circuits Syst. Video Technol., January, 2026

AtmosOceanNet: SST forecast method driven by atmospheric-oceanic multimodal data.
Pattern Recognit., 2026

Efficient redundancy reduction for open-vocabulary semantic segmentation.
Neurocomputing, 2026

Beyond Counting: Evaluating Abstract and Emotional Reasoning in Vision-Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

LookFlow: Training-Free and Efficient High-Resolution Image Synthesis via Dynamic Lookahead Guidance Flow.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
SAM-MI: A Mask-Injected Framework for Enhancing Open-Vocabulary Semantic Segmentation with SAM.
CoRR, November, 2025

Taming Modality Entanglement in Continual Audio-Visual Segmentation.
CoRR, October, 2025

Knowledge-based Visual Question Answer with Multimodal Processing, Retrieval and Filtering.
CoRR, October, 2025

Instructed fine-tuning based on semantic consistency constraint for deep multi-view stereo.
Appl. Intell., April, 2025

Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization.
CoRR, February, 2025

CLIP-MoA: Visual-Language Models With Mixture of Adapters for Multitask Remote Sensing Image Classification.
IEEE Trans. Geosci. Remote. Sens., 2025

EvoVLMA: Evolutionary Vision-Language Model Adaptation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Agent Reviewers: Domain-specific Multimodal Agents with Shared Memory for Paper Review.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Re-ranking Reasoning Context with Tree Search Makes Large Vision-Language Models Stronger.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Few-Shot Image Classification of Crop Diseases Based on Vision-Language Models.
Sensors, September, 2024

Multi-task prompt tuning with soft context sharing for vision-language models.
Neurocomputing, 2024

Compositional Kronecker Context Optimization for vision-language models.
Neurocomputing, 2024

Deep convolutional neural network based on self-distillation for tool wear recognition.
Eng. Appl. Artif. Intell., 2024

Rethinking Comprehensive Benchmark for Chart Understanding: A Perspective from Scientific Literature.
CoRR, 2024

Continuous Speculative Decoding for Autoregressive Image Generation.
CoRR, 2024

A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem.
CoRR, 2024

Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation.
CoRR, 2024

Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
PAD: Self-Supervised Pre-Training with Patchwise-Scale Adapter for Infrared Images.
CoRR, 2023

SFMatting-800: A Multi-Scene Smoke and Fire Image Matting Dataset for Fine-Grained Fire Detection.
Proceedings of the 4th International Conference on Artificial Intelligence and Computer Engineering, 2023

2022
Train in Dense and Test in Sparse: A Method for Sparse Object Detection in Aerial Images.
IEEE Geosci. Remote. Sens. Lett., 2022

Prompt Tuning with Soft Context Sharing for Vision-Language Models.
CoRR, 2022

2020
PackDet: Packed Long-Head Object Detector.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Deep Hierarchical Encoder-Decoder Network for Image Captioning.
IEEE Trans. Multim., 2019

Dense semantic embedding network for image captioning.
Pattern Recognit., 2019

Nonlinear Asymmetric Multi-Valued Hashing.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

2018
In Defense of Locality-Sensitive Hashing.
IEEE Trans. Neural Networks Learn. Syst., 2018

BundleNet: Learning with Noisy Label via Sample Correlations.
IEEE Access, 2018

2017
Cross-Modal Hashing via Rank-Order Preserving.
IEEE Trans. Multim., 2017

AMVH: Asymmetric Multi-Valued hashing.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Efficient Multiple Feature Fusion With Hashing for Hyperspectral Imagery Classification: A Comparative Study.
IEEE Trans. Geosci. Remote. Sens., 2016

Learning Relationship for Very High Resolution Image Change Detection.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2016

2015
Discriminant Tensor Spectral-Spatial Feature Extraction for Hyperspectral Image Classification.
IEEE Geosci. Remote. Sens. Lett., 2015

Multicluster Spatial-Spectral Unsupervised Feature Selection for Hyperspectral Image Classification.
IEEE Geosci. Remote. Sens. Lett., 2015

Sparse Hierarchical Clustering for VHR Image Change Detection.
IEEE Geosci. Remote. Sens. Lett., 2015

kNN Hashing with Factorized Neighborhood Representation.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2013
VHR image change detection based on discriminative dictionary learning.
Proceedings of the IEEE International Conference on Acoustics, 2013


  Loading...