Kun Ding

Orcid: 0000-0002-2256-8815

Affiliations:

Chinese Academy of Sciences, Institute of Automation, Beijing, China

According to our database¹, Kun Ding authored at least 49 papers between 2013 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

SAM-MI: A Mask-injected Framework for Enhancing Open-vocabulary Semantic Segmentation with SAM.

[BibT_eX]

[DOI]

Mach. Intell. Res., June, 2026

WikiSeeker: Rethinking the Role of Vision-Language Models in Knowledge-Based Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, April, 2026

SeaVIS: Sound-Enhanced Association for Online Audio-Visual Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, March, 2026

CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, February, 2026

DSFC-Net: A Dual-Encoder Spatial and Frequency Co-Awareness Network for Rural Road Extraction.

[BibT_eX]

[DOI]

CoRR, February, 2026

USVTrack: A Benchmark for Multi-Object Tracking in Complex Water Surface Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., January, 2026

AtmosOceanNet: SST forecast method driven by atmospheric-oceanic multimodal data.

[BibT_eX]

[DOI]

Pattern Recognit., 2026

Efficient redundancy reduction for open-vocabulary semantic segmentation.

[BibT_eX]

[DOI]

Neurocomputing, 2026

Beyond Counting: Evaluating Abstract and Emotional Reasoning in Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

LookFlow: Training-Free and Efficient High-Resolution Image Synthesis via Dynamic Lookahead Guidance Flow.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Taming Modality Entanglement in Continual Audio-Visual Segmentation.

[BibT_eX]

[DOI]

CoRR, October, 2025

Instructed fine-tuning based on semantic consistency constraint for deep multi-view stereo.

[BibT_eX]

[DOI]

Appl. Intell., April, 2025

Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization.

[BibT_eX]

[DOI]

CoRR, February, 2025

CLIP-MoA: Visual-Language Models With Mixture of Adapters for Multitask Remote Sensing Image Classification.

[BibT_eX]

[DOI]

Zhongzheng Fu

Hongping Yan

Kun Ding

IEEE Trans. Geosci. Remote. Sens., 2025

Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Knowledge-based Visual Question Answer with Multimodal Processing, Retrieval and Filtering.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

EvoVLMA: Evolutionary Vision-Language Model Adaptation.

[BibT_eX]

[DOI]

Kun Ding

Ying Wang

Shiming Xiang

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Agent Reviewers: Domain-specific Multimodal Agents with Shared Memory for Paper Review.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Re-ranking Reasoning Context with Tree Search Makes Large Vision-Language Models Stronger.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Few-Shot Image Classification of Crop Diseases Based on Vision-Language Models.

[BibT_eX]

[DOI]

Sensors, September, 2024

Multi-task prompt tuning with soft context sharing for vision-language models.

[BibT_eX]

[DOI]

Neurocomputing, 2024

Compositional Kronecker Context Optimization for vision-language models.

[BibT_eX]

[DOI]

Neurocomputing, 2024

Deep convolutional neural network based on self-distillation for tool wear recognition.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., 2024

Rethinking Comprehensive Benchmark for Chart Understanding: A Perspective from Scientific Literature.

[BibT_eX]

[DOI]

CoRR, 2024

Continuous Speculative Decoding for Autoregressive Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem.

[BibT_eX]

[DOI]

CoRR, 2024

Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

PAD: Self-Supervised Pre-Training with Patchwise-Scale Adapter for Infrared Images.

[BibT_eX]

[DOI]

CoRR, 2023

SFMatting-800: A Multi-Scene Smoke and Fire Image Matting Dataset for Fine-Grained Fire Detection.

[BibT_eX]

[DOI]

Shihui Ma

Kun Ding

Hongping Yan

Proceedings of the 4th International Conference on Artificial Intelligence and Computer Engineering, 2023

2022

Train in Dense and Test in Sparse: A Method for Sparse Object Detection in Aerial Images.

[BibT_eX]

[DOI]

IEEE Geosci. Remote. Sens. Lett., 2022

Prompt Tuning with Soft Context Sharing for Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

2020

PackDet: Packed Long-Head Object Detector.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

Deep Hierarchical Encoder-Decoder Network for Image Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Dense semantic embedding network for image captioning.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Nonlinear Asymmetric Multi-Valued Hashing.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

2018

In Defense of Locality-Sensitive Hashing.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2018

BundleNet: Learning with Noisy Label via Sample Correlations.

[BibT_eX]

[DOI]

IEEE Access, 2018

2017

Cross-Modal Hashing via Rank-Order Preserving.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

AMVH: Asymmetric Multi-Valued hashing.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Efficient Multiple Feature Fusion With Hashing for Hyperspectral Imagery Classification: A Comparative Study.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2016

Learning Relationship for Very High Resolution Image Change Detection.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2016

2015

Discriminant Tensor Spectral-Spatial Feature Extraction for Hyperspectral Image Classification.

[BibT_eX]

[DOI]

IEEE Geosci. Remote. Sens. Lett., 2015

Multicluster Spatial-Spectral Unsupervised Feature Selection for Hyperspectral Image Classification.

[BibT_eX]

[DOI]

IEEE Geosci. Remote. Sens. Lett., 2015

Sparse Hierarchical Clustering for VHR Image Change Detection.

[BibT_eX]

[DOI]

IEEE Geosci. Remote. Sens. Lett., 2015

kNN Hashing with Factorized Neighborhood Representation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2013

VHR image change detection based on discriminative dictionary learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Kun Ding

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...