Dingkang Liang

Orcid: 0000-0003-3035-1373

According to our database¹, Dingkang Liang authored at least 57 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2025

NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding.

[BibT_eX]

[DOI]

CoRR, October, 2025

More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

SoccerNet 2025 Challenges Results.

[BibT_eX]

[DOI]

Christophe De Vleeschouwer

Sergio Escalera

Bernard Ghanem

Thomas B. Moeslund

Marc Van Droogenbroeck

Tomoki Abe

Saad Ghazai Alotaibi

Faisal Sami Altawijri

Muhammad Amrulloh Robbani

CoRR, August, 2025

Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle.

[BibT_eX]

[DOI]

CoRR, August, 2025

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching.

[BibT_eX]

[DOI]

CoRR, July, 2025

Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, May, 2025

DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment.

[BibT_eX]

[DOI]

CoRR, April, 2025

An Empirical Study of Ground Segmentation for 3-D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., March, 2025

ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation.

[BibT_eX]

[DOI]

CoRR, March, 2025

Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception.

[BibT_eX]

[DOI]

CoRR, March, 2025

The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, February, 2025

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation.

[BibT_eX]

[DOI]

CoRR, January, 2025

Layerlink: Bridging remote sensing object detection and large vision models with efficient fine-tuning.

[BibT_eX]

[DOI]

Pattern Recognit., 2025

AVS-Net: Point sampling with adaptive voxel size for 3D scene understanding.

[BibT_eX]

[DOI]

Neurocomputing, 2025

Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MINIMA: Modality Invariant Image Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

A Unified Image-Dense Annotation Generation Model for Underwater Scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

A Discrepancy Aware Framework for Robust Anomaly Detection.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Informatics, March, 2024

LATFormer: Locality-Aware Point-View Fusion Transformer for 3D shape recognition.

[BibT_eX]

[DOI]

Pattern Recognit., 2024

Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Anomaly Detection by Adapting a pre-trained Vision Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Point Cloud Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

SAM3D: zero-shot 3D object detection via the segment anything model.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2024

Not All Texts Are the Same: Dynamically Querying Texts for Scene Text Detection.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

PointMamba: A Simple State Space Model for Point Cloud Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

A Unified Framework for 3D Scene Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Make Your ViT-Based Multi-view 3D Detectors Faster via Token Compression.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Well Begun is Half Done: The Importance of Initialization in Dataset Distillation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

You Only Look Bottom-Up for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., November, 2023

Focal Inverse Distance Transform Maps for Crowd Localization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution.

[BibT_eX]

[DOI]

CoRR, 2023

Diffusion-Based 3D Object Detection with Random Boxes.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Query-based Temporal Fusion with Explicit Motion for 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DDS3D: Dense Pseudo-Labels with Dynamic Threshold for Semi-Supervised 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Visual Information Extraction in the Wild: Practical Dataset and End-to-End Solution.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Super-Resolution Information Enhancement for Crowd Counting.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SOOD: Towards Semi-Supervised Oriented Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Cell Localization and Counting Using Direction Field Map.

[BibT_eX]

[DOI]

IEEE J. Biomed. Health Informatics, 2022

AutoScale: Learning to Scale for Crowd Counting.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

TransCrowd: weakly-supervised crowd counting with transformers.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2022

Comprehensive benchmark datasets for Amharic scene text detection and recognition.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2022

An End-to-End Transformer Model for Crowd Localization.

[BibT_eX]

[DOI]

Dingkang Liang

Wei Xu

Xiang Bai

Proceedings of the Computer Vision - ECCV 2022, 2022

When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Fault Diagnosis of Main Pump in Converter Station Based on Deep Neural Network.

[BibT_eX]

[DOI]

Symmetry, 2021

Dilated-Scale-Aware Category-Attention ConvNet for Multi-Class Object Counting.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2021

TransCrowd: Weakly-Supervised Crowd Counting with Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

Reciprocal Distance Transform Maps for Crowd Counting and People Localization in Dense Crowd.

[BibT_eX]

[DOI]

CoRR, 2021

VisDrone-CC2021: The Vision Meets Drone Crowd Counting Challenge Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

2020

Dilated-Scale-Aware Attention ConvNet For Multi-Class Object Counting.

[BibT_eX]

[DOI]

CoRR, 2020

VisDrone-CC2020: The Vision Meets Drone Crowd Counting Challenge Results.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Dingkang Liang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...