Kai Chen

Orcid: 0000-0001-8436-6533

Affiliations:

Huazhong University of Science and Technology, School of Automation, National Key Laboratory of Science and Technology on Multi-spectral Information Processing, Wuhan, China

According to our database¹, Kai Chen authored at least 39 papers between 2015 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, July, 2025

NTIRE 2025 Challenge on UGC Video Enhancement: Methods and Results.

[BibT_eX]

[DOI]

CoRR, May, 2025

Corrupted but Not Broken: Rethinking the Impact of Corrupted Data in Visual Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, February, 2025

Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Automated Evaluation of Large Vision-Language Models on Self-Driving Corner Cases.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

NTIRE 2025 Challenge on UGC Video Enhancement: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Mixture of insighTful Experts (MoTE): The Synergy of Reasoning Chains and Expert Mixtures in Self-Alignment.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control.

[BibT_eX]

[DOI]

CoRR, 2024

PatchScaler: An Efficient Patch-Independent Diffusion Model for Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2024

MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes.

[BibT_eX]

[DOI]

CoRR, 2024

Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

MagicDrive: Street View Generation with Diverse 3D Geometry Control.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Implicit Concept Removal of Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Eyes Closed, Safety on: Protecting Multimodal LLMs via Image-to-Text Transformation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2023

TrackDiffusion: Multi-object Tracking Data Generation via Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt.

[BibT_eX]

[DOI]

CoRR, 2023

Unfolding Once is Enough: A Deployment-Friendly Transformer Unit for Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Mixed Autoencoder for Self-Supervised Visual Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results.

[BibT_eX]

[DOI]

Pablo Navarrete Michelini

CoRR, 2022

CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results.

[BibT_eX]

[DOI]

Pablo Navarrete Michelini

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

GCFSR: a Generative and Controllable Face Super Resolution Method Without Facial and GAN Priors.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

SODA10M: Towards Large-Scale Object Detection Benchmark for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2021

SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2019

Learning Linear Regression via Single-Convolutional Layer for Visual Object Tracking.

[BibT_eX]

[DOI]

Kai Chen

Wenbing Tao

IEEE Trans. Multim., 2019

2018

Convolutional Regression for Visual Tracking.

[BibT_eX]

[DOI]

Kai Chen

Wenbing Tao

IEEE Trans. Image Process., 2018

Once for All: A Two-Flow Convolutional Neural Network for Visual Tracking.

[BibT_eX]

[DOI]

Kai Chen

Wenbing Tao

IEEE Trans. Circuits Syst. Video Technol., 2018

2017

Visual object tracking via enhanced structural correlation filter.

[BibT_eX]

[DOI]

Kai Chen

Wenbing Tao

Shoudong Han

Inf. Sci., 2017

The Visual Object Tracking VOT2017 Challenge Results.

[BibT_eX]

[DOI]

Abdelrahman Eldesokey

Alireza Memarmoghadam

Gorthi R. K. Sai Subrahmanyam

Goutam Bhat

Guan Huang

Guilherme Sousa Bastos

Kannappan Palaniappan

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

2015

2D facial landmark model design by combining key points and inserted points.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2015

Kai Chen

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...