Kai Chen

Orcid: 0000-0001-8436-6533

Affiliations:
  • Huazhong University of Science and Technology, School of Automation, National Key Laboratory of Science and Technology on Multi-spectral Information Processing, Wuhan, China


According to our database1, Kai Chen authored at least 39 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving.
CoRR, July, 2025

NTIRE 2025 Challenge on UGC Video Enhancement: Methods and Results.
CoRR, May, 2025

Corrupted but Not Broken: Rethinking the Impact of Corrupted Data in Visual Instruction Tuning.
CoRR, February, 2025

Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models.
Trans. Mach. Learn. Res., 2025

TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Automated Evaluation of Large Vision-Language Models on Self-Driving Corner Cases.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025


Mixture of insighTful Experts (MoTE): The Synergy of Reasoning Chains and Expert Mixtures in Self-Alignment.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control.
CoRR, 2024

PatchScaler: An Efficient Patch-Independent Diffusion Model for Super-Resolution.
CoRR, 2024

MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes.
CoRR, 2024

Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment.
CoRR, 2024

GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MagicDrive: Street View Generation with Diverse 3D Geometry Control.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Implicit Concept Removal of Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Eyes Closed, Safety on: Protecting Multimodal LLMs via Image-to-Text Transformation.
Proceedings of the Computer Vision - ECCV 2024, 2024

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning.
CoRR, 2023

TrackDiffusion: Multi-object Tracking Data Generation via Diffusion Models.
CoRR, 2023

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models.
CoRR, 2023

Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt.
CoRR, 2023

Unfolding Once is Enough: A Deployment-Friendly Transformer Unit for Super-Resolution.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Mixed Autoencoder for Self-Supervised Visual Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results.
CoRR, 2022

CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving.
Proceedings of the Computer Vision - ECCV 2022, 2022


GCFSR: a Generative and Controllable Face Super Resolution Method Without Facial and GAN Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
SODA10M: Towards Large-Scale Object Detection Benchmark for Autonomous Driving.
CoRR, 2021

SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2019
Learning Linear Regression via Single-Convolutional Layer for Visual Object Tracking.
IEEE Trans. Multim., 2019

2018
Convolutional Regression for Visual Tracking.
IEEE Trans. Image Process., 2018

Once for All: A Two-Flow Convolutional Neural Network for Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2018

2017
Visual object tracking via enhanced structural correlation filter.
Inf. Sci., 2017

The Visual Object Tracking VOT2017 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

2015
2D facial landmark model design by combining key points and inserted points.
Expert Syst. Appl., 2015


  Loading...