Kecheng Zheng

Orcid: 0000-0002-3450-400X

According to our database¹, Kecheng Zheng authored at least 66 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Vision-Centric Activation and Coordination for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

AutoStory: Generating Diverse Storytelling Images with Minimal Human Efforts.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., June, 2025

SynMotion: Semantic-Visual Adaptation for Motion Customized Video Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

VideoMAR: Autoregressive Video Generatio with Continuous Tokens.

[BibT_eX]

[DOI]

CoRR, June, 2025

Unfolding coupled convolutional sparse representation for multi-focus image fusion.

[BibT_eX]

[DOI]

Kecheng Zheng

Juan Cheng

Yu Liu

Inf. Fusion, 2025

Animate-X: Universal Character Image Animation with Enhanced Motion Representation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Aligned Better, Listen Better for Audio-Visual Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Framer: Interactive Frame Interpolation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Contextual AD Narration with Interleaved Multimodal Sequence.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Mimir: Improving Video Diffusion Models for Precise Text Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Learning Visual Generative Priors without Text.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Learning Naturally Aggregated Appearance for Efficient 3D Editing.

[BibT_eX]

[DOI]

Proceedings of the International Conference on 3D Vision, 2025

2024

Exert Diversity and Mitigate Bias: Domain Generalizable Person Re-identification with a Comprehensive Benchmark.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., November, 2024

Unleashing Knowledge Potential of Source Hypothesis for Source-Free Domain Adaptation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks.

[BibT_eX]

[DOI]

CoRR, 2024

MePT: Multi-Representation Guided Prompt Tuning for Vision-Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs.

[BibT_eX]

[DOI]

Shi Liu

Kecheng Zheng

Wei Chen

CoRR, 2024

DreamLIP: Language-Image Pre-training with Long Captions.

[BibT_eX]

[DOI]

CoRR, 2024

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

UKnow: A Unified Knowledge Protocol with Multimodal Knowledge Graph Datasets for Reasoning and Vision-Language Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DreamLIP: Language-Image Pre-training with Long Captions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Exploring Guided Sampling of Conditional GANs.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs.

[BibT_eX]

[DOI]

Shi Liu

Kecheng Zheng

Wei Chen

Proceedings of the Computer Vision - ECCV 2024, 2024

CoReS: Orchestrating the Dance of Reasoning and Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification.

[BibT_eX]

[DOI]

CoRR, 2023

GenDeF: Learning Generative Deformation Field for Video Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection.

[BibT_eX]

[DOI]

CoRR, 2023

AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2023

Cones 2: Customizable Image Synthesis with Multiple Subjects.

[BibT_eX]

[DOI]

CoRR, 2023

CCSR-Net: Unfolding Coupled Convolutional Sparse Representation for Multi-focus Image Fusion.

[BibT_eX]

[DOI]

Kecheng Zheng

Juan Cheng

Yu Liu

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Customizable Image Synthesis with Multiple Subjects.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RLEG: Vision-Language Representation Learning with Diffusion-based Embedding Generation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Cones: Concept Neurons in Diffusion Models for Customized Generation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-Organizing Pathway Expansion for Non-Exemplar Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Dependencies Emerging from Learning Massive Categories.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization.

[BibT_eX]

[DOI]

CoRR, 2022

Uncertainty-Aware Hierarchical Refinement for Incremental Implicitly-Refined Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Rank Diminishing in Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Principled Knowledge Extrapolation with GANs.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Unleashing Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Temporal Complementarity-Guided Reinforcement Learning for Image-to-Video Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Debiased Batch Normalization via Gaussian Process for Generalizable Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Calibrated Feature Decomposition for Generalizable Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2021

Semi-Supervised Domain Generalizable Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2021

Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval.

[BibT_eX]

[DOI]

CoRR, 2021

Disentanglement-based Cross-Domain Feature Augmentation for Effective Unsupervised Domain Adaptive Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2021

Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Group-aware Label Transfer for Domain Adaptive Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Spatial-Temporal Correlation and Topology Learning for Person Re-Identification in Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Exploiting Sample Uncertainty for Domain Adaptive Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Hierarchical Gumbel Attention Network for Text-based Person Search.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Stacked Convolutional Deep Encoding Network For Video-Text Retrieval.

[BibT_eX]

[DOI]

Rui Zhao

Kecheng Zheng

Zhengjun Zha

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

2019

Abstract Reasoning with Distracting Features.

[BibT_eX]

[DOI]

Kecheng Zheng

Zheng-Jun Zha

Wei Wei

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

LA-Net: Layout-Aware Dense Network for Monocular Depth Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Kecheng Zheng

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...