Kecheng Zheng

Orcid: 0000-0002-3450-400X

According to our database1, Kecheng Zheng authored at least 64 papers between 2018 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
AutoStory: Generating Diverse Storytelling Images with Minimal Human Efforts.
Int. J. Comput. Vis., June, 2025

SynMotion: Semantic-Visual Adaptation for Motion Customized Video Generation.
CoRR, June, 2025

VideoMAR: Autoregressive Video Generatio with Continuous Tokens.
CoRR, June, 2025

Unfolding coupled convolutional sparse representation for multi-focus image fusion.
Inf. Fusion, 2025

Animate-X: Universal Character Image Animation with Enhanced Motion Representation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Aligned Better, Listen Better for Audio-Visual Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Framer: Interactive Frame Interpolation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Contextual AD Narration with Interleaved Multimodal Sequence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Mimir: Improving Video Diffusion Models for Precise Text Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Learning Visual Generative Priors without Text.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Learning Naturally Aggregated Appearance for Efficient 3D Editing.
Proceedings of the International Conference on 3D Vision, 2025

2024
Exert Diversity and Mitigate Bias: Domain Generalizable Person Re-identification with a Comprehensive Benchmark.
Int. J. Comput. Vis., November, 2024

Unleashing Knowledge Potential of Source Hypothesis for Source-Free Domain Adaptation.
IEEE Trans. Multim., 2024

MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks.
CoRR, 2024

MePT: Multi-Representation Guided Prompt Tuning for Vision-Language Model.
CoRR, 2024

Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs.
CoRR, 2024

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

UKnow: A Unified Knowledge Protocol with Multimodal Knowledge Graph Datasets for Reasoning and Vision-Language Pre-Training.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DreamLIP: Language-Image Pre-training with Long Captions.
Proceedings of the Computer Vision - ECCV 2024, 2024

Exploring Guided Sampling of Conditional GANs.
Proceedings of the Computer Vision - ECCV 2024, 2024

Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs.
Proceedings of the Computer Vision - ECCV 2024, 2024

CoReS: Orchestrating the Dance of Reasoning and Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification.
CoRR, 2023

GenDeF: Learning Generative Deformation Field for Video Generation.
CoRR, 2023

Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection.
CoRR, 2023

AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort.
CoRR, 2023

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis.
CoRR, 2023

Cones 2: Customizable Image Synthesis with Multiple Subjects.
CoRR, 2023

CCSR-Net: Unfolding Coupled Convolutional Sparse Representation for Multi-focus Image Fusion.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Customizable Image Synthesis with Multiple Subjects.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RLEG: Vision-Language Representation Learning with Diffusion-based Embedding Generation.
Proceedings of the International Conference on Machine Learning, 2023

Cones: Concept Neurons in Diffusion Models for Customized Generation.
Proceedings of the International Conference on Machine Learning, 2023

Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-Organizing Pathway Expansion for Non-Exemplar Class-Incremental Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Dependencies Emerging from Learning Massive Categories.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization.
CoRR, 2022

Uncertainty-Aware Hierarchical Refinement for Incremental Implicitly-Refined Classification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Rank Diminishing in Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Principled Knowledge Extrapolation with GANs.
Proceedings of the International Conference on Machine Learning, 2022

Unleashing Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Temporal Complementarity-Guided Reinforcement Learning for Image-to-Video Person Re-Identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Debiased Batch Normalization via Gaussian Process for Generalizable Person Re-identification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-identification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Calibrated Feature Decomposition for Generalizable Person Re-Identification.
CoRR, 2021

Semi-Supervised Domain Generalizable Person Re-Identification.
CoRR, 2021

Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval.
CoRR, 2021

Disentanglement-based Cross-Domain Feature Augmentation for Effective Unsupervised Domain Adaptive Person Re-identification.
CoRR, 2021

Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Group-aware Label Transfer for Domain Adaptive Person Re-identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Spatial-Temporal Correlation and Topology Learning for Person Re-Identification in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Exploiting Sample Uncertainty for Domain Adaptive Person Re-Identification.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Hierarchical Gumbel Attention Network for Text-based Person Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Stacked Convolutional Deep Encoding Network For Video-Text Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

2019
Abstract Reasoning with Distracting Features.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
LA-Net: Layout-Aware Dense Network for Monocular Depth Estimation.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018


  Loading...