Lechao Cheng

Orcid: 0000-0002-7546-9052

According to our database1, Lechao Cheng authored at least 103 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
High-frequency structure transformer for magnetic resonance image super-resolution.
Pattern Recognit., 2026

2025
ODMixer: Fine-Grained Spatial-Temporal MLP for Metro Origin-Destination Prediction.
IEEE Trans. Knowl. Data Eng., September, 2025

SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition.
CoRR, August, 2025

Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation.
CoRR, August, 2025

Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering.
CoRR, August, 2025

StgcDiff: Spatial-Temporal Graph Condition Diffusion for Sign Language Transition Generation.
CoRR, June, 2025

Towards Fine-Grained Emotion Understanding via Skeleton-Based Micro-Gesture Recognition.
CoRR, June, 2025

SignAligner: Harmonizing Complementary Pose Modalities for Coherent Sign Language Generation.
CoRR, June, 2025

SSAM: Self-Supervised Association Modeling for Test-Time Adaption.
CoRR, June, 2025

Self-Classification Enhancement and Correction for Weakly Supervised Object Detection.
CoRR, May, 2025

Towards Micro-Action Recognition with Limited Annotations: An Asynchronous Pseudo Labeling and Training Approach.
CoRR, April, 2025

Mixed Attention and Channel Shift Transformer for Efficient Action Recognition.
ACM Trans. Multim. Comput. Commun. Appl., March, 2025

Text-Driven Diffusion Model for Sign Language Production.
CoRR, March, 2025

Knowledge Swapping via Learning and Unlearning.
CoRR, February, 2025

Navigating Semantic Drift in Task-Agnostic Class-Incremental Learning.
CoRR, February, 2025

Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching.
IEEE Trans. Image Process., 2025

Temporal multi-modal knowledge graph generation for link prediction.
Neural Networks, 2025

Efficient Vision Language Model Fine-tuning for Text-based Person Anomaly Search.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

Beyond General Alignment: Fine-Grained Entity-Centric Image-Text Matching with Multimodal Attentive Experts.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SLRTP2025 Sign Language Production Challenge: Methodology, Results and Future Work.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Separating Noisy Samples From Tail Classes for Long-Tailed Image Classification With Label Noise.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

Progressive Adapting and Pruning: Domain-Incremental Learning for Saliency Prediction.
ACM Trans. Multim. Comput. Commun. Appl., August, 2024

ScrollTimes: Tracing the Provenance of Paintings as a Window Into History.
IEEE Trans. Vis. Comput. Graph., June, 2024

Efficient Unsupervised Video Hashing With Contextual Modeling and Structural Controlling.
IEEE Trans. Multim., 2024

Life regression based patch slimming for vision transformers.
Neural Networks, 2024

Mixed Resolution Network with hierarchical motion modeling for efficient action recognition.
Knowl. Based Syst., 2024

Multimodality-guided Visual-Caption Semantic Enhancement.
Comput. Vis. Image Underst., 2024

Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation.
CoRR, 2024

Modality Alignment Meets Federated Broadcasting.
CoRR, 2024

FoPru: Focal Pruning for Efficient Large Vision-Language Models.
CoRR, 2024

Dataset Distillers Are Good Label Denoisers In the Wild.
CoRR, 2024

Diffusion-based Layer-wise Semantic Reconstruction for Unsupervised Out-of-Distribution Detection.
CoRR, 2024

EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning.
CoRR, 2024

Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing.
CoRR, 2024

PruningBench: A Comprehensive Benchmark of Structural Pruning.
CoRR, 2024

A Large-scale Universal Evaluation Benchmark For Face Forgery Detection.
CoRR, 2024

FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation.
CoRR, 2024

Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation.
CoRR, 2024

Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

JPA: A Joint-Part Attention for Mitigating Overfocusing on 3D Human Pose Estimation.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Benchmarking Multi-Scene Fire and Smoke Detection.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Behavior Capture Based Explainable Engagement Recognition.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Diffusion-based Layer-wise Semantic Reconstruction for Unsupervised Out-of-Distribution Detection.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Fire and Smoke Detection with Burning Intensity Representation.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

LoopGaussian: Creating 3D Cinemagraph with Multi-view Images via Eulerian Motion Field.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Revisiting the Power of Prompt for Visual Tuning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Improving Knowledge Distillation via Regularizing Feature Direction and Norm.
Proceedings of the Computer Vision - ECCV 2024, 2024

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing.
Proceedings of the Computer Vision - ECCV 2024, 2024

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques.
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

Open-Vocabulary Video Relation Extraction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Progressive Feature Self-Reinforcement for Weakly Supervised Semantic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

ViT-Calibrator: Decision Stream Calibration for Vision Transformer.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
KE-RCNN: Unifying Knowledge-Based Reasoning Into Part-Level Attribute Parsing.
IEEE Trans. Cybern., November, 2023

A privacy-aware visual query approach for location-based data.
Comput. Graph., October, 2023

Disassembling Convolutional Segmentation Network.
Int. J. Comput. Vis., July, 2023

Reliable Mutual Distillation for Medical Image Segmentation Under Imperfect Annotations.
IEEE Trans. Medical Imaging, June, 2023

From External to Internal: Structuring Image for Text-to-Image Attributes Manipulation.
IEEE Trans. Multim., 2023

3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing.
CoRR, 2023

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding.
CoRR, 2023

Integrating UMLS Knowledge into Large Language Models for Medical Question Answering.
CoRR, 2023

NLPBench: Evaluating Large Language Models on Solving NLP Problems.
CoRR, 2023

ScrollTimes: Tracing the Provenance of Paintings as a Window into History.
CoRR, 2023

Improving Knowledge Distillation via Regularizing Feature Norm and Direction.
CoRR, 2023

Mitigating Undisciplined Over-Smoothing in Transformer for Weakly Supervised Semantic Segmentation.
CoRR, 2023

Propheter: Prophetic Teacher Guided Long-Tailed Distribution Learning.
CoRR, 2023

Revisiting Long-tailed Image Classification: Survey and Benchmarks with New Evaluation Metrics.
CoRR, 2023

Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency Prompt.
CoRR, 2023

A Deep Multi-Task Network to Learn Tumor Pathological Representations for Lymph Node Metastasis Prediction.
Proceedings of the MEDINFO 2023 - The Future Is Accessible, 2023

Propheter: Prophetic Teacher Guided Long-Tailed Distribution Learning.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

SASFormer: Transformers for Sparsely Annotated Semantic Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Text-Guided Mask-Free Local Image Retouching.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Team DETR: Guide Queries as a Professional Team in Detection Transformers.
Proceedings of the IEEE International Conference on Image Processing, 2023

Model Doctor for Diagnosing and Treating Segmentation Error.
Proceedings of the IEEE International Conference on Image Processing, 2023

A Hierarchy-driven Multi-label Network with Label Constraints for Post-operative Complication Prediction of Lung Cancer.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023

Generalization Matters: Loss Minima Flattening via Parameter Hybridization for Efficient Online Knowledge Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency Prompt.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

De-biased Teacher: Rethinking IoU Matching for Semi-supervised Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Adapting Object Size Variance and Class Imbalance for Semi-supervised Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Transferability Estimation Based On Principal Gradient Expectation.
CoRR, 2022

A Survey of Neural Trees.
CoRR, 2022

Combating Noisy Labels in Long-Tailed Image Classification.
CoRR, 2022

ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition.
CoRR, 2022

CNN LEGO: Disassembling and Assembling Convolutional Neural Network.
CoRR, 2022

Cell Segmenter: A General Framework for Multi-modality Cell Segmentation.
Proceedings of The Cell Segmentation Challenge in Multi-modality High-Resolution Microscopy Images, 2022

Long-term Leap Attention, Short-term Periodic Shift for Video Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Mix-DANN and Dynamic-Modal-Distillation for Video Domain Adaptation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Cross-Modality High-Frequency Transformer for MR Image Super-Resolution.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dual Domain-Adversarial Learning for Audio-Visual Saliency Prediction.
Proceedings of the HCMA@MM 2022: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis, 2022

Compound Batch Normalization for Long-tailed Image Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Re-Attention Transformer for Weakly Supervised Object Localization.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Weakly Supervised Semantic Segmentation via Alternative Self-Dual Teaching.
CoRR, 2021

Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action Parsing.
CoRR, 2021

Boundary Knowledge Translation based Reference Semantic Segmentation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Edge-competing Pathological Liver Vessel Segmentation with Limited Labels.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Visual Boundary Knowledge Translation for Foreground Segmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2019
A Synthesis-by-Analysis Network with Applications in Image Super-Resolution.
Proceedings of the Advances in Computer Graphics, 2019

2018
Intrinsic Image Transformation via Scale Space Decomposition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2015
audeosynth: music-driven video montage.
ACM Trans. Graph., 2015


  Loading...