Ying Cheng

Orcid: 0000-0002-8964-3998

Affiliations:

Fudan University, Academy for Engineering and Technology, China

According to our database¹, Ying Cheng authored at least 26 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

MAGA-Bench: Machine-Augment-Generated Text via Alignment Detection Benchmark.

[BibT_eX]

[DOI]

CoRR, January, 2026

EviMMQA: Multimodal question answering for medical evidence extraction in systematic reviews.

[BibT_eX]

[DOI]

Pattern Recognit., 2026

2025

FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training.

[BibT_eX]

[DOI]

CoRR, January, 2025

Semantic-Aware Hard Negative Mining for Medical Vision-Language Contrastive Pretraining.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

ConTrack3D: Contrastive Learning Contributes Concise 3D Multi-Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Uncertainty-Aware Dynamic Fusion for Multimodal Clinical Prediction Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

SplitOcc: Multi-Resolution Sparse Voxel for Efficient LiDAR-Based Semantic Scene Completion.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

RoBGuard: Enhancing LLMs to Assess Risk of Bias in Clinical Trial Documents.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Fine-Grained Knowledge-Guided Alignment for Medical Vision-Language Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Learning Music-Dance Representations Through Explicit-Implicit Rhythm Synchronization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart.

[BibT_eX]

[DOI]

CoRR, 2024

Mixtures of Experts for Audio-Visual Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

CT<sup>2</sup>C-QA: Multimodal Question Answering over Chinese Text, Table and Chart.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DeepPointMap2: Accurate and Robust LiDAR-Visual SLAM with Neural Descriptors.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ADSNet: Cross-Domain LTV Prediction with an Adaptive Siamese Network in Advertising.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

2022

Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization.

[BibT_eX]

[DOI]

CoRR, 2022

Modality-aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Self-Supervised Video Representation Learning with Motion-Contrastive Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

2021

Exploring Logical Reasoning for Referring Expression Comprehension.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

MPN: Multimodal Parallel Network for Audio-Visual Event Localization.

[BibT_eX]

[DOI]

Jiashuo Yu

Ying Cheng

Rui Feng

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Improving Multimodal Speech Enhancement by Incorporating Self-Supervised and Curriculum Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Ying Cheng

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...