Ying Cheng

Orcid: 0000-0002-8964-3998

Affiliations:
  • Fudan University, Academy for Engineering and Technology, China


According to our database1, Ying Cheng authored at least 26 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MAGA-Bench: Machine-Augment-Generated Text via Alignment Detection Benchmark.
CoRR, January, 2026

EviMMQA: Multimodal question answering for medical evidence extraction in systematic reviews.
Pattern Recognit., 2026

2025
FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training.
CoRR, January, 2025

Semantic-Aware Hard Negative Mining for Medical Vision-Language Contrastive Pretraining.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

ConTrack3D: Contrastive Learning Contributes Concise 3D Multi-Object Tracking.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Uncertainty-Aware Dynamic Fusion for Multimodal Clinical Prediction Tasks.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

SplitOcc: Multi-Resolution Sparse Voxel for Efficient LiDAR-Based Semantic Scene Completion.
Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

RoBGuard: Enhancing LLMs to Assess Risk of Bias in Clinical Trial Documents.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Fine-Grained Knowledge-Guided Alignment for Medical Vision-Language Pre-Training.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Learning Music-Dance Representations Through Explicit-Implicit Rhythm Synchronization.
IEEE Trans. Multim., 2024

CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart.
CoRR, 2024

Mixtures of Experts for Audio-Visual Learning.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

CT<sup>2</sup>C-QA: Multimodal Question Answering over Chinese Text, Table and Chart.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DeepPointMap2: Accurate and Robust LiDAR-Visual SLAM with Neural Descriptors.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ADSNet: Cross-Domain LTV Prediction with an Adaptive Siamese Network in Advertising.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

2022
Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization.
CoRR, 2022

Modality-aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Self-Supervised Video Representation Learning with Motion-Contrastive Perception.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

2021
Exploring Logical Reasoning for Referring Expression Comprehension.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

MPN: Multimodal Parallel Network for Audio-Visual Event Localization.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Improving Multimodal Speech Enhancement by Incorporating Self-Supervised and Curriculum Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication.
Proceedings of the 28th International Conference on Computational Linguistics, 2020


  Loading...