Ming Cheng

Orcid: 0000-0002-6422-1748

Affiliations:
  • Dartmouth College, NH, Hanover, USA


According to our database1, Ming Cheng authored at least 15 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
ProtoVQA: An Adaptable Prototypical Framework for Explainable Fine-Grained Visual Question Answering.
CoRR, September, 2025

Learning Sparsity for Effective and Efficient Music Performance Question Answering.
CoRR, June, 2025

Music's Multimodal Complexity in AVQA: Why We Need More than General Multimodal LLMs.
CoRR, May, 2025

FT2TF: First-Person Statement Text-to-Talking Face Generation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

2024
Toward Short-Term Glucose Prediction Solely Based on CGM Time Series.
CoRR, 2024

CrossGP: Cross-Day Glucose Prediction Excluding Physiological Information.
CoRR, 2024

VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning.
CoRR, 2024

Learning Musical Representations for Music Performance Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

GluMarker: A Novel Predictive Modeling of Glycemic Control Through Digital Biomarkers.
Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2024

Efflex: Efficient and Flexible Pipeline for Spatio-Temporal Trajectory Graph Modeling and Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
SAIC: Integration of Speech Anonymization and Identity Classification.
CoRR, 2023

AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023

2021
DASGIL: Domain Adaptation for Semantic and Geometric-Aware Image-Based Localization.
IEEE Trans. Image Process., 2021

2020
DASGIL: Domain Adaptation for Semantic and Geometric-aware Image-based Localization.
CoRR, 2020


  Loading...