Martin Q. Ma

According to our database1, Martin Q. Ma authored at least 16 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Video Active Perception: Effective Inference-Time Long-Form Video Understanding with Vision-Language Models.
CoRR, May, 2026

Act2See: Emergent Active Visual Perception for Video Reasoning.
CoRR, May, 2026

2025
Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech.
CoRR, December, 2025

Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2023
The Need for Unsupervised Outlier Model Selection: A Review and Evaluation of Internal Evaluation Strategies.
SIGKDD Explor., 2023

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Face-to-Face Contrastive Learning for Social Intelligence Question-Answering.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023

Understanding Masked Autoencoders via Hierarchical Latent Variable Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Conditional Contrastive Learning with Kernel.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Conditional Contrastive Learning: Removing Undesirable Information in Self-Supervised Representations.
CoRR, 2021

A Large-scale Study on Unsupervised Outlier Model Selection: Do Internal Strategies Suffice?
CoRR, 2021

Self-supervised Representation Learning with Relative Predictive Coding.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Interpretable Multimodal Routing for Human Multimodal Language.
CoRR, 2020

Complex Transformer: A Framework for Modeling Complex-Valued Sequence.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020


  Loading...