Ali Vosoughi
Orcid: 0000-0003-1014-2937
According to our database1,
Ali Vosoughi authored at least 20 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
IEEE Trans. Circuits Syst. Video Technol., February, 2026
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
PromptReverb: Multimodal Room Impulse Response Generation Through Latent Rectified Flow Matching.
CoRR, October, 2025
CoRR, October, 2025
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models.
CoRR, October, 2025
OPENXRD: A Comprehensive Benchmark and Enhancement Framework for LLM/MLLM XRD Question Answering.
CoRR, July, 2025
I<sup>2</sup>G: Generating Instructional Illustrations via Text-Conditioned Diffusion.
CoRR, May, 2025
VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity.
CoRR, March, 2025
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model.
Proceedings of the 33rd European Signal Processing Conference, 2025
2024
Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA.
IEEE Trans. Multim., 2024
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation.
CoRR, 2023
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA.
CoRR, 2023