Ashutosh Chaubey
Orcid: 0000-0002-8463-0012
According to our database1,
Ashutosh Chaubey authored at least 17 papers
between 2019 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox.
CoRR, May, 2026
GDPO-Listener: Expressive Interactive Head Generation via Auto-Regressive Flow Matching and Group reward-Decoupled Policy Optimization.
CoRR, March, 2026
MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization.
CoRR, March, 2026
CoRR, February, 2026
CoRR, January, 2026
Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026
LibreFace 2.0: A Generalizable Facial Expression Analysis Toolkit Leveraging Synthetic Data.
Proceedings of the 20th IEEE International Conference on Automatic Face and Gesture Recognition, 2026
2025
ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual Advertising.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
2024
ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual Advertising.
CoRR, 2024
2023
Speaker-specific Thresholding for Robust Imposter Identification in Unseen Speaker Recognition.
CoRR, 2023
Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
2020
2019
A Generative Adversarial Network Based Ensemble Technique for Automatic Evaluation of Machine Synthesized Speech.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019