Ashutosh Chaubey

Orcid: 0000-0002-8463-0012

According to our database1, Ashutosh Chaubey authored at least 17 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox.
CoRR, May, 2026

GDPO-Listener: Expressive Interactive Head Generation via Auto-Regressive Flow Matching and Group reward-Decoupled Policy Optimization.
CoRR, March, 2026

MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization.
CoRR, March, 2026

AVERE: Improving Audiovisual Emotion Reasoning with Preference Optimization.
CoRR, February, 2026

Sparks of Rationality: Do Reasoning LLMs Align with Human Judgment and Choice?
CoRR, January, 2026

Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

LibreFace 2.0: A Generalizable Facial Expression Analysis Toolkit Leveraging Synthetic Data.
Proceedings of the 20th IEEE International Conference on Automatic Face and Gesture Recognition, 2026

2025
ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual Advertising.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Ditailistener: Controllable High Fidelity Listener Video Generation with Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Can VLMs Recall Factual Associations From Visual References?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual Advertising.
CoRR, 2024

2023
Speaker-specific Thresholding for Robust Imposter Identification in Unseen Speaker Recognition.
CoRR, 2023

Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Improved Relation Networks for End-to-End Speaker Verification and Identification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2020
Universal Adversarial Perturbations: A Survey.
CoRR, 2020

2019
A Generative Adversarial Network Based Ensemble Technique for Automatic Evaluation of Machine Synthesized Speech.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019


  Loading...