Utkarsh Tyagi

Orcid: 0009-0000-1831-7709

According to our database1, Utkarsh Tyagi authored at least 25 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
MultiVox: Benchmarking Voice Assistants for Multimodal Interactions.
CoRR, July, 2025

ProSE: Diffusion Priors for Speech Enhancement.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities.
CoRR, 2024

ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions.
CoRR, 2024

VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap.
CoRR, 2024

Do Vision-Language Models Understand Compound Nouns?
CoRR, 2024

Do Vision-Language Models Understand Compound Nouns?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
ASPIRE: Language-Guided Augmentation for Robust Image Classification.
CoRR, 2023

BioAug: Conditional Generation based Data Augmentation for Low-Resource Biomedical NER.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

MMER: Multimodal Multi-task Learning for Speech Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

AdVerb: Visually Guided Audio Dereverberation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DALE: Generative Data Augmentation for Low-Resource Legal NLP.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
A novel multimodal dynamic fusion network for disfluency detection in spoken utterances.
CoRR, 2022

2021
mFlameGaze: Mobile-Based Flame Gazing for Improving Sustained Attention.
Proceedings of the 13th International Conference on COMmunication Systems & NETworkS, 2021

2019
Classifying Question Papers with Bloom's Taxonomy Using Machine Learning Techniques.
Proceedings of the Advances in Computing and Data Sciences, 2019


  Loading...