Preethi Jyothi

Darshan Prabhu

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

DeFT-X: Denoised Sparse Fine-Tuning for Zero-Shot Cross-Lingual Transfer.

[BibT_eX]

[DOI]

Sona Elza Simon

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

LASER: An LLM-based ASR Scoring and Evaluation Rubric.

[BibT_eX]

[DOI]

Amruta Parulekar

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

RECAST: Retrieval-Augmented Contextual ASR via Decoder-State Keyword Spotting.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving.

[BibT_eX]

[DOI]

Bhavani Shankar

Proceedings of the 31st International Conference on Computational Linguistics, 2025

LEVOS: Leveraging Vocabulary Overlap with Sanskrit to Generate Technical Lexicons in Indian Languages.

[BibT_eX]

[DOI]

Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications, 2025

LoFTI: Localization and Factuality Transfer to Indian Locales.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

LexGen: Domain-aware Multilingual Lexicon Generation.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR.

[BibT_eX]

[DOI]

CoRR, 2024

Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection.

[BibT_eX]

[DOI]

Ashish Sunil Agrawal

CoRR, 2024

CharSS: Character-Level Transformer Model for Sanskrit Word Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

LexGen: Domain-aware Multilingual Lexicon Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language.

[BibT_eX]

[DOI]

Yash Sharma

Basil Abraham

CoRR, 2024

WikiDO: A New Benchmark Evaluating Cross-Modal Retrieval for Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MULTI-CONVFORMER: Extending Conformer with Multiple Convolution Kernels.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Improving Self-supervised Pre-training using Accent-Specific Codebooks.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

SALSA: Speedy ASR-LLM Synchronous Aggregation.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Emotion Arithmetic: Emotional Speech Synthesis via Weight Space Interpolation.

[BibT_eX]

[DOI]

Pavan Kalyan

Preeti Rao

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

DictDis: Dictionary Constrained Disambiguation for Improved NMT.

[BibT_eX]

[DOI]

Ayush Maheshwari

Ganesh Ramakrishnan

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

STORiCo: Storytelling TTS for Hindi with Character Voice Modulation.

[BibT_eX]

[DOI]

Pavan Tankala

Preeti Rao

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning.

[BibT_eX]

[DOI]

Ashish Sunil Agrawal

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

In-context Mixing (ICM): Code-mixed Prompts for Multilingual LLMs.

[BibT_eX]

[DOI]

Bhavani Shankar

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

DIMSIM: Distilled Multilingual Critics for Indic Text Simplification.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Part-of-speech Tagging for Extremely Low-resource Indian Languages.

[BibT_eX]

[DOI]

Sanjeev Kumar

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection.

[BibT_eX]

[DOI]

Ashish Agrawal

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Improving RNN-Transducers with Acoustic LookAhead.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Narrator or Character: Voice Modulation in an Expressive Multi-speaker TTS.

[BibT_eX]

[DOI]

Tankala Pavan Kalyan

Preeti Rao

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Unsupervised Code-switched Text Generation from Parallel Text.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction.

[BibT_eX]

[DOI]

Vineet Bhat

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

In-Situ Text-Only Adaptation of Speech Models with Low-Overhead Speech Imputations.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Towards Zero-Shot Code-Switched Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Accented Speech Recognition With Accent-specific Codebooks.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages.

[BibT_eX]

[DOI]

Vineet Bhat

D. Chandra Sekhara Hetha Havya

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DITTO: Data-efficient and Fair Targeted Subset Selection for ASR Accent Adaptation.

[BibT_eX]

[DOI]

Suraj Kothawade

Anmol Reddy Mekala

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Zero-shot Cross-lingual Transfer With Learned Projections Using Unlabeled Target-Language Data.

[BibT_eX]

[DOI]

Ujan Deb

Ridayesh Parab

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Improving Pretraining Techniques for Code-Switched NLP.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Adversarial Training for Low-Resource Disfluency Correction.

[BibT_eX]

[DOI]

Vineet Bhat

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

DICTDIS: Dictionary Constrained Disambiguation for Improved NMT.

[BibT_eX]

[DOI]

CoRR, 2022

Investigating Modality Bias in Audio Visual Video Parsing.

[BibT_eX]

[DOI]

CoRR, 2022

Error Correction in ASR using Sequence-to-Sequence Models.

[BibT_eX]

[DOI]

CoRR, 2022

Linguistically Informed Post-processing for ASR Error correction in Sanskrit.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

VAgyojaka: An Annotating and Post-Editing Tool for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

SPLICEOUT: A Simple and Efficient Audio Augmentation Method.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Adaptive Discounting of Implicit Language Models in RNN-Transducers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

CoCoa: An Encoder-Decoder Model for Controllable Code-switched Generation.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR Training.

[BibT_eX]

[DOI]

Durga Sivasubramanian

Rishabh K. Iyer

Ganesh Ramakrishnan

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Zero-shot Disfluency Detection for Indian Languages.

[BibT_eX]

[DOI]

Rohit Kundu

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Aligning Multilingual Embeddings for Improved Code-switched Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Accurate Online Posterior Alignments for Principled Lexically-Constrained Decoding.

[BibT_eX]

[DOI]

Soumya Chatterjee

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Personalizing ASR with limited data using targeted subset selection.

[BibT_eX]

[DOI]

CoRR, 2021

The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding.

[BibT_eX]

[DOI]

CoRR, 2021

Cross-Modal learning for Audio-Visual Video Parsing.

[BibT_eX]

[DOI]

CoRR, 2021

Multilingual and code-switching ASR challenges for low resource Indian languages.

[BibT_eX]

[DOI]

CoRR, 2021

Rudder: A Cross Lingual Video and Text Retrieval Dataset.

[BibT_eX]

[DOI]

CoRR, 2021

Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Cross-Modal Learning for Audio-Visual Video Parsing.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Low Resource ASR: The Surprising Effectiveness of High Resource Transliteration.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

MUCS 2021: Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages.

[BibT_eX]

[DOI]

Anuj Diwan

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Perturb, Predict & Paraphrase: Semi-Supervised Learning using Noisy Student for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Cross Lingual Video and Text Retrieval: A New Benchmark Dataset and Algorithm.

[BibT_eX]

[DOI]

Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

An Investigation of End-to-End Models for Robust Speech Recognition.

[BibT_eX]

[DOI]

Archiki Prasad

Rajbabu Velmurugan

Proceedings of the IEEE International Conference on Acoustics, 2021

Collaborative Learning to Generate Audio-Video Jointly.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Error-Driven Fixed-Budget ASR Personalization for Accented Speakers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Meta-Learning for Effective Multi-task and Multilingual Modelling.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Disfluency Correction using Unsupervised and Semi-supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text.

[BibT_eX]

[DOI]

Ishan Tarunesh

Syamantak Kumar

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Reduce and Reconstruct: Improving Low-resource End-to-end ASR Via Reconstruction Using Reduced Vocabularies.

[BibT_eX]

[DOI]

Anuj Diwan

CoRR, 2020

Crowdsourcing Speech Data for Low-Resource Languages from Low-Income Workers.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Generating Fluent Translations from Disfluent Text Without Access to Fluent References: IIT Bombay@IWSLT2020.

[BibT_eX]

[DOI]

Nikhil Saini

Jyotsana Khatri

Proceedings of the 17th International Conference on Spoken Language Translation, 2020

Improving Low Resource Code-Switched ASR Using Augmented Code-Switched TTS.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Caption Alignment for Low Resource Audio-Visual Data.

[BibT_eX]

[DOI]

Vighnesh Reddy Konda

Mayur Warialani

Rakesh Prasanth Achari

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Black-Box Adaptation of ASR for Accented Speech.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Coupled Training of Sequence-to-Sequence Models for Accented Speech Recognition.

[BibT_eX]

[DOI]

Vinit Unni

Nitish Joshi

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

How Accents Confound: Probing for Accent Information in End-to-End Speech Recognition Systems.

[BibT_eX]

[DOI]

Archiki Prasad

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Improved feature selection and classification for rheumatoid arthritis disease using weighted decision tree approach (REACT).

[BibT_eX]

[DOI]

Siva Shanmugam

Brij Mohan Lal Srivastava

J. Supercomput., 2019

Stem-driven Language Models for Morphologically Rich Languages.

[BibT_eX]

[DOI]

CoRR, 2019

End-to-End ASR for Code-switched Hindi-English Speech.

[BibT_eX]

[DOI]

CoRR, 2019

Exploiting Monolingual Speech Corpora for Code-Mixed Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Tale of Two Modalities for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Cross-Lingual Training for Automatic Question Generation.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Hindi Wordnet for Language Teaching: Experiences and Lessons Learnt.

[BibT_eX]

[DOI]

Hanumant Harichandra Redkar

Proceedings of the 9th Global Wordnet Conference, 2018

Synthesizing Audio for Hindi WordNet.

[BibT_eX]

[DOI]

Diptesh Kanojia

Proceedings of the 9th Global Wordnet Conference, 2018

Time Aggregation Operators for Multi-label Audio Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Improved Accented Speech Recognition Using Accent Embeddings and Multi-task Learning.

[BibT_eX]

[DOI]

Abhinav Jain

Minali Upreti

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Dual Language Models for Code Switched Speech Recognition.

[BibT_eX]

[DOI]

Saurabh Garg

Tanmay Parekh

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Generalizing Across Domains via Cross-Gradient Training.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Revisiting the Importance of Encoding Logic Rules in Sentiment Classification.

[BibT_eX]

[DOI]

Kalpesh Krishna

Mohit Iyyer

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Code-switched Language Models Using Dual RNNs and Same-Source Pretraining.

[BibT_eX]

[DOI]

Saurabh Garg

Tanmay Parekh

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017

ASR for Under-Resourced Languages From Probabilistic Transcription.

[BibT_eX]

[DOI]

Mark A. Hasegawa-Johnson

Daniel McCloy

Majid Mirbagheri

Giovanni M. Di Liberto

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Dual Language Models for Code Mixed Speech Recognition.

[BibT_eX]

[DOI]

Saurabh Garg

Tanmay Parekh

CoRR, 2017

Low-resource grapheme-to-phoneme conversion using recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Leveraging native language speech for accent identification using deep Siamese networks.

[BibT_eX]

[DOI]

Aditya Siddhant

Sriram Ganapathy

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Mismatched crowdsourcing: Mining latent skills to acquire speech transcriptions.

[BibT_eX]

[DOI]

Wenda Chen

Van Hai Do

Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

2016

Articulatory feature-based pronunciation modeling.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2016

Performance Improvements of Probabilistic Transcript-adapted ASR with Recurrent Neural Network and Language-specific Constraints.

[BibT_eX]

[DOI]

Xiang Kong

CoRR, 2016

Clustering-based Phonetic Projection in Mismatched Crowdsourcing Channels for Low-resourced ASR.

[BibT_eX]

[DOI]

Wenda Chen

Nancy F. Chen

Lav R. Varshney

Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, 2016

Performance Improvement of Probabilistic Transcriptions with Language-specific Constraints.

[BibT_eX]

[DOI]

Xiang Kong

Proceedings of the SLTU-2016, 2016

Language coverage for mismatched crowdsourcing.

[BibT_eX]

[DOI]

Lav R. Varshney

Proceedings of the 2016 Information Theory and Applications Workshop, 2016

Automatic Speech Recognition Using Probabilistic Transcriptions in Swahili, Amharic, and Dinka.

[BibT_eX]

[DOI]

Amit Das

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Adapting ASR for under-resourced languages using mismatched transcriptions.

[BibT_eX]

[DOI]

Sanjeev Khudanpur

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Improved hindi broadcast ASR by adapting the language model and pronunciation model using a priori syntactic and morphophonemic knowledge.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Transcribing continuous speech using mismatched crowdsourcing.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Prosodic and structural correlates of perceived prominence in Russian and Hindi.

[BibT_eX]

[DOI]

Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Acquiring Speech Transcriptions Using Mismatched Crowdsourcing.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Revisiting Word Neighborhoods for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, 2014

2013

Conditional Random Fields in Speech, Audio, and Language Processing.

[BibT_eX]

[DOI]

Proc. IEEE, 2013

Discriminative training of WFST factors with application to pronunciation modeling.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012

Large-scale discriminative language model reranking for voice-search.

[BibT_eX]

[DOI]

Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 2012

Discriminatively learning factorized finite state pronunciation models from dynamic Bayesian networks.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Distributed discriminative language models for Google voice-search.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Lexical access experiments with context-dependent articulatory feature-based models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Investigations into the Crandem Approach to Word Recognition.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Discriminative language modeling using simulated ASR errors.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

A comparison of audio-free speech recognition error prediction methods.

[BibT_eX]

[DOI]