Preethi Jyothi

According to our database1, Preethi Jyothi authored at least 94 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language.
CoRR, 2024

STORiCo: Storytelling TTS for Hindi with Character Voice Modulation.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023
Improving RNN-Transducers with Acoustic LookAhead.
CoRR, 2023

DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction.
CoRR, 2023

Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

In-Situ Text-Only Adaptation of Speech Models with Low-Overhead Speech Imputations.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Towards Zero-Shot Code-Switched Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Accented Speech Recognition With Accent-specific Codebooks.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DITTO: Data-efficient and Fair Targeted Subset Selection for ASR Accent Adaptation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Zero-shot Cross-lingual Transfer With Learned Projections Using Unlabeled Target-Language Data.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Improving Pretraining Techniques for Code-Switched NLP.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Adversarial Training for Low-Resource Disfluency Correction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
DICTDIS: Dictionary Constrained Disambiguation for Improved NMT.
CoRR, 2022

Investigating Modality Bias in Audio Visual Video Parsing.
CoRR, 2022

Error Correction in ASR using Sequence-to-Sequence Models.
CoRR, 2022

Linguistically Informed Post-processing for ASR Error correction in Sanskrit.
Proceedings of the Interspeech 2022, 2022

VAgyojaka: An Annotating and Post-Editing Tool for Automatic Speech Recognition.
Proceedings of the Interspeech 2022, 2022

SPLICEOUT: A Simple and Efficient Audio Augmentation Method.
Proceedings of the Interspeech 2022, 2022

Adaptive Discounting of Implicit Language Models in RNN-Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2022

CoCoa: An Encoder-Decoder Model for Controllable Code-switched Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Zero-shot Disfluency Detection for Indian Languages.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Aligning Multilingual Embeddings for Improved Code-switched Natural Language Understanding.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Accurate Online Posterior Alignments for Principled Lexically-Constrained Decoding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Personalizing ASR with limited data using targeted subset selection.
CoRR, 2021

The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding.
CoRR, 2021

Cross-Modal learning for Audio-Visual Video Parsing.
CoRR, 2021

Multilingual and code-switching ASR challenges for low resource Indian languages.
CoRR, 2021

Rudder: A Cross Lingual Video and Text Retrieval Dataset.
CoRR, 2021

Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Cross-Modal Learning for Audio-Visual Video Parsing.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Low Resource ASR: The Surprising Effectiveness of High Resource Transliteration.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

MUCS 2021: Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Perturb, Predict & Paraphrase: Semi-Supervised Learning using Noisy Student for Image Captioning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Cross Lingual Video and Text Retrieval: A New Benchmark Dataset and Algorithm.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

An Investigation of End-to-End Models for Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Collaborative Learning to Generate Audio-Video Jointly.
Proceedings of the IEEE International Conference on Acoustics, 2021

Error-Driven Fixed-Budget ASR Personalization for Accented Speakers.
Proceedings of the IEEE International Conference on Acoustics, 2021

Meta-Learning for Effective Multi-task and Multilingual Modelling.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Disfluency Correction using Unsupervised and Semi-supervised Learning.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Reduce and Reconstruct: Improving Low-resource End-to-end ASR Via Reconstruction Using Reduced Vocabularies.
CoRR, 2020

Crowdsourcing Speech Data for Low-Resource Languages from Low-Income Workers.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Generating Fluent Translations from Disfluent Text Without Access to Fluent References: IIT Bombay@IWSLT2020.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

Improving Low Resource Code-Switched ASR Using Augmented Code-Switched TTS.
Proceedings of the Interspeech 2020, 2020

Caption Alignment for Low Resource Audio-Visual Data.
Proceedings of the Interspeech 2020, 2020

Black-Box Adaptation of ASR for Accented Speech.
Proceedings of the Interspeech 2020, 2020

Coupled Training of Sequence-to-Sequence Models for Accented Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

How Accents Confound: Probing for Accent Information in End-to-End Speech Recognition Systems.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Improved feature selection and classification for rheumatoid arthritis disease using weighted decision tree approach (REACT).
J. Supercomput., 2019

Stem-driven Language Models for Morphologically Rich Languages.
CoRR, 2019

End-to-End ASR for Code-switched Hindi-English Speech.
CoRR, 2019

Exploiting Monolingual Speech Corpora for Code-Mixed Speech Recognition.
Proceedings of the Interspeech 2019, 2019

A Tale of Two Modalities for Video Captioning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Cross-Lingual Training for Automatic Question Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Hindi Wordnet for Language Teaching: Experiences and Lessons Learnt.
Proceedings of the 9th Global Wordnet Conference, 2018

Synthesizing Audio for Hindi WordNet.
Proceedings of the 9th Global Wordnet Conference, 2018

Time Aggregation Operators for Multi-label Audio Event Detection.
Proceedings of the Interspeech 2018, 2018

Improved Accented Speech Recognition Using Accent Embeddings and Multi-task Learning.
Proceedings of the Interspeech 2018, 2018

Dual Language Models for Code Switched Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Generalizing Across Domains via Cross-Gradient Training.
Proceedings of the 6th International Conference on Learning Representations, 2018

Revisiting the Importance of Encoding Logic Rules in Sentiment Classification.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Code-switched Language Models Using Dual RNNs and Same-Source Pretraining.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
ASR for Under-Resourced Languages From Probabilistic Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Dual Language Models for Code Mixed Speech Recognition.
CoRR, 2017

Low-resource grapheme-to-phoneme conversion using recurrent neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Leveraging native language speech for accent identification using deep Siamese networks.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Mismatched crowdsourcing: Mining latent skills to acquire speech transcriptions.
Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

2016
Articulatory feature-based pronunciation modeling.
Comput. Speech Lang., 2016

Performance Improvements of Probabilistic Transcript-adapted ASR with Recurrent Neural Network and Language-specific Constraints.
CoRR, 2016

Clustering-based Phonetic Projection in Mismatched Crowdsourcing Channels for Low-resourced ASR.
Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, 2016

Performance Improvement of Probabilistic Transcriptions with Language-specific Constraints.
Proceedings of the SLTU-2016, 2016

Language coverage for mismatched crowdsourcing.
Proceedings of the 2016 Information Theory and Applications Workshop, 2016

Automatic Speech Recognition Using Probabilistic Transcriptions in Swahili, Amharic, and Dinka.
Proceedings of the Interspeech 2016, 2016

Adapting ASR for under-resourced languages using mismatched transcriptions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Improved hindi broadcast ASR by adapting the language model and pronunciation model using a priori syntactic and morphophonemic knowledge.
Proceedings of the INTERSPEECH 2015, 2015

Transcribing continuous speech using mismatched crowdsourcing.
Proceedings of the INTERSPEECH 2015, 2015

Prosodic and structural correlates of perceived prominence in Russian and Hindi.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Acquiring Speech Transcriptions Using Mismatched Crowdsourcing.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Revisiting Word Neighborhoods for Speech Recognition.
Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, 2014

2013
Conditional Random Fields in Speech, Audio, and Language Processing.
Proc. IEEE, 2013

Discriminative training of WFST factors with application to pronunciation modeling.
Proceedings of the INTERSPEECH 2013, 2013

2012
Large-scale discriminative language model reranking for voice-search.
Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 2012

Discriminatively learning factorized finite state pronunciation models from dynamic Bayesian networks.
Proceedings of the INTERSPEECH 2012, 2012

Distributed discriminative language models for Google voice-search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Lexical access experiments with context-dependent articulatory feature-based models.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Investigations into the Crandem Approach to Word Recognition.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Discriminative language modeling using simulated ASR errors.
Proceedings of the INTERSPEECH 2010, 2010

2009
A comparison of audio-free speech recognition error prediction methods.
Proceedings of the INTERSPEECH 2009, 2009


  Loading...