Sunayana Sitaram

According to our database1, Sunayana Sitaram authored at least 66 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures.
CoRR, 2024

Private Benchmarking to Prevent Contamination and Improve Comparative Evaluation of LLMs.
CoRR, 2024

A Unified Framework and Dataset for Assessing Gender Bias in Vision-Language Models.
CoRR, 2024

CultureLLM: Incorporating Cultural Differences into Large Language Models.
CoRR, 2024

MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models.
CoRR, 2024

MAFIA: Multi-Adapter Fused Inclusive Language Models.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks.
CoRR, 2023

Breaking Language Barriers with a LEAP: Learning Strategies for Polyglot LLMs.
CoRR, 2023

MEGA: Multilingual Evaluation of Generative AI.
CoRR, 2023

Analysing the Masked Predictive Coding Training Criterion for Pre-Training a Speech Representation Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MEGA: Multilingual Evaluation of Generative AI.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Performance and Risk Trade-offs for Multi-word Text Prediction at Scale.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Fairness in Language Models Beyond English: Gaps and Challenges.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-Supervised Setting.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2023

On Evaluating and Mitigating Gender Biases in Multilingual Settings.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Comparative Study on the Impact of Model Compression Techniques on Fairness in Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages.
CoRR, 2022

Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

A Survey of Multilingual Models for Automatic Speech Recognition.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Multilingual CheckList: Generation and Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

On the Calibration of Massively Multilingual Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

LITMUS Predictor: An AI Assistant for Building Reliable, High-Performing and Fair Multilingual NLP Systems.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Predicting the Performance of Multilingual NLP Models.
CoRR, 2021

Multilingual and code-switching ASR challenges for low resource Indian languages.
CoRR, 2021

MUCS 2021: Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

On the Universality of Deep Contextual Language Models.
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021

GCM: A Toolkit for Generating Synthetic Code-mixed Text.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

A Survey of Code-switching: Linguistic and Social Perspectives for Language Technologies.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Cross-lingual and Multilingual Spoken Term Detection for Low-Resource Indian Languages.
CoRR, 2020

Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition.
CoRR, 2020

Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition.
CoRR, 2020

Crowdsourcing Speech Data for Low-Resource Languages from Low-Income Workers.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

GLUECoS: An Evaluation Benchmark for Code-Switched NLP.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A New Dataset for Natural Language Inference from Code-mixed Conversations.
Proceedings of the The 4th Workshop on Computational Approaches to Code Switching, 2020

2019
Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities.
CoRR, 2019

End-to-End ASR for Code-switched Hindi-English Speech.
CoRR, 2019

A Survey of Code-switched Speech and Language Processing.
CoRR, 2019

Using Monolingual Speech Recognition for Spoken Term Detection in Code-switched Hindi-English Speech.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

2018
Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Discovering Canonical Indian English Accents: A Crowdsourcing-based Approach.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Homophone Identification and Merging for Code-switched Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Effect of TTS Generated Audio on OOV Detection and Word Error Rate in ASR for Low-resource Languages.
Proceedings of the Interspeech 2018, 2018

Word Embeddings for Code-Mixed Language Processing.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Language Modeling for Code-Mixing: The Role of Linguistic Theory based Synthetic Data.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Phone Merging For Code-Switched Speech Recognition.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

Automatic Detection of Code-switching Style from Acoustics.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

2017
Speech Synthesis for Mixed-Language Navigation Instructions.
Proceedings of the Interspeech 2017, 2017

Curriculum Design for Code-switching: Experiments with Language Identification and Language Modeling with Deep Neural Networks.
Proceedings of the 14th International Conference on Natural Language Processing, 2017

2016
Open-Source Consumer-Grade Indic Text To Speech.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Experiments with Cross-lingual Systems for Synthesis of Code-Mixed Text.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning.
Proceedings of the NAACL HLT 2016, 2016

Speech Synthesis of Code-Mixed Text.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
Universal grapheme-based speech synthesis.
Proceedings of the INTERSPEECH 2015, 2015

Using acoustics to improve pronunciation for synthesis of low resource languages.
Proceedings of the INTERSPEECH 2015, 2015

Using articulatory features and inferred phonological segments in zero resource speech processing.
Proceedings of the INTERSPEECH 2015, 2015

2013
Text to speech in new languages without a standardized orthography.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Bootstrapping Text-to-Speech for speech processing in languages without an orthography.
Proceedings of the IEEE International Conference on Acoustics, 2013

A Hindi speech recognizer for an agricultural video search application.
Proceedings of the Annual Symposium on Computing for Development, 2013

2012
Mining Data from Project LISTEN's Reading Tutor to Analyze Development of Children's Oral Reading Prosody.
Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference, 2012

2011
Two methods for assessing oral reading prosody.
ACM Trans. Speech Lang. Process., 2011

What visual feedback should a reading tutor give children on their oral reading prosody?
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2011

2009
DA-IICT Cross-lingual and Multilingual Corpora for Speaker Recognition.
Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009


  Loading...