Kalika Bali

Orcid: 0000-0001-9275-742X

According to our database1, Kalika Bali authored at least 81 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures.
CoRR, 2024

MunTTS: A Text-to-Speech System for Mundari.
CoRR, 2024

Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks.
CoRR, 2023

Breaking Language Barriers with a LEAP: Learning Strategies for Polyglot LLMs.
CoRR, 2023

MEGA: Multilingual Evaluation of Generative AI.
CoRR, 2023

"Fifty Shades of Bias": Normative Ratings of Gender Bias in GPT Generated English Text.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MEGA: Multilingual Evaluation of Generative AI.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Can Large Language Models Support Medical Facilitation Work? A Speculative Analysis.
Proceedings of the 4th African Human Computer Interaction Conference, 2023

Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2023

X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Learnings from Technological Interventions in a Low Resource Language: Enhancing Information Access in Gondi.
CoRR, 2022

Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Resource MT Models.
CoRR, 2022

Annotated Speech Corpus for Low Resource Indian Languages: Awadhi, Bhojpuri, Braj and Magahi.
CoRR, 2022

Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic?
CoRR, 2022

Too Brittle to Touch: Comparing the Stability of Quantization and Distillation towards Developing Low-Resource MT Models.
Proceedings of the Seventh Conference on Machine Translation, 2022

Language Patterns and Behaviour of the Peer Supporters in Multilingual Healthcare Conversational Forums.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

"#DisabledOnIndianTwitter" : A Dataset towards Understanding the Expression of People with Disabilities on Indian Twitter.
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

The Six Conundrums of Building and Deploying Language Technologies for Social Good.
Proceedings of the COMPASS '22: ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies, Seattle, WA, USA, 29 June 2022, 2022

Global Readiness of Language Technology for Healthcare: What Would It Take to Combat the Next Pandemic?
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Feeling Proud, Feeling Embarrassed: Experiences of Low-income Women with Crowd Work.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

LITMUS Predictor: An AI Assistant for Building Reliable, High-Performing and Fair Multilingual NLP Systems.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Predicting the Performance of Multilingual NLP Models.
CoRR, 2021

Designing Language Technologies for Social Good: The Road not Taken.
CoRR, 2021

Multilingual and code-switching ASR challenges for low resource Indian languages.
CoRR, 2021

MUCS 2021: Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Language Translation as a Socio-Technical System: Case-Studies of Mixed-Initiative Interactions.
Proceedings of the COMPASS '21: ACM SIGCAS Conference on Computing and Sustainable Societies, Virtual Event, Australia, 28 June 2021, 2021

2020
Topical Focus of Political Campaigns and its Impact: Findings from Politicians' Hashtag Use during the 2019 Indian Elections.
Proc. ACM Hum. Comput. Interact., 2020

Do Multilingual Users Prefer Chat-bots that Code-mix? Let's Nudge and Find Out!
Proc. ACM Hum. Comput. Interact., 2020

Learnings from Technological Interventions in a Low Resource Language: A Case-Study on Gondi.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Crowdsourcing Speech Data for Low-Resource Languages from Low-Income Workers.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

The State and Fate of Linguistic Diversity and Inclusion in the NLP World.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Understanding Script-Mixing: A Case Study of Hindi-English Bilingual Twitter Users.
Proceedings of the The 4th Workshop on Computational Approaches to Code Switching, 2020

2019
Identifying and Analyzing Different Aspects of English-Hindi Code-Switching in Twitter.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019

Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities.
CoRR, 2019

INMT: Interactive Neural Machine Translation Prediction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Discovering Canonical Indian English Accents: A Crowdsourcing-based Approach.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

An Integrated Representation of Linguistic and Social Functions of Code-Switching.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

User Perception of Code-Switching Dialog Systems.
Proceedings of the 15th International Conference on Natural Language Processing, 2018

Language Modeling for Code-Mixing: The Role of Linguistic Theory based Synthetic Data.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Phone Merging For Code-Switched Speech Recognition.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

Accommodation of Conversational Code-Choice.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

2017
Curriculum Design for Code-switching: Experiments with Language Identification and Language Modeling with Deep Neural Networks.
Proceedings of the 14th International Conference on Natural Language Processing, 2017

Estimating Code-Switching on Twitter with a Novel Generalized Word-Level Language Detection Technique.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Grammatical Constraints on Intra-sentential Code-Switching: From Theories to Working Models.
CoRR, 2016

Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Understanding Language Preference for Expression of Opinion and Sentiment: What do Hindi-English Speakers do on Twitter?
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
POS Tagging of Hindi-English Code Mixed Text from Social Media: Some Machine Learning Experiments.
Proceedings of the 12th International Conference on Natural Language Processing, 2015

2014
Query expansion for mixed-script information retrieval.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

"ye word kis lang ka hai bhai?" Testing the Limits of Word level Language Identification.
Proceedings of the 11th International Conference on Natural Language Processing, 2014

POS Tagging of English-Hindi Code-Mixed Social Media Content.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System.
Proceedings of the First Workshop on Computational Approaches to Code Switching@EMNLP 2014, 2014

"I am borrowing ya mixing ?" An Analysis of English-Hindi Code Mixing in Facebook.
Proceedings of the First Workshop on Computational Approaches to Code Switching@EMNLP 2014, 2014

2013
Syllabification and stress assignment in phonetic Sanskrit text.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Automatically Identifying Vocal Expressions for Music Transcription.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

The Use Of Melodic Scales In Bollywood Music: An Empirical Study.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

A Hindi speech recognizer for an agricultural video search application.
Proceedings of the Annual Symposium on Computing for Development, 2013

VideoKheti: making video content accessible to low-literate and novice users.
Proceedings of the 2013 ACM SIGCHI Conference on Human Factors in Computing Systems, 2013

Entailment: An Effective Metric for Comparing and Evaluating Hierarchical and Non-hierarchical Annotation Schemes.
Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, 2013

Crowd Prefers the Middle Path: A New IAA Metric for Crowdsourcing Reveals Turker Biases in Query Segmentation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Can Modern Statistical Parsers Lead to Better Natural Language Understanding for Education?
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2012

2011
Query completion without query logs for song search.
Proceedings of the 20th International Conference on World Wide Web, 2011

Challenges in Designing Input Method Editors for Indian Lan-guages: The Role of Word-Origin and Context.
Proceedings of the Workshop on Advances in Text Input Methods, 2011

A Comparative Phonological Study of the Dialects of Hindi.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

2010
Resource Creation for Training and Testing of Transliteration Systems for Indian Languages.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Prosody cues for classification of the discourse particle "hã" in hindi.
Proceedings of the INTERSPEECH 2010, 2010

Enhancing ESL education in India with a reading tutor that listens.
Proceedings of the First ACM Annual Symposium on Computing for Development, 2010

2009
Real voice and TTS accent effects on intelligibility and comprehension for indian speakers of English as a second language.
Proceedings of the INTERSPEECH 2009, 2009

F0 cues for the discourse functions of "hã" in hindi.
Proceedings of the INTERSPEECH 2009, 2009

Voice key board: multimodal indic text input.
Proceedings of the 11th International Conference on Multimodal Interfaces, 2009

Complex Linguistic Annotation - No Easy Way Out! A Case from Bangla and Hindi POS Labeling Tasks.
Proceedings of the Third Linguistic Annotation Workshop, 2009

2008
Unexplored directions in spoken language technology for development.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

A Common Parts-of-Speech Tagset Framework for Indian Languages.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Designing a Common POS-Tagset Framework for Indian Languages.
Proceedings of the 6th Workshop on Asian Language Resources, 2008

2005
UPX: A New XML Representation for Annotated Datasets of Online Handwriting Data.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005

2004
Tools for the development of a Hindi speech synthesis system.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Automatic Generation of Compound Word Lexicon for Hindi Speech Synthesis.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Duration modeling for hindi text-to-speech synthesis system.
Proceedings of the INTERSPEECH 2004, 2004

2002
Language Technology Solutions in Simputer: an Overview.
Proceedings of the 2002 Language Engineering Conference (LEC 2002), 2002


  Loading...