David R. Mortensen

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure.

[BibT_eX]

[DOI]

Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Morpheme Induction for Emergent Language.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Searching for the Most Human-like Emergent Language.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

ZIPA: A family of efficient models for multilingual phone recognition.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Programming by Example meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models.

[BibT_eX]

[DOI]

Niyati Bafna

Emily Chang

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

A Review of the Applications of Deep Learning-Based Emergent Communication.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Derivational Morphology Reveals Analogical Generalization in Large Language Models.

[BibT_eX]

[DOI]

Janet B. Pierrehumbert

CoRR, 2024

ELCC: the Emergent Language Corpus Collection.

[BibT_eX]

[DOI]

CoRR, 2024

Carrot and Stick: Inducing Self-Motivation with Positive & Negative Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

Can Large Language Models Code Like a Linguist?: A Case Study in Low Resource Sound Law Induction.

[BibT_eX]

[DOI]

Atharva Naik

Kexun Zhang

CoRR, 2024

Neural Proto-Language Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2024

Mitigating the Linguistic Gap with Phonemic Representations for Robust Multilingual Language Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

XferBench: a Data-Driven Benchmark for Emergent Language.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Self-supervised Speech Representations Still Struggle with African American Vernacular English.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PWESuite: Phonetic Word Embeddings and Tasks They Facilitate.

[BibT_eX]

[DOI]

Mrinmaya Sachan

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Phonotactic Complexity across Dialects.

[BibT_eX]

[DOI]

Ryan Soh-Eun Shim

Kalvin Chang

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs.

[BibT_eX]

[DOI]

Valentina Izrailevitch

Yunze Xiao

Hinrich Schütze

Leonie Weissweiler

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Improved Neural Protoform Reconstruction via Reflex Prediction.

[BibT_eX]

[DOI]

Liang Lu

Jingzhi Wang

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Semisupervised Neural Proto-Language Reconstruction.

[BibT_eX]

[DOI]

Liang Lu

Peirong Xie

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Wav2Gloss: Generating Interlinear Glossed Text from Speech.

[BibT_eX]

[DOI]

Taiqi He

Kwanghee Choi

Lindia Tjuatja

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

PWESuite: Phonetic Word Embeddings and Tasks They Facilitate.

[BibT_eX]

[DOI]

Mrinmaya Sachan

CoRR, 2023

Construction Grammar Provides Unique Insight into Neural Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

ChatGPT MT: Competitive for High- (but Not Low-) Resource Languages.

[BibT_eX]

[DOI]

Perez Ogayo

Proceedings of the Eighth Conference on Machine Translation, 2023

Multilingual TTS Accent Impressions for Accented ASR.

[BibT_eX]

[DOI]

Georgios Karakasidis

Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

Generalized Glossing Guidelines: An Explicit, Human- and Machine-Readable, Item-and-Process Convention for Morphological Annotation.

[BibT_eX]

[DOI]

Ela Gulsen

Taiqi He

Jonathan Amith

Lindia Tjuatja

Lori S. Levin

Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, 2023

SigMoreFun Submission to the SIGMORPHON Shared Task on Interlinear Glossing.

[BibT_eX]

[DOI]

Taiqi He

Lindia Tjuatja

Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, 2023

Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing.

[BibT_eX]

[DOI]

Yanlin Feng

Adithya Pratapa

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

African Substrates Rather Than European Lexifiers to Augment African-diaspora Creole Translation.

[BibT_eX]

[DOI]

Matthew Dean Stutzman

Stephen D. Richardson

Proceedings of the 4th Workshop on African Natural Language Processing, 2023

Transformed Protoform Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Automating Sound Change Prediction for Phylogenetic Inference: A Tukanoan Case Study.

[BibT_eX]

[DOI]

Kalvin Chang

Proceedings of the 4th Workshop on Computational Approaches to Historical Language Change, 2023

2022

Mathematically Modeling the Lexicon Entropy of Emergent Language.

[BibT_eX]

[DOI]

CoRR, 2022

Recommendations for Systematic Research on Emergent Language.

[BibT_eX]

[DOI]

CoRR, 2022

Modeling Emergent Lexicon Formation with a Self-Reinforcing Stochastic Process.

[BibT_eX]

[DOI]

CoRR, 2022

AUTOLEX: An Automatic Framework for Linguistic Exploration.

[BibT_eX]

[DOI]

Aditi Chaudhary

Zaid Sheikh

CoRR, 2022

Learning the Ordering of Coordinate Compounds and Elaborate Expressions in Hmong, Lahu, and Chinese.

[BibT_eX]

[DOI]

Chenxuan Cui

Katherine J. Zhang

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

A Hmong Corpus with Elaborate Expression Annotations.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Phone Inventories and Recognition for Every Language.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Data-adaptive Transfer Learning for Translation: A Case Study in Haitian and Jamaican.

[BibT_eX]

[DOI]

Cameron J. Hogan

Nancy Fulda

Proceedings of the Fifth Workshop on Technologies for Machine Translation of Low-Resource Languages, 2022

When Is TTS Augmentation Through a Pivot Language Useful?

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ASR2K: Speech Recognition for Around 2000 Languages without Audio.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

WikiHan: A New Comparative Dataset for Chinese Languages.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Zero-shot Learning for Grapheme to Phoneme Conversion with Language Ensemble.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

Quantifying Cognitive Factors in Lexical Decline.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2021

Differentiable Allophone Graphs for Language-Universal Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Phoneme Recognition Through Fine Tuning of Phonetic Representations: A Case Study on Luhya Language Varieties.

[BibT_eX]

[DOI]

Kathleen Siminyu

Xinjian Li

Michael R. Marlo

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Tusom2021: A Phonetically Transcribed Speech Dataset from an Endangered Language for Universal Phone Recognition Experiments.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multilingual Phonetic Dataset for Low Resource Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Evaluating the Morphosyntactic Well-formedness of Generated Texts.

[BibT_eX]

[DOI]

Adithya Pratapa

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Cross-Cultural Similarity Features for Cross-Lingual Transfer Learning of Pragmatically Motivated Tasks.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020

Ranking Transfer Languages with Pragmatically-Motivated Features for Multilingual Sentiment Analysis.

[BibT_eX]

[DOI]

CoRR, 2020

Where New Words Are Born: Distributional Semantic Analysis of Neologisms and Their Semantic Neighborhoods.

[BibT_eX]

[DOI]

Maria Ryskina

Ella Rabinovich

Taylor Berg-Kirkpatrick

Yulia Tsvetkov

CoRR, 2020

Characterizing Sociolinguistic Variation in the Competing Vaccination Communities.

[BibT_eX]

[DOI]

Proceedings of the Social, Cultural, and Behavioral Modeling, 2020

AlloVera: A Multilingual Allophone Database.

[BibT_eX]

[DOI]

Alan W. Black

Florian Metze

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Universal Phone Recognition with a Multilingual Allophone System.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Automatic Extraction of Rules Governing Morphological Agreement.

[BibT_eX]

[DOI]

Aditi Chaudhary

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Towards Zero-Shot Learning for Automatic Phonemic Transcription.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Low-Resource Machine Translation using Interlinear Glosses.

[BibT_eX]

[DOI]

CoRR, 2019

CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology.

[BibT_eX]

[DOI]

CoRR, 2019

The ARIEL-CMU Systems for LoReHLT18.

[BibT_eX]

[DOI]

CoRR, 2019

2018

The ARIEL-CMU situation frame detection pipeline for LoReHLT16: a model translation approach.

[BibT_eX]

[DOI]

Mach. Transl., 2018

Epitran: Precision G2P for Many Languages.

[BibT_eX]

[DOI]