Antonios Anastasopoulos

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Follow the Beaten Path: The Role of Route Patterns on Vision-Language Navigation Agents Generalization Abilities.

[BibT_eX]

[DOI]

Kourosh T. Baghaei

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Script-Agnosticism and its Impact on Language Identification for Dravidian Languages.

[BibT_eX]

[DOI]

Joshua Otten

Irianna Linardaki Vasileiadi

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

VMWE identification with models trained on GUD (a UDv.2 treebank of Standard Modern Greek).

[BibT_eX]

[DOI]

Stella Markantonatou

Vivian Stamou

Stavros Bompolas

Katerina Anastasopoulou

Konstantinos Diamantopoulos

Yannis Kazos

Proceedings of the 21st Workshop on Multiword Expressions, 2025

GMU Systems for the IWSLT 2025 Low-Resource Speech Translation Shared Task.

[BibT_eX]

[DOI]

Chutong Meng

Proceedings of the 22nd International Conference on Spoken Language Translation, 2025

Findings of the IWSLT 2025 Evaluation Campaign.

[BibT_eX]

[DOI]

Victor Agostinelli

Tanel Alumäe

Chandresh Kumar Maurya

Proceedings of the 22nd International Conference on Spoken Language Translation, 2025

The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties.

[BibT_eX]

[DOI]

Antonis Anastasopoulos

Hung-yi Lee

Karen Livescu

Shinji Watanabe

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Graph Enhanced Trajectory Anomaly Detection.

[BibT_eX]

[DOI]

Jonathan Mbuya

Proceedings of the 33rd ACM International Conference on Advances in Geographic Information Systems, 2025

Dialectal Toxicity Detection: Evaluating LLM-as-a-Judge Consistency Across Language Varieties.

[BibT_eX]

[DOI]

Md Mushfiqur Rahman

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Tracing L1 Interference in English Learner Writing: A Longitudinal Corpus with Error Annotations.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Testing the Boundaries of LLMs: Dialectal and Language-Variety Tasks.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Large Language Models as a Normalizer for Transliteration and Dialectal Translation.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Costs and Benefits of AI-Enabled Topic Modeling in P-20 Research: The Case of School Improvement Plans.

[BibT_eX]

[DOI]

Syeda Sabrina Akter

Seth Hunter

David Woo

Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications, 2025

Cross-Lingual Representation Alignment Through Contrastive Image-Caption Tuning.

[BibT_eX]

[DOI]

Nathaniel Krasner

Nicholas Lanuzo

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

Dialect Normalization using Large Language Models and Morphological Rules.

[BibT_eX]

[DOI]

Antonios Dimakis

John Pavlopoulos

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Clinical risk prediction using language models: benefits and considerations.

[BibT_eX]

[DOI]

Sanmay Das

J. Am. Medical Informatics Assoc., 2024

Findings of the IWSLT 2024 Evaluation Campaign.

[BibT_eX]

[DOI]

CoRR, 2024

Birdie: Advancing State Space Models with Reward-Driven Objectives and Curricula.

[BibT_eX]

[DOI]

Sam Blouir

Jimmy T. H. Smith

Amarda Shehu

CoRR, 2024

Script-Agnostic Language Identification.

[BibT_eX]

[DOI]

Joshua Otten

CoRR, 2024

Unlearning Climate Misinformation in Large Language Models.

[BibT_eX]

[DOI]

Dimitrios Stamoulis

CoRR, 2024

EmoMix-3L: A Code-Mixed Dataset for Bangla-English-Hindi Emotion Detection.

[BibT_eX]

[DOI]

Md. Nishat Raihan

Dhiman Goswami

Antara Mahmud

Marcos Zampieri

CoRR, 2024

CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models.

[BibT_eX]

[DOI]

Zaid Sheikh

CoRR, 2024

An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages.

[BibT_eX]

[DOI]

CoRR, 2024

A Case Study on Filtering for End-to-End Speech Translation.

[BibT_eX]

[DOI]

CoRR, 2024

A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages.

[BibT_eX]

[DOI]

CoRR, 2024

Findings of the WMT 2024 Shared Task of the Open Language Data Initiative.

[BibT_eX]

[DOI]

Jean Maillard

Laurie Burchell

Christian Federmann

Philipp Koehn

Skyler Wang

Proceedings of the Ninth Conference on Machine Translation, 2024

Data-Augmentation-Based Dialectal Adaptation for LLMs.

[BibT_eX]

[DOI]

Proceedings of the Eleventh Workshop on NLP for Similar Languages, Varieties, and Dialects, 2024

Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers.

[BibT_eX]

[DOI]

Roy Xie

Orevaoghene Ahia

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Global Gallery: The Fine Art of Painting Culture Portraits through Multilingual Instruction Tuning.

[BibT_eX]

[DOI]

Aylin Caliskan

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

A Study on Scaling Up Multilingual News Framing Analysis.

[BibT_eX]

[DOI]

Syeda Sabrina Akter

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

FINDINGS OF THE IWSLT 2024 EVALUATION CAMPAIGN.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Spoken Language Translation, 2024

Speech Recognition for Greek Dialects: A Challenging Benchmark.

[BibT_eX]

[DOI]

Georgios Paraskevopoulos

Antonios Dimakis

Stella Markantonatou

Angela Ralli

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing.

[BibT_eX]

[DOI]

Brian Yan

Xuankai Chang

Yuya Fujita

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization.

[BibT_eX]

[DOI]

Amir Hussein

Brian Yan

Shinji Watanabe

Sanjeev Khudanpur

Proceedings of the IEEE International Conference on Acoustics, 2024

Trajectory Anomaly Detection with Language Models.

[BibT_eX]

[DOI]

Jonathan Kabala Mbuya

Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, 2024

Urban Mobility Assessment Using LLMs.

[BibT_eX]

[DOI]

Prabin Bhandari

Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, 2024

BiasDora: Exploring Hidden Biased Associations in Vision-Language Models.

[BibT_eX]

[DOI]

Aylin Caliskan

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Back to School: Translation Using Grammar Books.

[BibT_eX]

[DOI]

Jonathan Hus

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Gloss2Text: Sign Language Gloss translation using LLMs and Semantically Aware Label Smoothing.

[BibT_eX]

[DOI]

Pooya Fayyazsanavi

Jana Kosecka

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead?

[BibT_eX]

[DOI]

Alexander S. Choi

Syeda Sabrina Akter

JP Singh

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Birdie: Advancing State Space Language Modeling with Dynamic Mixtures of Training Objectives.

[BibT_eX]

[DOI]

Sam Blouir

Jimmy T. H. Smith

Amarda Shehu

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SALSA: Salience-Based Switching Attack for Adversarial Perturbations in Fake News Detection Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2024

CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Language and Speech Technology for Central Kurdish Varieties.

[BibT_eX]

[DOI]

Daban Q. Jaff

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Breaking Bias, Building Bridges: Evaluation and Mitigation of Social Biases in LLMs via Contact Hypothesis.

[BibT_eX]

[DOI]

Aylin Caliskan

Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024

DIALECTBENCH: An NLP Benchmark for Dialects, Varieties, and Closely-Related Languages.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages.

[BibT_eX]

[DOI]

Antonios Dimakis

Stella Markantonatou

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Offensive Language Identification in Transliterated and Code-Mixed Bangla.

[BibT_eX]

[DOI]

Marcos Zampieri

CoRR, 2023

To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer.

[BibT_eX]

[DOI]

Md Mushfiqur Rahman

Fardin Ahsan Sakib

CoRR, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.

[BibT_eX]

[DOI]

CoRR, 2023

Approaches to Corpus Creation for Low-Resource Language Technology: the Case of Southern Kurdish and Laki.

[BibT_eX]

[DOI]

Zahra Azin

Sara Belelli

CoRR, 2023

User-Centric Evaluation of OCR Systems for Kwak'wala.

[BibT_eX]

[DOI]

Daisy Rosenblum

Michayla King

CoRR, 2023

PALI: A Language Identification Benchmark for Perso-Arabic Scripts.

[BibT_eX]

[DOI]

Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023

GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters.

[BibT_eX]

[DOI]

Ruoyu Xie

Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

GMU Systems for the IWSLT 2023 Dialect and Low-resource Speech Translation Tasks.

[BibT_eX]

[DOI]

Jonathan Mbuya

Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Findings of the IWSLT 2023 Evaluation Campaign.

[BibT_eX]

[DOI]

Sweta Agrawal

Alexandra Chronopoulou

Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Towards a Universal Python: Translating the Natural Modality of Python into Other Human Languages.

[BibT_eX]

[DOI]

Joshua Otten

Kevin Moran

Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2023

Are Large Language Models Geospatially Knowledgeable?

[BibT_eX]

[DOI]

Prabin Bhandari

Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Global Voices, Local Biases: Socio-Cultural Prejudices across Languages.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning.

[BibT_eX]

[DOI]

Arun Sampath

Ashwin Sheshadri

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Mitigating Societal Harms in Large Language Models.

[BibT_eX]

[DOI]

Sachin Kumar

Vidhisha Balachandran

Lucille Njoo

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Noisy Parallel Data Alignment.

[BibT_eX]

[DOI]

Ruoyu Xie

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey.

[BibT_eX]

[DOI]

Sachin Kumar

Vidhisha Balachandran

Lucille Njoo

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

BIG-C: a Multimodal Multi-Purpose Dataset for Bemba.

[BibT_eX]

[DOI]

Claytone Sikasote

Eunice Mukonde

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Geographic and Geopolitical Biases of Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

Educational Tools for Mapuzugun.

[BibT_eX]

[DOI]

Cristian Ahumada

Claudio Gutierrez

CoRR, 2022

AUTOLEX: An Automatic Framework for Linguistic Exploration.

[BibT_eX]

[DOI]

Zaid Sheikh

David R. Mortensen

CoRR, 2022

Language Adapters for Large-Scale MT: The GMU System for the WMT 2022 Large-Scale Machine Translation Evaluation for African Languages Shared Task.

[BibT_eX]

[DOI]

Proceedings of the Seventh Conference on Machine Translation, 2022

Findings of the WMT'22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages.

[BibT_eX]

[DOI]

David Ifeoluwa Adelani

Proceedings of the Seventh Conference on Machine Translation, 2022

Findings of the VarDial Evaluation Campaign 2022.

[BibT_eX]

[DOI]

Noëmi Aepli

Proceedings of the Ninth Workshop on NLP for Similar Languages, Varieties and Dialects, 2022

Quand être absent de mBERT n'est que le commencement : Gérer de nouvelles langues à l'aide de modèles de langues multilingues (When Being Unseen from mBERT is just the Beginning : Handling New Languages With Multilingual Language Models).

[BibT_eX]

[DOI]

Benjamin Muller

Benoît Sagot

Djamé Seddah

Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

BembaSpeech: A Speech Recognition Corpus for the Bemba Language.

[BibT_eX]

[DOI]

Claytone Sikasote

Jaime Rafael Montoya Samame

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

UniMorph 4.0: Universal Morphology.

[BibT_eX]

[DOI]

Khuyagbaatar Batsuren

Delio Siticonatzi Camaiteri

Gema Celeste Silva Villegas

Lucas Torroba Hennigen

Adam Ek

David Guriel

Peter Dirix

Jean-Philippe Bernardy

Andrey Scherbakov

Aziyana Bayyr-ool

Jonathan North Washington

Natalia Krizhanovskaya

Maria Nepomniashchaya

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Findings of the IWSLT 2022 Evaluation Campaign.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Phylogeny-Inspired Adaptation of Multilingual Models to New Languages.

[BibT_eX]

[DOI]

Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

PROBER: A System for Real-time Propaganda Behavior Analytics on Social Media and Web Data Streams.

[BibT_eX]

[DOI]

Yasas Senarath

Tonya Thornton

Proceedings of the IEEE International Conference on Big Data, 2022

Cross-Lingual Text Classification of Transliterated Hindi and Malayalam.

[BibT_eX]

[DOI]

Jitin Krishnan

Huzefa Rangwala

Proceedings of the IEEE International Conference on Big Data, 2022

Revisiting the Effects of Leakage on Dependency Parsing.

[BibT_eX]

[DOI]

Nathaniel Krasner

Miriam Wanner

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Dataset Geography: Mapping Language Data to Language Users.

[BibT_eX]

[DOI]

Yinkai Wang

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Systematic Inequalities in Language Technology Performance across the World's Languages.

[BibT_eX]

[DOI]

Damián E. Blasi

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Lexically Aware Semi-Supervised Learning for OCR Post-Correction.

[BibT_eX]

[DOI]

Daisy Rosenblum

Trans. Assoc. Comput. Linguistics, 2021

Reducing Confusion in Active Learning for Part-Of-Speech Tagging.

[BibT_eX]

[DOI]

Zaid Sheikh

Antonis Anastasopoulos

Trans. Assoc. Comput. Linguistics, 2021

Investigating Post-pretraining Representation Alignment for Cross-Lingual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2021

On the Evaluation of Machine Translation for Terminology Consistency.

[BibT_eX]

[DOI]

CoRR, 2021

Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors.

[BibT_eX]

[DOI]

Junayed Mahmud

Raihan Islam Arnob

Kevin Moran

CoRR, 2021

Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling.

[BibT_eX]

[DOI]

Jitin Krishnan

Huzefa Rangwala

CoRR, 2021

Findings of the WMT Shared Task on Machine Translation Using Terminologies.

[BibT_eX]

[DOI]

Ivana Kvapilíková

Proceedings of the Sixth Conference on Machine Translation, 2021

When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models.

[BibT_eX]

[DOI]

Benjamin Muller

Benoît Sagot

Djamé Seddah

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Findings of the IWSLT 2021 Evaluation Campaign.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Phoneme Recognition Through Fine Tuning of Phonetic Representations: A Case Study on Luhya Language Varieties.

[BibT_eX]

[DOI]

Kathleen Siminyu

Xinjian Li

David R. Mortensen

Michael R. Marlo

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Evaluating the Morphosyntactic Well-formedness of Generated Texts.

[BibT_eX]

[DOI]

Adithya Pratapa

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

SD-QA: Spoken Dialectal Question Answering for the Real World.

[BibT_eX]

[DOI]

Sharlina Keshava

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

When is Wall a Pared and when a Muro?: Extracting Rules Governing Lexical Selection.

[BibT_eX]

[DOI]

Kayo Yin

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Machine Translation into Low-resource Language Varieties.

[BibT_eX]

[DOI]

Sachin Kumar

Shuly Wintner

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Towards more equitable question answering systems: How much more data do you need?

[BibT_eX]

[DOI]

Arnab Debnath

Navid Rajabi

Fardina Fathmiul Alam

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Comparison of Interactive Knowledge Base Spelling Correction Models for Low-Resource Languages.

[BibT_eX]

[DOI]

Yiyuan Li

CoRR, 2020

Practical Comparable Data Collection for Low-Resource Languages via Images.

[BibT_eX]

[DOI]

Aman Madaan

Yiming Yang

CoRR, 2020

Towards Minimal Supervision BERT-based Grammar Error Correction.

[BibT_eX]

[DOI]

Yiyuan Li

CoRR, 2020

A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization.

[BibT_eX]

[DOI]

Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection.

[BibT_eX]

[DOI]

Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, 2020

Transliteration for Cross-Lingual Morphological Inflection.

[BibT_eX]

[DOI]

Nikitha Murikinati

Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, 2020

The CMU-LTI submission to the SIGMORPHON 2020 Shared Task 0: Language-Specific Cross-Lingual Transfer.

[BibT_eX]

[DOI]

Nikitha Murikinati

Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, 2020

AlloVera: A Multilingual Allophone Database.

[BibT_eX]

[DOI]

Florian Metze

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A Resource for Computational Experiments on Mapudungun.

[BibT_eX]

[DOI]

Mingjun Duan

Carlos Fasola

Sai Krishna Rallabandi

Rodolfo Vega

Lori S. Levin

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A Resource for Studying Chatino Verbal Morphology.

[BibT_eX]

[DOI]

Hilaria Cruz

Gregory Stump

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Optimizing Data Usage via Differentiable Rewards.

[BibT_eX]

[DOI]

Xinyi Wang

Hieu Pham

Paul Michel

Jaime G. Carbonell

Proceedings of the 37th International Conference on Machine Learning, 2020

Universal Phone Recognition with a Multilingual Allophone System.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

OCR Post Correction for Endangered Language Texts.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models.

[BibT_eX]

[DOI]

Zhengbao Jiang

Jun Araki

Haibo Ding

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

It's not a Non-Issue: Negation as a Source of Error in Machine Translation.

[BibT_eX]

[DOI]

Md Mosharaf Hossain

Eduardo Blanco

Alexis Palmer

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Dynamic Data Selection and Weighting for Iterative Back-Translation.

[BibT_eX]

[DOI]

Zi-Yi Dou

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Automatic Extraction of Rules Governing Morphological Agreement.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TICO-19: the Translation Initiative for COvid-19.

[BibT_eX]

[DOI]

Proceedings of the 1st Workshop on NLP for COVID-19@ EMNLP 2020, Online, December 2020, 2020

Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations.

[BibT_eX]

[DOI]

Xingyuan Zhao

Satoru Ozaki

Lori S. Levin

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Endangered Languages meet Modern NLP.

[BibT_eX]

[DOI]

Christopher Cox

Hilaria Cruz

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Fine-Tuning MT systems for Robustness to Second-Language Speaker Variations.

[BibT_eX]

[DOI]

Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

Predicting Performance for Natural Language Processing Tasks.

[BibT_eX]

[DOI]

Mengzhou Xia

Ruochen Xu

Yiming Yang

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information.

[BibT_eX]

[DOI]

Emanuele Bugliarello

Sabrina J. Mielke

Ryan Cotterell

Naoaki Okazaki

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Should All Cross-Lingual Embeddings Speak English?

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Towards Minimal Supervision BERT-Based Grammar Error Correction (Student Abstract).

[BibT_eX]

[DOI]

Yiyuan Li

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Towards Robust Toxic Content Classification.

[BibT_eX]

[DOI]

Keita Kurita

Anna Belova

CoRR, 2019

Neural Language Modeling with Visual Features.

[BibT_eX]

[DOI]

Shankar Kumar

Hank Liao

CoRR, 2019

Improving Robustness of Neural Machine Translation with Multi-task Learning.

[BibT_eX]

[DOI]

Shuyan Zhou

Xiangkai Zeng

Yingqi Zhou

Proceedings of the Fourth Conference on Machine Translation, 2019

Findings of the First Shared Task on Machine Translation Robustness.

[BibT_eX]

[DOI]

Xian Li

Paul Michel

Proceedings of the Fourth Conference on Machine Translation, 2019

Neural Machine Translation of Text from Non-Native Speakers.

[BibT_eX]

[DOI]

Alison Lui

Toan Q. Nguyen

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks.

[BibT_eX]

[DOI]

Zi-Yi Dou

Keyi Yu

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings.

[BibT_eX]

[DOI]

Zi-Yi Dou

Junjie Hu

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Pushing the Limits of Low-Resource Morphological Inflection.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

An Analysis of Source-Side Grammatical Errors in NMT.

[BibT_eX]

[DOI]

Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2019

Generalized Data Augmentation for Low-Resource Translation.

[BibT_eX]

[DOI]

Mengzhou Xia

Xiang Kong

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Choosing Transfer Languages for Cross-Lingual Learning.

[BibT_eX]

[DOI]

Patrick Littell

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Neural Machine Translation of Text from Non-Native Speakers.

[BibT_eX]

[DOI]

Alison Lui

CoRR, 2018

Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation.

[BibT_eX]

[DOI]

Brian Thompson

Huda Khayrallah

Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

A Small Griko-Italian Speech Translation Corpus.

[BibT_eX]

[DOI]

Marcely Zanon Boito

Aline Villavicencio

Laurent Besacier

Marika Lekakou

Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Tied Multitask Learning for Neural Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Leveraging Translations for Speech Transcription in Low-resource Settings.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Part-of-Speech Tagging on an Endangered Language: a Parallel Griko-Italian Resource.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017

DyNet: The Dynamic Neural Network Toolkit.

[BibT_eX]

[DOI]

CoRR, 2017

A case study on using speech-to-translation alignments for language documentation.

[BibT_eX]

[DOI]

CoRR, 2017

Spoken Term Discovery for Language Documentation using Translations.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Speech-Centric Natural Language Processing, 2017

2016

An Attentional Model for Speech Translation Without Transcription.

[BibT_eX]

[DOI]

Long Duong

Steven Bird

Trevor Cohn

Proceedings of the NAACL HLT 2016, 2016

An Unsupervised Probability Model for Speech-to-Translation Alignment of Low-Resource Languages.

[BibT_eX]

[DOI]

Long Duong

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2014

Adaptive Quality Estimation for Machine Translation.

[BibT_eX]

[DOI]

Marco Turchi