Sebastian Ruder

Bonaventure F. P. Dossou

Albert Njoroge Kahira

Abraham Toluwase Owodunni

Akintunde Oladipo

Atnafu Lambebo Tonja

Iyanuoluwa Shode

Akari Asai

Tunde Oluwaseyi Ajayi

Andre Niyongabo Rubungo

Daniel A. Ajisafe

Emeka Felix Onwuegbuzia

Chinedu Emmanuel Mbonu

CoRR, 2023

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages.

[BibT_eX]

[DOI]

CoRR, 2023

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval).

[BibT_eX]

[DOI]

Idris Abdulmumin

Seid Muhie Yimam

Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Language models are multilingual chain-of-thought reasoners.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Romanization-based Large-scale Adaptation of Multilingual Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Cross-lingual Open-Retrieval Question Answering for African Languages.

[BibT_eX]

[DOI]

Abraham Toluwase Owodunni

Akintunde Oladipo

Andre Niyongabo Rubungo

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Evaluating and Modeling Attribution for Cross-Lingual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

TaTA: A Multilingual Table-to-Text Dataset for African Languages.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Evaluating the Diversity, Equity, and Inclusion of NLP Technology: A Case Study for Indian Languages.

[BibT_eX]

[DOI]

Simran Khanuja

Partha Talukdar

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features.

[BibT_eX]

[DOI]

Ester Hlavnova

Muhammad Satrio Wicaksono

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

NusaCrowd: Open Source Initiative for Indonesian NLP Resources.

[BibT_eX]

[DOI]

Ivan Halim Parmonangan

Arie Ardiyanti Suryani

Rifki Afina Putri

Dan Su

Keith Stevens

Made Nindyatama Nityasya

Muhammad Farid Adilazuarda

Haryo Akbarianto Wibowo

Cuk Tho

Ichwanul Muslim Karo Karo

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

NusaCrowd: Open Source Initiative for Indonesian NLP Resources.

[BibT_eX]

[DOI]

Ivan Halim Parmonangan

Ika Alfina

Muhammad Satrio Wicaksono

Arie Ardiyanti Suryani

Rifki Afina Putri

Dan Su

Keith Stevens

Made Nindyatama Nityasya

Muhammad Farid Adilazuarda

Ichwanul Muslim Karo Karo

Tirana Noor Fatyanosa

CoRR, 2022

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages.

[BibT_eX]

[DOI]

CoRR, 2022

Evaluating Inclusivity, Equity, and Accessibility of NLP Technology: A Case Study for Indian Languages.

[BibT_eX]

[DOI]

Simran Khanuja

Partha P. Talukdar

CoRR, 2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis.

[BibT_eX]

[DOI]

Saheed Abdullahi Salahudeen

Chris Chinenye Emezue

Aremu Anuoluwapo

Alípio Jeorge

Pavel Brazdil

CoRR, 2022

Writing System and Speaker Metadata for 2, 800+ Language Varieties.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

XTREME-S: Evaluating Cross-lingual Speech Representations.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Charformer: Fast Character Transformers via Gradient-based Subword Tokenization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Modular and Parameter-Efficient Fine-Tuning for NLP Models.

[BibT_eX]

[DOI]

Jonas Pfeiffer

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022, 2022

MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition.

[BibT_eX]

[DOI]

Victoire Memdjokam Koagne

Peter Nabende

Cheikh M. Bamba Dione

Andiswa Bukula

Rooweither Mabuya

Bonaventure F. P. Dossou

Fatoumata Ouoba Kabore

Chris Chinenye Emezue

Allahsera Auguste Tapo

Joyce Nakatumba-Nabende

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation.

[BibT_eX]

[DOI]

Xinyi Wang

Graham Neubig

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Memorisation versus Generalisation in Pre-trained Language Models.

[BibT_eX]

[DOI]

Michael Tänzer

Marek Rei

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

MasakhaNER: Named Entity Recognition for African Languages.

[BibT_eX]

[DOI]

Marco Antonio Sobrevilla Cabezudo

Chris Chinenye Emezue

Joyce Nakatumba-Nabende

Rubungo Andre Niyongabo

Bonaventure F. P. Dossou

Kelechi Ogueji

Thierno Ibrahima Diop

Trans. Assoc. Comput. Linguistics, 2021

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.

[BibT_eX]

[DOI]

Jascha Sohl-Dickstein

Paulo Henrique Santos Vasconcellos

William Soto Martinez

CoRR, 2021

Balancing Average and Worst-case Accuracy in Multitask Learning.

[BibT_eX]

[DOI]

Paul Michel

Dani Yogatama

CoRR, 2021

FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding.

[BibT_eX]

[DOI]

CoRR, 2021

BERT memorisation and pitfalls in low-resource scenarios.

[BibT_eX]

[DOI]

Michael Tänzer

Cyprien de Masson d'Autume

Marek Rei

CoRR, 2021

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation.

[BibT_eX]

[DOI]

CoRR, 2021

Pitfalls of Static Language Modelling.

[BibT_eX]

[DOI]

CoRR, 2021

Compacter: Efficient Low-Rank Hypercomplex Adapter Layers.

[BibT_eX]

[DOI]

Rabeeh Karimi Mahabadi

James Henderson

Cyprien de Masson d'Autume

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mind the Gap: Assessing Temporal Generalization in Neural Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

LiRo: Benchmark and leaderboard for Romanian language tasks.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Multi-view Subword Regularization.

[BibT_eX]

[DOI]

Xinyi Wang

Graham Neubig

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Long Range Arena : A Benchmark for Efficient Transformers.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Rethinking Embedding Coupling in Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Efficient Test Time Adapter Ensembling for Low-resource Language Varieties.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Multi-Domain Multilingual Question Answering.

[BibT_eX]

[DOI]

Avi Sil

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: EMNLP 2021, 2021

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

UNKs Everywhere: Adapting Multilingual Language Models to New Scripts.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual Transfer.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks.

[BibT_eX]

[DOI]

Rabeeh Karimi Mahabadi

Mostafa Dehghani

James Henderson

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Analogy Training Multilingual Encoders.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization.

[BibT_eX]

[DOI]

CoRR, 2020

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Are All Good Word Vector Spaces Isomorphic?

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

AdapterHub: A Framework for Adapting Transformers.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

AxCell: Automatic Extraction of Results from Machine Learning Papers.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Morphologically Aware Word-Level Translation.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

A Call for More Rigor in Unsupervised Cross-lingual Learning.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

On the Cross-lingual Transferability of Monolingual Representations.

[BibT_eX]

[DOI]

Mikel Artetxe

Dani Yogatama

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Cross-Lingual Word Embeddings

[BibT_eX]

[DOI]

Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02171-8, 2019

A Survey of Cross-lingual Word Embedding Models.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2019

What do Deep Networks Like to Read?

[BibT_eX]

[DOI]

CoRR, 2019

To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks.

[BibT_eX]

[DOI]

Matthew E. Peters

Cyprien de Masson d'Autume

Noah A. Smith

Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

Episodic Memory in Lifelong Language Learning.

[BibT_eX]

[DOI]

Lingpeng Kong

Dani Yogatama

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Transfer Learning in Natural Language Processing.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

MultiFiT: Efficient Multi-lingual Language Model Fine-tuning.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Unsupervised Cross-Lingual Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics: Tutorial Abstracts, 2019

How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A Hierarchical Multi-Task Approach for Learning Embeddings from Semantic Tasks.

[BibT_eX]

[DOI]

Victor Sanh

Thomas Wolf

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Latent Multi-Task Architecture Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Off-the-Shelf Unsupervised NMT.

[BibT_eX]

[DOI]

Chris Hokamp

John Glover

CoRR, 2018

Fine-tuned Language Models for Text Classification.

[BibT_eX]

[DOI]

Jeremy Howard

CoRR, 2018

360° Stance Detection.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Multi-Task Learning of Pairwise Sequence Classification Tasks over Disparate Label Spaces.

[BibT_eX]

[DOI]

Isabelle Augenstein

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

A Discriminative Latent-Variable Model for Bilingual Lexicon Induction.

[BibT_eX]

[DOI]

Ryan Cotterell

Yova Kementchedjhieva

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction.

[BibT_eX]

[DOI]

Yova Kementchedjhieva

Ryan Cotterell

Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

On the Limitations of Unsupervised Bilingual Dictionary Induction.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Universal Language Model Fine-tuning for Text Classification.

[BibT_eX]

[DOI]

Jeremy Howard

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Strong Baselines for Neural Semi-Supervised Learning under Domain Shift.

[BibT_eX]

[DOI]

Barbara Plank

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Data Selection Strategies for Multi-Domain Sentiment Analysis.

[BibT_eX]

[DOI]

CoRR, 2017

Knowledge Adaptation: Teaching to Adapt.

[BibT_eX]

[DOI]

CoRR, 2017

Sluice networks: Learning what to share between loosely related tasks.

[BibT_eX]

[DOI]

CoRR, 2017

An Overview of Multi-Task Learning in Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

A survey of cross-lingual embedding models.

[BibT_eX]

[DOI]

CoRR, 2017

Learning to select data for transfer learning with Bayesian Optimization.

[BibT_eX]

[DOI]

Barbara Plank

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016

Towards a continuous modeling of natural language domains.

[BibT_eX]

[DOI]

CoRR, 2016

Character-level and Multi-channel Convolutional Neural Networks for Large-scale Authorship Attribution.

[BibT_eX]

[DOI]

CoRR, 2016

An overview of gradient descent optimization algorithms.

[BibT_eX]

[DOI]

CoRR, 2016

INSIGHT-1 at SemEval-2016 Task 5: Deep Learning for Multilingual Aspect-based Sentiment Analysis.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

INSIGHT-1 at SemEval-2016 Task 4: Convolutional Neural Networks for Sentiment Classification and Quantification.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis.

[BibT_eX]

[DOI]