Chris C. Emezue

According to our database1, Chris C. Emezue authored at least 45 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Beyond MLE: Investigating SEARNN for Low-Resourced Neural Machine Translation.
CoRR, 2024

The IgboAPI Dataset: Empowering Igbo Language Technologies through Multi-dialectal Enrichment.
CoRR, 2024

Text Categorization Can Enhance Domain-Agnostic Stopword Extraction.
CoRR, 2024

AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

The IgboAPI Dataset: Empowering Igbo Language Technologies through Multi-dialectal Enrichment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR.
Trans. Assoc. Comput. Linguistics, 2023

Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation.
CoRR, 2023

Adapting Pretrained ASR Models to Low-resource Clinical Speech using Epistemic Uncertainty-based Data Selection.
CoRR, 2023

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages.
CoRR, 2023

The African Stopwords project: curating stopwords for African languages.
CoRR, 2023

MasakhaNEWS: News Topic Classification for African languages.
CoRR, 2023

Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages.
CoRR, 2023

AfriNames: Most ASR Models "Butcher" African Names.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

LOWRECORP: the Low-Resource NLG Corpus Building Challenge.
Proceedings of the 16th International Natural Language Generation Conference, 2023

MasakhaNEWS: News Topic Classification for African languages.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

GFlowOut: Dropout with Generative Flow Networks.
Proceedings of the International Conference on Machine Learning, 2023


Koya: A Recommender System for Large Language Model Selection.
Proceedings of the 4th Workshop on African Natural Language Processing, 2023

Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages.
Proceedings of the 4th Workshop on African Natural Language Processing, 2023

AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages.
Proceedings of the 4th Workshop on African Natural Language Processing, 2023



2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model.
CoRR, 2022

AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages.
CoRR, 2022

Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages.
CoRR, 2022

Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources.
CoRR, 2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis.
CoRR, 2022

Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages.
Proceedings of the Seventh Conference on Machine Translation, 2022

Bayesian structure learning with generative flow networks.
Proceedings of the Uncertainty in Artificial Intelligence, 2022


BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022


MeSH2Matrix: Machine learning-driven biomedical relation classification based on the MeSH keywords of PubMed scholarly publications.
Proceedings of the 12th International Workshop on Bibliometric-enhanced Information Retrieval co-located with 44th European Conference on Information Retrieval (ECIR 2022), 2022

2021
MasakhaNER: Named Entity Recognition for African Languages.
Trans. Assoc. Comput. Linguistics, 2021

Crowdsourced Phrase-Based Tokenization for Low-Resourced Neural Machine Translation: The Case of Fon Language.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

OkwuGbé: End-to-End Speech Recognition for Fon and Igbo.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics.
CoRR, 2021

MMTAfrica: Multilingual Machine Translation for African Languages.
Proceedings of the Sixth Conference on Machine Translation, 2021

A Computational Method for Histological Bone Research using Convolutional Neural Networks.
Proceedings of the 34th Canadian Conference on Artificial Intelligence, 2021

2020
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages.
CoRR, 2020

Lanfrica: A Participatory Approach to Documenting Machine Translation Research on African Languages.
CoRR, 2020

FFR v1.1: Fon-French Neural Machine Translation.
CoRR, 2020

FFR V1.0: Fon-French Neural Machine Translation.
Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020




  Loading...