Salomey Osei
Orcid: 0000-0003-1900-3124Affiliations:
- University of Deusto, Bilbao, Spain
- Kwame Nkrumah University of Science and Technology, Ghana (former)
According to our database1,
Salomey Osei
authored at least 28 papers
between 2020 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages.
CoRR, February, 2025
Understanding the Role of Diversity in Ensemble-Based AutoML Methods for Classification Tasks.
IEEE Access, 2025
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models.
CoRR, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
2023
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR.
Trans. Assoc. Comput. Linguistics, 2023
AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages.
CoRR, 2023
Adapting Pretrained ASR Models to Low-resource Clinical Speech using Epistemic Uncertainty-based Data Selection.
CoRR, 2023
CoRR, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
2022
Trans. Assoc. Comput. Linguistics, 2022
AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages.
CoRR, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Trans. Assoc. Comput. Linguistics, 2021
Reusable Templates and Guides For Documenting Datasets and Models for Natural Language Processing and Generation: A Case Study of the HuggingFace and GEM Data and Model Cards.
CoRR, 2021
CoRR, 2021
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021
CoRR, 2021
2020
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages.
CoRR, 2020
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020