Mikko Aulamo

Orcid: 0000-0002-3253-2744

According to our database1, Mikko Aulamo authored at least 13 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A New Massive Multilingual Dataset for High-Performance Language Technologies.
CoRR, 2024

2023
OpusCleaner and OpusTrainer, open source toolkits for training Machine Translation and Large language models.
CoRR, 2023

Unsupervised Feature Selection for Effective Parallel Corpus Filtering.
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

HPLT: High Performance Language Technologies.
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

2022
Democratizing Machine Translation with OPUS-MT.
CoRR, 2022

2021
Boosting Neural Machine Translation from Finnish to Northern Sámi with Rule-Based Backtranslation.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

2020
The FISKMÖ Project: Resources and Tools for Finnish-Swedish Machine Translation and Cross-Linguistic Research.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

OpusTools and Parallel Corpus Diagnostics.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

The University of Helsinki Submission to the IWSLT2020 Offline SpeechTranslation Task.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

OpusFilter: A Configurable Parallel Corpus Filtering Toolbox.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

2019
The OPUS Resource Repository: An Open Package for Creating Parallel Corpora and Machine Translation Services.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

Annotation of subtitle paraphrases using a new web tool.
Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, 2019

2018
Paraphrase Detection on Noisy Subtitles in Six Languages.
Proceedings of the 4th Workshop on Noisy User-generated Text, 2018


  Loading...