Jaume Zaragoza-Bernabeu

According to our database1, Jaume Zaragoza-Bernabeu authored at least 6 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A New Massive Multilingual Dataset for High-Performance Language Technologies.
CoRR, 2024

2023
OpusCleaner and OpusTrainer, open source toolkits for training Machine Translation and Large language models.
CoRR, 2023

MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages.
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

2022
Bicleaner AI: Bicleaner Goes Neural.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2020
Bicleaner at WMT 2020: Universitat d'Alacant-Prompsit's submission to the parallel corpus filtering shared task.
Proceedings of the Fifth Conference on Machine Translation, 2020

Bifixer and Bicleaner: two open-source tools to clean your parallel data.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020


  Loading...