Ricardo Rei

Orcid: 0000-0001-8265-1939

According to our database1, Ricardo Rei authored at least 65 papers between 2006 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Can Vision Language Models Judge Action Quality? An Empirical Evaluation.
CoRR, April, 2026

Self-Preference Bias in Rubric-Based Evaluation of Large Language Models.
CoRR, April, 2026

EuroLLM-22B: Technical Report.
CoRR, February, 2026

MindGuard: Guardrail Classifiers for Multi-Turn Mental Health Support.
CoRR, February, 2026

TOWER+: Bridging Generality and Translation Specialization in Multilingual LLMs.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
MindEval: Benchmarking Language Models on Multi-turn Mental Health Support.
CoRR, November, 2025

EuroLLM-9B: Technical Report.
CoRR, June, 2025

M-Prometheus: A Suite of Open Multilingual LLM Judges.
CoRR, April, 2025

Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models.
CoRR, April, 2025

XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation.
CoRR, March, 2025

EuroBERT: Scaling Multilingual Encoders for European Languages.
CoRR, March, 2025

CroissantLLM: A Truly Bilingual French-English Language Model.
Trans. Mach. Learn. Res., 2025

Adding Chocolate to Mint : Mitigating Metric Interference in Machine Translation.
Trans. Assoc. Comput. Linguistics, 2025

Robust, interpretable and efficient MT evaluation with fine-tuned metrics.
Proceedings of Machine Translation Summit XX: MTSummit 2025, 2025

XL-Suite: Cross-Lingual Synthetic Training and Evaluation Data for Open-Ended Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Translate Smart, not Hard: Cascaded Translation Systems with Quality-Aware Deferral.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
xcomet : Transparent Machine Translation Evaluation through Fine-grained Error Detection.
Trans. Assoc. Comput. Linguistics, 2024

Assessing the Role of Context in Chat Translation Evaluation: Is Context Helpful and Under What Conditions?
Trans. Assoc. Comput. Linguistics, 2024

EuroLLM: Multilingual Language Models for Europe.
CoRR, 2024

Is Context Helpful for Chat Translation Evaluation?
CoRR, 2024

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks.
CoRR, 2024


Tower v2: Unbabel-IST 2024 Submission for the General MT Shared Task.
Proceedings of the Ninth Conference on Machine Translation, 2024

Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis.
Proceedings of the Ninth Conference on Machine Translation, 2024

Are LLMs Breaking MT Metrics? Results of the WMT24 Metrics Shared Task.
Proceedings of the Ninth Conference on Machine Translation, 2024

QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024


xTower: A Multilingual LLM for Explaining and Correcting Translation Errors.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Can Automatic Metrics Assess High-Quality Translations?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Onception: Active Learning with Expert Advice for Real World Machine Translation.
Comput. Linguistics, June, 2023

AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages.
CoRR, 2023

Scaling up CometKiwi: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task.
Proceedings of the Eighth Conference on Machine Translation, 2023

Results of WMT23 Metrics Shared Task: Metrics Might Be Guilty but References Are Not Innocent.
Proceedings of the Eighth Conference on Machine Translation, 2023

Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Better Uncertainty Quantification for Machine Translation Evaluation.
CoRR, 2022

Findings of the WMT 2022 Shared Task on Quality Estimation.
Proceedings of the Seventh Conference on Machine Translation, 2022

CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

COMET-22: Unbabel-IST 2022 Submission for the Metrics Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Results of WMT22 Metrics Shared Task: Stop Using BLEU - Neural Metrics Are Better and More Robust.
Proceedings of the Seventh Conference on Machine Translation, 2022

Robust MT Evaluation with Sentence-level Multilingual Augmentation.
Proceedings of the Seventh Conference on Machine Translation, 2022

Quality-Aware Decoding for Neural Machine Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Towards a sentiment-aware conversational agent.
Proceedings of the IVA '22: ACM International Conference on Intelligent Virtual Agents, Faro, Portugal, September 6, 2022

Disentangling Uncertainty in Machine Translation Evaluation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

QUARTZ: Quality-Aware Machine Translation.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Searching for COMETINHO: The Little Metric That Could.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

2021
Towards better subtitles: A multilingual approach for punctuation restoration of speech transcripts.
Expert Syst. Appl., 2021

IST-Unbabel 2021 Submission for the Quality Estimation Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Are References Really Needed? Unbabel-IST 2021 Submission for the Metrics Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Results of the WMT21 Metrics Shared Task: Evaluating Metrics with Expert-based Human Evaluations on TED and News Domain.
Proceedings of the Sixth Conference on Machine Translation, 2021

Multilingual Simultaneous Sentence End and Punctuation Prediction (short paper).
Proceedings of the Swiss Text Analytics Conference 2021, Winterthur, 2021

IST-Unbabel 2021 Submission for the Explainable Quality Estimation Shared Task.
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

Uncertainty-Aware Machine Translation Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Multilingual Email Zoning.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, 2021

MT-Telescope: An interactive platform for contrastive evaluation of MT systems.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human Effort.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Unbabel's Participation in the WMT20 Metrics Shared Task.
CoRR, 2020

A free web service for fast COVID-19 classification of chest X-Ray images.
CoRR, 2020

Unbabel's Participation in the WMT20 Metrics Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

Automatic Truecasing of Video Subtitles Using BERT: A Multilingual Adaptable Approach.
Proceedings of the Information Processing and Management of Uncertainty in Knowledge-Based Systems, 2020

COMET: A Neural Framework for MT Evaluation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

COMET - Deploying a New State-of-the-art MT Evaluation Metric in Production.
Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020

2006
Urban Cellular Planning Optimisation of Multi-service Enhanced UMTS Based in Economic Issues.
Proceedings of the Wired/Wireless Internet Communications, 4th International Conference, 2006


  Loading...