Thales Sales Almeida

Orcid: 0009-0006-9568-9331

According to our database1, Thales Sales Almeida authored at least 11 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models.
CoRR, January, 2025

The interplay between domain specialization and model size: a case study in the legal domain.
CoRR, January, 2025

2024
Sabiá-3 Technical Report.
CoRR, 2024

Sabiá-2: A New Generation of Portuguese Large Language Models.
CoRR, 2024

Measuring Cross-lingual Transfer in Bytes.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section.
Proceedings of the Intelligent Systems - 34th Brazilian Conference, 2024

2023
Evaluating GPT-4's Vision Capabilities on Brazilian University Admission Exams.
CoRR, 2023

Sabiá: Portuguese Large Language Models.
CoRR, 2023

[inline-graphic not available: see fulltext] Sabiá: Portuguese Large Language Models.
Proceedings of the Intelligent Systems - 12th Brazilian Conference, 2023

BLUEX: A Benchmark Based on Brazilian Leading Universities Entrance eXams.
Proceedings of the Intelligent Systems - 12th Brazilian Conference, 2023

2022
NeuralSearchX: Serving a Multi-billion-parameter Reranker for Multilingual Metasearch at a Low Cost.
Proceedings of the Third International Conference on Design of Experimental Search & Information REtrieval Systems, 2022


  Loading...