Juan Luis Gastaldi

Orcid: 0000-0003-0494-5266

According to our database1, Juan Luis Gastaldi authored at least 8 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Language Models over Canonical Byte-Pair Encodings.
CoRR, June, 2025

The Foundations of Tokenization: Statistical and Computational Concerns.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Computing Cultures: Historical and Philosophical Perspectives.
Minds Mach., February, 2024

From Language Models over Tokens to Language Models over Characters.
CoRR, 2024

On the Proper Treatment of Tokenization in Psycholinguistics.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
A Formal Perspective on Byte-Pair Encoding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Tokenization and the Noiseless Channel.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2015
Frege's Habilitationsschrift: Magnitude, Number and the Problems of Computability.
Proceedings of the History and Philosophy of Computing - Third International Conference, 2015


  Loading...