Thomas Bauwens

According to our database1, Thomas Bauwens authored at least 5 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ReBPE: Iteratively Improving the Internal Structure of a Structured Tokeniser by Mining its Internal Structure.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

2025
Confounding Factors in Relating Model Performance to Morphology.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

GRaMPa: Subword Regularisation by Skewing Uniform Segmentation Distributions with an Efficient Path-counting Markov Model.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
BPE-knockout: Pruning Pre-existing BPE Tokenisers with Backwards-compatible Morphological Semi-supervision.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024


  Loading...