Thomas Bauwens
According to our database1,
Thomas Bauwens authored at least 5 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
ReBPE: Iteratively Improving the Internal Structure of a Structured Tokeniser by Mining its Internal Structure.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026
2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
GRaMPa: Subword Regularisation by Skewing Uniform Segmentation Distributions with an Efficient Path-counting Markov Model.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
BPE-knockout: Pruning Pre-existing BPE Tokenisers with Backwards-compatible Morphological Semi-supervision.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024