Nathan Godey

According to our database1, Nathan Godey authored at least 10 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content.
CoRR, June, 2025

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression.
CoRR, March, 2025

2024
Improving Representations for Language Modeling. (Amélioration des représentations pour la modélisation du langage).
PhD thesis, 2024

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck.
CoRR, 2024

Headless Language Models: Learning without Predicting with Contrastive Weight Tying.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Anisotropy Is Inherent to Self-Attention in Transformers.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

On the Scaling Laws of Geographical Representation in Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Is Anisotropy Inherent to Transformers?
CoRR, 2023

2022
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling.
CoRR, 2022

MANTa: Efficient Gradient-Based Tokenization for End-to-End Robust Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022


  Loading...