Jeffrey Quesnelle

According to our database1, Jeffrey Quesnelle authored at least 9 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Long Context Pre-Training with Lighthouse Attention.
CoRR, May, 2026

Efficient Pre-Training with Token Superposition.
CoRR, May, 2026

Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation.
CoRR, April, 2026

2025
CurvaDion: Curvature-Adaptive Distributed Orthonormalization.
CoRR, December, 2025

Hermes 4 Technical Report.
CoRR, August, 2025

2024
DeMo: Decoupled Momentum Optimization.
CoRR, 2024

Hermes 3 Technical Report.
CoRR, 2024

YaRN: Efficient Context Window Extension of Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2017
On the linkability of Zcash transactions.
CoRR, 2017


  Loading...