Zébulon Goriely

According to our database1, Zébulon Goriely authored at least 7 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
ByteSpan: Information-Driven Subword Tokenisation.
CoRR, June, 2025

BabyLM's First Words: Word Segmentation as a Phonological Probing Task.
CoRR, April, 2025

IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language Modeling.
CoRR, April, 2025

2024
From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes.
CoRR, 2024

Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies.
CoRR, 2024

Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
CLIMB: Curriculum Learning for Infant-inspired Model Building.
CoRR, 2023


  Loading...