Markus Kliegl
Orcid: 0000-0001-6063-3959
According to our database1,
Markus Kliegl
authored at least 7 papers
between 2017 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training.
CoRR, April, 2025
CoRR, April, 2025
Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2017
CoRR, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017