Dirk Groeneveld
Orcid: 0000-0002-8274-768X
According to our database1,
Dirk Groeneveld
authored at least 25 papers
between 2016 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training.
CoRR, May, 2025
CoRR, April, 2025
CoRR, April, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
2020
From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project.
AI Mag., 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
2016
Proceedings of the 5th Workshop on Automated Knowledge Base Construction, 2016