Dirk Groeneveld
According to our database1,
Dirk Groeneveld
authored at least 13 papers
between 2016 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.
CoRR, 2024
2023
2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
2020
From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project.
AI Mag., 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
2016
Proceedings of the 5th Workshop on Automated Knowledge Base Construction, 2016