Pierre-Carl Langlais

According to our database1, Pierre-Carl Langlais authored at least 8 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Common Corpus: The Largest Collection of Ethical Data for LLM Pre-Training.
CoRR, June, 2025

Even Small Reasoners Should Quote Their Sources: Introducing the Pleias-RAG Model Family.
CoRR, April, 2025

What the HellaSwag? On the Validity of Common-Sense Reasoning Benchmarks.
CoRR, April, 2025

Towards Best Practices for Open Datasets for LLM Training.
CoRR, January, 2025

2024
Toxicity of the Commons: Curating Open-Source Pre-Training Data.
CoRR, 2024

2023
Make Love or War? Monitoring the Thematic Evolution of Medieval French Narratives.
Proceedings of the Computational Humanities Research Conference 2023, 2023

2021
Digital interfaces of historical newspapers: opportunities, restrictions and recommendations.
J. Data Min. Digit. Humanit., 2021

2017
Journal Flipping Or Model Flipping? Le Tournant Du Libre Accès Au Prisme Des Humanités Numériques..
Proceedings of the 12th Annual International Conference of the Alliance of Digital Humanities Organizations, 2017


  Loading...