We stand with Ukraine

We stand with Ukraine

Niklas Muennighoff

Orcid: 0009-0001-7157-770X

According to our database¹, Niklas Muennighoff authored at least 73 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought.

[BibT_eX]

[DOI]

,

,

,

,

Niklas Muennighoff

,

,

,

,

,

,

,

,

,

CoRR, November, 2025

ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality.

[BibT_eX]

[DOI]

,

Sneha Kudugunta

,

Niklas Muennighoff

,

,

,

,

Sercan Ö. Arik

,

,

CoRR, October, 2025

HUME: Measuring the Human-Model Performance Gap in Text Embedding Tasks.

[BibT_eX]

[DOI]

Adnan El Assadi

,

,

Roman Solomatin

,

Niklas Muennighoff

,

Kenneth C. Enevoldsen

CoRR, October, 2025

Humanline: Online Alignment as Perceptual Loss.

[BibT_eX]

[DOI]

,

Niklas Muennighoff

,

Kawin Ethayarajh

CoRR, September, 2025

UQ: Assessing Language Models on Unsolved Questions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Niklas Muennighoff

CoRR, August, 2025

FlexOlmo: Open Language Models for Flexible Data Use.

[BibT_eX]

[DOI]

CoRR, July, 2025

OpenThoughts: Data Recipes for Reasoning Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability.

[BibT_eX]

[DOI]

CoRR, June, 2025

Crosslingual Reasoning through Test-Time Scaling.

[BibT_eX]

[DOI]

,

Muhammad Farid Adilazuarda

,

Jonibek Mansurov

,

,

Niklas Muennighoff

,

Carsten Eickhoff

,

Genta Indra Winata

,

,

Stephen H. Bach

,

Alham Fikri Aji

CoRR, May, 2025

ReasonIR: Training Retrievers for Reasoning Tasks.

[BibT_eX]

[DOI]

,

,

,

Niklas Muennighoff

,

Xi Victoria Lin

,

,

Bryan Kian Hsiang Low

,

,

,

,

Luke Zettlemoyer

CoRR, April, 2025

MIEB: Massive Image Embedding Benchmark.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Roman Solomatin

,

Noura Al Moubayed

,

Kenneth C. Enevoldsen

,

Niklas Muennighoff

CoRR, April, 2025

Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

Shrimai Prabhumoye

,

Niklas Muennighoff

,

Mostofa Patwary

,

Mohammad Shoeybi

,

Bryan Catanzaro

,

CoRR, April, 2025

MMTEB: Massive Multilingual Text Embedding Benchmark.

[BibT_eX]

[DOI]

Kenneth C. Enevoldsen

,

,

,

,

,

,

,

,

Dominik Krzeminski

,

Genta Indra Winata

,

,

,

Mathieu Ciancone

,

Marion Schaeffer

,

Gabriel Sequeira

,

,

,

Jonathan Rystrøm

,

Roman Solomatin

,

,

,

Martin Bernstorff

,

,

Akshita Sukhlecha

,

,

,

Kranthi Kiran GV

,

,

,

Björn Plüster

,

Jan Philipp Harries

,

,

,

Mariya Hendriksen

,

,

Hippolyte Gisserot-Boukhlef

,

,

,

Konrad Wojtasik

,

,

,

,

,

,

Andrianos Michail

,

,

,

Aleksei Vatolin

,

,

,

,

Pranjal A. Chitale

,

Simone Tedeschi

,

,

,

Michael Günther

,

,

,

,

,

Gayatri Krishnakumar

,

,

,

Maria Tikhonova

,

,

Aleksandr Abramov

,

Malte Ostendorff

,

,

Simon Clematide

,

Lester James V. Miranda

,

Alena Fenogenova

,

,

Ruqiya Bin Safi

,

,

Alessia Borghini

,

Federico Cassano

,

,

,

,

,

,

,

Vaibhav Adlakha

,

,

,

Niklas Muennighoff

CoRR, February, 2025

s1: Simple test-time scaling.

[BibT_eX]

[DOI]

Niklas Muennighoff

,

,

,

,

,

Hannaneh Hajishirzi

,

Luke Zettlemoyer

,

,

Emmanuel J. Candès

,

Tatsunori Hashimoto

CoRR, January, 2025

KMMLU: Measuring Massive Multitask Language Understanding in Korean.

[BibT_eX]

[DOI]

,

,

,

,

Niklas Muennighoff

,

,

,

,

Stella Biderman

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

[BibT_eX]

[DOI]

,

Carlos E. Jimenez

,

,

,

,

,

,

Niklas Muennighoff

,

Gabriel Synnaeve

,

Karthik R. Narasimhan

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

Niklas Muennighoff

,

,

,

,

Zachary S. Siegel

,

,

,

,

Sercan Ö. Arik

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OLMoE: Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

Niklas Muennighoff

,

,

Dirk Groeneveld

,

,

,

,

,

Evan Pete Walsh

,

,

,

,

,

,

,

,

Alexander Wettig

,

,

,

,

,

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Generative Representational Instruction Tuning.

[BibT_eX]

[DOI]

Niklas Muennighoff

,

,

,

,

,

,

Amanpreet Singh

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RegMix: Data Mixture as Regression for Language Model Pre-training.

[BibT_eX]

[DOI]

,

,

Niklas Muennighoff

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Scaling Laws for Precision.

[BibT_eX]

[DOI]

,

,

Benjamin Frederick Spector

,

,

Niklas Muennighoff

,

,

Cengiz Pehlevan

,

Christopher Ré

,

Aditi Raghunathan

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OpenHands: An Open Platform for AI Software Developers as Generalist Agents.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Mingzhang Zheng

,

,

,

Niklas Muennighoff

,

,

,

,

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation.

[BibT_eX]

[DOI]

,

,

,

Niklas Muennighoff

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

A Survey on Data Selection for Language Models.

[BibT_eX]

[DOI]

,

,

Sang Michael Xie

,

,

,

,

Niklas Muennighoff

,

,

,

,

,

,

Tatsunori Hashimoto

,

William Yang Wang

Trans. Mach. Learn. Res., 2024

A large-scale audit of dataset licensing and attribution in AI.

[BibT_eX]

[DOI]

,

,

,

Naana Obeng-Marnu

,

,

William Brannon

,

Niklas Muennighoff

,

,

,

Kartik Perisetla

,

,

Enrico Shippole

,

Kurt D. Bollacker

,

,

,

,

Nat. Mac. Intell., 2024

Bridging the Data Provenance Gap Across Text, Speech and Video.

[BibT_eX]

[DOI]

CoRR, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2024

OLMoE: Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Mingzhang Zheng

,

,

,

Niklas Muennighoff

,

,

,

,

,

,

,

CoRR, 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons.

[BibT_eX]

[DOI]

CoRR, 2024

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions.

[BibT_eX]

[DOI]

CoRR, 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.

[BibT_eX]

[DOI]

,

Rahmad Mahendra

,

Salsabil Maulana Akbar

,

Lester James V. Miranda

,

Jennifer Santoso

,

,

Akhdan Fadhilah

,

Jonibek Mansurov

,

Joseph Marvin Imperial

,

Onno Pepijn Kampman

,

Joel Ruben Antony Moniz

,

Muhammad Ravi Shulthan Habibi

,

Frederikus Hudi

,

Railey Montalan

,

,

Joanito Agili Lopo

,

,

Börje F. Karlsson

,

,

Ryandito Diandaru

,

,

Patrick Amadeus Irawan

,

,

Jan Christian Blaise Cruz

,

Chenxi Whitehouse

,

Ivan Halim Parmonangan

,

,

,

,

Reynard Adha Ryanda

,

Sonny Lazuardi Hermawan

,

Dan John Velasco

,

Muhammad Dehan Al Kautsar

,

Willy Fitra Hendria

,

,

,

Muhammad Farid Adilazuarda

,

,

,

,

,

Muhammad Reza Qorib

,

Amirbek Djanibekov

,

,

,

Niklas Muennighoff

,

Tanrada Pansuwan

,

Ilham Firdausi Putra

,

,

,

Ayu Purwarianti

,

Sebastian Ruder

,

William-Chandra Tjhi

,

Peerat Limkonchotiwat

,

Alham Fikri Aji

,

,

Genta Indra Winata

,

,

,

,

Samuel Cahyawijaya

CoRR, 2024

Lessons from the Trenches on Reproducible Evaluation of Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence.

[BibT_eX]

[DOI]

CoRR, 2024

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order.

[BibT_eX]

[DOI]

CoRR, 2024

Language models scale reliably with over-training and on downstream tasks.

[BibT_eX]

[DOI]

CoRR, 2024

StarCoder 2 and The Stack v2: The Next Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2024

KTO: Model Alignment as Prospect Theoretic Optimization.

[BibT_eX]

[DOI]

Kawin Ethayarajh

,

,

Niklas Muennighoff

,

,

CoRR, 2024

OLMo: Accelerating the Science of Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models.

[BibT_eX]

[DOI]

,

,

Nitchakarn Suppattarachai

,

Leandro von Werra

,

,

,

Niklas Muennighoff

CoRR, 2024

C-Pack: Packed Resources For General Chinese Embeddings.

[BibT_eX]

[DOI]

,

,

,

Niklas Muennighoff

,

,

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies.

[BibT_eX]

[DOI]

,

,

,

Niklas Muennighoff

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DataComp-LM: In search of the next generation of training sets for language models.

[BibT_eX]

[DOI]

,

,

Georgios Smyrnis

,

,

,

Samir Yitzhak Gadre

,

,

Etash Kumar Guha

,

Sedrick Scott Keh

,

,

,

,

Niklas Muennighoff

,

Reinhard Heckel

,

,

,

Suchin Gururangan

,

Mitchell Wortsman

,

,

,

Marianna Nezhurina

,

,

,

,

,

,

,

,

,

,

Gabriel Ilharco

,

,

Kalyani Marathe

,

,

,

Khyathi Raghavi Chandu

,

,

Igor Vasiljevic

,

,

,

,

,

,

Luke Zettlemoyer

,

,

Alaaeldin El-Nouby

,

Hadi Pouransari

,

Alexander Toshev

,

,

Dirk Groeneveld

,

,

,

,

,

,

,

,

,

Vaishaal Shankar

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding.

[BibT_eX]

[DOI]

Kenneth C. Enevoldsen

,

,

Niklas Muennighoff

,

Kristoffer L. Nielbo

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Model Alignment as Prospect Theoretic Optimization.

[BibT_eX]

[DOI]

Kawin Ethayarajh

,

,

Niklas Muennighoff

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

OctoPack: Instruction Tuning Code Large Language Models.

[BibT_eX]

[DOI]

Niklas Muennighoff

,

,

Armel Randy Zebaze

,

,

,

,

,

,

Leandro von Werra

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.

[BibT_eX]

[DOI]

,

Rahmad Mahendra

,

Salsabil Maulana Akbar

,

Lester James V. Miranda

,

Jennifer Santoso

,

,

Akhdan Fadhilah

,

Jonibek Mansurov

,

Joseph Marvin Imperial

,

,

Joel Ruben Antony Moniz

,

Muhammad Ravi Shulthan Habibi

,

Frederikus Hudi

,

Jann Railey Montalan

,

Ryan Hadiwijaya

,

Joanito Agili Lopo

,

,

Börje Karlsson

,

,

Ryandito Diandaru

,

,

Patrick Amadeus Irawan

,

,

Jan Christian Blaise Cruz

,

Chenxi Whitehouse

,

Ivan Halim Parmonangan

,

,

,

,

Reynard Adha Ryanda

,

Sonny Lazuardi Hermawan

,

Dan John Velasco

,

Muhammad Dehan Al Kautsar

,

Willy Fitra Hendria

,

,

,

Muhammad Farid Adilazuarda

,

,

,

,

,

Muhammad Reza Qorib

,

Amirbek Djanibekov

,

,

,

Niklas Muennighoff

,

Tanrada Pansuwan

,

Ilham Firdausi Putra

,

,

,

Ayu Purwarianti

,

Sebastian Ruder

,

William-Chandra Tjhi

,

Peerat Limkonchotiwat

,

Alham Fikri Aji

,

,

Genta Indra Winata

,

,

,

,

Samuel Cahyawijaya

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model.

[BibT_eX]

[DOI]

,

Viraat Aryabumi

,

,

,

,

Gbemileke Onilude

,

,

Shivalika Singh

,

,

,

,

,

,

Niklas Muennighoff

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

OLMo: Accelerating the Science of Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Aarohi Srivastava

,

Abhinav Rastogi

,

,

Abu Awal Md Shoeb

,

,

,

,

,

,

Adrià Garriga-Alonso

,

Agnieszka Kluska

,

Aitor Lewkowycz

,

,

,

,

,

Alexander W. Kocurek

,

,

,

,

,

,

,

,

,

,

,

Anantharaman S. Iyer

,

Anders Andreassen

,

,

Andrea Santilli

,

Andreas Stuhlmüller

,

,

,

Andrew K. Lampinen

,

,

,

,

,

,

,

Antonio Norelli

,

,

Arash Gholamidavoodi

,

,

,

Arun Kirubarajan

,

Asher Mullokandov

,

Ashish Sabharwal

,

,

,

,

,

B. Ryan Roberts

,

,

,

Bartlomiej Bojanowski

,

Batuhan Özyurt

,

Behnam Hedayatnia

,

Behnam Neyshabur

,

,

,

,

Bill Yuchen Lin

,

,

,

,

,

Catherine Stinson

,

Cedrick Argueta

,

Cèsar Ferri Ramírez

,

,

Charles Rathkopf

,

,

,

,

Chris Callison-Burch

,

,

Christian Voigt

,

Christopher D. Manning

,

Christopher Potts

,

,

Clara E. Rivera

,

,

,

Courtney Ashcraft

,

Cristina Garbacea

,

,

,

,

,

,

,

Daniel Khashabi

,

,

Daniel Moseguí González

,

Danielle Perszyk

,

Danny Hernandez

,

,

Daphne Ippolito

,

,

,

,

,

Debajyoti Datta

,

,

,

,

,

,

,

,

,

,

Dimitri Coelho Mollo

,

,

,

,

Ekaterina Shutova

,

Ekin Dogus Cubuk

,

,

Eleanor Hagerman

,

Elizabeth Barnes

,

Elizabeth Donoway

,

,

Emanuele Rodolà

,

,

,

,

,

,

,

,

Ethan J. Jerzak

,

,

Eunice Engefu Manyasi

,

Evgenii Zheltonozhskii

,

,

,

Fernando Martínez-Plumed

,

Francesca Happé

,

François Chollet

,

,

,

Genta Indra Winata

,

,

Germán Kruszewski

,

Giambattista Parascandolo

,

Giorgio Mariani

,

,

Gonzalo Jaimovitch-López

,

,

,

Hana Galijasevic

,

,

,

Hannaneh Hajishirzi

,

,

,

,

Hinrich Schütze

,

,

,

,

,

,

,

Jack Geissinger

,

Jackson Kernion

,

,

,

Jaime Fernández Fisac

,

,

,

,

,

,

,

Janelle Wingfield

,

,

,

Jascha Sohl-Dickstein

,

,

,

,

Jekaterina Novikova

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Jonathan Batchelder

,

Jonathan Berant

,

,

,

José Hernández-Orallo

,

Joseph Boudeman

,

,

,

Joshua B. Tenenbaum

,

,

,

,

,

,

Karthik Gopalakrishnan

,

Katerina Ignatyeva

,

,

Kaustubh D. Dhole

,

,

,

Kory W. Mathewson

,

Kristen Chiafullo

,

Ksenia Shkaruta

,

,

,

Kyle Richardson

,

,

,

,

,

,

Lidia Contreras Ochando

,

Louis-Philippe Morency

,

,

,

,

,

,

Luis Oliveros Colón

,

,

Lütfi Kerem Senel

,

,

,

Maartje ter Hoeve

,

,

,

,

,

,

,

María José Ramírez-Quintana

,

,

Mario Giulianelli

,

,

Martin Potthast

,

Matthew L. Leavitt

,

,

Mátyás Schubert

,

Medina Baitemirova

,

,

Melvin McElrath

,

,

,

,

Michael I. Ivanitskiy

,

Michael Starritt

,

,

Michal Swedrowski

,

Michele Bevilacqua

,

Michihiro Yasunaga

,

,

,

,

,

,

,

,

Moin Aminnaseri

,

,

,

Mukund Varma T.

,

,

,

,

Neta Gur-Ari Krakover

,

Nicholas Cameron

,

Nicholas Roberts

,

,

Nicole Martinez

,

,

,

Niklas Muennighoff

,

Nitish Shirish Keskar

,

,

,

,

,

,

,

Omar Elbaghdadi

,

,

,

Pablo Antonio Moreno Casares

,

,

,

,

,

Pegah Alipoormolabashi

,

,

,

,

Peter Eckersley

,

,

,

Piotr Milkowski

,

,

Pouya Pezeshkpour

,

,

,

,

,

,

Rachel Etta Rudolph

,

,

,

,

Raphaël Millière

,

,

,

,

,

Robbe Raymaekers

,

,

,

,

,

,

,

,

,

Ruslan Salakhutdinov

,

,

,

,

,

,

,

Saif M. Mohammad

,

,

,

,

,

Samuel Gruetter

,

Samuel R. Bowman

,

Samuel S. Schoenholz

,

,

,

,

Sarik Ghazarian

,

,

,

Sebastian Bischoff

,

Sebastian Gehrmann

,

Sebastian Schuster

,

Sepideh Sadeghi

,

,

,

Shashank Srivastava

,

,

,

,

Shixiang Shane Gu

,

Shubh Pachchigar

,

Shubham Toshniwal

,

,

Shyamolima (Shammie) Debnath

,

,

Simon Thormeyer

,

,

,

Sneha Priscilla Makini

,

,

,

Sriharsha Hatwar

,

Stanislas Dehaene

,

,

,

Stella Biderman

,

,

,

Steven T. Piantadosi

,

Stuart M. Shieber

,

Summer Misherghi

,

Svetlana Kiritchenko

,

,

,

,

,

,

,

Tatsu Hashimoto

,

,

Théo Desbordes

,

Theodore Rothschild

,

,

,

Tiberius Nkinyili

,

,

,

,

Tobias Gerstenberg

,

,

Trishala Neeraj

,

,

,

,

,

,

Victoria Nyamai

,

,

Vinay V. Ramasesh

,

Vinay Uday Prabhu

,

Vishakh Padmakumar

,

,

,

William Saunders

,

,

,

,

,

,

,

,

Yadollah Yaghoobzadeh

,

,

,

,

,

,

,

,

Yonatan Belinkov

,

,

,

,

,

,

,

,

,

Trans. Mach. Learn. Res., 2023

StarCoder: may the source be with you!

[BibT_eX]

[DOI]

,

Loubna Ben Allal

,

,

Niklas Muennighoff

,

,

,

,

Christopher Akiki

,

,

,

,

Evgenii Zheltonozhskii

,

,

,

Olivier Dehaene

,

Mishig Davaadorj

,

Joel Lamy-Poirier

,

,

,

Nicolas Gontier

,

,

,

,

Logesh Kumar Umapathi

,

,

Benjamin Lipkin

,

Muhtasham Oblokulov

,

,

,

Jason T. Stillerman

,

Siva Sankalp Patel

,

Dmitry Abulkhanov

,

,

,

,

,

Urvashi Bhattacharyya

,

,

,

,

,

,

,

,

,

,

,

Claire Schlesinger

,

Hailey Schoelkopf

,

,

,

,

,

Jennifer Robinson

,

Carolyn Jane Anderson

,

Brendan Dolan-Gavitt

,

Danish Contractor

,

,

,

Dzmitry Bahdanau

,

,

Carlos Muñoz Ferrandis

,

,

,

,

Leandro von Werra

,

Trans. Mach. Learn. Res., 2023

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI.

[BibT_eX]

[DOI]

,

,

,

Naana Obeng-Marnu

,

,

William Brannon

,

Niklas Muennighoff

,

,

,

Kartik Perisetla

,

,

Enrico Shippole

,

Kurt D. Bollacker

,

,

,

,

,

CoRR, 2023

C-Pack: Packaged Resources To Advance General Chinese Embedding.

[BibT_eX]

[DOI]

,

,

,

Niklas Muennighoff

CoRR, 2023

SantaCoder: don't reach for the stars!

[BibT_eX]

[DOI]

CoRR, 2023

Scaling Data-Constrained Language Models.

[BibT_eX]

[DOI]

Niklas Muennighoff

,

Alexander M. Rush

,

,

,

,

Aleksandra Piktus

,

,

,

Colin A. Raffel

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FinGPT: Large Generative Models for a Small Language.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MTEB: Massive Text Embedding Benchmark.

[BibT_eX]

[DOI]

Niklas Muennighoff

,

,

,

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.

[BibT_eX]

[DOI]

,

Hailey Schoelkopf

,

Niklas Muennighoff

,

Alham Fikri Aji

,

David Ifeoluwa Adelani

,

Khalid Almubarak

,

,

Lintang Sutawika

,

,

,

Genta Indra Winata

,

Stella Biderman

,

,

,

Vassilina Nikoulina

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Crosslingual Generalization through Multitask Finetuning.

[BibT_eX]

[DOI]

Niklas Muennighoff

,

,

Lintang Sutawika

,

,

Stella Biderman

,

,

,

,

,

Hailey Schoelkopf

,

,

,

Alham Fikri Aji

,

Khalid Almubarak

,

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.

[BibT_eX]

[DOI]

,

Hailey Schoelkopf

,

Niklas Muennighoff

,

Alham Fikri Aji

,

David Ifeoluwa Adelani

,

Khalid Almubarak

,

,

Lintang Sutawika

,

,

,

Genta Indra Winata

,

Stella Biderman

,

,

Vassilina Nikoulina

CoRR, 2022

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model.

[BibT_eX]

[DOI]

CoRR, 2022

SGPT: GPT Sentence Embeddings for Semantic Search.

[BibT_eX]

[DOI]

Niklas Muennighoff

CoRR, 2022

What Language Model to Train if You Have One Million GPU Hours?

[BibT_eX]

[DOI]

,

,

,

Lucile Saulnier

,

,

,

Stella Biderman

,

,

Niklas Muennighoff

,

,

,

,

,

,

Lintang Sutawika

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.

[BibT_eX]

[DOI]

Kaustubh D. Dhole

,

,

Sebastian Gehrmann

,

,

,

,

Abinaya Mahendiran

,

,

Ashish Srivastava

,

,

,

Jascha Sohl-Dickstein

,

,

,

,

Sebastian Ruder

,

,

,

,

,

,

Ian Berlot-Attwell

,

,

,

Marco Antonio Sobrevilla Cabezudo

,

Samuel Cahyawijaya

,

,

,

Mukund Choudhary

,

Christian Clauss

,

,

,

,

,

,

Thomas Dopierre

,

Paul-Alexis Dray

,

,

Tatiana Ekeinhor

,

Marco Di Giovanni

,

,

,

,

,

Fabrice Harel-Canada

,

,

,

Przemyslaw K. Joniak

,

,

Venelin Kovatchev

,

Kalpesh Krishna

,

,

,

Seungjae Ryan Lee

,

Corey James Levinson

,

,

,

,

Andrey Lukyanenko

,

Vukosi Marivate

,

,

,

,

,

Nafise Sadat Moosavi

,

Niklas Muennighoff

,

Timothy Sum Hon Mun

,

,

,

,

,

Nivranshu Pasricha

,

,

,

,

,

,

,

Pawan Kumar Rajpoot

,

,

,

Nicholas Roberts

,

Juan Diego Rodriguez

,

,

Paulo Henrique Santos Vasconcellos

,

,

Robin M. Schmidt

,

,

Tshephisho Sefara

,

,

,

,

,

,

,

,

,

,

,

,

Taylor Sorensen

,

William Soto Martinez

,

Aman Srivastava

,

KV Aditya Srivatsa

,

,

Mukund Varma T.

,

,

Fiona Anting Tan

,

,

,

,

,

,

,

,

,

,

Genta Indra Winata

,

,

Witold Wydmanski

,

,

,

,

,

CoRR, 2021

Diagnosing the Impact of AI on Radiology in China.

[BibT_eX]

[DOI]

Niklas Muennighoff

CoRR, 2021

2020

Vilio: State-of-the-art Visio-Linguistic Models applied to Hateful Memes.

[BibT_eX]

[DOI]

Niklas Muennighoff

CoRR, 2020

The Hateful Memes Challenge: Competition Report.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

Loading...