Niklas Muennighoff

According to our database1, Niklas Muennighoff authored at least 28 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Language models scale reliably with over-training and on downstream tasks.
CoRR, 2024

StarCoder 2 and The Stack v2: The Next Generation.
CoRR, 2024

A Survey on Data Selection for Language Models.
CoRR, 2024

KMMLU: Measuring Massive Multitask Language Understanding in Korean.
CoRR, 2024

Generative Representational Instruction Tuning.
CoRR, 2024

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model.
CoRR, 2024

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning.
CoRR, 2024

KTO: Model Alignment as Prospect Theoretic Optimization.
CoRR, 2024

OLMo: Accelerating the Science of Language Models.
CoRR, 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.
CoRR, 2024

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models.
CoRR, 2024

2023
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI.
CoRR, 2023

OctoPack: Instruction Tuning Code Large Language Models.
CoRR, 2023

StarCoder: may the source be with you!
CoRR, 2023

SantaCoder: don't reach for the stars!
CoRR, 2023

Scaling Data-Constrained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


MTEB: Massive Text Embedding Benchmark.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Crosslingual Generalization through Multitask Finetuning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
CoRR, 2022

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model.
CoRR, 2022

What Language Model to Train if You Have One Million GPU Hours?
CoRR, 2022

SGPT: GPT Sentence Embeddings for Semantic Search.
CoRR, 2022

What Language Model to Train if You Have One Million GPU Hours?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Diagnosing the Impact of AI on Radiology in China.
CoRR, 2021

2020
Vilio: State-of-the-art Visio-Linguistic Models applied to Hateful Memes.
CoRR, 2020



  Loading...