Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

[BibT_eX]

[DOI]

John Yang

Karthik R. Narasimhan

Diyi Yang

Sida Wang

Ofir Press

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OLMoE: Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Generative Representational Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RegMix: Data Mixture as Regression for Language Model Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Scaling Laws for Precision.

[BibT_eX]

[DOI]

Tanishq Kumar

Zachary Ankner

Benjamin Frederick Spector

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OpenHands: An Open Platform for AI Software Developers as Generalist Agents.

[BibT_eX]

[DOI]

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Mieb: Massive Image Embedding Benchmark.

[BibT_eX]

[DOI]

Kenneth C. Enevoldsen

Niklas Muennighoff

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

s1: Simple test-time scaling.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

A Survey on Data Selection for Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

A large-scale audit of dataset licensing and attribution in AI.

[BibT_eX]

[DOI]

Nat. Mac. Intell., 2024

Bridging the Data Provenance Gap Across Text, Speech and Video.

[BibT_eX]

[DOI]

CoRR, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2024

OLMoE: Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents.

[BibT_eX]

[DOI]

CoRR, 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons.

[BibT_eX]

[DOI]

CoRR, 2024

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions.

[BibT_eX]

[DOI]

CoRR, 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.

[BibT_eX]

[DOI]

Holy Lovenia

Rahmad Mahendra

Salsabil Maulana Akbar

Lester James V. Miranda

Joseph Marvin Imperial

Onno Pepijn Kampman

Joel Ruben Antony Moniz

Muhammad Ravi Shulthan Habibi

Patrick Amadeus Irawan

Bin Wang

Jan Christian Blaise Cruz

Chenxi Whitehouse

Ivan Halim Parmonangan

Sonny Lazuardi Hermawan

Dan John Velasco

Muhammad Dehan Al Kautsar

Willy Fitra Hendria

Yasmin Moslem

Noah Flynn

Muhammad Farid Adilazuarda

Peerat Limkonchotiwat

CoRR, 2024

Lessons from the Trenches on Reproducible Evaluation of Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence.

[BibT_eX]

[DOI]

CoRR, 2024

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order.

[BibT_eX]

[DOI]

CoRR, 2024

Language models scale reliably with over-training and on downstream tasks.

[BibT_eX]

[DOI]

CoRR, 2024

StarCoder 2 and The Stack v2: The Next Generation.

[BibT_eX]

[DOI]

Evgenii Zheltonozhskii

Carolyn Jane Anderson

Nicolas Chapados

et al.

CoRR, 2024

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2024

KTO: Model Alignment as Prospect Theoretic Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models.

[BibT_eX]

[DOI]

Terry Yue Zhuo

Armel Zebaze

Nitchakarn Suppattarachai

CoRR, 2024

C-Pack: Packed Resources For General Chinese Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DataComp-LM: In search of the next generation of training sets for language models.

[BibT_eX]

[DOI]

Khyathi Raghavi Chandu

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding.

[BibT_eX]

[DOI]

Kenneth C. Enevoldsen

Márton Kardos

Niklas Muennighoff

Kristoffer L. Nielbo

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Model Alignment as Prospect Theoretic Optimization.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

OctoPack: Instruction Tuning Code Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.

[BibT_eX]

[DOI]

Holy Lovenia

Rahmad Mahendra

Salsabil Maulana Akbar

Lester James V. Miranda

Joseph Marvin Imperial

Onno Kampman

Joel Ruben Antony Moniz

Muhammad Ravi Shulthan Habibi

Patrick Amadeus Irawan

Bin Wang

Jan Christian Blaise Cruz

Chenxi Whitehouse

Ivan Halim Parmonangan

Sonny Lazuardi Hermawan

Dan John Velasco

Muhammad Dehan Al Kautsar

Willy Fitra Hendria

Yasmin Moslem

Noah Flynn

Muhammad Farid Adilazuarda

Peerat Limkonchotiwat

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

OLMo: Accelerating the Science of Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Bartlomiej Bojanowski

Christopher D. Manning

Daniel Moseguí González

Eunice Engefu Manyasi

Evgenii Zheltonozhskii

Fanyue Xia

Fatemeh Siar

Fernando Martínez-Plumed

Giambattista Parascandolo

Giorgio Mariani

Gloria Wang

Gonzalo Jaimovitch-López

Jaime Fernández Fisac

Jascha Sohl-Dickstein

José Hernández-Orallo

Karthik Gopalakrishnan

Lidia Contreras Ochando

Louis-Philippe Morency

María José Ramírez-Quintana

Michael I. Ivanitskiy

Neta Gur-Ari Krakover

Nitish Shirish Keskar

Pablo Antonio Moreno Casares

Pegah Alipoormolabashi

Shyamolima (Shammie) Debnath

Sneha Priscilla Makini

Yadollah Yaghoobzadeh

Trans. Mach. Learn. Res., 2023

StarCoder: may the source be with you!

[BibT_eX]

[DOI]

Evgenii Zheltonozhskii

Logesh Kumar Umapathi

Urvashi Bhattacharyya

Carolyn Jane Anderson

Carlos Muñoz Ferrandis

Trans. Mach. Learn. Res., 2023

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI.

[BibT_eX]

[DOI]

CoRR, 2023

C-Pack: Packaged Resources To Advance General Chinese Embedding.

[BibT_eX]

[DOI]

CoRR, 2023

SantaCoder: don't reach for the stars!

[BibT_eX]

[DOI]

CoRR, 2023

Scaling Data-Constrained Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FinGPT: Large Generative Models for a Small Language.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MTEB: Massive Text Embedding Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.

[BibT_eX]

[DOI]

David Ifeoluwa Adelani

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Crosslingual Generalization through Multitask Finetuning.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.

[BibT_eX]

[DOI]

David Ifeoluwa Adelani

CoRR, 2022

SGPT: GPT Sentence Embeddings for Semantic Search.

[BibT_eX]

[DOI]

Niklas Muennighoff

CoRR, 2022

What Language Model to Train if You Have One Million GPU Hours?

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.

[BibT_eX]

[DOI]

Jascha Sohl-Dickstein

Marco Antonio Sobrevilla Cabezudo

Paulo Henrique Santos Vasconcellos

William Soto Martinez

CoRR, 2021

Diagnosing the Impact of AI on Radiology in China.

[BibT_eX]

[DOI]

Niklas Muennighoff

CoRR, 2021

2020

Vilio: State-of-the-art Visio-Linguistic Models applied to Hateful Memes.

[BibT_eX]

[DOI]

Niklas Muennighoff

CoRR, 2020

The Hateful Memes Challenge: Competition Report.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

Niklas Muennighoff

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...