Alexander Wettig

According to our database¹, Alexander Wettig authored at least 23 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Extracting Rule-based Descriptions of Attention Features in Transformers.

[BibT_eX]

[DOI]

CoRR, October, 2025

Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?

[BibT_eX]

[DOI]

CoRR, June, 2025

SWE-smith: Scaling Data for Software Engineering Agents.

[BibT_eX]

[DOI]

CoRR, April, 2025

Lugha-Llama: Adapting Large Language Models for African Languages.

[BibT_eX]

[DOI]

Happy Buzaaba

Alexander Wettig

David Ifeoluwa Adelani

Christiane Fellbaum

CoRR, April, 2025

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Metadata Conditioning Accelerates Language Model Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

OLMoE: Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

How to Train Long-Context Language Models (Effectively).

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Establishing Task Scaling Laws via Compute-Efficient Model Ladders.

[BibT_eX]

[DOI]

CoRR, 2024

OLMoE: Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Language Models as Science Tutors.

[BibT_eX]

[DOI]

CoRR, 2024

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Finding Transformer Circuits With Edge Pruning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

QuRating: Selecting High-Quality Data for Training Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Models as Science Tutors.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SWE-bench: Can Language Models Resolve Real-world Github Issues?

[BibT_eX]

[DOI]

Karthik R. Narasimhan

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Learning Transformer Programs.

[BibT_eX]

[DOI]

Dan Friedman

Alexander Wettig

Danqi Chen

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Kernel-Based View of Language Model Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Poisoning Retrieval Corpora by Injecting Adversarial Passages.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Adapting Language Models to Compress Contexts.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Should You Mask 15% in Masked Language Modeling?

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022

Finding Dataset Shortcuts with Grammar Induction.

[BibT_eX]

[DOI]

Dan Friedman

Alexander Wettig

Danqi Chen

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

Phrase Retrieval Learns Passage Retrieval, Too.

[BibT_eX]

[DOI]

Jinhyuk Lee

Alexander Wettig

Danqi Chen

Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

Alexander Wettig

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...