Alexander Wettig

According to our database1, Alexander Wettig authored at least 22 papers between 2021 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?
CoRR, June, 2025

SWE-smith: Scaling Data for Software Engineering Agents.
CoRR, April, 2025

Lugha-Llama: Adapting Large Language Models for African Languages.
CoRR, April, 2025

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation.
CoRR, February, 2025

Metadata Conditioning Accelerates Language Model Pre-training.
CoRR, January, 2025

OLMoE: Open Mixture-of-Experts Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

How to Train Long-Context Language Models (Effectively).
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Establishing Task Scaling Laws via Compute-Efficient Model Ladders.
CoRR, 2024

OLMoE: Open Mixture-of-Experts Language Models.
CoRR, 2024

Language Models as Science Tutors.
CoRR, 2024

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Finding Transformer Circuits With Edge Pruning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

QuRating: Selecting High-Quality Data for Training Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


SWE-bench: Can Language Models Resolve Real-world Github Issues?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Learning Transformer Programs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Kernel-Based View of Language Model Fine-Tuning.
Proceedings of the International Conference on Machine Learning, 2023

Poisoning Retrieval Corpora by Injecting Adversarial Passages.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Adapting Language Models to Compress Contexts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Should You Mask 15% in Masked Language Modeling?
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022
Finding Dataset Shortcuts with Grammar Induction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Phrase Retrieval Learns Passage Retrieval, Too.
Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021


  Loading...