Ben Athiwaratkun

Tri Dao

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Improving Model Alignment Through Collective Intelligence of Open-Source Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Mixture-of-Agents Enhances Large Language Model Capabilities.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Training-Free Activation Sparsity in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

Token Alignment via Character Matching for Subword Completion.

[BibT_eX]

[DOI]

CoRR, 2024

RedPajama: an Open Dataset for Training Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Bifurcated Attention for Single-Context Large-Batch Sampling.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Token Alignment via Character Matching for Subword Completion.

[BibT_eX]

[DOI]

Murali Krishna Ramanathan

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Greener yet Powerful: Taming Large Code Generation Models with Quantization.

[BibT_eX]

[DOI]

Parminder Bhatia

CoRR, 2023

Towards Greener Yet Powerful Code Generation via Quantization: An Empirical Study.

[BibT_eX]

[DOI]

Xiaokai Wei

Murali Krishna Ramanathan

Parminder Bhatia

Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

Multi-lingual Evaluation of Code Generation Models.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Multi-lingual Evaluation of Code Generation Models.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Joint Text and Label Generation for Spoken Language Understanding.

[BibT_eX]

[DOI]

Yang Li

CoRR, 2021

Structured Prediction as Translation between Augmented Natural Languages.

[BibT_eX]

[DOI]

Stefano Soatto

Proceedings of the 9th International Conference on Learning Representations, 2021

Generative Context Pair Selection for Multi-hop Question Answering.

[BibT_eX]

[DOI]

Dheeru Dua

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Augmented Natural Language for Generative Sequence Labeling.

[BibT_eX]

[DOI]

Jason Krone

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

There Are Many Consistent Explanations of Unlabeled Data: Why You Should Average.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2018

Improving Consistency-Based Semi-Supervised Learning with Weight Averaging.

[BibT_eX]

[DOI]

CoRR, 2018

Hierarchical Density Order Embeddings.

[BibT_eX]

[DOI]

Andrew Gordon Wilson

Proceedings of the 6th International Conference on Learning Representations, 2018

Probabilistic FastText for Multi-Sense Word Embeddings.

[BibT_eX]

[DOI]

Andrew Gordon Wilson

Anima Anandkumar

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Malware classification with LSTM and GRU language models and a character-level CNN.

[BibT_eX]

[DOI]

Jack W. Stokes

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multimodal Word Distributions.

[BibT_eX]

[DOI]

Andrew Gordon Wilson

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2015

Feature Representation in Convolutional Neural Networks.

[BibT_eX]

[DOI]