Ben Athiwaratkun

Orcid: 0000-0002-2009-496X

According to our database1, Ben Athiwaratkun authored at least 34 papers between 2015 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Data Diversification Methods In Alignment Enhance Math Performance In LLMs.
CoRR, July, 2025

Shrinking the Generation-Verification Gap with Weak Verifiers.
CoRR, June, 2025

When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework.
CoRR, June, 2025

Disentangling Reasoning and Knowledge in Medical Large Language Models.
CoRR, May, 2025

Improving Model Alignment Through Collective Intelligence of Open-Source LLMS.
CoRR, May, 2025

How Well Can General Vision-Language Models Learn Medicine By Watching Public Educational Videos?
CoRR, April, 2025

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods.
CoRR, April, 2025

Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation.
CoRR, April, 2025

Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping.
CoRR, January, 2025

Mixture-of-Agents Enhances Large Language Model Capabilities.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Training-Free Activation Sparsity in Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model.
CoRR, 2024

Token Alignment via Character Matching for Subword Completion.
CoRR, 2024

RedPajama: an Open Dataset for Training Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Bifurcated Attention for Single-Context Large-Batch Sampling.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Token Alignment via Character Matching for Subword Completion.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Greener yet Powerful: Taming Large Code Generation Models with Quantization.
CoRR, 2023

Towards Greener Yet Powerful Code Generation via Quantization: An Empirical Study.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023


2022
Multi-lingual Evaluation of Code Generation Models.
CoRR, 2022

2021
Joint Text and Label Generation for Spoken Language Understanding.
CoRR, 2021

Structured Prediction as Translation between Augmented Natural Languages.
Proceedings of the 9th International Conference on Learning Representations, 2021

Generative Context Pair Selection for Multi-hop Question Answering.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Augmented Natural Language for Generative Sequence Labeling.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
There Are Many Consistent Explanations of Unlabeled Data: Why You Should Average.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification.
Trans. Assoc. Comput. Linguistics, 2018

Improving Consistency-Based Semi-Supervised Learning with Weight Averaging.
CoRR, 2018

Hierarchical Density Order Embeddings.
Proceedings of the 6th International Conference on Learning Representations, 2018

Probabilistic FastText for Multi-Sense Word Embeddings.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Malware classification with LSTM and GRU language models and a character-level CNN.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multimodal Word Distributions.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2015
Feature Representation in Convolutional Neural Networks.
CoRR, 2015


  Loading...