Ben Athiwaratkun

Orcid: 0000-0002-2009-496X

According to our database1, Ben Athiwaratkun authored at least 48 papers between 2015 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SAW-INT4: System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving.
CoRR, April, 2026

Introspective Diffusion Language Models.
CoRR, April, 2026

Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution.
CoRR, April, 2026

CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention.
CoRR, March, 2026

V<sub>1</sub>: Unifying Generation and Self-Verification for Parallel Reasoners.
CoRR, March, 2026

When RL Meets Adaptive Speculative Training: A Unified Training-Serving System.
CoRR, February, 2026

2025
Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time.
CoRR, December, 2025

CDLM: Consistency Diffusion Language Models For Faster Sampling.
CoRR, November, 2025

Kitty: Accurate and Efficient 2-bit KV Cache Quantization with Dynamic Channel-wise Precision Boost.
CoRR, November, 2025

Beat the long tail: Distribution-Aware Speculative Decoding for RL Training.
CoRR, November, 2025

Intelligence per Watt: Measuring Intelligence Efficiency of Local AI.
CoRR, November, 2025

Opportunistic Expert Activation: Batch-Aware Expert Routing for Faster Decode Without Retraining.
CoRR, November, 2025

Staircase Streaming for Low-Latency Multi-Agent Inference.
CoRR, October, 2025

Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient.
CoRR, September, 2025

Data Diversification Methods In Alignment Enhance Math Performance In LLMs.
CoRR, July, 2025

Shrinking the Generation-Verification Gap with Weak Verifiers.
CoRR, June, 2025

When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework.
CoRR, June, 2025

Disentangling Reasoning and Knowledge in Medical Large Language Models.
CoRR, May, 2025

Improving Model Alignment Through Collective Intelligence of Open-Source LLMS.
CoRR, May, 2025

How Well Can General Vision-Language Models Learn Medicine By Watching Public Educational Videos?
CoRR, April, 2025

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods.
CoRR, April, 2025

Ladder-Residual: Parallelism-Aware Architecture for Accelerating Large Model Inference with Communication Overlapping.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Improving Model Alignment Through Collective Intelligence of Open-Source Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Mixture-of-Agents Enhances Large Language Model Capabilities.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Training-Free Activation Sparsity in Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model.
CoRR, 2024

Token Alignment via Character Matching for Subword Completion.
CoRR, 2024

RedPajama: an Open Dataset for Training Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Bifurcated Attention for Single-Context Large-Batch Sampling.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Token Alignment via Character Matching for Subword Completion.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Greener yet Powerful: Taming Large Code Generation Models with Quantization.
CoRR, 2023

Towards Greener Yet Powerful Code Generation via Quantization: An Empirical Study.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023


2022
Multi-lingual Evaluation of Code Generation Models.
CoRR, 2022

2021
Joint Text and Label Generation for Spoken Language Understanding.
CoRR, 2021

Structured Prediction as Translation between Augmented Natural Languages.
Proceedings of the 9th International Conference on Learning Representations, 2021

Generative Context Pair Selection for Multi-hop Question Answering.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Augmented Natural Language for Generative Sequence Labeling.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
There Are Many Consistent Explanations of Unlabeled Data: Why You Should Average.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification.
Trans. Assoc. Comput. Linguistics, 2018

Improving Consistency-Based Semi-Supervised Learning with Weight Averaging.
CoRR, 2018

Hierarchical Density Order Embeddings.
Proceedings of the 6th International Conference on Learning Representations, 2018

Probabilistic FastText for Multi-Sense Word Embeddings.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Malware classification with LSTM and GRU language models and a character-level CNN.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multimodal Word Distributions.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2015
Feature Representation in Convolutional Neural Networks.
CoRR, 2015


  Loading...