Andrei Panferov

According to our database1, Andrei Panferov authored at least 13 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Apertus LLM Family Expansion via Distillation and Quantization.
CoRR, May, 2026

Grid Games: The Power of Multiple Grids for Quantizing Large Language Models.
CoRR, May, 2026

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation.
CoRR, January, 2026

2025
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization.
CoRR, September, 2025

Quartet: Native FP4 Training Can Be Optimal for Large Language Models.
CoRR, May, 2025

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations.
CoRR, February, 2025

Correlated Quantization for Faster Nonconvex Distributed Optimization.
Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2025

Unified Scaling Laws for Compressed Representations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

HIGGS: Pushing the Limits of Large Language Model Quantization via the Linearity Theorem.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
Pushing the Limits of Large Language Model Quantization via the Linearity Theorem.
CoRR, 2024

Panza: A Personalized Text Writing Assistant via Data Playback and Local Fine-Tuning.
CoRR, 2024

Extreme Compression of Large Language Models via Additive Quantization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


  Loading...