Mart van Baalen

According to our database¹, Mart van Baalen authored at least 22 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Leech Lattice Vector Quantization for Efficient LLM Compression.

[BibT_eX]

[DOI]

Tycho F. A. van der Ouderaa

Mart van Baalen

Paul N. Whatmough

Markus Nagel

CoRR, March, 2026

2025

Mixture of Cache-Conditional Experts for Efficient Mobile Device Inference.

[BibT_eX]

[DOI]

Babak Ehteshami Bejnordi

Trans. Mach. Learn. Res., 2025

Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking.

[BibT_eX]

[DOI]

Proceedings of the Eighth Conference on Machine Learning and Systems, 2025

2024

Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters.

[BibT_eX]

[DOI]

CoRR, 2024

GPTVQ: The Blessing of Dimensionality for LLM Quantization.

[BibT_eX]

[DOI]

CoRR, 2024

Sparse High Rank Adapters.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

The LLM Surgeon.

[BibT_eX]

[DOI]

Tycho F. A. van der Ouderaa

Markus Nagel

Mart van Baalen

Tijmen Blankevoort

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

The LLM Surgeon.

[BibT_eX]

[DOI]

Tycho F. A. van der Ouderaa

CoRR, 2023

FP8 versus INT8 for efficient deep learning inference.

[BibT_eX]

[DOI]

CoRR, 2023

Pruning vs Quantization: Which is Better?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

QBitOpt: Fast and Accurate Bitwidth Reallocation during Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Practical Mixed Precision Algorithm for Post-Training Quantization.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference Workshop Proceedings, 2023

2022

Quantized Sparse Weight Decomposition for Neural Network Compression.

[BibT_eX]

[DOI]

CoRR, 2022

FP8 Quantization: The Power of the Exponent.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cyclical Pruning for Sparse Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Simulated Quantization, Real Power Savings.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021

A White Paper on Neural Network Quantization.

[BibT_eX]

[DOI]

CoRR, 2021

2020

Gradient 𝓁<sub>1</sub> Regularization for Quantization Robustness.

[BibT_eX]

[DOI]

CoRR, 2020

Bayesian Bits: Unifying Quantization and Pruning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Up or Down? Adaptive Rounding for Post-Training Quantization.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Gradient $\ell_1$ Regularization for Quantization Robustness.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Data-Free Quantization Through Weight Equalization and Bias Correction.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Mart van Baalen

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...