Mart van Baalen

According to our database1, Mart van Baalen authored at least 16 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
GPTVQ: The Blessing of Dimensionality for LLM Quantization.
CoRR, 2024

2023
The LLM Surgeon.
CoRR, 2023

FP8 versus INT8 for efficient deep learning inference.
CoRR, 2023

Pruning vs Quantization: Which is Better?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

QBitOpt: Fast and Accurate Bitwidth Reallocation during Training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Practical Mixed Precision Algorithm for Post-Training Quantization.
Proceedings of the 34th British Machine Vision Conference Workshop Proceedings, 2023

2022
Quantized Sparse Weight Decomposition for Neural Network Compression.
CoRR, 2022

FP8 Quantization: The Power of the Exponent.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cyclical Pruning for Sparse Neural Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Simulated Quantization, Real Power Savings.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
A White Paper on Neural Network Quantization.
CoRR, 2021

2020
Gradient 𝓁<sub>1</sub> Regularization for Quantization Robustness.
CoRR, 2020

Bayesian Bits: Unifying Quantization and Pruning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Up or Down? Adaptive Rounding for Post-Training Quantization.
Proceedings of the 37th International Conference on Machine Learning, 2020

Gradient $\ell_1$ Regularization for Quantization Robustness.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Data-Free Quantization Through Weight Equalization and Bias Correction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...