Mart van Baalen

According to our database1, Mart van Baalen authored at least 19 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters.
CoRR, 2024

Sparse High Rank Adapters.
CoRR, 2024

GPTVQ: The Blessing of Dimensionality for LLM Quantization.
CoRR, 2024

The LLM Surgeon.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
The LLM Surgeon.
CoRR, 2023

FP8 versus INT8 for efficient deep learning inference.
CoRR, 2023

Pruning vs Quantization: Which is Better?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

QBitOpt: Fast and Accurate Bitwidth Reallocation during Training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Practical Mixed Precision Algorithm for Post-Training Quantization.
Proceedings of the 34th British Machine Vision Conference Workshop Proceedings, 2023

2022
Quantized Sparse Weight Decomposition for Neural Network Compression.
CoRR, 2022

FP8 Quantization: The Power of the Exponent.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cyclical Pruning for Sparse Neural Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Simulated Quantization, Real Power Savings.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
A White Paper on Neural Network Quantization.
CoRR, 2021

2020
Gradient 𝓁<sub>1</sub> Regularization for Quantization Robustness.
CoRR, 2020

Bayesian Bits: Unifying Quantization and Pruning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Up or Down? Adaptive Rounding for Post-Training Quantization.
Proceedings of the 37th International Conference on Machine Learning, 2020

Gradient $\ell_1$ Regularization for Quantization Robustness.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Data-Free Quantization Through Weight Equalization and Bias Correction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...