Tijmen Blankevoort

According to our database1, Tijmen Blankevoort authored at least 36 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning.
CoRR, 2024

Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding.
CoRR, 2024

GPTVQ: The Blessing of Dimensionality for LLM Quantization.
CoRR, 2024

2023
The LLM Surgeon.
CoRR, 2023

VeRA: Vector-based Random Matrix Adaptation.
CoRR, 2023

FP8 versus INT8 for efficient deep learning inference.
CoRR, 2023

Scalarization for Multi-Task and Multi-Domain Learning at Scale.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pruning vs Quantization: Which is Better?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

QBitOpt: Fast and Accurate Bitwidth Reallocation during Training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MSViT: Dynamic Mixed-scale Tokenization for Vision Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Efficient Neural PDE-Solvers using Quantization Aware Training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Practical Mixed Precision Algorithm for Post-Training Quantization.
Proceedings of the 34th British Machine Vision Conference Workshop Proceedings, 2023

2022
Neural Network Quantization with AI Model Efficiency Toolkit (AIMET).
CoRR, 2022

FP8 Quantization: The Power of the Exponent.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Overcoming Oscillations in Quantization-Aware Training.
Proceedings of the International Conference on Machine Learning, 2022

Cyclical Pruning for Sparse Neural Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Simple and Efficient Architectures for Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Simulated Quantization, Real Power Savings.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Revisiting single-gated Mixtures of Experts.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
A White Paper on Neural Network Quantization.
CoRR, 2021

Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Understanding and Overcoming the Challenges of Efficient Transformer Quantization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Learned Threshold Pruning.
CoRR, 2020

Gradient 𝓁<sub>1</sub> Regularization for Quantization Robustness.
CoRR, 2020

Bayesian Bits: Unifying Quantization and Pruning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Up or Down? Adaptive Rounding for Post-Training Quantization.
Proceedings of the 37th International Conference on Machine Learning, 2020

Batch-shaping for learning conditional channel gated networks.
Proceedings of the 8th International Conference on Learning Representations, 2020

Gradient $\ell_1$ Regularization for Quantization Robustness.
Proceedings of the 8th International Conference on Learning Representations, 2020

Differentiable Joint Pruning and Quantization for Hardware Efficiency.
Proceedings of the Computer Vision - ECCV 2020, 2020

LSQ+: Improving low-bit quantization through learnable offsets and better initialization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Conditional Channel Gated Networks for Task-Aware Continual Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Taxonomy and Evaluation of Structured Compression of Convolutional Neural Networks.
CoRR, 2019

Batch-Shaped Channel Gated Networks.
CoRR, 2019

Relaxed Quantization for Discretized Neural Networks.
Proceedings of the 7th International Conference on Learning Representations, 2019

Data-Free Quantization Through Weight Equalization and Bias Correction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...