We stand with Ukraine

We stand with Ukraine

Roberto L. Castro

Orcid: 0000-0001-5493-0287

According to our database¹, Roberto L. Castro authored at least 12 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization.

[BibT_eX]

[DOI]

Vage Egiazarian

,

Roberto L. Castro

,

Denis Kuznedelev

,

Andrei Panferov

,

,

,

Alexandre Noll Marques

,

,

,

Torsten Hoefler

,

CoRR, September, 2025

Quartet: Native FP4 Training Can Be Optimal for Large Language Models.

[BibT_eX]

[DOI]

Roberto L. Castro

,

Andrei Panferov

,

,

Oliver Sieberling

,

,

,

,

CoRR, May, 2025

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations.

[BibT_eX]

[DOI]

Andrei Panferov

,

,

,

Roberto L. Castro

,

,

CoRR, February, 2025

HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs.

[BibT_eX]

[DOI]

,

,

,

Roberto L. Castro

,

Torsten Hoefler

,

CoRR, January, 2025

MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.

[BibT_eX]

[DOI]

,

Roberto L. Castro

,

,

Torsten Hoefler

,

Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

Adapt-S: Effective DNN Pruning via Unified Accuracy and Performance Tuning.

[BibT_eX]

[DOI]

Roberto L. Castro

,

,

Basilio B. Fraguela

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2025

2024

MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.

[BibT_eX]

[DOI]

,

Roberto L. Castro

,

,

Torsten Hoefler

,

Dataset, November, 2024

STuning-DL: Model-Driven Autotuning of Sparse GPU Kernels for Deep Learning.

[BibT_eX]

[DOI]

Roberto L. Castro

,

,

Basilio B. Fraguela

IEEE Access, 2024

2023

VENOM: A Vectorized N: M Format for Unleashing the Power of Sparse Tensor Cores.

[BibT_eX]

[DOI]

Roberto L. Castro

,

,

,

,

Basilio B. Fraguela

,

Torsten Hoefler

Proceedings of the International Conference for High Performance Computing, 2023

2022

Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM Routine on Ampere GPUs.

[BibT_eX]

[DOI]

Roberto L. Castro

,

,

Basilio B. Fraguela

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2020

Reusing Trained Layers of Convolutional Neural Networks to Shorten Hyperparameters Tuning Time.

[BibT_eX]

[DOI]

Roberto L. Castro

,

,

Basilio B. Fraguela

CoRR, 2020

A Hybrid Approach for Tracking Individual Players in Broadcast Match Videos.

[BibT_eX]

[DOI]

Roberto L. Castro

,

,

Basilio B. Fraguela

CoRR, 2020

Loading...