Roberto L. Castro

Orcid: 0000-0001-5493-0287

According to our database1, Roberto L. Castro authored at least 11 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Quartet: Native FP4 Training Can Be Optimal for Large Language Models.
CoRR, May, 2025

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations.
CoRR, February, 2025

HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs.
CoRR, January, 2025

MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

Adapt-S: Effective DNN Pruning via Unified Accuracy and Performance Tuning.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2025

2024
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models.
Dataset, November, 2024

STuning-DL: Model-Driven Autotuning of Sparse GPU Kernels for Deep Learning.
IEEE Access, 2024

2023
VENOM: A Vectorized N: M Format for Unleashing the Power of Sparse Tensor Cores.
Proceedings of the International Conference for High Performance Computing, 2023

2022
Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM Routine on Ampere GPUs.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2020
Reusing Trained Layers of Convolutional Neural Networks to Shorten Hyperparameters Tuning Time.
CoRR, 2020

A Hybrid Approach for Tracking Individual Players in Broadcast Match Videos.
CoRR, 2020


  Loading...