Elvis Rojas

Orcid: 0000-0002-4238-0908

According to our database1, Elvis Rojas authored at least 8 papers between 2019 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Exploring the Effects of Silent Data Corruption in Distributed Deep Learning Training.
Proceedings of the 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2022

Early Experiences of Noise-Sensitivity Performance Analysis of a Distributed Deep Learning Framework.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

2021
Understanding failures through the lifetime of a top-level supercomputer.
J. Parallel Distributed Comput., 2021

Understanding Soft Error Sensitivity of Deep Learning Models and Frameworks through Checkpoint Alteration.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

Large-Scale Distributed Deep Learning: A Study of Mechanisms and Trade-Offs with PyTorch.
Proceedings of the High Performance Computing - 8th Latin American Conference, 2021

2020
A Study of Checkpointing in Large Scale Training of Deep Neural Networks.
CoRR, 2020

Towards a Model to Estimate the Reliability of Large-Scale Hybrid Supercomputers.
Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2019
Analyzing a Five-Year Failure Record of a Leadership-Class Supercomputer.
Proceedings of the 31st International Symposium on Computer Architecture and High Performance Computing, 2019


  Loading...