Andrew Gu

Orcid: 0009-0001-1236-9484

According to our database1, Andrew Gu authored at least 8 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection.
CoRR, April, 2025

Scaling Llama 3 Training with Efficient Parallelism Strategies.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

TorchTitan: One-stop PyTorch native solution for production ready LLM pretraining.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile.
CoRR, 2024

TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training.
CoRR, 2024

2023
PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel.
Proc. VLDB Endow., 2023

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel.
CoRR, 2023

2021
Deep Transfer Learning for Infectious Disease Case Detection Using Electronic Medical Records.
CoRR, 2021


  Loading...