Andrew Gu

Orcid: 0009-0001-1236-9484

According to our database1, Andrew Gu authored at least 10 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026

2025
Meta Lattice: Model Space Redesign for Cost-Effective Industry-Scale Ads Recommendations.
CoRR, December, 2025

GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection.
CoRR, April, 2025

Scaling Llama 3 Training with Efficient Parallelism Strategies.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

TorchTitan: One-stop PyTorch native solution for production ready LLM pretraining.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile.
CoRR, 2024

TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training.
CoRR, 2024

2023
PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel.
Proc. VLDB Endow., 2023

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel.
CoRR, 2023

2021
Deep Transfer Learning for Infectious Disease Case Detection Using Electronic Medical Records.
CoRR, 2021


  Loading...