Zach DeVito

According to our database1, Zach DeVito authored at least 26 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Revisiting Reliability in Large-Scale Machine Learning Research Clusters.
CoRR, 2024

Is Flash Attention Stable?
CoRR, 2024

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2024

MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024


2023
A Theory on Adam Instability in Large-Scale Machine Learning.
CoRR, 2023


2022
torch.fx: Practical Program Capture and Transformation for Deep Learning in Python.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

2021
Using Python for Model Inference in Deep Learning.
CoRR, 2021

2020
The Next 700 Accelerated Layers: From Mathematical Expressions of Network Computation Graphs to Accelerated GPU Kernels, Automatically.
ACM Trans. Archit. Code Optim., 2020

2019
PyTorch: An Imperative Style, High-Performance Deep Learning Library.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions.
CoRR, 2018

2017
Opt: A Domain Specific Language for Non-Linear Least Squares Optimization in Graphics and Imaging.
ACM Trans. Graph., 2017

2016
Rigel: flexible multi-rate image processing hardware.
ACM Trans. Graph., 2016

Ebb: A DSL for Physical Simulation on CPUs and GPUs.
ACM Trans. Graph., 2016

2015
Ebb: A DSL for Physical Simluation on CPUs and GPUs.
CoRR, 2015

The Design of Terra: Harnessing the Best Features of High-Level and Low-Level Languages.
Proceedings of the 1st Summit on Advances in Programming Languages, 2015

2014
Terra: simplifying high-performance programming using multi-stage programming.
PhD thesis, 2014

Darkroom: compiling high-level image processing code into hardware pipelines.
ACM Trans. Graph., 2014

Just-in-time Length Specialization of Dynamic Vector Code.
Proceedings of the ARRAY'14: Proceedings of the 2014 ACM SIGPLAN International Workshop on Libraries, 2014

First-class runtime generation of high-performance types using exotypes.
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2014

2013
Terra: a multi-stage language for high-performance computing.
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2013

Exploring Traditional and Emerging Parallel Programming Models Using a Proxy Application.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

2012
Riposte: a trace-driven compiler and parallel VM for vector code in R.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011
Liszt: a domain specific language for building portable mesh-based PDE solvers.
Proceedings of the Conference on High Performance Computing Networking, 2011

2010
Language virtualization for heterogeneous parallel computing.
Proceedings of the 25th Annual ACM SIGPLAN Conference on Object-Oriented Programming, 2010


  Loading...