Suchita Pati

Orcid: 0009-0008-1083-1146

According to our database1, Suchita Pati authored at least 14 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
GOLDYLOC: Global Optimizations & Lightweight Dynamic Logic for Concurrency.
ACM Trans. Archit. Code Optim., June, 2025

ConCCL: Optimizing ML Concurrent Computation and Communication with GPU DMA Engines.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2025

2024
Optimizing ML Concurrent Computation and Communication with GPU DMA Engines.
CoRR, 2024

Global Optimizations & Lightweight Dynamic Logic for Concurrency.
CoRR, 2024

JIT-Q: Just-in-time Quantization with Processing-In-Memory for Efficient ML Training.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute & Collectives.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Just-in-time Quantization with Processing-In-Memory for Efficient ML Training.
CoRR, 2023

Computation vs. Communication Scaling for Future Transformers on Future Hardware.
CoRR, 2023

Tale of Two Cs: Computation vs. Communication Scaling for Future Transformers on Future Hardware.
Proceedings of the IEEE International Symposium on Workload Characterization, 2023

2022
Demystifying BERT: System Design Implications.
Proceedings of the IEEE International Symposium on Workload Characterization, 2022

2021
Demystifying BERT: Implications for Accelerator Design.
CoRR, 2021

2020
SeqPoint: Identifying Representative Iterations of Sequence-Based Neural Networks.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2020

2019
Analyzing Machine Learning Workloads Using a Detailed GPU Simulator.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2019

2017
DARTS: Performance-counter driven sampling using binary translators.
Proceedings of the 2017 IEEE International Symposium on Performance Analysis of Systems and Software, 2017


  Loading...