Sara S. Baghsorkhi

According to our database1, Sara S. Baghsorkhi authored at least 15 papers between 2007 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2020
SAVE: Sparsity-Aware Vector Engine for Accelerating DNN Training and Inference on CPUs.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

2019
C3-Flow: Compute Compression Co-Design Flow for Deep Neural Networks.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019

2018
DeepThin: A Self-Compressing Library for Deep Neural Networks.
CoRR, 2018

Automating efficient variable-grained resiliency for low-power IoT systems.
Proceedings of the 2018 International Symposium on Code Generation and Optimization, 2018

2016
FlexVec: auto-vectorization for irregular loops.
Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2016

2012
Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU)
Synthesis Lectures on Computer Architecture, Morgan & Claypool Publishers, ISBN: 978-3-031-01737-7, 2012

Efficient performance evaluation of memory hierarchy for highly multithreaded graphics processors.
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

2011
Efficient Performance Evaluation for Highly Multi-threaded Graphics Processors
PhD thesis, 2011

Auto-tuning of fast fourier transform on graphics processors.
Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011

2010
An adaptive performance modeling tool for GPU architectures.
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010

2008
Program optimization carving for GPU computing.
J. Parallel Distributed Comput., 2008

Optimization principles and application performance evaluation of a multithreaded GPU using CUDA.
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008

CUDA-Lite: Reducing GPU Programming Complexity.
Proceedings of the Languages and Compilers for Parallel Computing, 2008

Program optimization space pruning for a multithreaded gpu.
Proceedings of the Sixth International Symposium on Code Generation and Optimization (CGO 2008), 2008

2007
Implicitly Parallel Programming Models for Thousand-Core Microprocessors.
Proceedings of the 44th Design Automation Conference, 2007


  Loading...