Perhaad Mistry

According to our database1, Perhaad Mistry authored at least 16 papers between 2009 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2019
Profiling OpenCL Kernels Using Wavefront Occupancy with Radeon GPU Profiler.
Proceedings of the International Workshop on OpenCL, 2019

2015
NUPAR: A Benchmark Suite for Modern GPU Architectures.
Proceedings of the 6th ACM/SPEC International Conference on Performance Engineering, Austin, TX, USA, January 31, 2015

2014
Analyzing power efficiency of optimization techniques and algorithm design methods for applications on heterogeneous platforms.
Int. J. High Perform. Comput. Appl., 2014

Runtime Support for Adaptive Spatial Partitioning and Inter-Kernel Communication on GPUs.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

A parallel clustering algorithm for placement.
Proceedings of the Fifteenth International Symposium on Quality Electronic Design, 2014

Exploring the Heterogeneous Design Space for both Performance and Reliability.
Proceedings of the 51st Annual Design Automation Conference 2014, 2014

2013
Quantifying the energy efficiency of FFT on heterogeneous platforms.
Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2013

Valar: a benchmark suite to study the dynamic behavior of heterogeneous systems.
Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units, 2013

Heterogeneous Computing with OpenCL - Revised OpenCL 1.2 Edition.
Morgan Kaufmann, ISBN: 978-0-12-405894-1, 2013

2012
Multi2Sim: a simulation framework for CPU-GPU computing.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011
Exploiting Memory Access Patterns to Improve Memory Performance in Data-Parallel Architectures.
IEEE Trans. Parallel Distributed Syst., 2011

Analyzing program flow within a many-kernel OpenCL application.
Proceedings of 4th Workshop on General Purpose Processing on Graphics Processing Units, 2011

2010
Data Structures and Transformations for Physically Based Simulation on a GPU.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

Data transformations enabling loop vectorization on multithreaded data parallel architectures.
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010

2009
Profile-Guided Optimization of Critical Medical Imaging Algorithms.
Proceedings of the 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, Boston, MA, USA, June 28, 2009

Accelerating phase unwrapping and affine transformations for optical quadrature microscopy using CUDA.
Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, 2009


  Loading...