Bin Ren

Roberto Gioiosa

Jacques Pienaar

Gokcen Kestor

Proceedings of the Languages and Compilers for Parallel Computing, 2020

Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Parallelizing pruned landmark labeling: dealing with dependencies in graph algorithms.

[BibT_eX]

[DOI]

Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

On Efficient Constructions of Checkpoints.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

A Privacy-Preserving-Oriented DNN Pruning and Mobile Acceleration Framework.

[BibT_eX]

[DOI]

Proceedings of the GLSVLSI '20: Great Lakes Symposium on VLSI 2020, 2020

An Image Enhancing Pattern-Based Sparsity for Real-Time Inference on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

ATMem: adaptive data placement in graph applications on heterogeneous memories.

[BibT_eX]

[DOI]

Proceedings of the CGO '20: 18th ACM/IEEE International Symposium on Code Generation and Optimization, 2020

PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Extracting SIMD Parallelism from Recursive Task-Parallel Programs.

[BibT_eX]

[DOI]

Shruthi Balakrishna

Youngjoon Jo

Kunal Agrawal

Milind Kulkarni

ACM Trans. Parallel Comput., 2019

Pruned Landmark Labeling Meets Vertex Centric Computation: A Surprisingly Happy Marriage!

[BibT_eX]

[DOI]

CoRR, 2019

26ms Inference Time for ResNet-50: Towards Real-Time Execution of all DNNs on Smartphone.

[BibT_eX]

[DOI]

CoRR, 2019

MemXCT: memory-centric X-ray CT reconstruction with massive parallelization.

[BibT_eX]

[DOI]

Mert Hidayetoglu

Tekin Biçer

Simon Garcia De Gonzalo

Proceedings of the International Conference for High Performance Computing, 2019

Transforming Query Sequences for High-Throughput B+ Tree Processing on Many-Core Processors.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

2018

Graphphi: efficient parallel graph processing on emerging throughput-oriented architectures.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018

2017

Exploiting Vector and Multicore Parallelism for Recursive, Data- and Task-Parallel Programs.

[BibT_eX]

[DOI]

Kunal Agrawal

Milind Kulkarni

Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

Real-Time Data Analysis and Autonomous Steering of Synchrotron Light Source Experiments.

[BibT_eX]

[DOI]

Proceedings of the 13th IEEE International Conference on e-Science, 2017

2016

User-Assisted Store Recycling for Dynamic Task Graph Schedulers.

[BibT_eX]

[DOI]

Mehmet Can Kurt

ACM Trans. Archit. Code Optim., 2016

User-assisted storage reuse determination for dynamic task graphs.

[BibT_eX]

[DOI]

Mehmet Can Kurt

Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

On the Impact of Widening Vector Registers on Sequence Alignment.

[BibT_eX]

[DOI]

Jeffrey Daily

Ananth Kalyanaraman

Proceedings of the 45th International Conference on Parallel Processing, 2016

MicroSpec: Speculation-Centric Fine-Grained Parallelization for FSM Computations.

[BibT_eX]

[DOI]

Junqiao Qiu

Zhijia Zhao

Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

2015

Efficient execution of recursive programs on commodity vector hardware.

[BibT_eX]

[DOI]

Youngjoon Jo

Kunal Agrawal

Milind Kulkarni

Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015

Automatic and Efficient Data Host-Device Communication for Many-Core Coprocessors.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2015

Low-Overhead Fault-Tolerance Support Using DISC Programming Model.

[BibT_eX]

[DOI]

Mehmet Can Kurt

Proceedings of the Languages and Compilers for Parallel Computing, 2015

Efficient and Simplified Parallel Graph Processing over CPU and MIC.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

2014

A Portable Optimization Engine for Accelerating Irregular Data-Traversal Applications on SIMD Architectures.

[BibT_eX]

[DOI]

Todd Mytkowicz

ACM Trans. Archit. Code Optim., 2014

Automating and optimizing data transfers for many-core coprocessors.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Supercomputing, 2014

A programming system for xeon phis with runtime SIMD parallelization.

[BibT_eX]

[DOI]

Xin Huo

Proceedings of the 2014 International Conference on Supercomputing, 2014

2013

SIMD parallelization of applications that traverse irregular data structures.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, 2013

2012

Fine-grained parallel traversals of irregular data structures.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011

Translating Chapel to Use FREERIDE: A Case Study in Using an HPC Language for Data-Intensive Computing.

[BibT_eX]

[DOI]

Bradford L. Chamberlain

Steven J. Deitz

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Compiling Dynamic Data Structures in Python to Enable the Use of Multi-core and Many-core Libraries.

[BibT_eX]

[DOI]