Proceedings of the GPGPU@PPoPP '20: 13th Annual Workshop on General Purpose Processing using Graphics Processing Unit colocated with 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

COMET: A Domain-Specific Compilation of High-Performance Computational Chemistry.

[BibT_eX]

[DOI]

Erdal Mutlu

Ruiqin Tian

Bin Ren

Sriram Krishnamoorthy

Roberto Gioiosa

Jacques A. Pienaar

Gokcen Kestor

Proceedings of the Languages and Compilers for Parallel Computing, 2020

2019

Advert: An Asynchronous Runtime for Fine-Grained Network Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM Third Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware, 2019

Argo.

[BibT_eX]

[DOI]

Proceedings of the Operating Systems for Supercomputers and High Performance Computing, 2019

2018

Characterizing the performance benefit of hybrid memory system for HPC applications.

[BibT_eX]

[DOI]

Parallel Comput., 2018

Understanding scale-Dependent soft-Error Behavior of Scientific Applications.

[BibT_eX]

[DOI]

Gokcen Kestor

Ivy Bo Peng

Roberto Gioiosa

Sriram Krishnamoorthy

Proceedings of the 18th IEEE/ACM International Symposium on Cluster, 2018

2017

MPI Streams for HPC Applications.

[BibT_eX]

[DOI]

CoRR, 2017

Exploring the Performance Benefit of Hybrid Memory System on HPC Environments.

[BibT_eX]

[DOI]

CoRR, 2017

MPI windows on storage for HPC applications.

[BibT_eX]

[DOI]

Proceedings of the 24th European MPI Users' Group Meeting, 2017

RTHMS: a tool for data placement on hybrid memory system.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM SIGPLAN International Symposium on Memory Management, 2017

Argo NodeOS: Toward Unified Resource Management for Exascale.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Exploring the Performance Benefit of Hybrid Memory System on HPC Environments.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Exploring DataVortex Systems for Irregular Applications.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Exploring the Effect of Compiler Optimizations on the Reliability of HPC Applications.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Preparing HPC Applications for the Exascale Era: A Decoupling Strategy.

[BibT_eX]

[DOI]

Proceedings of the 46th International Conference on Parallel Processing, 2017

Pushing the Limits of Irregular Access Patterns on Emerging Network Architecture: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

Extending Message Passing Interface Windows to Storage.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

2016

Assessing Advanced Technology in CENATE.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Networking, 2016

A Performance Characterization of Streaming Computing on Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science 2016, 2016

Idle Period Propagation in Message-Passing Applications.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

Exploring Application Performance on Emerging Hybrid-Memory Supercomputers.

[BibT_eX]

[DOI]

Exploring Data Vortex Network Architectures.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Annual Symposium on High-Performance Interconnects, 2016

2015

Understanding the propagation of transient errors in HPC applications.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Prometheus: scalable and accurate emulation of task-based applications on many-core systems.

[BibT_eX]

[DOI]

Gokcen Kestor

Roberto Gioiosa

Daniel G. Chavarría-Miranda

Proceedings of the 2015 IEEE International Symposium on Performance Analysis of Systems and Software, 2015

A Container-Based Approach to OS Specialization for Exascale Computing.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Cloud Engineering, 2015

Analyzing System Calls in Multi-OS Hierarchical Environments.

[BibT_eX]

[DOI]

Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers, 2015

On the Application Task Granularity and the Interplay with the Scheduling Overhead in Many-Core Shared Memory Systems.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014

Adaptive Prefetching on POWER7: Improving Performance and Power Consumption.

[BibT_eX]

[DOI]

ACM Trans. Parallel Comput., 2014

Online Monitoring Systems for Performance Fault Detection.

[BibT_eX]

[DOI]

Roberto Gioiosa

Gokcen Kestor

Darren J. Kerbyson

Parallel Process. Lett., 2014

Evaluating performance and power efficiency of scientific applications on multi-threaded systems.

[BibT_eX]

[DOI]

Roberto Gioiosa

Darren J. Kerbyson

Adolfy Hoisie

Proceedings of the 2nd International Workshop on Energy Efficient Supercomputing, 2014

Cross-Layer Self-Adaptive/Self-Aware System Software for Exascale Systems.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Online Monitoring System for Performance Fault Detection.

[BibT_eX]

[DOI]

Roberto Gioiosa

Gokcen Kestor

Darren J. Kerbyson

Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

2013

SMT Malleability in IBM POWER5 and POWER6 Processors.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2013

Quantifying the energy cost of data movement in scientific applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2013

Enabling accurate power profiling of HPC applications on exascale systems.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, 2013

2012

CPU Accounting for Multicore Processors.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2012

HPPAC Introduction.

[BibT_eX]

[DOI]

Bronis R. de Supinski

Roberto Gioiosa

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Evaluating the Impact of TLB Misses on Future HPC Systems.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Characterizing thread placement in the IBM POWER7 processor.

[BibT_eX]

[DOI]

Stelios Manousopoulos

Proceedings of the 2012 IEEE International Symposium on Workload Characterization, 2012

Enhancing the performance of assisted execution runtime systems through hardware/software techniques.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Supercomputing, 2012

Assessing the suitability of the NGMP multi-core processor in the space domain.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on Embedded Software, 2012

Making data prefetch smarter: adaptive prefetching on POWER7.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011

Energy-Aware Accounting and Billing in Large-Scale Computing Facilities.

[BibT_eX]

[DOI]

IEEE Micro, 2011

Characterizing Power and Temperature Behavior of POWER6-Based System.

[BibT_eX]

[DOI]

IEEE J. Emerg. Sel. Topics Circuits Syst., 2011

A Quantitative Analysis of OS Noise.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

HPPAC Introduction.

[BibT_eX]

[DOI]

Rong Ge

Roberto Gioiosa

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

STM2: A Parallel STM for High Performance Simultaneous Multithreading Systems.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010

Trends and techniques for energy efficient architectures.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE/IFIP VLSI-SoC 2010, 2010

Towards sustainable exascale computing.

[BibT_eX]

[DOI]

Roberto Gioiosa

Proceedings of the 18th IEEE/IFIP VLSI-SoC 2010, 2010

Power and performance aware reconfigurable cache for CMPs.

[BibT_eX]

[DOI]

Proceedings of the Second International Forum on Next-Generation Multicore/Manycore Technologies, 2010

Portable, scalable, per-core power estimation for intelligent resource management.

[BibT_eX]

[DOI]

Proceedings of the International Green Computing Conference 2010, 2010

Designing OS for HPC Applications: Scheduling.

[BibT_eX]

[DOI]

Roberto Gioiosa

Sally A. McKee

Mateo Valero

Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010

Power and thermal characterization of POWER6 system.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

2009

CPU Accounting in CMP Processors.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2009

A global operating system for HPC clusters.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

ITCA: Inter-task Conflict-Aware CPU Accounting for CMPs.

[BibT_eX]

[DOI]

Proceedings of the PACT 2009, 2009

2008

Hard Real-Time Performances in Multiprocessor-Embedded Systems Using ASMP-Linux.

[BibT_eX]

[DOI]

EURASIP J. Embed. Syst., 2008

A dynamic scheduler for balancing HPC applications.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Measuring Operating System Overhead on CMT Processors.

[BibT_eX]

[DOI]

Proceedings of the 20th International Symposium on Computer Architecture and High Performance Computing, 2008

Software-Controlled Priority Characterization of POWER5 Processor.

[BibT_eX]

[DOI]

Proceedings of the 35th International Symposium on Computer Architecture (ISCA 2008), 2008

Balancing HPC applications through smart allocation of resources in MT processors.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Soft Real-Time Scheduling on SMT Processors with Explicit Resource Allocation.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems, 2008

2005

Transparent, Incremental Checkpointing at Kernel Level: a Foundation for Fault Tolerance for Parallel Computers.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

Current Practice and a Direction Forward in Checkpoint/Restart Implementations for Fault Tolerance.

[BibT_eX]

[DOI]

Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Roberto Gioiosa

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...