Michael Lang

Proceedings of the Workshop on Memory Centric Programming for HPC, 2017

Modeling UGAL on the Dragonfly Topology.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2017

Optimized scatter/gather data operations for parallel storage.

[BibT_eX]

[DOI]

Carlos Maltzahn

Proceedings of the 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems, 2017

A comparative study of SDN and adaptive routing on dragonfly networks.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2017

UNITY: Unified Memory and File Space.

[BibT_eX]

[DOI]

Thaleia Dimitra Doudali

Pradeep Fernando

Proceedings of the 7th International Workshop on Runtime and Operating Systems for Supercomputers, 2017

Throughput Models of Interconnection Networks: The Good, the Bad, and the Ugly.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE Annual Symposium on High-Performance Interconnects, 2017

RSVP: Soft Error Resilient Power Savings at Near-Threshold Voltage Using Register Vulnerability.

[BibT_eX]

[DOI]

Proceedings of the 47th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2017

2016

Exploring the Design Tradeoffs for Extreme-Scale High-Performance Computing System Software.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2016

TracSim: Simulating and scheduling trapped power capacity to maximize machine room throughput.

[BibT_eX]

[DOI]

Parallel Comput., 2016

Load-balanced and locality-aware scheduling for data-intensive workloads at extreme scales.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2016

Power usage of production supercomputers and production workloads.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2016

Enhancing infiniband with openflow-style SDN capability.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

Active Burst-Buffer: In-Transit Processing Integrated into Hierarchical Storage.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Networking, 2016

Random Regular Graph and Generalized De Bruijn Graph with k-Shortest Path Routing.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

A Cross-Enclave Composition Mechanism for Exascale System Software.

[BibT_eX]

[DOI]

Proceedings of the 6th International Workshop on Runtime and Operating Systems for Supercomputers, 2016

Traffic Pattern-Based Adaptive Routing for Intra-Group Communication in Dragonfly Networks.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Annual Symposium on High-Performance Interconnects, 2016

Characterizing power and energy efficiency of legion runtime and applications: An early experience.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Green and Sustainable Computing Conference, 2016

2015

Hop: Elastic Consistency for Exascale Data Stores.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 30th International Conference, 2015

Measurement and characterization of Haswell power and energy consumption.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Workshop on Energy Efficient Supercomputing, 2015

Towards Scalable Distributed Workload Manager with Monitoring-Based Weakly Consistent Resource Stealing.

[BibT_eX]

[DOI]

Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

What is a Lightweight Kernel?

[BibT_eX]

[DOI]

Rolf Riesen

Arthur Barney Maccabe

Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers, 2015

System-Level Support for Composition of Applications.

[BibT_eX]

[DOI]

Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers, 2015

Dynamic Adaptation for Elastic System Services Using Virtual Servers.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on High Performance Computing, 2015

Overcoming Hadoop Scaling Limitations through Distributed Task Execution.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Fast Calculation of Max-Min Fair Rates for Multi-commodity Flows in Fat-Tree Networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014

Static load-balanced routing for slimmed fat-trees.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2014

Trapped capacity: scheduling under a power cap to maximize machine-room throughput.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Workshop on Energy Efficient Supercomputing, 2014

LFTI: A New Performance Metric for Assessing Interconnect Designs for Extreme-Scale HPC Systems.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Next generation job management systems for extreme-scale ensemble computing.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

Optimizing load balancing and data-locality with data-aware scheduling.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

Enabling composite applications through an asynchronous shared memory interface.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

2013

Optimizing process creation and execution on multi-core architectures.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2013

Understanding and isolating the noise in the Linux kernel.

[BibT_eX]

[DOI]

Lorie M. Liebrock

Int. J. High Perform. Comput. Appl., 2013

A new routing scheme for Jellyfish and its performance with HPC workloads.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

Using simulation to explore distributed key-value stores for extreme-scale system services.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

DRepl: Optimizing access to application data for analysis and visualization.

[BibT_eX]

[DOI]

Carlos Maltzahn

Proceedings of the IEEE 29th Symposium on Mass Storage Systems and Technologies, 2013

RRR: A Load Balanced Routing Scheme for Slimmed Fat-Trees.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Understanding the Performance of Two Production Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

A gossip-based approach to exascale system services.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, 2013

Transparently consistent asynchronous shared memory.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, 2013

Energy modeling of supercomputers and large-scale scientific applications.

[BibT_eX]

[DOI]

Proceedings of the International Green Computing Conference, 2013

HPC runtime support for fast and power efficient locking and synchronization.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

Multilevel Active Storage for big data applications in high performance computing.

[BibT_eX]

[DOI]

Chao Chen

Yong Chen

Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

2012

Optimizing latency and throughput for spawning processes on massively multicore processors.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Workshop on Runtime and Operating Systems for Supercomputers, 2012

Stepping towards noiseless Linux environment.

[BibT_eX]

[DOI]

Lorie M. Liebrock

Proceedings of the 2nd International Workshop on Runtime and Operating Systems for Supercomputers, 2012

The design and implementation of a multi-level content-addressable checkpoint file system.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on High Performance Computing, 2012

2011

Adapting wave-front algorithms to efficiently utilize systems with deep communication hierarchies.

[BibT_eX]

[DOI]

Parallel Comput., 2011

2010

On the Performance and Technological Impact of Adding Memory Controllers in Multi-Core Processors.

[BibT_eX]

[DOI]

José Carlos Sancho

Parallel Process. Lett., 2010

Optimized InfiniBand<sup>TM</sup> fat-tree routing for shift all-to-all communication patterns.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2010

Analyzing the trade-off between multiple memory controllers and memory channels on multi-core processor performance.

[BibT_eX]

[DOI]

José Carlos Sancho

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Characterizing the Impact of Using Spare-Cores on Application Performance.

[BibT_eX]

[DOI]

José Carlos Sancho

Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

2009

Implementation and performance modeling of deterministic particle transport (Sweep3D) on the IBM Cell/B.E.

[BibT_eX]

[DOI]

Sci. Program., 2009

The reverse-acceleration model for programming petascale hybrid systems.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2009

Using Performance Modeling to Design Large-Scale Systems.

[BibT_eX]

[DOI]

Computer, 2009

2008

Infiniband Routing Table Optimizations for Scientific Applications.

[BibT_eX]

[DOI]

Greg Johnson

Parallel Process. Lett., 2008

A Performance Evaluation of the Nehalem Quad-Core Processor for Scientific Computing.

[BibT_eX]

[DOI]

Parallel Process. Lett., 2008

Entering the petaflop era: the architecture and performance of Roadrunner.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Optimization of infiniband for scientific applications.

[BibT_eX]

[DOI]

Greg Johnson