Hao Yu

Affiliations:
  • IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
  • Texas A&M University, College Station, TX, USA (PhD 2004)


According to our database1, Hao Yu authored at least 29 papers between 1999 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Chic-sched: a HPC Placement-Group Scheduler on Hierarchical Topologies with Constraints.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

2018
Graph Programming Interface (GPI): A Linear Algebra Programming Model for Large Scale Graph Computations.
Int. J. Parallel Program., 2018

2017
Automation of Cloud Node Installation for Testing and Scalable Provisioning.
Proceedings of the Companion Proceedings of the 10th International Conference on Utility and Cloud Computing, 2017

A Linear Algebra-Based Programming Interface for Graph Computations in Scala and Spark.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016
Graph programming interface (GPI): a linear algebra programming model for large scale graph computations.
Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

2014
Author retrospective for adaptive reduction parallelization techniques.
Proceedings of the ACM International Conference on Supercomputing 25th Anniversary Volume, 2014

2013
Platform and applications for massive-scale streaming network analytics.
IBM J. Res. Dev., 2013

2011
Optimization of stateful hardware acceleration in hybrid architectures.
Proceedings of the Design, Automation and Test in Europe, 2011

2010
Exploiting heterogeneous multicore-processor systems for high-performance network processing.
IBM J. Res. Dev., 2010

2009
Application level I/O caching on Blue Gene/P systems.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

2008
Early experiences in application level I/O tracing on blue gene systems.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Stateful hardware decompression in networking environment.
Proceedings of the 2008 ACM/IEEE Symposium on Architecture for Networking and Communications Systems, 2008

2007
Performance Evaluation of a Commercial Application, Trade, in Scale-out Environments.
Proceedings of the 15th International Symposium on Modeling, 2007

Performance Studies of a WebSphere Application, Trade, in Scale-out and Scale-up Environments.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Performance Studies of Commercial Workloads on a Multi-core System.
Proceedings of the IEEE 10th International Symposium on Workload Characterization, 2007

2006
An Adaptive Algorithm Selection Framework for Reduction Parallelization.
IEEE Trans. Parallel Distributed Syst., 2006

Blue Gene system software - Topology mapping for Blue Gene/L supercomputer.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

MPI tools and performance studies - MPI performance analysis tools on Blue Gene/L.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

A study of MPI performance analysis tools on Blue Gene/L.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

High performance file I/O for the Blue Gene/L supercomputer.
Proceedings of the 12th International Symposium on High-Performance Computer Architecture, 2006

2004
Run-time optimization of adaptive irregular applications.
PhD thesis, 2004

An Adaptive Algorithm Selection Framework.
Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques (PACT 2004), 29 September, 2004

2002
Parallel Reductions: An Application of Adaptive Algorithm Selection.
Proceedings of the Languages and Compilers for Parallel Computing, 15th Workshop, 2002

The R-LRPD Test: Speculative Parallelization of Partially Parallel Loops.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

SmartApps: An Application Centric Approach to High Performance Computing: Compiler-Assisted Software and Hardware Support for Reduction Operations.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

2001
Architectural Support for Parallel Reductions in Scalable Shared-Memory Multiprocessors.
Proceedings of the 2001 International Conference on Parallel Architectures and Compilation Techniques (PACT 2001), 2001

2000
Adaptive reduction parallelization techniques.
Proceedings of the 14th international conference on Supercomputing, 2000

Techniques for Reducing the Overhead of Run-Time Parallelization.
Proceedings of the Compiler Construction, 9th International Conference, 2000

1999
Run-Time Parallelization Optimization Techniques.
Proceedings of the Languages and Compilers for Parallel Computing, 1999


  Loading...