Jarek Nieplocha

According to our database1, Jarek Nieplocha authored at least 84 papers between 1994 and 2011.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2011
Global Arrays Parallel Programming Toolkit.
Proceedings of the Encyclopedia of Parallel Computing, 2011

2010
Implementation and evaluation of active storage in modern parallel file systems.
Parallel Comput., 2010

NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations.
Comput. Phys. Commun., 2010

2009
Design and implementation of a high-performance CCA event service.
Concurr. Comput. Pract. Exp., 2009

Scalable work stealing.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

Scalable transparent checkpoint-restart of global address space applications on virtual machines over infiniband.
Proceedings of the 6th Conference on Computing Frontiers, 2009

2008
A framework for characterizing overlap of communication and computation in parallel applications.
Clust. Comput., 2008

Early experience with out-of-core applications on the Cray XMT.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Evaluation of Remote Memory Access Communication on the IBM Blue Gene/P Supercomputer.
Proceedings of the 37th International Conference on Parallel Processing, 2008

Scioto: A Framework for Global-View Task Parallelism.
Proceedings of the 2008 International Conference on Parallel Processing, 2008

Integrated Data and Task Management for Scientific Applications.
Proceedings of the Computational Science, 2008

Efficient Management of Complex Striped Files in Active Storage.
Proceedings of the Euro-Par 2008, 2008

2007
Using the GA and TAO toolkits for solving large-scale optimization problems on parallel computers.
ACM Trans. Math. Softw., 2007

Evaluation of active storage strategies for the lustre parallel file system.
Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007

Towards Fault Resilient Global Arrays.
Proceedings of the Parallel Computing: Architectures, 2007

Evaluation of Remote Memory Access Communication on the Cray XT3.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Probability Convergence in a Multithreaded Counting Application.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Scalable Visual Analytics of Massive Textual Datasets.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

A global address space framework for locality aware scheduling of block-sparse computations.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Transparent system-level migration of PGAS applications using Xen on InfiniBand.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

Non-collective parallel I/O for global address space programming models.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

Evaluating the potential of multithreaded platforms for irregular scientific computations.
Proceedings of the 4th Conference on Computing Frontiers, 2007

2006
ScalaBLAST: A Scalable Implementation of BLAST for High-Performance Data-Intensive Bioinformatics Analysis.
IEEE Trans. Parallel Distributed Syst., 2006

Layout transformation support for the disk resident arrays framework.
J. Supercomput., 2006

SFT: scalable fault tolerance.
ACM SIGOPS Oper. Syst. Rev., 2006

High Performance Remote Memory Access Communication: The Armci Approach.
Int. J. High Perform. Comput. Appl., 2006

Advances, Applications and Performance of the Global Arrays Shared Memory Programming Toolkit.
Int. J. High Perform. Comput. Appl., 2006

A Component Architecture for High-Performance Scientific Computing.
Int. J. High Perform. Comput. Appl., 2006

M12 - Overview of the global arrays parallel software development toolkit.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Data management and query - Hypergraph partitioning for automatic memory hierarchy management.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Blue Gene system software - Design and implementation of a one-sided communication interface for the IBM eServer Blue Gene® supercomputer.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

An approach to locality-conscious load balancing and transparent memory hierarchy management with a global-address-space parallel programming model.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

An extensible global address space framework with decoupled task and data abstractions.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

A Performance Instrumentation Framework to Characterize Computation-Communication Overlap in Message-Passing Systems.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006

Memory efficient parallel matrix multiplication operation for irregular problems.
Proceedings of the Third Conference on Computing Frontiers, 2006

Topology-aware tile mapping for clusters of SMPs.
Proceedings of the Third Conference on Computing Frontiers, 2006

2005
QsNetII: Defining High-Performance Network Design.
IEEE Micro, 2005

Optimizing All-to-All Collective Communication by Exploiting Concurrency in Modern Networks.
Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

Multilevel Parallelism in Computational Chemistry using Common Component Architecture and Global Arrays.
Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

Parallelization of the NAS Conjugate Gradient Benchmark Using the Global Arrays Shared Memory Programming Model.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

An Evaluation of Two Implementation Strategies for Optimizing One-Sided Atomic Reduction.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Data and Computation Abstractions for Dynamic and Irregular Computations.
Proceedings of the High Performance Computing, 2005

Symmetric Data Objects and Remote Memory Access Communication for Fortran-95 Applications.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

Exploiting processor groups to extend scalability of the GA shared memory programming model.
Proceedings of the Second Conference on Computing Frontiers, 2005

2004
Component-based integration of chemistry and optimization software.
J. Comput. Chem., 2004

Optimisation and performance evaluation of mechanisms for latency tolerance in remote memory access communication on clusters.
Int. J. High Perform. Comput. Netw., 2004

Host-Assisted Zero-Copy Remote Memory Access Communication on InfiniBand.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

SRUMMA: A Matrix Multiplication Algorithm Suitable for Clusters and Scalable Shared Memory Systems.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Processor-Group Aware Runtime Support for Shared- and Global-Address Space Models.
Proceedings of the 33rd International Conference on Parallel Processing Workshops (ICPP 2004 Workshops), 2004

Optimizing Parallel Multiplication Operation for Rectangular and Transposed Matrices.
Proceedings of the 10th International Conference on Parallel and Distributed Systems, 2004

What are the future trends in high-performance inter.connects for parallel computers? [Panel 1].
Proceedings of the 12th Annual IEEE Symposium on High Performance Interconnects, 2004

Toward Efficient Compilation of User-Defined Extensible Fortran Directives.
Proceedings of the 9th International Workshop on High-Level Programming Models and Supportive Environments (HIPS 2004), 2004

Efficient Layout Transformation for Disk-Based Multidimensional Arrays.
Proceedings of the High Performance Computing, 2004

Co-array Python: A Parallel Extension to the Python Language.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

2003
One-Sided Communication on Clusters with Myrinet.
Clust. Comput., 2003

Fast Collective Operations Using Shared and Remote Memory Access Protocols on Clusters.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Efficient Collective Operations Using Remote Memory Operations on VIA-Based Clusters.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Optimizing Synchronization Operations for Remote Memory Communication Systems.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Exploiting Non-blocking Remote Memory Access Communication in Scientific Benchmarks.
Proceedings of the High Performance Computing - HiPC 2003, 10th International Conference, 2003

Shared Memory Mirroring for Reducing Communication Overhead on Commodity Networks.
Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003

Optimizing Mechanisms for Latency Tolerance in Remote Memory Access Communication on Clusters.
Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003

2002
Efficient Algorithms for Ghost Cell Updates on Two Classes of MPP Architectures.
Proceedings of the International Conference on Parallel and Distributed Computing Systems, 2002

Protocols and Strategies for Optimizing Performance of Remote Memory Operations on Clusters.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Efficient Barrier Using Remote Memory Operations on VIA-Based Clusters.
Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002

2001
Building an Application Domain Specific Programming Framework for Computational Fluid Dynamics Calculations on Parallel Computers.
Proceedings of the Tenth SIAM Conference on Parallel Processing for Scientific Computing, 2001

One-sided Communication on the Myrinet-based SMP Clusters using the GM Message-Passing Library.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

2000
Computational chemistry on Fujitsu vector-parallel processors: Hardware and programming environment.
Parallel Comput., 2000

A Multiprotocol Communication Support for the Global Address Space Programming Model on the IBM SP.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1999
Implementing noncollective parallel I/O in cluster environments using Active Message communication.
Clust. Comput., 1999

ARMCI: A Portable Remote Memory Copy Libray for Ditributed Array Libraries and Compiler Run-Time Systems.
Proceedings of the Parallel and Distributed Processing, 1999

1998
ChemIo: High Performance Parallel I/o for Computational Chemistry Applications.
Int. J. High Perform. Comput. Appl., 1998

An out-of-core implementation of the COLUMBUS massively-parallel multireference configuration interaction program.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1998

Performance and Experience with LAPI - a New High-Performance Communication Library for the IBM RS/6000 SP.
Proceedings of the 12th International Parallel Processing Symposium / 9th Symposium on Parallel and Distributed Processing (IPPS/SPDP '98), March 30, 1998

Distant I/O: One-Sided Access to Secondary Storage on Remote Processors.
Proceedings of the Seventh IEEE International Symposium on High Performance Distributed Computing, 1998

1997
Shared Memory Programming in Metacomputing Environments: The Global Array Approach.
J. Supercomput., 1997

A massively parallel multireference configuration interaction program: The parallel COLUMBUS program.
J. Comput. Chem., 1997

Optimizing Collective I/O Performance on Parallel Computers: A Multisystem Study.
Proceedings of the 11th international conference on Supercomputing, 1997

1996
Global arrays: A nonuniform memory access programming model for high-performance computers.
J. Supercomput., 1996

Toward high-performance computational chemistry: II. A scalable self-consistent field program.
J. Comput. Chem., 1996

High-performance computing in chemistry: NW Chem.
Future Gener. Comput. Syst., 1996

Shared Memory NUMA Programming on I-WAY.
Proceedings of the 5th International Symposium on High Performance Distributed Computing (HPDC '96), 1996

1995
Performance Improvement of Asynchronous Iterations by Non-Uniform Load Distribution.
Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, 1995

High Performance Computational Chemistry: NWChem and Fully Distributed Parallel Applications.
Proceedings of the Applied Parallel Computing, 1995

1994
Global arrays: a portable "shared-memory" programming model for distributed memory computers.
Proceedings of the Proceedings Supercomputing '94, 1994


  Loading...