Costin Iancu

According to our database1, Costin Iancu authored at least 43 papers between 2001 and 2018.

Collaborative distances :
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2018
Improving Network Throughput with Global Communication Reordering.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Maximizing Communication Overlap with Dynamic Program Analysis.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2018

2017
Reaching bandwidth saturation using transparent injection parallelization.
IJHPCA, 2017

Report of the HPC Correctness Summit, Jan 25-26, 2017, Washington, DC.
CoRR, 2017

Application Level Reordering of Remote Direct Memory Access Operations.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2016
Scaling Spark on Lustre.
Proceedings of the High Performance Computing, 2016

OPR: deterministic group replay for one-sided communication.
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

Floating-point precision tuning using blame analysis.
Proceedings of the 38th International Conference on Software Engineering, 2016

SReplay: Deterministic Sub-Group Replay for One-Sided Communication.
Proceedings of the 2016 International Conference on Supercomputing, 2016

Scaling Spark on HPC Systems.
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

Time-Sharing Redux for Large-Scale HPC Systems.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

Exploiting variability for energy optimization of parallel programs.
Proceedings of the Eleventh European Conference on Computer Systems, 2016

2015
Exploiting communication concurrency on high performance computing systems.
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

Barrier elision for production parallel programs.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

2014
The Case for Partitioning Virtual Machines on Multicore Architectures.
IEEE Trans. Parallel Distrib. Syst., 2014

An Evaluation of One-Sided and Two-Sided Communication Paradigms on Relaxed-Ordering Interconnect.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

2013
Juggle: addressing extrinsic load imbalances in SPMD applications on multicore computers.
Cluster Computing, 2013

Precimonious: tuning assistant for floating-point precision.
Proceedings of the International Conference for High Performance Computing, 2013

Scalable data race detection for partitioned global address space programs.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2013

Scaling data race detection for partitioned global address space programs.
Proceedings of the International Conference on Supercomputing, 2013

2012
Congestion avoidance on manycore high performance computing systems.
Proceedings of the International Conference on Supercomputing, 2012

2011
Efficient data race detection for distributed memory parallel programs.
Proceedings of the Conference on High Performance Computing Networking, 2011

Optimized pre-copy live migration for memory intensive applications.
Proceedings of the Conference on High Performance Computing Networking, 2011

Juggle: proactive load balancing on multicore computers.
Proceedings of the 20th ACM International Symposium on High Performance Distributed Computing, 2011

Characterizing the Performance of Parallel Applications on Multi-socket Virtual Machines.
Proceedings of the 11th IEEE/ACM International Symposium on Cluster, 2011

2010
Load balancing on speed.
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010

Hybrid PGAS runtime support for multicore nodes.
Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, 2010

Oversubscription on multicore processors.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009
Scheduling dynamic parallelism on accelerators.
Proceedings of the 6th Conference on Computing Frontiers, 2009

2008
Performance portable optimizations for loops containing communication operations.
Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

Runtime optimization of vector operations on large scale SMP clusters.
Proceedings of the 17th International Conference on Parallel Architecture and Compilation Techniques, 2008

2007
Optimizing communication overlap for high-speed networks.
Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2007

Productivity and performance using partitioned global address space languages.
Proceedings of the Parallel Symbolic Computation, 2007

Scientific Application Performance on Candidate PetaScale Platforms.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Automatic nonblocking communication for partitioned global address space programs.
Proceedings of the 21th Annual International Conference on Supercomputing, 2007

Performance Portable Optimizations for Loops Containing Communication Operations.
Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007), 2007

2005
HUNTing the Overlap.
Proceedings of the 14th International Conference on Parallel Architecture and Compilation Techniques (PACT 2005), 2005

Communication Optimizations for Fine-Grained UPC Applications.
Proceedings of the 14th International Conference on Parallel Architecture and Compilation Techniques (PACT 2005), 2005

2004
Message Strip-Mining Heuristics for High Speed Networks.
Proceedings of the High Performance Computing for Computational Science, 2004

2003
An Evaluation of Current High-Performance Networks.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

A performance analysis of the Berkeley UPC compiler.
Proceedings of the 17th Annual International Conference on Supercomputing, 2003

2001
An evaluation of search tree techniques in the presence of caches.
Proceedings of the 2001 IEEE International Symposium on Performance Analysis of Systems and Software, 2001

A Comparison of Feedback Based and Fair Queuing Mechanisms for Handling Unresponsive Traffic.
Proceedings of the Sixth IEEE Symposium on Computers and Communications (ISCC 2001), 2001


  Loading...