Jiayuan Meng

According to our database1, Jiayuan Meng
  • authored at least 36 papers between 2006 and 2016.
  • has a "Dijkstra number"2 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2016
Workflow performance improvement using model-based scheduling over multiple clusters and clouds.
Future Generation Comp. Syst., 2016

Abnormal EEG Complexity and Functional Connectivity of Brain in Patients with Acute Thalamic Ischemic Stroke.
Comp. Math. Methods in Medicine, 2016

Compiler-Assisted Overlapping of Communication and Computation in MPI Applications.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

2015
Modeling Cooperative Threads to Project GPU Performance for Adaptive Parallelism.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

AsHES Introduction and Committees.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

2014
Special Issue on Applications for the Heterogeneous Computing Era.
IJHPCA, 2014

Analytically Modeling Application Execution for Software-Hardware Co-design.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Dymaxion++: A Directive-Based API to Optimize Data Layout and Memory Mapping for Heterogeneous Systems.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Improving Multisite Workflow Performance Using Model-Based Scheduling.
Proceedings of the 43rd International Conference on Parallel Processing, 2014

SKOPE: a framework for modeling and exploring workload behavior.
Proceedings of the Computing Frontiers Conference, CF'14, 2014

2013
SESH Framework: A Space Exploration Framework for GPU Application and Hardware Codesign.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

Characterization and Understanding Machine-Specific Interconnects.
Proceedings of the Parallel Computing Technologies - 12th International Conference, 2013

Early Experience on the Blue Gene/Q Supercomputing System.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

ASHES Introduction.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Improving GPU Performance Prediction with Data Transfer Modeling.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Online Performance Projection for Clusters with Heterogeneous GPUs.
Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems, 2013

A multiple SIMD, multiple data (MSMD) architecture: Parallel execution of dynamic and static SIMD fragments.
Proceedings of the 19th IEEE International Symposium on High Performance Computer Architecture, 2013

Model-driven multisite workflow scheduling.
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

2012
Applications for the Heterogeneous Computing Era.
IJHPCA, 2012

Dataflow-driven GPU performance projection for multi-kernel transformations.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Robust SIMD: Dynamically Adapted SIMD Width and Multi-Threading Depth.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

ALCF MPI Benchmarks: Understanding Machine-Specific Communication Behavior.
Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

2011
A Performance Study for Iterative Stencil Loops on GPUs with Ghost Zone Optimizations.
International Journal of Parallel Programming, 2011

GROPHECY: GPU performance projection from CPU code skeletons.
Proceedings of the Conference on High Performance Computing Networking, 2011

A reconfigurable simulator for large-scale heterogeneous multicore architectures.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2011

2010
Dynamic warp subdivision for integrated branch and memory divergence tolerance.
Proceedings of the 37th International Symposium on Computer Architecture (ISCA 2010), 2010

Exploiting inter-thread temporal locality for chip multithreading.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Exploiting the forgiving nature of applications for scalable parallel execution.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Best-effort semantic document search on GPUs.
Proceedings of 3rd Workshop on General Purpose Processing on Graphics Processing Units, 2010

2009
Increasing memory miss tolerance for SIMD cores.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

Best-effort parallel execution framework for Recognition and mining applications.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Rodinia: A benchmark suite for heterogeneous computing.
Proceedings of the 2009 IEEE International Symposium on Workload Characterization, 2009

Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs.
Proceedings of the 23rd international conference on Supercomputing, 2009

Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling.
Proceedings of the 27th International Conference on Computer Design, 2009

2008
A performance study of general-purpose applications on graphics processors using CUDA.
J. Parallel Distrib. Comput., 2008

2006
High-resolution image viewing on projection-based tiled display wall.
Proceedings of the Color Imaging XI: Processing, Hardcopy, and Applications, San Jose, 2006


  Loading...