Kalyan Kumaran

Orcid: 0000-0002-6447-3195

According to our database1, Kalyan Kumaran authored at least 34 papers between 2010 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Implementation of Dataflow Software Pipelining for Codelet Model.
Proceedings of the 2023 ACM/SPEC International Conference on Performance Engineering, 2023

Demonstration of Portable Performance of Scientific Machine Learning on High Performance Computing Systems.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Towards Maximum Throughput of Dataflow Software Pipeline under Resource Constraints.
Proceedings of the 14th International Workshop on Programming Models and Applications for Multicores and Manycores, 2023

Codelet Pipe: Realization of Dataflow Software Pipelining for Extended Codelet Model.
Proceedings of the 52nd International Conference on Parallel Processing Workshops, 2023

2022
The SuperCodelet architecture.
Proceedings of the ExHET@PPoPP 2022: Proceedings of the 1st International Workshop on Extreme Heterogeneity Solutions, 2022

2019
Scalable Reactive Molecular Dynamics Simulations for Computational Synthesis.
Comput. Sci. Eng., 2019

Evaluating Quality of Service Traffic Classes on the Megafly Network.
Proceedings of the High Performance Computing - 34th International Conference, 2019

Sequential Codelet Model of Program Execution. A Super-Codelet model based on the Hierarchical Turing Machine.
Proceedings of the IEEE/ACM Third Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware, 2019

GPCNeT: designing a benchmark suite for inducing and measuring contention in HPC networks.
Proceedings of the International Conference for High Performance Computing, 2019

Position Paper: Extending Codelet Model for Dataflow Software Pipelining using Software-Hardware Co-Design.
Proceedings of the 43rd IEEE Annual Computer Software and Applications Conference, 2019

2018
Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications.
Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

Characterization of MPI usage on a production supercomputer.
Proceedings of the International Conference for High Performance Computing, 2018

2017
HACC: extreme scaling and performance across diverse architectures.
Commun. ACM, 2017

Run-to-run variability on Xeon Phi based cray XC systems.
Proceedings of the International Conference for High Performance Computing, 2017

Analytical Performance Modeling and Validation of Intel's Xeon Phi Architecture.
Proceedings of the Computing Frontiers Conference, 2017

2016

Improving Data Transfer Throughput with Direct Search Optimization.
Proceedings of the 45th International Conference on Parallel Processing, 2016

2015
Modeling Cooperative Threads to Project GPU Performance for Adaptive Parallelism.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

2014
SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

Analytically Modeling Application Execution for Software-Hardware Co-design.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

SKOPE: a framework for modeling and exploring workload behavior.
Proceedings of the Computing Frontiers Conference, CF'14, 2014

2013
Argonne applications for the IBM Blue Gene/Q, Mira.
IBM J. Res. Dev., 2013

Characterization and Understanding Machine-Specific Interconnects.
Proceedings of the Parallel Computing Technologies - 12th International Conference, 2013

Early Experience on the Blue Gene/Q Supercomputing System.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Improving GPU Performance Prediction with Data Transfer Modeling.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

2012
Dataflow-driven GPU performance projection for multi-kernel transformations.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

The universe at extreme scale: multi-petaflop sky simulation on the BG/Q.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

SPEC OMP2012 - An Application Benchmark Suite for Parallel Systems Using OpenMP.
Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

ALCF MPI Benchmarks: Understanding Machine-Specific Communication Behavior.
Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

2011
SPEC Benchmarks.
Proceedings of the Encyclopedia of Parallel Computing, 2011

Electronic poster: co-visualization of full data and in situ data extracts from unstructured grid cfd at 160k cores.
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

GROPHECY: GPU performance projection from CPU code skeletons.
Proceedings of the Conference on High Performance Computing Networking, 2011

A new computational paradigm in multiscale simulations: application to brain blood flow.
Proceedings of the Conference on High Performance Computing Networking, 2011

2010
SPEC MPI2007 - an application benchmark suite for parallel systems using MPI.
Concurr. Comput. Pract. Exp., 2010


  Loading...