Aparna Chandramowlishwaran

Orcid: 0000-0003-0840-4192

According to our database1, Aparna Chandramowlishwaran authored at least 43 papers between 2008 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
BubbleML: A Multi-Physics Dataset and Benchmarks for Machine Learning.
CoRR, 2023

Breaking Boundaries: Distributed Domain Decomposition with Scalable Physics-Informed Neural PDE Solvers.
Proceedings of the International Conference for High Performance Computing, 2023

BubbleML: A Multiphase Multiphysics Dataset and Benchmarks for Machine Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ADARNet: Deep Learning Predicts Adaptive Mesh Refinement.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

2022
NUNet: Deep Learning for Non-Uniform Super-Resolution of Turbulent Flows.
CoRR, 2022

Lessons Learned on MPI+Threads Communication.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

2021
Logically Parallel Communication for Fast MPI+Threads Applications.
IEEE Trans. Parallel Distributed Syst., 2021

adPerf: Characterizing the Performance of Third-party Ads.
Proc. ACM Meas. Anal. Comput. Syst., 2021

Optimizing the hypre solver for manycore and GPU architectures.
J. Comput. Sci., 2021

Demystifying asynchronous I/O Interference in HPC applications.
Int. J. High Perform. Comput. Appl., 2021

Train Once and Use Forever: Solving Boundary Value Problems in Unseen Domains with Pre-trained Deep Learning Models.
CoRR, 2021

SURFNet: Super-Resolution of Turbulent Flows with Transfer Learning using Small Datasets.
Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021

2020
Only Relative Speed Matters: Virtual Causal Profiling.
SIGMETRICS Perform. Evaluation Rev., 2020

Brief Announcement: On the Limits of Parallelizing Convolutional Neural Networks on GPUs.
CoRR, 2020

On the Limits of Parallelizing Convolutional Neural Networks on GPUs.
Proceedings of the SPAA '20: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, 2020

Pencil: a pipelined algorithm for distributed stencils.
Proceedings of the International Conference for High Performance Computing, 2020

How I learned to stop worrying about user-visible endpoints and love MPI.
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

CFDNet: a deep learning-based accelerator for fluid simulations.
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

2019
What-If Analysis of Page Load Time in Web Browsers Using Causal Profiling.
Proc. ACM Meas. Anal. Comput. Syst., 2019

Portal: A High-Performance Language and Compiler for Parallel N-Body Problems.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Multi-criteria partitioning of multi-block structured grids.
Proceedings of the ACM International Conference on Supercomputing, 2019

Breaking Band: A Breakdown of High-performance Communication.
Proceedings of the 48th International Conference on Parallel Processing, 2019

Towards Portable Online Prediction of Network Utilization Using MPI-Level Monitoring.
Proceedings of the Euro-Par 2019: Parallel Processing, 2019

2018
Roofline Guided Design and Analysis of a Multi-stencil CFD Solver for Multicore Performance.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Scalable Communication Endpoints for MPI+Threads Applications.
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018

Sugar: Secure GPU Acceleration in Web Browsers.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

2017
PASCAL: A Parallel Algorithmic SCALable Framework for N-body Problems.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

cudaCR: An In-Kernel Application-Level Checkpoint/Restart Scheme for CUDA-Enabled GPUs.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016
Parallel Performance-Energy Predictive Modeling of Browsers: Case Study of Servo.
Proceedings of the 23rd IEEE International Conference on High Performance Computing, 2016

2014
The fast multipole method at exascale.
PhD thesis, 2014

A CPU: GPU Hybrid Implementation and Model-Driven Scheduling of the Fast Multipole Method.
Proceedings of the Seventh Workshop on General Purpose Processing Using GPUs, 2014

2012
A massively parallel adaptive fast multipole method on heterogeneous architectures.
Commun. ACM, 2012

Brief announcement: towards a communication optimal fast multipole method and its implications at exascale.
Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, 2012

Courses in High-performance Computing for Scientists and Engineers.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Communication-Optimal Parallel N-body Solvers.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

2011
Balance Principles for Algorithm-Architecture Co-Design.
Proceedings of the 3rd USENIX Workshop on Hot Topics in Parallelism, 2011

2010
Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures.
Proceedings of the Conference on High Performance Computing Networking, 2010

Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method.
Proceedings of the Conference on High Performance Computing Networking, 2010

Applying the concurrent collections programming model to asynchronous parallel dense linear algebra.
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010

Optimizing and tuning the fast multipole method for state-of-the-art multicore architectures.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Performance evaluation of concurrent collections on high-performance multicore computing systems.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009
Declarative aspects of memory management in the concurrent collections parallel programming model.
Proceedings of the POPL 2009 Workshop on Declarative Aspects of Multicore Programming, 2009

2008
On the Design of Fast Pseudo-Random Number Generators for the Cell Broadband Engine and an Application to Risk Analysis.
Proceedings of the 2008 International Conference on Parallel Processing, 2008


  Loading...