Muthu Manikandan Baskaran

According to our database1, Muthu Manikandan Baskaran authored at least 44 papers between 2007 and 2021.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2021
An All-at-Once CP Decomposition Method for Count Tensors.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

Filtered Tensor Construction and Decomposition for Drug Repositioning.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

2020
Large-scale Sparse Tensor Decomposition Using a Damped Gauss-Newton Method.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

Multiscale Data Analysis Using Binning, Tensor Decompositions, and Backtracking.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

Automatic Mapping and Optimization to Kokkos with Polyhedral Compilation.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

2019
Enhancing Network Visibility and Security through Tensor Analysis.
Future Gener. Comput. Syst., 2019

Automatic Parallelization to Asynchronous Task-Based Runtimes Through a Generic Runtime Layer.
Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

Combining Tensor Decompositions and Graph Analytics to Provide Cyber Situational Awareness at HPC Scale.
Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

Fast and Scalable Distributed Tensor Decompositions.
Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

POSTER: Automatic Parallelization Targeting Asynchronous Task-Based Runtimes.
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018
Analysis of Explicit vs. Implicit Tasking in OpenMP Using Kripke.
Proceedings of the 4th International Workshop on Extreme Scale Programming Models and Middleware, 2018

Computationally Efficient CP Tensor Decomposition Update Framework for Emerging Component Discovery in Streaming Data.
Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

All-at-once Decomposition of Coupled Billion-scale Tensors in Apache Spark.
Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

2017
Polyhedral Optimization of TensorFlow Computation Graphs.
Proceedings of the Programming and Performance Visualization Tools, 2017

A quantitative and qualitative analysis of tensor decompositions on spatiotemporal data.
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

Memory-efficient parallel tensor decompositions.
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

2016
Efficient Compilation to Event-Driven Task Programs.
CoRR, 2016

Automatic Code Generation and Data Management for an Asynchronous Task-Based Runtime.
Proceedings of the 5th Workshop on Extreme-Scale Programming Tools, 2016

Scalable Hierarchical Polyhedral Compilation.
Proceedings of the 45th International Conference on Parallel Processing, 2016

Polyhedral compilation for energy efficiency.
Proceedings of the 2016 IEEE High Performance Extreme Computing Conference, 2016

Accelerated low-rank updates to tensor decompositions.
Proceedings of the 2016 IEEE High Performance Extreme Computing Conference, 2016

2015
Polyhedral user mapping and assistant visualizer tool for the r-stream auto-parallelizing compiler.
Proceedings of the 3rd IEEE Working Conference on Software Visualization, 2015

Automatic cluster parallelization and minimizing communication via selective data replication.
Proceedings of the 2015 IEEE High Performance Extreme Computing Conference, 2015

Optimization of symmetric tensor computations.
Proceedings of the 2015 IEEE High Performance Extreme Computing Conference, 2015

2014
A Tale of Three Runtimes.
CoRR, 2014

Parallelizing and optimizing sparse tensor computations.
Proceedings of the 2014 International Conference on Supercomputing, 2014

Low-overhead load-balanced scheduling for sparse tensor computations.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2014

2013
Re-Introduction of communication-avoiding FMM-accelerated FFTs with GPU acceleration.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2013

Memory reuse optimizations in the R-Stream compiler.
Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units, 2013

2012
Automatic communication optimizations through memory reuse strategies.
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

Efficient and scalable computations with sparse tensors.
Proceedings of the IEEE Conference on High Performance Extreme Computing, 2012

2011
R-Stream Compiler.
Proceedings of the Encyclopedia of Parallel Computing, 2011

2010
Optimal loop unrolling for GPGPU programs.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

DynTile: Parametric tiled loop generation for parallel execution on multicore processors.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Parameterized tiling revisited.
Proceedings of the CGO 2010, 2010

Automatic C-to-CUDA Code Generation for Affine Programs.
Proceedings of the Compiler Construction, 19th International Conference, 2010

A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction.
Proceedings of 3rd Workshop on General Purpose Processing on Graphics Processing Units, 2010

2009
Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors.
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

Parametric multi-level tiling of imperfectly nested loops.
Proceedings of the 23rd international conference on Supercomputing, 2009

2008
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories.
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008

Towards effective automatic parallelization for multicore systems.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

A compiler framework for optimization of affine loop nests for gpgpus.
Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model.
Proceedings of the Compiler Construction, 17th International Conference, 2008

2007
Effective automatic parallelization of stencil computations.
Proceedings of the ACM SIGPLAN 2007 Conference on Programming Language Design and Implementation, 2007


  Loading...