Sasikanth Avancha

According to our database1, Sasikanth Avancha authored at least 42 papers between 2001 and 2022.

Collaborative distances:



In proceedings 
PhD thesis 




Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning and HPC Workloads.
Frontiers Appl. Math. Stat., 2022

DistGNN-MB: Distributed Large-Scale Graph Neural Network Training on x86 via Minibatch Sampling.
CoRR, 2022

PolyDL: Polyhedral Optimizations for Creation of High-performance DL Primitives.
ACM Trans. Archit. Code Optim., 2021

Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights.
Proc. IEEE, 2021

Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning Workloads.
CoRR, 2021

AI Powered Compiler Techniques for DL Code Optimization.
CoRR, 2021

DistGNN: scalable distributed training for large-scale graph neural networks.
Proceedings of the International Conference for High Performance Computing, 2021

Tensor processing primitives: a programming abstraction for efficiency and portability in deep learning workloads.
Proceedings of the International Conference for High Performance Computing, 2021

GNNerator: A Hardware/Software Framework for Accelerating Graph Neural Networks.
Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

SEERL: Sample Efficient Ensemble Reinforcement Learning.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Deep Graph Library Optimizations for Intel(R) x86 Architecture.
CoRR, 2020

PolyScientist: Automatic Loop Transformations Combined with Microkernels for Optimization of Deep Learning Primitives.
CoRR, 2020

Harnessing Deep Learning via a Single Building Block.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

dMazeRunner: Optimizing Convolutions on Dataflow Accelerators.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Intel Nervana Neural Network Processor-T (NNP-T) Fused Floating Point Many-Term Dot Product.
Proceedings of the 27th IEEE Symposium on Computer Arithmetic, 2020

ERLP: Ensembles of Reinforcement Learning Policies (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

dMazeRunner: Executing Perfectly Nested Loops on Dataflow Accelerators.
ACM Trans. Embed. Comput. Syst., 2019

High Performance Scalable FPGA Accelerator for Deep Neural Networks.
CoRR, 2019

High-Performance Deep Learning via a Single Building Block.
CoRR, 2019

A Study of BFLOAT16 for Deep Learning Training.
CoRR, 2019

Hierarchical Block Sparse Neural Networks.
CoRR, 2018

On Scale-out Deep Learning Training for Cloud and HPC.
CoRR, 2018

Anatomy of high-performance deep learning convolutions on SIMD architectures.
Proceedings of the International Conference for High Performance Computing, 2018

Mixed Precision Training of Convolutional Neural Networks using Integer Operations.
Proceedings of the 6th International Conference on Learning Representations, 2018

RAIL: Risk-Averse Imitation Learning.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

ScaleDeep: A Scalable Compute Architecture for Learning and Evaluating Deep Networks.
Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

Distributed Deep Learning Using Synchronous Stochastic Gradient Descent.
CoRR, 2016

Privacy in mobile technology for personal healthcare.
ACM Comput. Surv., 2012

Exascale Computing & Beyond: Meeting the Challenges.
Proceedings of the Transition of HPC Towards Exascale Computing, 2012

A Framework for Trustworthy Service-Oriented Computing (Short Paper).
Proceedings of the Information Systems Security, 4th International Conference, 2008

Using Ontologies in the Semantic Web: A Survey.
Proceedings of the Ontologies: A Handbook of Principles, 2007

Data and Services for Mobile Computing.
Proceedings of the Practical Handbook of Internet Computing., 2004

Ontology-Driven Adaptive Sensor Networks.
Proceedings of the 1st Annual International Conference on Mobile and Ubiquitous Systems (MobiQuitous 2004), 2004

P2P M-commerce in pervasive environments.
SIGecom Exch., 2003

Secure sensor networks for perimeter protection.
Comput. Networks, 2003

Centaurus: An Infrastructure for Service Management in Ubiquitous Computing Environments.
Wirel. Networks, 2002

Intelligent Agents for Mobile and Embedded Devices.
Int. J. Cooperative Inf. Syst., 2002

Enhanced Service Discovery in Bluetooth.
Computer, 2002

On experiments with a transport protocol for pervasive computing environments.
Comput. Networks, 2002

Simulation of a Common Access Point for Bluetooth, 802.11 and Wired LANs.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002

Profile Driven Data Management for Pervasive Environments.
Proceedings of the Database and Expert Systems Applications, 13th International Conference, 2002

Transport protocols in wireless networks.
Proceedings of the 10th International Conference on Computer Communications and Networks, 2001