We stand with Ukraine

We stand with Ukraine

Charith Mendis

Orcid: 0000-0002-8140-2321

Affiliations:

University of Illinois at Urbana-Champaign, IL, USA

According to our database¹, Charith Mendis authored at least 54 papers between 2015 and 2026.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org
on charithmendis.com

On csauthors.net:

Bibliography

2026

VTC: DNN Compilation with Virtual Tensors for Data Movement Elimination.

[DOI]

,

,

,

,

,

,

Janardhan Kulkarni

,

,

,

CoRR, April, 2026

RuleFlow : Generating Reusable Program Optimizations with LLMs.

[DOI]

,

Dushyant Bharadwaj

,

Stefanos Baziotis

,

Kaushik Varadharajan

,

CoRR, February, 2026

MINISA: Minimal Instruction Set Architecture for Next-gen Reconfigurable Inference Accelerator.

[DOI]

,

,

,

,

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2026

SAS: Sparse Attention Synthesizer for Efficient Language Model Inference.

[DOI]

,

,

,

,

,

Proceedings of the 21st European Conference on Computer Systems, 2026

GRANII: Selection and Ordering of Primitives in GRAph Neural Networks using Input Inspection.

[DOI]

Damitha Lenadora

,

,

Gerasimos Gerogiannis

,

,

Josep Torrellas

,

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2026

2025

ACT: Automatically Generating Compiler Backends from Tensor Accelerator ISA Descriptions.

[DOI]

,

,

,

,

Kaustubh Khulbe

,

,

CoRR, October, 2025

A Tensor-Based Compiler and a Runtime for Neuron-Level DNN Certifier Specifications.

[DOI]

,

Yamin Chandini Sarita

,

,

,

Gagandeep Singh

,

CoRR, July, 2025

PilotDB: Database-Agnostic Online Approximate Query Processing with A Priori Error Guarantees.

[DOI]

,

,

Stefanos Baziotis

,

Chengsong Zhang

,

,

Proc. ACM Manag. Data, June, 2025

PandasBench: A Benchmark for the Pandas API.

[DOI]

,

Stefanos Baziotis

,

,

CoRR, June, 2025

PilotDB: Database-Agnostic Online Approximate Query Processing with A Priori Error Guarantees (Technical Report).

[DOI]

,

,

Stefanos Baziotis

,

Chengsong Zhang

,

,

CoRR, March, 2025

Automated Verification of Soundness of DNN Certifiers.

[DOI]

,

,

,

Gagandeep Singh

Proc. ACM Program. Lang., 2025

MISAAL: Synthesis-Based Automatic Generation of Efficient and Retargetable Semantics-Driven Optimizations.

[DOI]

Abdul Rafae Noor

,

,

,

,

,

Proc. ACM Program. Lang., 2025

GALA: A High Performance Graph Neural Network Acceleration LAnguage and Compiler.

[DOI]

Damitha Lenadora

,

Nikhil Jayakumar

,

Chamika Sudusinghe

,

Proc. ACM Program. Lang., 2025

SPLAT: A Framework for Optimised GPU Code-Generation for SParse reguLar ATtention.

[DOI]

,

,

,

,

,

,

Proc. ACM Program. Lang., 2025

TensorRight: Automated Verification of Tensor Graph Rewrites.

[DOI]

,

,

,

,

Farzin Houshmand

,

Phitchaya Mangpo Phothilimthana

,

,

Praveen Narayanan

,

Karthik Srinivasa Murthy

,

Rastislav Bodík

,

,

Proc. ACM Program. Lang., 2025

TAIDL: Tensor Accelerator ISA Definition Language with Auto-generation of Scalable Test Oracles.

[DOI]

,

,

,

,

,

,

Proceedings of the 58th IEEE/ACM International Symposium on Microarchitecture, 2025

COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning.

[DOI]

Chamika Sudusinghe

,

Gerasimos Gerogiannis

,

Damitha Lenadora

,

,

Josep Torrellas

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

Dias: Dynamic Rewriting of Pandas Code.

[DOI]

Stefanos Baziotis

,

,

Proc. ACM Manag. Data, February, 2024

Transforming the Hybrid Cloud for Emerging AI Workloads.

[DOI]

CoRR, 2024

ConstraintFlow: A DSL for Specification and Verification of Neural Network Analyses.

[DOI]

,

,

,

Gagandeep Singh

CoRR, 2024

TGOnline: Enhancing Temporal Graph Learning with Adaptive Online Meta-Learning.

[DOI]

,

,

,

,

,

,

,

,

Tarek F. Abdelzaher

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

ConstraintFlow: A Declarative DSL for Easy Development of DNN Certifiers.

[DOI]

,

,

,

Gagandeep Singh

Proceedings of the Static Analysis - 31st International Symposium, 2024

COMET: Neural Cost Model Explanation Framework.

[DOI]

,

,

,

Gagandeep Singh

Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

Towards Efficient Temporal Graph Learning: Algorithms, Frameworks, and Tools.

[DOI]

,

,

,

,

Tarek F. Abdelzaher

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

TGLite: A Lightweight Programming Framework for Continuous-Time Temporal Graph Neural Networks.

[DOI]

,

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Hydride: A Retargetable and Extensible Synthesis-based Compiler for Modern Hardware Architectures.

[DOI]

,

Abdul Rafae Noor

,

,

,

,

Stefanos Baziotis

,

,

,

Sudipta Sengupta

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Two-Face: Combining Collective and One-Sided Communication for Efficient Distributed SpMM.

[DOI]

,

Gerasimos Gerogiannis

,

,

,

Josep Torrellas

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs.

[DOI]

Phitchaya Mangpo Phothilimthana

,

Sami Abu-El-Haija

,

,

,

,

CoRR, 2023

FLuRKA: Fast fused Low-Rank & Kernel Attention.

[DOI]

,

,

,

CoRR, 2023

Input-sensitive dense-sparse primitive compositions for GNN acceleration.

[DOI]

Damitha Lenadora

,

,

Gerasimos Gerogiannis

,

,

Josep Torrellas

,

CoRR, 2023

CoMEt: x86 Cost Model Explanation Framework.

[DOI]

,

,

,

Gagandeep Singh

CoRR, 2023

TGOpt: Redundancy-Aware Optimizations for Temporal Graph Attention Networks.

[DOI]

,

Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs.

[DOI]

Phitchaya Mangpo Phothilimthana

,

Sami Abu-El-Haija

,

,

,

Michael Burrows

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Large Graph Property Prediction via Graph Segment Training.

[DOI]

,

Phitchaya Mangpo Phothilimthana

,

Sami Abu-El-Haija

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Unified Convolution Framework: A compiler-based approach to support sparse convolutions.

[DOI]

,

,

,

,

Saman P. Amarasinghe

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Challenges in Metaverse Research: An Internet of Things Perspective.

[DOI]

Tarek F. Abdelzaher

,

,

,

Klara Nahrstedt

,

Mani Srivastava

,

Proceedings of the IEEE International Conference on Metaverse Computing, 2023

Generating Memory Allocators from the Ground Up.

[DOI]

Pavlo Pastaryev

,

,

Lawrence Rauchwerger

Proceedings of the Languages and Compilers for Parallel Computing, 2023

SPADE: A Flexible and Scalable Accelerator for SpMM and SDDMM.

[DOI]

Gerasimos Gerogiannis

,

,

Damitha Lenadora

,

,

,

Josep Torrellas

Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

WACO: Learning Workload-Aware Co-optimization of the Format and Schedule of a Sparse Tensor Program.

[DOI]

,

,

,

Saman P. Amarasinghe

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022

All you need is superword-level parallelism: systematic control-flow vectorization with SLP.

[DOI]

,

,

Saman P. Amarasinghe

Proceedings of the PLDI '22: 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation, San Diego, CA, USA, June 13, 2022

GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation.

[DOI]

,

Phitchaya Mangpo Phothilimthana

,

,

Amir Yazdanbakhsh

Proceedings of the IEEE International Symposium on Workload Characterization, 2022

2021

A Learned Performance Model for Tensor Processing Units.

[DOI]

Samuel J. Kaufman

,

Phitchaya Mangpo Phothilimthana

,

,

,

,

,

Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

VeGen: a vectorizer generator for SIMD and beyond.

[DOI]

,

,

,

Saman P. Amarasinghe

Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

2020

DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates.

[DOI]

,

,

,

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

2019

Compiler Auto-Vectorization with Imitation Learning.

[DOI]

,

,

,

Saman P. Amarasinghe

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

BHive: A Benchmark Suite and Measurement Framework for Validating x86-64 Basic Block Performance Models.

[DOI]

,

Ajay Brahmakshatriya

,

,

,

,

,

Saman P. Amarasinghe

,

Proceedings of the IEEE International Symposium on Workload Characterization, 2019

Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks.

[DOI]

,

,

Saman P. Amarasinghe

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Revec: program rejuvenation through revectorization.

[DOI]

,

,

,

Saman P. Amarasinghe

Proceedings of the 28th International Conference on Compiler Construction, 2019

2018

goSLP: globally optimized superword level parallelism framework.

[DOI]

,

Saman P. Amarasinghe

Proc. ACM Program. Lang., 2018

Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks.

[DOI]

,

Saman P. Amarasinghe

,

CoRR, 2018

2017

Making caches work for graph analytics.

[DOI]

,

Vladimir Kiriansky

,

,

Saman P. Amarasinghe

,

Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

2016

Optimizing Cache Performance for Graph Analytics.

[DOI]

,

Vladimir Kiriansky

,

,

,

Saman P. Amarasinghe

CoRR, 2016

Parallelizing WFST speech decoders.

[DOI]

,

,

,

Madanlal Musuvathi

,

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Helium: lifting high-performance stencil kernels from stripped x86 binaries to halide DSL code.

[DOI]

,

Jeffrey Bosboom

,

,

,

Jonathan Ragan-Kelley

,

,

,

Saman P. Amarasinghe

Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015

Loading...