Milind Chabbi

Orcid: 0000-0003-1021-7644

According to our database1, Milind Chabbi authored at least 46 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
EasyView: Bringing Performance Profiles into Integrated Development Environments.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2024

Unveiling and Vanquishing Goroutine Leaks in Enterprise Microservices: A Dynamic Analysis Approach.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2024

2023
Precise Event Sampling on AMD Versus Intel: Quantitative and Qualitative Comparison.
IEEE Trans. Parallel Distributed Syst., May, 2023

Precise event sampling-based data locality tools for AMD multicore architectures.
Concurr. Comput. Pract. Exp., 2023

Protecting Locks Against Unbalanced Unlock().
Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures, 2023

DJXPerf: Identifying Memory Inefficiencies via Object-Centric Profiling for Java.
Proceedings of the 21st ACM/IEEE International Symposium on Code Generation and Optimization, 2023

2022
ReuseTracker: Fast Yet Accurate Multicore Reuse Distance Analyzer.
ACM Trans. Archit. Code Optim., 2022

CRISP: Critical Path Analysis of Large-Scale Microservice Architectures.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

A study of real-world data races in Golang.
Proceedings of the PLDI '22: 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation, San Diego, CA, USA, June 13, 2022

OJXPERF: Featherlight Object Replica Detection for Java Programs.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

2021
Optimistic Concurrency Control for Real-world Go Programs (Extended Version with Appendix).
CoRR, 2021

Optimistic Concurrency Control for Real-world Go Programs.
Proceedings of the 2021 USENIX Annual Technical Conference, 2021

Low-Overhead Reuse Distance Profiling Tool for Multicore.
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021

An Experience with Code-Size Optimization for Production iOS Mobile Applications.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021

2020
Efficient Abortable-locking Protocol for Multi-level NUMA Systems: Design and Correctness.
ACM Trans. Parallel Comput., 2020

DrCCTProf: a fine-grained call path profiler for ARM-based clusters.
Proceedings of the International Conference for High Performance Computing, 2020

What every scientific programmer should know about compiler optimizations?
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

2019
Optimization of swift protocols.
Proc. ACM Program. Lang., 2019

Pinpointing performance inefficiencies in Java.
Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2019

Pinpointing performance inefficiencies via lightweight variance profiling.
Proceedings of the International Conference for High Performance Computing, 2019

ComDetective: a lightweight communication detection tool for threads.
Proceedings of the International Conference for High Performance Computing, 2019

Lightweight hardware transactional memory profiling.
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

Language Modeling at Scale.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Redundant loads: a software inefficiency indicator.
Proceedings of the 41st International Conference on Software Engineering, 2019

Featherlight Reuse-Distance Measurement.
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

Accelerated Genomics Data Processing using Memory-Driven Computing.
Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

2018
Lock Contention Management in Multithreaded MPI.
ACM Trans. Parallel Comput., 2018

An Evaluation of Vectorization and Cache Reuse Tradeoffs on Modern CPUs.
Proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, 2018

Featherlight on-the-fly false-sharing detection.
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018

Memory-Oriented Distributed Computing at Rack Scale.
Proceedings of the ACM Symposium on Cloud Computing, 2018

Watching for Software Inefficiencies with Witch.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

2017
Path-Synchronous Performance Monitoring in HPC Interconnection Networks with Source-Code Attribution.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2017

An Efficient Abortable-locking Protocol for Multi-level NUMA Systems.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

REDSPY: Exploring Value Locality in Software.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

2016
MPI-ACC: Accelerator-Aware MPI for Scientific Applications.
IEEE Trans. Parallel Distributed Syst., 2016

Correctness of Hierarchical MCS Locks with Timeout.
CoRR, 2016

Be my guest: MCS lock now welcomes guests.
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

Contention-conscious, locality-preserving locks.
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

2015
Barrier elision for production parallel programs.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

High performance locks for multi-level NUMA systems.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

Runtime Value Numbering: A Profiling Technique to Pinpoint Redundant Computations.
Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

2014
Call Paths for Pin Tools.
Proceedings of the 12th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2014

2013
Effective sampling-driven performance tools for GPU-accelerated supercomputers.
Proceedings of the International Conference for High Performance Computing, 2013

Integrating Asynchronous Task Parallelism with MPI.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

On the efficacy of GPU-integrated MPI for scientific applications.
Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013

2012
DeadSpy: a tool to pinpoint program inefficiencies.
Proceedings of the 10th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2012


  Loading...