# Kunal Agrawal

According to our database

^{1}, Kunal Agrawal## Timeline

#### Legend:

Book In proceedings Article PhD thesis Other## Links

#### On csauthors.net:

## Bibliography

2018

Blocking Analysis for Spin Locks in Real-Time Parallel Tasks.

IEEE Trans. Parallel Distrib. Syst., 2018

Analysis of classic algorithms on highly-threaded many-core architectures.

Future Generation Comp. Syst., 2018

Race Detection and Reachability in Nearly Series-Parallel DAGs.

Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, 2018

Efficient parallel determinacy race detection for two-dimensional dags.

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018

Scheduling Parallelizable Jobs Online to Maximize Throughput.

Proceedings of the LATIN 2018: Theoretical Informatics, 2018

2017

Mixed-criticality federated scheduling for parallel real-time tasks.

Real-Time Systems, 2017

Reservation-Based Federated Scheduling for Parallel Real-Time Tasks.

CoRR, 2017

Brief Announcement: Scheduling Parallelizable Jobs Online to Maximize Throughput.

Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and Architectures, 2017

Processor-Oblivious Record and Replay.

Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

Exploiting Vector and Multicore Parallelism for Recursive, Data- and Task-Parallel Programs.

Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

Locality-Aware Dynamic Task Graph Scheduling.

Proceedings of the 46th International Conference on Parallel Processing, 2017

2016

Provably Good and Practically Efficient Parallel Race Detection for Fork-Join Programs.

Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures, 2016

Scheduling Parallelizable Jobs Online to Minimize the Maximum Flow Time.

Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures, 2016

Scheduling Parallel DAG Jobs Online to Minimize Average Flow Time.

Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, 2016

Randomized Work Stealing for Large Scale Soft Real-Time Systems.

Proceedings of the 2016 IEEE Real-Time Systems Symposium, 2016

Mixed-Criticality Federated Scheduling for Parallel Real-Time Tasks.

Proceedings of the 2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS), 2016

Work stealing for interactive services to meet target latency.

Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

2015

Global EDF scheduling for parallel real-time tasks.

Real-Time Systems, 2015

Efficient execution of recursive programs on commodity vector hardware.

Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015

Elastic Tasks: Unifying Task Parallelism and SPMD Parallelism with an Adaptive Runtime.

Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014

Parallel Real-Time Scheduling of DAGs.

IEEE Trans. Parallel Distrib. Syst., 2014

A memory access model for highly-threaded many-core architectures.

Future Generation Comp. Syst., 2014

Parallel Support Vector Machines in Practice.

CoRR, 2014

Provably good scheduling for parallel programs that use data structures through implicit batching.

Proceedings of the 26th ACM Symposium on Parallelism in Algorithms and Architectures, 2014

Brief announcement: cache-oblivious scheduling of streaming pipelines.

Proceedings of the 26th ACM Symposium on Parallelism in Algorithms and Architectures, 2014

Fault-Tolerant Dynamic Task Graph Scheduling.

Proceedings of the International Conference for High Performance Computing, 2014

Federated scheduling for stochastic parallel real-time tasks.

Proceedings of the 2014 IEEE 20th International Conference on Embedded and Real-Time Computing Systems and Applications, 2014

Theoretical analysis of classic algorithms on highly-threaded many-core GPUs.

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014

Provably good scheduling for parallel programs that use data structures through implicit batching.

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014

Orchestrating safe streaming computations with precise control.

Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Stochastic Neighbor Compression.

Proceedings of the 31th International Conference on Machine Learning, 2014

Analysis of classic algorithms on GPUs.

Proceedings of the International Conference on High Performance Computing & Simulation, 2014

Cache-conscious scheduling of streaming pipelines on parallel machines with private caches.

Proceedings of the 21st International Conference on High Performance Computing, 2014

Real-time system support for hybrid structural simulation.

Proceedings of the 2014 International Conference on Embedded Software, 2014

Analysis of Federated and Global Scheduling for Parallel Real-Time Tasks.

Proceedings of the 26th Euromicro Conference on Real-Time Systems, 2014

Performance modeling for highly-threaded many-core GPUs.

Proceedings of the IEEE 25th International Conference on Application-Specific Systems, 2014

2013

Multi-core real-time scheduling for generalized parallel task models.

Real-Time Systems, 2013

A real-time scheduling service for parallel tasks.

Proceedings of the 19th IEEE Real-Time and Embedded Technology and Applications Symposium, 2013

Adding data parallelism to streaming pipelines for throughput optimization.

Proceedings of the 20th Annual International Conference on High Performance Computing, 2013

Outstanding Paper Award: Analysis of Global EDF for Parallel Tasks.

Proceedings of the 25th Euromicro Conference on Real-Time Systems, 2013

2012

Mapping Filtering Streaming Applications.

Algorithmica, 2012

Cache-conscious scheduling of streaming applications.

Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, 2012

Efficient deadlock avoidance for streaming computation with filtering.

Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

A Memory Access Model for Highly-threaded Many-core Architectures.

Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012

2011

Parallel boosted regression trees for web search ranking.

Proceedings of the 20th International Conference on World Wide Web, 2011

Multi-core Real-Time Scheduling for Generalized Parallel Task Models.

Proceedings of the 32nd IEEE Real-Time Systems Symposium, 2011

2010

Deadlock avoidance for streaming computations with filtering.

Proceedings of the SPAA 2010: Proceedings of the 22nd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2010

Brief announcement: serial-parallel reciprocity in dynamic multithreaded languages.

Proceedings of the SPAA 2010: Proceedings of the 22nd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2010

Helper locks for fork-join parallel programming.

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010

Executing task graphs using work-stealing.

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Scheduling algorithms for linear workflow optimization.

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Deadlock-avoidance for streaming applications with split-join structure: Two case studies.

Proceedings of the 21st IEEE International Conference on Application-specific Systems Architectures and Processors, 2010

2009

Scheduling and synchronization for multicore concurrency platforms.

PhD thesis, 2009

The Worst Page-Replacement Policy.

Theory Comput. Syst., 2009

Mapping filtering streaming applications with communication costs.

Proceedings of the SPAA 2009: Proceedings of the 21st Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2009

Safe open-nested transactions through ownership.

Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

2008

Adaptive work-stealing with parallelism feedback.

ACM Trans. Comput. Syst., 2008

Safe open-nested transactions through ownership.

Proceedings of the SPAA 2008: Proceedings of the 20th Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2008

Safer open-nested transactions through ownership.

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008

Nested parallelism in transactional memory.

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008

Mapping Linear Workflows with Computation/Communication Overlap.

Proceedings of the 14th International Conference on Parallel and Distributed Systems, 2008

2007

Adaptive work stealing with parallelism feedback.

Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2007

Adaptive Scheduling with Parallelism Feedback.

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

The Worst Page-Replacement Policy.

Proceedings of the Fun with Algorithms, 4th International Conference, 2007

2006

Adaptive scheduling with parallelism feedback.

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

An Empirical Evaluation ofWork Stealing with Parallelism Feedback.

Proceedings of the 26th IEEE International Conference on Distributed Computing Systems (ICDCS 2006), 2006

Memory models for open-nested transactions.

Proceedings of the 2006 workshop on Memory System Performance and Correctness, 2006