We stand with Ukraine

We stand with Ukraine

Shivaram Kalyanakrishnan

Orcid: 0009-0006-7707-6056

According to our database¹, Shivaram Kalyanakrishnan authored at least 48 papers between 2006 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

On-line Learning in Tree MDPs by Treating Policies as Bandit Arms.

[DOI]

,

Ramsundar Anandanarayanan

,

Sharayu Moharir

,

Shivaram Kalyanakrishnan

CoRR, May, 2026

Using Common Random Numbers for Simulation-based Planning with Rollouts.

[DOI]

,

Frederic J. Maliakkal

,

Harshad Khadilkar

,

Shivaram Kalyanakrishnan

CoRR, May, 2026

Upper Bounds for All and Max-Gain Policy Iteration Algorithms on Deterministic MDPs.

[DOI]

,

,

,

Shivaram Kalyanakrishnan

Math. Oper. Res., 2026

2025

Hybrids of Reinforcement Learning and Evolutionary Computation in Finance: A Survey.

[DOI]

,

,

Shivaram Kalyanakrishnan

ACM Comput. Surv., October, 2025

Howard's Policy Iteration is Subexponential for Deterministic Markov Decision Problems with Rewards of Fixed Bit-size and Arbitrary Discount Factor.

[DOI]

Dibyangshu Mukherjee

,

Shivaram Kalyanakrishnan

CoRR, May, 2025

A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model.

[DOI]

Shivaram Kalyanakrishnan

,

,

Santhosh Kumar Guguloth

CoRR, January, 2025

A View of the Certainty-Equivalence Method for PAC RL as an Application of the Trajectory Tree Method.

[DOI]

Shivaram Kalyanakrishnan

,

,

Santhosh Kumar Guguloth

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Efficient Computation of Blackwell Optimal Policies Using Rational Functions.

[DOI]

Dibyangshu Mukherjee

,

Shivaram Kalyanakrishnan

Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

On the Efficiency of Algebraic Simplex Algorithms for Solving MDPs.

[DOI]

Dibyangshu Mukherjee

,

Shivaram Kalyanakrishnan

Proceedings of the Integration of Constraint Programming, Artificial Intelligence, and Operations Research, 2025

2024

Optimal Stopping Rules for Best Arm Identification in Stochastic Bandits under Uniform Sampling.

[DOI]

,

,

Shivaram Kalyanakrishnan

,

Nikhil Karamchandani

Proceedings of the IEEE International Symposium on Information Theory, 2024

Linear-Time Optimal Deadlock Detection for Efficient Scheduling in Multi-Track Railway Networks.

[DOI]

,

,

,

Harshad Khadilkar

,

Shivaram Kalyanakrishnan

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

2022

Some Upper Bounds on the Running Time of Policy Iteration on Deterministic MDPs.

[DOI]

,

,

,

Pratyush Agarwal

,

Mulinti Shaik Wajid

,

Shivaram Kalyanakrishnan

CoRR, 2022

Artificial Intelligence and Life in 2030: The One Hundred Year Study on Artificial Intelligence.

[DOI]

,

,

Erik Brynjolfsson

,

,

,

,

Julia Hirschberg

,

Shivaram Kalyanakrishnan

,

,

,

Kevin Leyton-Brown

,

David C. Parkes

,

William H. Press

,

AnnaLee Saxenian

,

,

,

CoRR, 2022

PAC Mode Estimation using PPR Martingale Confidence Sequences.

[DOI]

Shubham Anand Jain

,

,

,

,

Inderjeet J. Nair

,

,

,

,

Vinay J. Ribeiro

,

Shivaram Kalyanakrishnan

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

An Analysis of Frame-skipping in Reinforcement Learning.

[DOI]

Shivaram Kalyanakrishnan

,

Siddharth Aravindan

,

Vishwajeet Bagdawat

,

,

,

,

Kalpesh Krishna

,

CoRR, 2021

The Second NeurIPS Tournament of Reconnaissance Blind Chess.

[DOI]

,

Ryan W. Gardner

,

,

Mohammad Taufeeque

,

,

Shivaram Kalyanakrishnan

,

,

,

,

Brady P. Garrison

,

Prithviraj Dasgupta

,

,

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

The Machine Reconnaissance Blind Chess Tournament of NeurIPS 2022.

[DOI]

Ryan W. Gardner

,

,

,

Shivaram Kalyanakrishnan

,

,

,

,

Johannes Fürnkranz

,

,

Brady P. Garrison

,

Prithviraj Dasgupta

,

Proceedings of the NeurIPS 2022 Competition Track, 2021

Intelligent and Learning Agents: Four Investigations.

[DOI]

Shivaram Kalyanakrishnan

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020

Lower Bounds for Policy Iteration on Multi-action MDPs.

[DOI]

,

,

,

Parthasarathi Khirwadkar

,

,

Shivaram Kalyanakrishnan

Proceedings of the 59th IEEE Conference on Decision and Control, 2020

Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory.

[DOI]

Arghya Roy Chaudhuri

,

Shivaram Kalyanakrishnan

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

A Tighter Analysis of Randomised Policy Iteration.

[DOI]

,

Shivaram Kalyanakrishnan

Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits.

[DOI]

Arghya Roy Chaudhuri

,

Shivaram Kalyanakrishnan

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

Quantile-Regret Minimisation in Infinitely Many-Armed Bandits.

[DOI]

Arghya Roy Chaudhuri

,

Shivaram Kalyanakrishnan

Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Opportunities and Challenges for Artificial Intelligence in India.

[DOI]

Shivaram Kalyanakrishnan

,

Rahul Alex Panicker

,

Sarayu Natarajan

,

Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 2018

2017

RLWS: A Reinforcement Learning based GPU Warp Scheduler.

[DOI]

Jayvant Anantpur

,

Nagendra Dwarakanath Gulur

,

Shivaram Kalyanakrishnan

,

Shalabh Bhatnagar

,

R. Govindarajan

CoRR, 2017

Improved Strong Worst-case Upper Bounds for MDP Planning.

[DOI]

,

Shivaram Kalyanakrishnan

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

PAC Identification of a Bandit Arm Relative to a Reward Quantile.

[DOI]

Arghya Roy Chaudhuri

,

Shivaram Kalyanakrishnan

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

AI's 10 to Watch.

[DOI]

,

Elias Bareinboim

,

,

,

Shivaram Kalyanakrishnan

,

,

,

Gerardo I. Simari

,

,

IEEE Intell. Syst., 2016

Batch-Switching Policy Iteration.

[DOI]

Shivaram Kalyanakrishnan

,

,

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Randomised Procedures for Initialising and Switching Actions in Policy Iteration.

[DOI]

Shivaram Kalyanakrishnan

,

Neeldhara Misra

,

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2014

Direction-changing fall control of humanoid robots: theory and experiments.

[DOI]

Ambarish Goswami

,

,

Umashankar Nagarajan

,

,

,

Shivaram Kalyanakrishnan

Auton. Robots, 2014

GEV-Canonical Regression for Accurate Binary Class Probability Estimation when One Class is Rare.

[DOI]

,

Harikrishna Narasimhan

,

Shivaram Kalyanakrishnan

,

Shivani Agarwal

Proceedings of the 31th International Conference on Machine Learning, 2014

On Building Decision Trees from Large-scale Data in Applications of On-line Advertising.

[DOI]

Shivaram Kalyanakrishnan

,

,

Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013

Information Complexity in Bandit Subset Selection.

[DOI]

Emilie Kaufmann

,

Shivaram Kalyanakrishnan

Proceedings of the COLT 2013, 2013

2012

PAC Subset Selection in Stochastic Multi-armed Bandits.

[DOI]

Shivaram Kalyanakrishnan

,

,

,

Proceedings of the 29th International Conference on Machine Learning, 2012

UT Austin Villa 2011: a champion agent in the RoboCup 3D soccer simulation competition.

[DOI]

Patrick MacAlpine

,

,

,

Shivaram Kalyanakrishnan

,

Francisco Barrera

,

Adrian Lopez-Mobilia

,

Nicolae Stiurca

,

,

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

2011

Characterizing reinforcement learning methods through parameterized learning problems.

[DOI]

Shivaram Kalyanakrishnan

,

Mach. Learn., 2011

Learning to Predict Humanoid Fall.

[DOI]

Shivaram Kalyanakrishnan

,

Ambarish Goswami

Int. J. Humanoid Robotics, 2011

On optimizing interdependent skills: a case study in simulated 3D humanoid robot soccer.

[DOI]

,

Patrick MacAlpine

,

Shivaram Kalyanakrishnan

,

,

Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

On learning with imperfect representations.

[DOI]

Shivaram Kalyanakrishnan

,

Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

2010

Efficient Selection of Multiple Bandit Arms: Theory and Practice.

[DOI]

Shivaram Kalyanakrishnan

,

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Predicting Falls of a Humanoid Robot through Machine Learning.

[DOI]

Shivaram Kalyanakrishnan

,

Ambarish Goswami

Proceedings of the Twenty-Second Conference on Innovative Applications of Artificial Intelligence, 2010

2009

Learning Complementary Multiagent Behaviors: A Case Study.

[DOI]

Shivaram Kalyanakrishnan

,

Proceedings of the RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29, 2009

Three Humanoid Soccer Platforms: Comparison and Synthesis.

[DOI]

Shivaram Kalyanakrishnan

,

,

Michael J. Quinlan

,

,

Proceedings of the RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29, 2009

An empirical analysis of value function-based and policy search reinforcement learning.

[DOI]

Shivaram Kalyanakrishnan

,

Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

2007

Model-Based Reinforcement Learning in a Complex Domain.

[DOI]

Shivaram Kalyanakrishnan

,

,

Proceedings of the RoboCup 2007: Robot Soccer World Cup XI, 2007

Batch reinforcement learning in a complex domain.

[DOI]

Shivaram Kalyanakrishnan

,

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

2006

Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study.

[DOI]

Shivaram Kalyanakrishnan

,

,

Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Loading...