Shivaram Kalyanakrishnan

According to our database1, Shivaram Kalyanakrishnan authored at least 37 papers between 2006 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Some Upper Bounds on the Running Time of Policy Iteration on Deterministic MDPs.
CoRR, 2022

Artificial Intelligence and Life in 2030: The One Hundred Year Study on Artificial Intelligence.
CoRR, 2022

PAC Mode Estimation using PPR Martingale Confidence Sequences.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
An Analysis of Frame-skipping in Reinforcement Learning.
CoRR, 2021

The Second NeurIPS Tournament of Reconnaissance Blind Chess.
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

The Machine Reconnaissance Blind Chess Tournament of NeurIPS 2022.
Proceedings of the NeurIPS 2022 Competition Track, 2021

Intelligent and Learning Agents: Four Investigations.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020
Lower Bounds for Policy Iteration on Multi-action MDPs.
Proceedings of the 59th IEEE Conference on Decision and Control, 2020

Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A Tighter Analysis of Randomised Policy Iteration.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Quantile-Regret Minimisation in Infinitely Many-Armed Bandits.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Opportunities and Challenges for Artificial Intelligence in India.
Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 2018

2017
RLWS: A Reinforcement Learning based GPU Warp Scheduler.
CoRR, 2017

Improved Strong Worst-case Upper Bounds for MDP Planning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

PAC Identification of a Bandit Arm Relative to a Reward Quantile.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
AI's 10 to Watch.
IEEE Intell. Syst., 2016

Batch-Switching Policy Iteration.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Randomised Procedures for Initialising and Switching Actions in Policy Iteration.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2014
Direction-changing fall control of humanoid robots: theory and experiments.
Auton. Robots, 2014

GEV-Canonical Regression for Accurate Binary Class Probability Estimation when One Class is Rare.
Proceedings of the 31th International Conference on Machine Learning, 2014

On Building Decision Trees from Large-scale Data in Applications of On-line Advertising.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013
Information Complexity in Bandit Subset Selection.
Proceedings of the COLT 2013, 2013

2012
PAC Subset Selection in Stochastic Multi-armed Bandits.
Proceedings of the 29th International Conference on Machine Learning, 2012

UT Austin Villa 2011: a champion agent in the RoboCup 3D soccer simulation competition.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

2011
Characterizing reinforcement learning methods through parameterized learning problems.
Mach. Learn., 2011

Learning to Predict Humanoid Fall.
Int. J. Humanoid Robotics, 2011

On optimizing interdependent skills: a case study in simulated 3D humanoid robot soccer.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

On learning with imperfect representations.
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

2010
Efficient Selection of Multiple Bandit Arms: Theory and Practice.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Predicting Falls of a Humanoid Robot through Machine Learning.
Proceedings of the Twenty-Second Conference on Innovative Applications of Artificial Intelligence, 2010

2009
Learning Complementary Multiagent Behaviors: A Case Study.
Proceedings of the RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29, 2009

Three Humanoid Soccer Platforms: Comparison and Synthesis.
Proceedings of the RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29, 2009

An empirical analysis of value function-based and policy search reinforcement learning.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

2007
Model-Based Reinforcement Learning in a Complex Domain.
Proceedings of the RoboCup 2007: Robot Soccer World Cup XI, 2007

Batch reinforcement learning in a complex domain.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

2006
Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study.
Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006


  Loading...