Peter Stone

According to our database1, Peter Stone authored at least 451 papers between 1990 and 2019.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2018, "For contributions to reinforcement learning, multiagent systems, and robotics".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Task planning in robotics: an empirical comparison of PDDL- and ASP-based systems.
Frontiers of IT & EE, 2019

Leveraging Human Guidance for Deep Reinforcement Learning Tasks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Recent Advances in Imitation Learning from Observation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Imitation Learning from Video by Leveraging Proprioception.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Ad Hoc Teamwork With Behavior Switching Agents.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Improving Grounded Natural Language Understanding through Human-Robot Dialog.
Proceedings of the International Conference on Robotics and Automation, 2019

Importance Sampling Policy Evaluation with an Estimated Behavior Policy.
Proceedings of the 36th International Conference on Machine Learning, 2019

Adversarial Imitation Learning from State-only Demonstrations.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Marginal Cost Pricing with a Fixed Error Factor in Traffic Networks.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Learning Curriculum Policies for Reinforcement Learning.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Escape Room: A Configurable Testbed for Hierarchical Reinforcement Learning.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Reducing Sampling Error in Policy Gradient Learning.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork - The STAR Framework: Extended Abstract.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Open-World Reasoning for Service Robots.
Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, 2019

Robust Motion Planning and Safety Benchmarking in Human Workspaces.
Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the Thirty-Third AAAI Conference on Artificial Intelligence 2019 (AAAI-19), 2019

Selecting Compliant Agents for Opt-in Micro-Tolling.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Variety Wins: Soccer-Playing Robots and Infant Walking.
Front. Neurorobot., 2018

A century-long commitment to assessing artificial intelligence and its impact on society.
Commun. ACM, 2018

Overlapping layered learning.
Artif. Intell., 2018

Autonomous agents modelling other agents: A comprehensive survey and open problems.
Artif. Intell., 2018

UT Austin Villa: RoboCup 2018 3D Simulation League Champions.
Proceedings of the RoboCup 2018: Robot World Cup XXII [Montreal, 2018

A Study of Human-Robot Copilot Systems for En-route Destination Changing.
Proceedings of the 27th IEEE International Symposium on Robot and Human Interactive Communication, 2018

Passive Demonstrations of Light-Based Robot Signals for Improved Human Interpretability.
Proceedings of the 27th IEEE International Symposium on Robot and Human Interactive Communication, 2018

Enhanced Delta-tolling: Traffic Optimization via Policy Gradient Reinforcement Learning.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

On the Impact of Music on Decision Making in Cooperative Tasks.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Traffic Optimization For a Mixture of Self-interested and Compliant Agents.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2018

Enhanced Delta-tolling: Traffic Optimization via Policy Gradient Reinforcement Learning.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2018

PRISM: Pose Registration for Integrated Semantic Mapping.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Behavioral Cloning from Observation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Multi-modal Predicate Identification using Dynamically Learned Robot Controllers.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Inferring User Intention using Gaze in Vehicles.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Learning a Policy for Opportunistic Active Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Link-based Parameterized Micro-tolling Scheme for Optimal Traffic Management.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

PETLON: Planning Efficiently for Task-Level-Optimal Navigation.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

A Stitch in Time - Autonomous Model Management via Reinforcement Learning.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

State Abstraction Synthesis for Discrete Models of Continuous Domains.
Proceedings of the 2018 AAAI Spring Symposia, 2018

Towards a Data Efficient Off-Policy Policy Gradient.
Proceedings of the 2018 AAAI Spring Symposia, 2018

Robot Behavioral Exploration and Multi-modal Perception using Dynamically Constructed Controllers.
Proceedings of the 2018 AAAI Spring Symposia, 2018

Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Guiding Exploratory Behaviors for Multi-Modal Grounding of Linguistic Descriptions.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Traffic Optimization for a Mixture of Self-Interested and Compliant Agents.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Autonomous Model Management via Reinforcement Learning.
Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

DIPD: Gaze-Based Intention Inference in Dynamic Environments.
Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Adversarial Goal Generation for Intrinsic Motivation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

DyETC: Dynamic Electronic Toll Collection for Traffic Congestion Alleviation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Reinforcement Learning.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Q-Learning.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Machine Learning Capabilities of a Simulated Cerebellum.
IEEE Trans. Neural Netw. Learning Syst., 2017

BWIBots: A platform for bridging the gap between AI and human-robot interaction research.
I. J. Robotics Res., 2017

Multirobot Systems.
IEEE Intelligent Systems, 2017

Intrinsically motivated model learning for developing curious robots.
Artif. Intell., 2017

Making friends on the fly: Cooperating with new teammates.
Artif. Intell., 2017

Three years of the RoboCup standard platform league drop-in player competition - Creating and maintaining a large scale ad hoc teamwork robotics competition.
Autonomous Agents and Multi-Agent Systems, 2017

Special issue on multiagent interaction without prior coordination: guest editorial.
Autonomous Agents and Multi-Agent Systems, 2017

Fast and Precise Black and White Ball Detection for RoboCup Soccer.
Proceedings of the RoboCup 2017: Robot World Cup XXI [Nagoya, Japan, July 27-31, 2017]., 2017

UT Austin Villa: RoboCup 2017 3D Simulation League Competition and Technical Challenges Champions.
Proceedings of the RoboCup 2017: Robot World Cup XXI [Nagoya, Japan, July 27-31, 2017]., 2017

Leveraging commonsense reasoning and multimodal perception for robot spoken dialog systems.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Autonomous Task Sequencing for Customized Curriculum Design in Reinforcement Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Data-Efficient Policy Evaluation Through Behavior Policy Search.
Proceedings of the 34th International Conference on Machine Learning, 2017

CC-Log: Drastically Reducing Storage Requirements for Robots Using Classification and Compression.
Proceedings of the 9th USENIX Workshop on Hot Topics in Storage and File Systems, 2017

Opportunistic Active Learning for Grounding Natural Language Descriptions.
Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

Multirobot Symbolic Planning under Temporal Uncertainty.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

A Protocol for Mixed Autonomous and Human-Operated Vehicles at Intersections.
Proceedings of the Autonomous Agents and Multiagent Systems, 2017

Evaluating Ad Hoc Teamwork Performance in Drop-In Player Challenges.
Proceedings of the Autonomous Agents and Multiagent Systems, 2017

Autonomous Model Management via Reinforcement Learning: Extended Abstract.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Multi-Robot Human Guidance: Human Experiments and Multiple Concurrent Requests.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Agent Behaviors for Joining and Leaving a Flock.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Three Years of the RoboCup Standard Platform League Drop-In Player Competition: Creating and Maintaining a Large Scale Ad Hoc Teamwork Robotics Competition.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Reasoning about Hypothetical Agent Behaviours and their Parameters.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Mechanism Design with Unknown Correlated Distributions: Can We Learn Optimal Mechanisms?
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Dynamically Constructed (PO)MDPs for Adaptive Robot Planning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Automatic Curriculum Graph Generation for Reinforcement Learning Agents.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Designing Better Playlists with Monte Carlo Tree Search.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Grounded Action Transformation for Robot Learning in Simulation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Grounded Action Transformation for Robot Learning in Simulation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Automated Design of Robust Mechanisms.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
UT Austin Villa: Project-Driven Research in AI and Robotics.
IEEE Intelligent Systems, 2016

Deep Reinforcement Learning in Parameterized Action Space.
Proceedings of the 4th International Conference on Learning Representations, 2016

A synthesis of automated planning and reinforcement learning for efficient, robust decision-making.
Artif. Intell., 2016

UT Austin Villa: RoboCup 2016 3D Simulation League Competition and Technical Challenges Champions.
Proceedings of the RoboCup 2016: Robot World Cup XX [Leipzig, Germany, June 30, 2016

Prioritized Role Assignment for Marking.
Proceedings of the RoboCup 2016: Robot World Cup XX [Leipzig, Germany, June 30, 2016

UT Austin Villa RoboCup 3D Simulation Base Code Release.
Proceedings of the RoboCup 2016: Robot World Cup XX [Leipzig, Germany, June 30, 2016

Impact of Music on Decision Making in Quantitative Tasks.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Robot Scavenger Hunt: A Standardized Framework for Evaluating Intelligent Mobile Robots.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning Multi-Modal Grounded Linguistic Semantics by Playing "I Spy".
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning to Order Objects Using Haptic and Proprioceptive Exploratory Behaviors.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Delta-Tolling: Adaptive Tolling for Optimizing Traffic Throughput.
Proceedings of the Ninth International Workshop on Agents in Traffic and Transportation (ATT 2016) co-located with the 25th International Joint Conference On Artificial Intelligence (IJCAI 2016), 2016

On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Dynamic behaviors on the NAO robot with closed-loop whole body operational space control.
Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots, 2016

Adaptation of Surrogate Tasks for Bipedal Walk Optimization.
Proceedings of the Genetic and Evolutionary Computation Conference, 2016

An MDP-Based Winning Approach to Autonomous Power Trading: Formalization and Empirical Analysis.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Autonomous Learning Agents: Layered Learning and Ad Hoc Teamwork.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Source Task Creation for Curriculum Learning.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Adding Influencing Agents to a Flock.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

An MDP-Based Winning Approach to Autonomous Power Trading: Formalization and Empirical Analysis.
Proceedings of the AI for Smart Grids and Smart Buildings, 2016

Autonomous Electricity Trading Using Time-of-Use Tariffs in a Competitive Market.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

What's Hot at RoboCup.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
TRANSIT - A model for simulating infrastructure and policy interventions in agriculture logistics: Application to the northern Australia beef industry.
Computers and Electronics in Agriculture, 2015

Who speaks for AI?
AI Matters, 2015

Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance.
Artif. Intell., 2015

Representative Selection in Nonmetric Datasets.
Applied Artificial Intelligence, 2015

Robot-Centric Activity Recognition 'in the Wild'.
Proceedings of the Social Robotics - 7th International Conference, 2015

UT Austin Villa: RoboCup 2015 3D Simulation League Competition and Technical Challenges Champions.
Proceedings of the RoboCup 2015: Robot World Cup XIX [papers from the 19th Annual RoboCup International Symposium, 2015

A Study of Layered Learning Strategies Applied to Individual Behaviors in Robot Soccer.
Proceedings of the RoboCup 2015: Robot World Cup XIX [papers from the 19th Annual RoboCup International Symposium, 2015

Mobile Robot Planning Using Action Language BC with an Abstraction Hierarchy.
Proceedings of the Logic Programming and Nonmonotonic Reasoning, 2015

How Music Alters Decision Making - Impact of Music Stimuli on Emotional Classification.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Benchmarking robot cooperation without pre-coordination in the RoboCup Standard Platform League drop-in player competition.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Learning to Interpret Natural Language Commands through Human-Robot Dialog.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

When Security Games Go Green: Designing Defender Strategies to Prevent Poaching and Illegal Fishing.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Learning Inter-Task Transferability in the Absence of Target Task Samples.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Monte Carlo Hierarchical Model Learning: (Doctoral Consortium).
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Monte Carlo Hierarchical Model Learning.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

DJ-MC: A Reinforcement-Learning Agent for Music Playlist Recommendation.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Leading the Way: An Efficient Multi-robot Guidance System.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Determining Placements of Influencing Agents in a Flock.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

The RoboCup 2014 SPL Drop-in Player Competition: Encouraging Teamwork without Pre-coordination.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Defender Strategies In Domains Involving Frequent Adversary Interaction.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

CORPP: Commonsense Reasoning and Probabilistic Planning, as Applied to Dialog with a Mobile Robot.
Proceedings of the 2015 AAAI Spring Symposia, 2015

Autonomous Electricity Trading Using Time-Of-Use Tariffs in a Competitive Market.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

Deep Recurrent Q-Learning for Partially Observable MDPs.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

CORPP: Commonsense Reasoning and Probabilistic Planning, as Applied to Dialog with a Mobile Robot.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

SCRAM: Scalable Collision-avoiding Role Assignment with Minimal-Makespan for Formational Positioning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

UT Austin Villa 2014: RoboCup 3D Simulation League Champion via Overlapping Layered Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

The Impact of Determinism on Learning Atari 2600 Games.
Proceedings of the Learning for General Competency in Video Games, 2015

Placing Influencing Agents in a Flock.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
A Neuroevolution Approach to General Atari Game Playing.
IEEE Trans. Comput. Intellig. and AI in Games, 2014

Drop-in games at RoboCup.
AI Matters, 2014

RoboCup Soccer Leagues.
AI Magazine, 2014

Multiagent learning in the presence of memory-bounded agents.
Autonomous Agents and Multi-Agent Systems, 2014

UT Austin Villa: RoboCup 2014 3D Simulation League Competition and Technical Challenge Champions.
Proceedings of the RoboCup 2014: Robot World Cup XVIII [papers from the 18th Annual RoboCup International Symposium, 2014

Keyframe Sampling, Optimization, and Behavior Integration: Towards Long-Distance Kicking in the RoboCup 3D Simulation League.
Proceedings of the RoboCup 2014: Robot World Cup XVIII [papers from the 18th Annual RoboCup International Symposium, 2014

The RoboCup 2013 drop-in player challenges: Experiments in ad hoc teamwork.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Communicating with Unknown Teammates.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

TacTex'13: a champion adaptive power trading agent.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

SCRAM: scalable collision-avoiding role assignment with minimal-makespan for formational positioning.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

The RoboCup 2013 drop-in player challenges: a testbed for ad hoc teamwork.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Orienting a flock via ad hoc teamwork.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Communicating with unknown teammates.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Semi-autonomous intersection management.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Modeling uncertainty in leading ad hoc teams.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Influencing a Flock via Ad Hoc Teamwork.
Proceedings of the Swarm Intelligence - 9th International Conference, 2014

Planning in Action Language BC while Learning Action Costs for Mobile Robots.
Proceedings of the Twenty-Fourth International Conference on Automated Planning and Scheduling, 2014

Planning in Answer Set Programming while Learning Action Costs for Mobile Robots.
Proceedings of the 2014 AAAI Spring Symposia, 2014

Multi-Robot Human Guidance Using Topological Graphs.
Proceedings of the 2014 AAAI Spring Symposia, 2014

Leading the Way: An Efficient Multi-Robot Guidance System.
Proceedings of the 2014 AAAI Fall Symposia, Arlington, Virginia, USA, November 13-15, 2014, 2014

TacTex'13: A Champion Adaptive Power Trading Agent.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Using a million cell simulation of the cerebellum: Network scaling and task generality.
Neural Networks, 2013

Teaching and leading an ad hoc teammate: Collaboration without pre-coordination.
Artif. Intell., 2013

Training a Robot via Human Feedback: A Case Study.
Proceedings of the Social Robotics - 5th International Conference, 2013

The Open-Source TEXPLORE Code Release for Reinforcement Learning on Robots.
Proceedings of the RoboCup 2013: Robot World Cup XVII [papers from the 17th Annual RoboCup International Symposium, 2013

The 2012 UT Austin Villa Code Release.
Proceedings of the RoboCup 2013: Robot World Cup XVII [papers from the 17th Annual RoboCup International Symposium, 2013

Model-Selection for Non-parametric Function Approximation in Continuous Control Problems: A Case Study in a Smart Energy System.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Teaching agents with human feedback: a demonstration of the TAMER framework.
Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Learning non-myopically from human-generated reward.
Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Auction-based autonomous intersection management.
Proceedings of the 16th International IEEE Conference on Intelligent Transportation Systems, 2013

A learning agent for heat-pump thermostat control.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Learning exploration strategies in model-based reinforcement learning.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Ad hoc teamwork for leading a flock.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Humanoid robots learning to walk faster: from the real world to simulation and back.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Cooperating with a markovian ad hoc teammate.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Teamwork with Limited Knowledge of Teammates.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
How Humans Teach Agents - A New Experimental Perspective.
I. J. Social Robotics, 2012

Ten Years of AAMAS: Introduction to the Special Issue.
AI Magazine, 2012

UT Austin Villa: RoboCup 2012 3D Simulation League Champion.
Proceedings of the RoboCup 2012: Robot Soccer World Cup XVI [papers from the 16th Annual RoboCup International Symposium, 2012

Positioning to Win: A Dynamic Role Assignment and Formation Positioning System.
Proceedings of the RoboCup 2012: Robot Soccer World Cup XVI [papers from the 16th Annual RoboCup International Symposium, 2012

UT Austin Villa 2012: Standard Platform League World Champions.
Proceedings of the RoboCup 2012: Robot Soccer World Cup XVI [papers from the 16th Annual RoboCup International Symposium, 2012

Reinforcement learning from human reward: Discounting in episodic tasks.
Proceedings of the 21st IEEE International Symposium on Robot and Human Interactive Communication, 2012

Approximately Orchestrated Routing and Transportation Analyzer: Large-scale traffic simulation for autonomous vehicles.
Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems, 2012

Video: RoboCup robot soccer history 1997 - 2011.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Evasion planning for autonomous vehicles at intersections.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

RTMBA: A Real-Time Model-Based Reinforcement Learning Architecture for robot control.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Setpoint scheduling for autonomous vehicle controllers.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

On coordination in practical multi-robot patrol.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

PAC Subset Selection in Stochastic Multi-armed Bandits.
Proceedings of the 29th International Conference on Machine Learning, 2012

Intrinsically motivated model learning for a developing curious agent.
Proceedings of the 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics, 2012

A Platform for Evaluating Autonomous Intersection Management Policies.
Proceedings of the 2012 IEEE/ACM Third International Conference on Cyber-Physical Systems, 2012

HyperNEAT-GGP: a hyperNEAT-based atari general game player.
Proceedings of the Genetic and Evolutionary Computation Conference, 2012

UT Austin Villa 2011: a champion agent in the RoboCup 3D soccer simulation competition.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Reinforcement learning from simultaneous human and MDP reward.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Role selection in ad hoc teamwork.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

An analysis framework for ad hoc teamwork tasks.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Leading ad hoc agents in joint action settings with multiple teammates.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots.
Proceedings of the Designing Intelligent Robots, 2012

Design and Optimization of an Omnidirectional Humanoid Walk: A Winning Approach at the RoboCup 2011 3D Simulation Competition.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Positioning to Win: A Dynamic Role Assignment and Formation Positioning System.
Proceedings of the Multiagent Pathfinding, Papers from the 2012 AAAI Workshop, 2012

Learning and Using Models.
Proceedings of the Reinforcement Learning, 2012

2011
Designing adaptive trading agents.
SIGecom Exchanges, 2011

Characterizing reinforcement learning methods through parameterized learning problems.
Machine Learning, 2011

An Introduction to Intertask Transfer for Reinforcement Learning.
AI Magazine, 2011

Empowerment for continuous agent - environment systems.
Adaptive Behaviour, 2011

A Low Cost Ground Truth Detection System for RoboCup Using the Kinect.
Proceedings of the RoboCup 2011: Robot Soccer World Cup XV [papers from the 15th Annual RoboCup International Symposium, 2011

WrightEagle and UT Austin Villa: RoboCup 2011 Simulation League Champions.
Proceedings of the RoboCup 2011: Robot Soccer World Cup XV [papers from the 15th Annual RoboCup International Symposium, 2011

Dynamic lane reversal in traffic management.
Proceedings of the 14th International IEEE Conference on Intelligent Transportation Systems, 2011

Autonomous Intersection Management: Multi-intersection optimization.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function's In-Degree.
Proceedings of the 28th International Conference on Machine Learning, 2011

Invited Talk: PRISM - Practical RL: Representation, Interaction, Synthesis, and Mortality.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

On optimizing interdependent skills: a case study in simulated 3D humanoid robot soccer.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Batch reservations in autonomous intersection management.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Flood Disaster Mitigation: A Real-World Challenge Problem for Multi-agent Unmanned Surface Vehicles.
Proceedings of the Advanced Agent Technology, 2011

A particle filter for bid estimation in ad auctions with periodic ranking observations.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Empirical evaluation of ad hoc teamwork in the pursuit domain.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Ship patrol: multiagent patrol under complex environmental conditions.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Protecting against evaluation overfitting in empirical reinforcement learning.
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

On learning with imperfect representations.
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

Intersections of the Future: Using Fully Autonomous Vehicles.
Proceedings of the Agents and Data Mining Interaction, 2011

Reinforcement Learning with Human Feedback in Mountain Car.
Proceedings of the Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, 2011

Comparing Agents' Success against People in Security Domains.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Role-Based Ad Hoc Teamwork.
Proceedings of the Plan, Activity, and Intent Recognition, 2011

Role-Based Ad Hoc Teamwork.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Ad Hoc Teamwork in Variations of the Pursuit Domain.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Enforcing Liveness in Autonomous Traffic Management.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Multiagent Patrol Generalized to Complex Environmental Conditions.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Leading Multiple Ad Hoc Teammates in Joint Action Settings.
Proceedings of the Interactive Decision Theory and Game Theory, 2011

2010
Reinforcement Learning.
Proceedings of the Encyclopedia of Machine Learning, 2010

Q-Learning.
Proceedings of the Encyclopedia of Machine Learning, 2010

Adaptive Auction Mechanism Design and the Incorporation of Prior Knowledge.
INFORMS Journal on Computing, 2010

Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning.
Autonomous Agents and Multi-Agent Systems, 2010

Learning Powerful Kicks on the Aibo ERS-7: The Quest for a Striker.
Proceedings of the RoboCup 2010: Robot Soccer World Cup XIV [papers from the 14th annual RoboCup International Symposium, 2010

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

Bringing simulation to life: A mixed reality autonomous intersection.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Generalized model learning for Reinforcement Learning on a humanoid robot.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Boosting for Regression Transfer.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Efficient Selection of Multiple Bandit Arms: Theory and Practice.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Convergence, Targeted Optimality, and Safety in Multiagent Learning.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Real time targeted exploration in large domains.
Proceedings of the 2010 IEEE 9th International Conference on Development and Learning, 2010

To teach or not to teach?: decision making under uncertainty in ad hoc teams.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

MARIOnET: motion acquisition for robots through iterative online evaluative training.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

TacTex09: a champion bidding agent for ad auctions.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Training a Tetris agent via interactive shaping: a demonstration of the TAMER framework.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Combining manual feedback with subsequent MDP reward signals for reinforcement learning.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Online model learning in adversarial Markov decision processes.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Motion Planning Algorithms for Autonomous Intersection Management.
Proceedings of the Bridging the Gap Between Task and Motion Planning, 2010

Multi-Agent Social Simulation.
Proceedings of the Handbook of Ambient Intelligence and Smart Environments, 2010

2009
Color learning and illumination invariance on mobile robots: A survey.
Robotics and Autonomous Systems, 2009

Transfer Learning for Reinforcement Learning Domains: A Survey.
J. Mach. Learn. Res., 2009

Learning Complementary Multiagent Behaviors: A Case Study.
Proceedings of the RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29, 2009

Three Humanoid Soccer Platforms: Comparison and Synthesis.
Proceedings of the RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29, 2009

Feature Selection for Value Function Approximation Using Bayesian Model Selection.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Compositional Models for Reinforcement Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Interactively shaping agents via human reinforcement: the TAMER framework.
Proceedings of the 5th International Conference on Knowledge Capture (K-CAP 2009), 2009

Improving particle filter performance using SSE instructions.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Learning complementary multiagent behaviors: a case study.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

An empirical analysis of value function-based and policy search reinforcement learning.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

Generalized model learning for reinforcement learning in factored domains.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

A task specification language for bootstrap learning.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

Leading a Best-Response Teammate in an Ad Hoc Team.
Proceedings of the Agent-Mediated Electronic Commerce. Designing Trading Strategies and Mechanisms for Electronic Markets, 2009

Design Principles for Creating Human-Shapable Agents.
Proceedings of the Agents that Learn from Human Teachers, 2009

A Task Specification Language for Bootstrap Learning.
Proceedings of the Agents that Learn from Human Teachers, 2009

An Unmanaged Intersection Protocol and Improved Intersection Safety for Autonomous Vehicles.
Proceedings of the Multi-Agent Systems for Traffic and Transportation Engineering., 2009

2008
Book announcement: autonomous bidding agents.
SIGecom Exchanges, 2008

A Multiagent Approach to Autonomous Intersection Management.
J. Artif. Intell. Res., 2008

Open Source Software: A Key Component of E-Health in Developing Nations.
IJHISI, 2008

Comparing Two Action Planning Approaches for Color Learning on a Mobile Robot.
Proceedings of the VISAPP International Workshop on Robotic Perception, 2008

Long-Term vs. Greedy Action Planning for Color Learning on a Mobile Robot.
Proceedings of the VISAPP 2008: Proceedings of the Third International Conference on Computer Vision Theory and Applications, Funchal, Madeira, Portugal, January 22-25, 2008, 2008

Domestic Interaction on a Segway Base.
Proceedings of the RoboCup 2008: Robot Soccer World Cup XII [papers from the 12th annual RoboCup International Symposium, 2008

Transferring Instances for Model-Based Reinforcement Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2008

Online Multiagent Learning against Memory Bounded Adversaries.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2008

Maximum likelihood estimation of sensor and action model functions on a mobile robot.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Person tracking on a mobile robot with heterogeneous inter-characteristic feedback.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Person recognition on a Segway Robot: A video of UT Austin Villa Robocup@Home 2007 finals demonstration.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Negative information and line observations for Monte Carlo localization.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Online kernel selection for Bayesian reinforcement learning.
Proceedings of the Machine Learning, 2008

CARVE: A Cognitive Agent for Resource Value Estimation.
Proceedings of the 2008 International Conference on Autonomic Computing, 2008

Autonomous transfer for reinforcement learning.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Replacing the stop sign: unmanaged intersection control for autonomous vehicles.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

The utility of temporal abstraction in reinforcement learning.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Mitigating catastrophic failure at intersections of autonomous vehicles.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

The 2007 TAC SCM Prediction Challenge.
Proceedings of the Agent-Mediated Electronic Commerce and Trading Agent Design and Analysis, 2008

Transfer Learning and Intelligence: an Argument and Approach.
Proceedings of the Artificial General Intelligence 2008, 2008

2007
Intelligent Autonomous Robotics: A Robot Soccer Case Study
Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, 2007

Transfer Learning via Inter-Task Mappings for Temporal Difference Learning.
J. Mach. Learn. Res., 2007

Structure-based color learning on a mobile robot under changing illumination.
Auton. Robots, 2007

Multiagent learning is not the answer. It is the question.
Artif. Intell., 2007

Empirical Studies in Action Selection with Reinforcement Learning.
Adaptive Behaviour, 2007

Model-Based Exploration in Continuous State Spaces.
Proceedings of the Abstraction, 2007

Model-Based Reinforcement Learning in a Complex Domain.
Proceedings of the RoboCup 2007: Robot Soccer World Cup XI, 2007

A Neural Network-Based Approach to Robot Motion Control.
Proceedings of the RoboCup 2007: Robot Soccer World Cup XI, 2007

Instance-Based Action Models for Fast Action Planning.
Proceedings of the RoboCup 2007: Robot Soccer World Cup XI, 2007

Global action selection for illumination invariant color modeling.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Machine Learning for On-Line Hardware Reconfiguration.
Proceedings of the IJCAI 2007, 2007

Learning and Multiagent Reasoning for Autonomous Agents.
Proceedings of the IJCAI 2007, 2007

Color Learning on a Mobile Robot: Towards Full Autonomy under Changing Illumination.
Proceedings of the IJCAI 2007, 2007

Sharing the Road: Autonomous Vehicles Meet Human Drivers.
Proceedings of the IJCAI 2007, 2007

General Game Learning Using Knowledge Transfer.
Proceedings of the IJCAI 2007, 2007

A Comparison of Two Approaches for Vision and Self-Localization on a Mobile Robot.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Cross-domain transfer for reinforcement learning.
Proceedings of the Machine Learning, 2007

Autonomous Return on Investment Analysis of Additional Processing Resources.
Proceedings of the Fourth International Conference on Autonomic Computing (ICAC'07), 2007

Graph-Based Domain Mapping for Transfer Learning in General Games.
Proceedings of the Machine Learning: ECML 2007, 2007

Transfer via inter-task mappings in policy search reinforcement learning.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Towards reinforcement learning representation transfer.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Adapting Price Predictions in TAC SCM.
Proceedings of the Agent-Mediated Electronic Commerce and Trading Agent Design and Analysis, 2007

Adapting in agent-based markets: a study from TAC SCM.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Batch reinforcement learning in a complex domain.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Model-based function approximation in reinforcement learning.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

IFSA: incremental feature-set augmentation for reinforcement learning tasks.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Representation Transfer for Reinforcement Learning.
Proceedings of the Computational Approaches to Representation Change during Learning and Development, 2007

Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Representation Transfer via Elaboration.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Autonomous bidding agents - strategies and lessons from the trading agent competition.
MIT Press, ISBN: 978-0-262-23260-9, 2007

2006
From pixels to multi-robot decision-making: A study in uncertainty.
Robotics and Autonomous Systems, 2006

Evolutionary Function Approximation for Reinforcement Learning.
J. Mach. Learn. Res., 2006

Towards autonomous sensor and actuator model induction on a mobile robot.
Connect. Sci., 2006

Cobot in LambdaMOO: An Adaptive Social Statistics Agent.
Autonomous Agents and Multi-Agent Systems, 2006

Selective Visual Attention for Object Detection on a Legged Robot.
Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Autonomous Planned Color Learning on a Legged Robot.
Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Autonomous Learning of Stable Quadruped Locomotion.
Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study.
Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

The Chin Pinch: A Case Study in Skill Learning on a Legged Robot.
Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Polynomial Regression with Automated Degree: A Function Approximator for Autonomous Agents.
Proceedings of the 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2006), 2006

A Multi-robot System for Continuous Area Sweeping Tasks.
Proceedings of the 2006 IEEE International Conference on Robotics and Automation, 2006

Autonomous Planned Color Learning on a Mobile Robot Without Labeled Data.
Proceedings of the Ninth International Conference on Control, 2006

On-line evolutionary computation for reinforcement learning in stochastic domains.
Proceedings of the Genetic and Evolutionary Computation Conference, 2006

Comparing evolutionary and temporal difference methods in a reinforcement learning domain.
Proceedings of the Genetic and Evolutionary Computation Conference, 2006

Designing safe, profitable automated stock trading agents using evolutionary algorithms.
Proceedings of the Genetic and Evolutionary Computation Conference, 2006

A Distributed Biconnectivity Check.
Proceedings of the Distributed Autonomous Robotic Systems 7, 2006

TacTex-05: An Adaptive Agent for TAC SCM.
Proceedings of the Agent-Mediated Electronic Commerce. Automated Negotiation and Strategy Design for Electronic Markets, 2006

Predictive Planning for Supply Chain Management.
Proceedings of the Sixteenth International Conference on Automated Planning and Scheduling, 2006

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning.
Proceedings of the Proceedings, 2006

Inter-Task Action Correlation for Reinforcement Learning Tasks.
Proceedings of the Proceedings, 2006

Expectation-Based Vision for Self-Localization on a Legged Robot.
Proceedings of the Proceedings, 2006

TacTex-05: A Champion Supply Chain Management Agent.
Proceedings of the Proceedings, 2006

Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping.
Proceedings of the Proceedings, 2006

Automatic Heuristic Construction for General Game Playing.
Proceedings of the Proceedings, 2006

Automatic Heuristic Construction in a Complete General Game Player.
Proceedings of the Proceedings, 2006

Know Thine Enemy: A Champion RoboCup Coach Agent.
Proceedings of the Proceedings, 2006

Making Autonomous Intersection Management Backwards-Compatible.
Proceedings of the Proceedings, 2006

Traffic Intersections of the Future.
Proceedings of the Proceedings, 2006

Biconnected Structure for Multi-Robot Systems.
Proceedings of the Proceedings, 2006

Keeping in Touch: Maintaining Biconnected Structure by Homogeneous Robots.
Proceedings of the Proceedings, 2006

Adaptive mechanism design: a metalearning approach.
Proceedings of the 8th International Conference on Electronic Commerce: The new e-commerce, 2006

2005
Developing adaptive auction mechanisms.
SIGecom Exchanges, 2005

Evolving Soccer Keepaway Players Through Task Decomposition.
Machine Learning, 2005

The First International Trading Agent Competition: Autonomous Bidding Agents.
Electronic Commerce Research, 2005

Reinforcement Learning for RoboCup Soccer Keepaway.
Adaptive Behaviour, 2005

Function Approximation via Tile Coding: Automating Parameter Choice.
Proceedings of the Abstraction, 2005

Keepaway Soccer: From Machine Learning Testbed to Benchmark.
Proceedings of the RoboCup 2005: Robot Soccer World Cup IX, 2005

Towards Eliminating Manual Color Calibration at RoboCup.
Proceedings of the RoboCup 2005: Robot Soccer World Cup IX, 2005

Multiagent Traffic Management: Opportunities for Multiagent Learning.
Proceedings of the Learning and Adaption in Multi-Agent Systems, 2005

Multi-robot Learning for Continuous Area Sweeping.
Proceedings of the Learning and Adaption in Multi-Agent Systems, 2005

Real-time vision on a mobile robot platform.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

State Abstraction Discovery from Irrelevant State Variables.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Simultaneous Calibration of Action and Sensor Models on a Mobile Robot.
Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

Practical Vision-Based Monte Carlo Localization on a Legged Robot.
Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

Towards Self-Configuring Hardware for Distributed Computer Systems.
Proceedings of the Second International Conference on Autonomic Computing (ICAC 2005), 2005

Automatic feature selection in neuroevolution.
Proceedings of the Genetic and Evolutionary Computation Conference, 2005

Behavior transfer for value-function-based reinforcement learning.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

Multiagent traffic management: an improved intersection control mechanism.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

Value Functions for RL-Based Behavior Transfer: A Comparative Study.
Proceedings of the Proceedings, 2005

Autonomous Color Learning on a Mobile Robot.
Proceedings of the Proceedings, 2005

Improving Action Selection in MDP's via Knowledge Transfer.
Proceedings of the Proceedings, 2005

2004
TacTex-03: a supply chain management agent.
SIGecom Exchanges, 2004

Using RoboCup in university-level computer science education.
ACM Journal of Educational Resources in Computing, 2004

Adaptive job routing and scheduling.
Eng. Appl. of AI, 2004

A Model-Based Approach to Robot Joint Control.
Proceedings of the RoboCup 2004: Robot Soccer World Cup VIII, 2004

Towards Illumination Invariance in the Legged League.
Proceedings of the RoboCup 2004: Robot Soccer World Cup VIII, 2004

The UT Austin Villa 2003 Champion Simulator Coach: A Machine Learning Approach.
Proceedings of the RoboCup 2004: Robot Soccer World Cup VIII, 2004

Quantitative Analysis of Circumferential Plaque Distribution in Human Coronary Arteries in Relation to Local Vessel Curvature.
Proceedings of the 2004 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2004

Policy Gradient Reinforcement Learning for Fast Quadrupedal Locomotion.
Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Towards Autonomic Computing: Adaptive Network Routing and Scheduling.
Proceedings of the 1st International Conference on Autonomic Computing (ICAC 2004), 2004

Towards On-Board Color Constancy on Mobile Robots.
Proceedings of the 1st Canadian Conference on Computer and Robot Vision (CRV 2004) 17-19 May 2004, 2004

Agent-Based Supply Chain Management: Bidding for Customer Orders.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Three Automated Stock-Trading Agents: A Comparative Study.
Proceedings of the Agent-Mediated Electronic Commerce VI, 2004

Bidding for Customer Orders in TAC SCM.
Proceedings of the Agent-Mediated Electronic Commerce VI, 2004

Towards Autonomic Computing: Adaptive Job Routing and Scheduling.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

Machine Learning for Fast Quadrupedal Locomotion.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

2003
Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions.
J. Artif. Intell. Res., 2003

Guest Editors' Introduction: Agents and Markets.
IEEE Intelligent Systems, 2003

The RoboCup Soccer Server and CMUnited Clients: Implemented Infrastructure for MAS Research.
Autonomous Agents and Multi-Agent Systems, 2003

Progress in learning 3 vs. 2 keepaway.
Proceedings of the IEEE International Conference on Systems, 2003

A polynomial-time nash equilibrium algorithm for repeated games.
Proceedings of the Proceedings 4th ACM Conference on Electronic Commerce (EC-2003), 2003

RoboCup as an Introduction to CS Research.
Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

RoboCup in Higher Education: A Preliminary Report.
Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

Progress in Learning 3 vs. 2 Keepaway
Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

Learning Predictive State Representations.
Proceedings of the Machine Learning, 2003

Evolving Keepaway Soccer Players through Task Decomposition.
Proceedings of the Genetic and Evolutionary Computation, 2003

Concurrent layered learning.
Proceedings of the Second International Joint Conference on Autonomous Agents & Multiagent Systems, 2003

Performance analysis of a counter-intuitive automated stock-trading agent.
Proceedings of the 5th International Conference on Electronic Commerce, 2003

2002
RoboCup-2001: The Fifth Robotic Soccer World Championships.
AI Magazine, 2002

The 2002 AAAI Spring Symposium Series.
AI Magazine, 2002

Multiagent Competitions and Research: Lessons from RoboCup and TAC.
Proceedings of the RoboCup 2002: Robot Soccer World Cup VI, 2002

Modeling Auction Price Uncertainty Using Boosting-based Conditional Density Estimation.
Proceedings of the Machine Learning, 2002

Randomized strategic demand reduction: getting more by asking for less.
Proceedings of the First International Joint Conference on Autonomous Agents & Multiagent Systems, 2002

ATTac-2001: A Learning, Autonomous Bidding Agent.
Proceedings of the Agent-Mediated Electronic Commerce IV, 2002

Self-Enforcing Strategic Demand Reduction.
Proceedings of the Agent-Mediated Electronic Commerce IV, 2002

The 2001 Trading Agent Competition.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

2001
Autonomous Bidding Agents in the Trading Agent Competition.
IEEE Internet Computing, 2001

RoboCup-2000: The Fourth Robotic Soccer World Championships.
AI Magazine, 2001

FAucS : An FCC Spectrum Auction Simulator for Autonomous Bidding Agents.
Proceedings of the Electronic Commerce, Second International Workshop, 2001

Keepaway Soccer: A Machine Learning Testbed.
Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001

ATTUnited-2001: Using Heterogeneous Players.
Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001

Cobot: A Social Reinforcement Learning Agent.
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Scaling Reinforcement Learning toward RoboCup Soccer.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

Implicit Negotiation in Repeated Games.
Proceedings of the Intelligent Agents VIII, 8th International Workshop, 2001

An architecture for action selection in robotic soccer.
Proceedings of the Fifth International Conference on Autonomous Agents, 2001

ATTac-2000: an adaptive autonomous bidding agent.
Proceedings of the Fifth International Conference on Autonomous Agents, 2001

A social reinforcement learning agent.
Proceedings of the Fifth International Conference on Autonomous Agents, 2001

2000
Multiagent Systems: A Survey from a Machine Learning Perspective.
Auton. Robots, 2000

CMUNITED-98: RoboCup-98 Small-Robot World Champion Team.
AI Magazine, 2000

CMUNITED-98 Simulator Team.
AI Magazine, 2000

Overview of RoboCup-99.
AI Magazine, 2000

Reinforcement Learning for 3 vs. 2 Keepaway
Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Overview of RoboCup-2000.
Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

ATT-CMUnited-2000: Third Place Finisher in the RoboCup-2000 Simulator League.
Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Keeping the Ball from CMUnited-99.
Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Progress in RoboCup Soccer Research in 2000.
Proceedings of the Experimental Robotics VII [ISER 2000, 2000

TPOT-RL Applied to Network Routing.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Defining and Using Ideal Teammate and Opponent Agent Models: A Case Study in Robotic Soccer.
Proceedings of the 4th International Conference on Multi-Agent Systems, 2000

Layered Learning.
Proceedings of the Machine Learning: ECML 2000, 11th European Conference on Machine Learning, Barcelona, Catalonia, Spain, May 31, 2000

Layered Disclosure: Revealing Agents' Internals.
Proceedings of the Intelligent Agents VII. Agent Theories Architectures and Languages, 2000

Layered disclosure: why is the agent doing what it's doing?
Proceedings of the Fourth International Conference on Autonomous Agents, 2000

The RoboCup Soccer Server and CMUnited: Implemented Infrastructure for MAS Research.
Proceedings of the Infrastructure for Agents, 2000

Defining and Using Ideal Teammate and Opponent Agent Models.
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

Cobot in LambdaMOO: A Social Statistics Agent.
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

Layered learning in multiagent systems - a winning approach to robotic soccer.
Intelligent robotics and autonomous agents, MIT Press, ISBN: 978-0-262-19438-9, 2000

1999
The CMUnited-97 robotic soccer team: Perception and multi-agent control.
Robotics and Autonomous Systems, 1999

Task Decomposition, Dynamic Role Assignment, and Low-Bandwidth Communication for Real-Time Strategic Teamwork.
Artif. Intell., 1999

Overview of RoboCup-99.
Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

Layered Learning and Flexible Teamwork in RoboCup Simulation Agents.
Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

The CMUnited-99 Champion Simulator Team.
Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

Team-Partitioned, Opaque-Transition Reinforcement Learning.
Proceedings of the Third Annual Conference on Autonomous Agents, 1999

CMUnited-98: A Team of Robotic Soccer Agents.
Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1998
Towards collaborative and adversarial learning: a case study in robotic soccer.
Int. J. Hum.-Comput. Stud., 1998

The CMUnited-98 champion small-robot team.
Advanced Robotics, 1998

CMUNITED-97: RoboCup-97 Small-Robot World Champion Team.
AI Magazine, 1998

Layered Approach to Learning Client Behaviors in the Robocup Soccer Server.
Applied Artificial Intelligence, 1998

The Robocup Physical Agent Challenge: Phase I.
Applied Artificial Intelligence, 1998

The CMUnited-98 Small-Robot Team.
Proceedings of the RoboCup-98: Robot Soccer World Cup II, 1998

The CMUnited-98 Champion Simulator Team.
Proceedings of the RoboCup-98: Robot Soccer World Cup II, 1998

Team-Partitioned, Opaque-Transition Reinforced Learning.
Proceedings of the RoboCup-98: Robot Soccer World Cup II, 1998

Individual and Collaborative Behaviors in a Team of Robotic Soccer Agents.
Proceedings of the Third International Conference on Multiagent Systems, 1998

Communication in Domains with Unreliable, Single-Channel, Low-Bandwidth Communication.
Proceedings of the Collective Robotics, First International Workshop, 1998

Task Decomposition and Dynamic Role Assignment for Real-Time Strategic Teamwork.
Proceedings of the Intelligent Agents V, 1998

The CMUnited-97 Robotic Socccer Team: Perception and Multiagent Control.
Proceedings of the Second International Conference on Autonomous Agents, 1998

Using Decision Tree Confidence Factors for Multi-Agent Control.
Proceedings of the Second International Conference on Autonomous Agents, 1998

1997
The CMUnited-97 Small Robot Team.
Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

The CMUnited-97 Simulator Team.
Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

Using Decision Tree Confidence Factors for Multiagent Control.
Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

The RoboCup Synthetic Agent Challenge 97.
Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

The RoboCup Physical Agent Challenge: Goals and Protocols for Phase 1.
Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

The RoboCup Synthetic Agent Challenge 97.
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

A Layered Approach for an Autonomous Robotic Soccer System.
Proceedings of the First International Conference on Autonomous Agents, 1997

Layered Learning in Multiagent Systems.
Proceedings of the Fourteenth National Conference on Artificial Intelligence and Ninth Innovative Applications of Artificial Intelligence Conference, 1997

1995
FLECS: Planning with a Flexible Commitment Strategy.
J. Artif. Intell. Res., 1995

Beating a Defender in Robotic Soccer: Memory-Based Learning of a Continuous Function.
Proceedings of the Advances in Neural Information Processing Systems 8, 1995

1994
The Need for Different Domain-independent Heuristics.
Proceedings of the Second International Conference on Artificial Intelligence Planning Systems, 1994

1990
Developing Networked Services for Libraries: The U. K. Experience.
Computer Networks and ISDN Systems, 1990


  Loading...