Andrey Kolobov

Roland Siegwart

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Improving Offline RL by Blending Heuristics.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Influence of Heat Loss on the Chaotic Dynamics of Reaction Waves in the Model with Chain-Branching Reaction.

[BibT_eX]

[DOI]

Int. J. Bifurc. Chaos, September, 2023

LLF-Bench: Benchmark for Interactive Learning from Language Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Interactive Robot Learning from Verbal Correction.

[BibT_eX]

[DOI]

CoRR, 2023

PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining.

[BibT_eX]

[DOI]

CoRR, 2023

Survival Instinct in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Exploring Levels of Control for a Navigation Assistant for Blind Travelers.

[BibT_eX]

[DOI]

Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, 2023

PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining.

[BibT_eX]

[DOI]

Garrett Thomas

Ricky Loynd

Felipe Vieira Frujeri

Vibhav Vineet

Mihai Jalobeanu

Proceedings of the Conference on Robot Learning, 2023

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control.

[BibT_eX]

[DOI]

Philippe Hansen-Estruch

Proceedings of the Conference on Robot Learning, 2023

2022

The Sandbox Environment for Generalizable Agent Research (SEGAR).

[BibT_eX]

[DOI]

CoRR, 2022

MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control.

[BibT_eX]

[DOI]

Nolan Wagener

Felipe Vieira Frujeri

Ricky Loynd

Matthew J. Hausknecht

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Heuristic-Guided Reinforcement Learning.

[BibT_eX]

[DOI]

Adith Swaminathan

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

Policy Improvement from Multiple Experts.

[BibT_eX]

[DOI]

Alekh Agarwal

CoRR, 2020

Safe Reinforcement Learning via Curriculum Induction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

Policy Improvement via Imitation of Multiple Oracles.

[BibT_eX]

[DOI]

Alekh Agarwal

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Online Learning for Active Cache Synchronization.

[BibT_eX]

[DOI]

Sébastien Bubeck

Julian Zimmert

Proceedings of the 37th International Conference on Machine Learning, 2020

MultiPoint: Cross-spectral registration of thermal and optical aerial imagery.

[BibT_eX]

[DOI]

Nicholas R. J. Lawrance

Proceedings of the 4th Conference on Robot Learning, 2020

2019

Optimal Freshness Crawl Under Politeness Constraints.

[BibT_eX]

[DOI]

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Staying up to Date with Online Content Changes Using Reinforcement Learning for Scheduling.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

Autonomous Thermalling as a Partially Observable Markov Decision Process (Extended Version).

[BibT_eX]

[DOI]

CoRR, 2018

Autonomous Thermalling as a Partially Observable Markov Decision Process.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XIV, 2018

ArduSoar: An Open-Source Thermalling Controller for Resource-Constrained Autopilots.

[BibT_eX]

[DOI]

Samuel Tabor

Iain Guilliard

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

2016

Interactive Teaching Strategies for Agent Training.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015

Metareasoning for Planning Under Uncertainty.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Selecting Robust Strategies in RTS Games via Concurrent Plan Augmentation.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

TODTLER: Two-Order-Deep Transfer Learning.

[BibT_eX]

[DOI]

Jan Van Haaren

Jesse Davis

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Parallel Task Routing for Crowdsourcing.

[BibT_eX]

[DOI]

Proceedings of the Second AAAI Conference on Human Computation and Crowdsourcing, 2014

Gauss meets Canadian traveler: shortest-path problems with correlated natural dynamics.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Saturated Path-Constrained MDP: Planning under Uncertainty and Deterministic Model-Checking Constraints.

[BibT_eX]

[DOI]

Jonathan Sprauel

Florent Teichteil-Königsbuch

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Scalable Methods and Expressive Models for Planning Under Uncertainty.

[BibT_eX]

[DOI]

PhD thesis, 2013

Joint Crowdsourcing of Multiple Tasks.

[BibT_eX]

[DOI]

Proceedings of the Human Computation and Crowdsourcing: Works in Progress and Demonstration Abstracts, 2013

2012

Planning with Markov Decision Processes: An AI Perspective

[BibT_eX]

[DOI]

Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01559-5, 2012

Discovering hidden structure in factored MDPs.

[BibT_eX]

[DOI]

Artif. Intell., 2012

A Theory of Goal-Oriented MDPs with Dead Ends.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Reverse Iterative Deepening for Finite-Horizon MDPs with Large Branching Factors.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second International Conference on Automated Planning and Scheduling, 2012

LRTDP Versus UCT for Online Probabilistic Planning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011

Towards Scalable MDP Algorithms.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2011, 2011

Heuristic Search for Generalized Stochastic Shortest Path MDPs.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Automated Planning and Scheduling, 2011

2010

Classical Planning in MDP Heuristics: with a Little Help from Generalization.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Automated Planning and Scheduling, 2010

SixthSense: Fast and Reliable Recognition of Dead Ends in MDPs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009

ReTrASE: Integrating Paradigms for Approximate Probabilistic Planning.

[BibT_eX]

[DOI]