Roy Fox

According to our database1, Roy Fox authored at least 50 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Reinforcement Learning from Delayed Observations via World Models.
CoRR, 2024

Moonwalk: Inverse-Forward Differentiation.
CoRR, 2024

Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills.
CoRR, 2024

2023
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors.
CoRR, 2023

Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling.
Proceedings of the International Conference on Machine Learning, 2023

Learning to Design Analog Circuits to Meet Threshold Specifications.
Proceedings of the International Conference on Machine Learning, 2023

2022
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments.
CoRR, 2022

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games.
CoRR, 2022

Learning to Query Internet Text for Informing Reinforcement Learning Agents.
CoRR, 2022

Anytime PSRO for Two-Player Zero-Sum Games.
CoRR, 2022

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks.
Proceedings of the International Conference on Machine Learning, 2022

Independent Natural Policy Gradient always converges in Markov Potential Games.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Target Entropy Annealing for Discrete Soft Actor-Critic.
CoRR, 2021

Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning.
CoRR, 2021

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates.
CoRR, 2021

Modular Framework for Visuomotor Language Grounding.
CoRR, 2021

Improving Social Welfare While Preserving Autonomy via a Pareto Mediator.
CoRR, 2021

XDO: A Double Oracle Algorithm for Extensive-Form Games.
CoRR, 2021

A* Search Without Expansions: Learning Heuristic Functions with Deep Q-Networks.
CoRR, 2021

XDO: A Double Oracle Algorithm for Extensive-Form Games.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
AutoPandas: neural-backed generators for program synthesis.
Proc. ACM Program. Lang., 2019

Hierarchical Variational Imitation Learning of Control Programs.
CoRR, 2019

Multi-Task Hierarchical Imitation Learning for Home Automation.
Proceedings of the 15th IEEE International Conference on Automation Science and Engineering, 2019

2018
Derivative-Free Failure Avoidance Control for Manipulation using Learned Support Constraints.
CoRR, 2018

Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models.
Proceedings of the Algorithmic Foundations of Robotics XIII, 2018

Fast and Reliable Autonomous Surgical Debridement with Cable-Driven Robots Using a Two-Phase Calibration Procedure.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Robustly Adjusting Indoor Drip Irrigation Emitters with the Toyota HSR Robot.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

RLlib: Abstractions for Distributed Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Parametrized Hierarchical Procedures for Neural Programming.
Proceedings of the 6th International Conference on Learning Representations, 2018

Constraint Estimation and Derivative-Free Recovery for Robot Learning from Demonstrations.
Proceedings of the 14th IEEE International Conference on Automation Science and Engineering, 2018

2017
Ray RLLib: A Composable and Scalable Reinforcement Learning Library.
CoRR, 2017

DDCO: Discovery of Deep Continuous Options forRobot Learning from Demonstrations.
CoRR, 2017

Iterative Noise Injection for Scalable Imitation Learning.
CoRR, 2017

Multi-Level Discovery of Deep Options.
CoRR, 2017

DART: Noise Injection for Robust Imitation Learning.
Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations.
Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

Statistical data cleaning for deep learning of automation tasks from demonstrations.
Proceedings of the 13th IEEE Conference on Automation Science and Engineering, 2017

An algorithm and user study for teaching bilateral manipulation via iterated best response demonstrations.
Proceedings of the 13th IEEE Conference on Automation Science and Engineering, 2017

2016
Information-Theoretic Methods for Planning and Learning in Partially Observable Markov Decision Processes (שער נוסף בעברית: שיטות תורת-האינפורמציה לתכנון ולמידה בתהליכי החלטה מרקוב נצפים חלקית.).
PhD thesis, 2016

Principled Option Learning in Markov Decision Processes.
CoRR, 2016

Information-Theoretic Methods for Planning and Learning in Partially Observable Markov Decision Processes.
CoRR, 2016

Taming the Noise in Reinforcement Learning via Soft Updates.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Minimum-information LQG control part I: Memoryless controllers.
Proceedings of the 55th IEEE Conference on Decision and Control, 2016

Minimum-information LQG control Part II: Retentive controllers.
Proceedings of the 55th IEEE Conference on Decision and Control, 2016

2015
Optimal Selective Attention in Reactive Agents.
CoRR, 2015

G-Learning: Taming the Noise in Reinforcement Learning via Soft Updates.
CoRR, 2015

2013
A multi-agent control framework for co-adaptation in brain-computer interfaces.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

2012
Bounded Planning in Passive POMDPs.
Proceedings of the 29th International Conference on Machine Learning, 2012

2007
A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007


  Loading...