We stand with Ukraine

We stand with Ukraine

Josiah Hanna

Orcid: 0000-0002-7411-0398

According to our database¹, Josiah Hanna authored at least 67 papers between 2013 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Abstract Sim2Real Through Approximate Information States.

[DOI]

,

,

Josiah P. Hanna

IEEE Robotics Autom. Lett., June, 2026

Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling.

[DOI]

Nicholas E. Corrado

,

,

Josiah P. Hanna

CoRR, May, 2026

Articulated-Body Dynamics Network: Dynamics-Grounded Prior for Robot Learning.

[DOI]

,

,

,

Josiah P. Hanna

CoRR, March, 2026

Efficient and Versatile Quadrupedal Skating: Optimal Co-design via Reinforcement Learning and Bayesian Optimization.

[DOI]

,

,

,

CoRR, March, 2026

On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling.

[DOI]

Nicholas E. Corrado

,

Josiah P. Hanna

Trans. Mach. Learn. Res., 2026

2025

Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies.

[DOI]

Nicholas E. Corrado

,

Josiah P. Hanna

CoRR, August, 2025

When Can Model-Free Reinforcement Learning be Enough for Thinking?

[DOI]

Josiah P. Hanna

,

Nicholas E. Corrado

CoRR, June, 2025

Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning.

[DOI]

,

Jean-Charles Noirot Ferrand

,

,

,

,

Patrick D. McDaniel

CoRR, March, 2025

Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer.

[DOI]

,

,

Siddhant Agarwal

,

,

Geethika Hemkumar

,

Abhinav Narayan Harish

,

,

,

,

,

,

,

Josiah P. Hanna

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Multi-Robot Collaboration Through Reinforcement Learning and Abstract Simulation.

[DOI]

,

Josiah P. Hanna

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation.

[DOI]

,

Josiah P. Hanna

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Stable Offline Value Function Learning with Bisimulation-based Representations.

[DOI]

Brahma S. Pavse

,

,

,

Josiah P. Hanna

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

Conservative Evaluation of Offline Policy Learning.

[DOI]

Hager Radi Abdelwahed

,

Josiah P. Hanna

,

Matthew E. Taylor

Trans. Mach. Learn. Res., 2024

Data-Efficient Policy Evaluation Through Behavior Policy Search.

[DOI]

Josiah P. Hanna

,

,

Philip S. Thomas

,

,

,

J. Mach. Learn. Res., 2024

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning.

[DOI]

Subhojyoti Mukherjee

,

Josiah P. Hanna

,

,

Robert D. Nowak

CoRR, 2024

Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments.

[DOI]

,

,

Robert D. Nowak

,

CoRR, 2024

Toward the confident deployment of real-world reinforcement learning agents.

[DOI]

Josiah P. Hanna

AI Mag., 2024

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning.

[DOI]

Nicholas E. Corrado

,

,

,

,

Josiah P. Hanna

RLJ, 2024

Adaptive Exploration for Data-Efficient General Value Function Evaluations.

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces.

[DOI]

Brahma S. Pavse

,

,

,

,

Josiah P. Hanna

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP.

[DOI]

Subhojyoti Mukherjee

,

Josiah P. Hanna

,

Robert D. Nowak

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Understanding when Dynamics-Invariant Data Augmentations Benefit Model-free Reinforcement Learning Updates.

[DOI]

Nicholas Corrado

,

Josiah P. Hanna

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Reinforcement Learning via Auxiliary Task Distillation.

[DOI]

Abhinav Narayan Harish

,

,

Josiah P. Hanna

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits.

[DOI]

Subhojyoti Mukherjee

,

,

Josiah P. Hanna

,

Robert D. Nowak

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Scaling Offline Evaluation of Reinforcement Learning Agents through Abstraction.

[DOI]

Josiah P. Hanna

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Tackling Unbounded State Spaces in Continuing Task Reinforcement Learning.

[DOI]

Brahma S. Pavse

,

,

,

Josiah P. Hanna

CoRR, 2023

State-Action Similarity-Based Representations for Off-Policy Evaluation.

[DOI]

Brahma S. Pavse

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multi-task Representation Learning for Pure Exploration in Bilinear Bandits.

[DOI]

Subhojyoti Mukherjee

,

,

,

Robert D. Nowak

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Conditional Mutual Information for Disentangled Representations in Reinforcement Learning.

[DOI]

,

,

Kevin Sebastian Luck

,

,

Stefano V. Albrecht

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning.

[DOI]

,

,

Kevin Sebastian Luck

,

Josiah P. Hanna

,

Stefano V. Albrecht

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction.

[DOI]

Brahma S. Pavse

,

Josiah P. Hanna

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Safe Evaluation For Offline Learning: Are We Ready To Deploy?

[DOI]

,

Josiah P. Hanna

,

,

Matthew E. Taylor

CoRR, 2022

Multi-agent Databases via Independent Learning.

[DOI]

,

Olga Papaemmanouil

,

CoRR, 2022

ReVar: Strengthening policy evaluation via reduced variance sampling.

[DOI]

Subhojyoti Mukherjee

,

Josiah P. Hanna

,

Robert D. Nowak

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning.

[DOI]

,

,

,

Stefano V. Albrecht

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Simulation-Acquired Latent Action Spaces for Dynamics Generalization.

[DOI]

Nicholas Corrado

,

,

Josiah P. Hanna

Proceedings of the Conference on Lifelong Learning Agents, 2022

Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration.

[DOI]

,

Filippos Christianos

,

Josiah P. Hanna

,

Stefano V. Albrecht

Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021

Importance sampling in reinforcement learning with an estimated behavior policy.

[DOI]

Josiah P. Hanna

,

,

Mach. Learn., 2021

Grounded action transformation for sim-to-real reinforcement learning.

[DOI]

Josiah P. Hanna

,

Siddharth Desai

,

,

Garrett Warnell

,

Mach. Learn., 2021

Robust On-Policy Data Collection for Data-Efficient Policy Evaluation.

[DOI]

,

Josiah P. Hanna

,

,

Stefano V. Albrecht

CoRR, 2021

Decoupling Exploration and Exploitation in Reinforcement Learning.

[DOI]

,

Filippos Christianos

,

,

Stefano V. Albrecht

CoRR, 2021

Towards Quantum-Secure Authentication and Key Agreement via Abstract Multi-Agent Interaction.

[DOI]

,

Josiah P. Hanna

,

,

Stefano V. Albrecht

Proceedings of the Advances in Practical Applications of Agents, Multi-Agent Systems, and Social Good. The PAAMS Collection, 2021

Interpretable Goal Recognition in the Presence of Occluded Factors for Autonomous Vehicles.

[DOI]

Josiah P. Hanna

,

,

,

Francisco Eiras

,

,

,

Subramanian Ramamoorthy

,

Stefano V. Albrecht

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret.

[DOI]

Sheelabhadra Dey

,

Sumedh Pendurkar

,

,

Josiah P. Hanna

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

2020

RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration.

[DOI]

Brahma S. Pavse

,

,

,

Garrett Warnell

,

IEEE Robotics Autom. Lett., 2020

An Imitation from Observation Approach to Sim-to-Real Transfer.

[DOI]

Siddharth Desai

,

,

,

Garrett Warnell

,

,

CoRR, 2020

Quantum-Secure Authentication via Abstract Multi-Agent Interaction.

[DOI]

,

Josiah P. Hanna

,

Stefano V. Albrecht

CoRR, 2020

An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch.

[DOI]

Siddharth Desai

,

,

,

Garrett Warnell

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Reinforced Grounded Action Transformation for Sim-to-Real Transfer.

[DOI]

,

Siddharth Desai

,

Josiah P. Hanna

,

Garrett Warnell

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Stochastic Grounded Action Transformation for Robot Learning in Simulation.

[DOI]

Siddharth Desai

,

,

Josiah P. Hanna

,

Garrett Warnell

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Reducing Sampling Error in Batch Temporal Difference Learning.

[DOI]

Brahma S. Pavse

,

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Learning an Interpretable Traffic Signal Control Policy.

[DOI]

,

Josiah P. Hanna

,

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019

Importance Sampling Policy Evaluation with an Estimated Behavior Policy.

[DOI]

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Reducing Sampling Error in Policy Gradient Learning.

[DOI]

Josiah P. Hanna

,

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Selecting Compliant Agents for Opt-in Micro-Tolling.

[DOI]

Josiah P. Hanna

,

,

Stephen D. Boyles

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Towards a Data Efficient Off-Policy Policy Gradient.

[DOI]

Josiah P. Hanna

,

Proceedings of the 2018 AAAI Spring Symposia, 2018

DyETC: Dynamic Electronic Toll Collection for Traffic Congestion Alleviation.

[DOI]

,

,

,

Josiah P. Hanna

,

,

,

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Fast and Precise Black and White Ball Detection for RoboCup Soccer.

[DOI]

,

,

,

,

,

Sanmit Narvekar

,

,

Proceedings of the RoboCup 2017: Robot World Cup XXI [Nagoya, Japan, July 27-31, 2017]., 2017

Data-Efficient Policy Evaluation Through Behavior Policy Search.

[DOI]

Josiah P. Hanna

,

Philip S. Thomas

,

,

Proceedings of the 34th International Conference on Machine Learning, 2017

Bridging the Gap Between Simulation and Reality.

[DOI]

Josiah P. Hanna

Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation.

[DOI]

Josiah P. Hanna

,

,

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Grounded Action Transformation for Robot Learning in Simulation.

[DOI]

Josiah P. Hanna

,

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

UT Austin Villa: Project-Driven Research in AI and Robotics.

[DOI]

,

Patrick MacAlpine

,

,

,

,

Sanmit Narvekar

,

,

IEEE Intell. Syst., 2016

High Confidence Off-Policy Evaluation with Models.

[DOI]

Josiah P. Hanna

,

,

CoRR, 2016

Delta-Tolling: Adaptive Tolling for Optimizing Traffic Throughput.

[DOI]

,

,

,

,

,

Stephen D. Boyles

Proceedings of the Ninth International Workshop on Agents in Traffic and Transportation (ATT 2016) co-located with the 25th International Joint Conference On Artificial Intelligence (IJCAI 2016), 2016

2015

UT Austin Villa: RoboCup 2015 3D Simulation League Competition and Technical Challenges Champions.

[DOI]

Patrick MacAlpine

,

,

,

Proceedings of the RoboCup 2015: Robot World Cup XIX [papers from the 19th Annual RoboCup International Symposium, 2015

2013

Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes.

[DOI]

,

,

,

Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

Loading...