Bradly C. Stadie

According to our database1, Bradly C. Stadie authored at least 29 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
TEMPO: Temporal Enforcement via Mode-Separated Policy Optimization for Trustworthy LLM Backtesting.
CoRR, May, 2026

All Leaks Count, Some Count More: Interpretable Temporal Contamination Detection in LLM Backtesting.
CoRR, February, 2026

Evolutionary System Prompt Learning for Reinforcement Learning in LLMs.
CoRR, February, 2026

2025
AIA Forecaster: Technical Report.
CoRR, November, 2025

LAMP: Extracting Locally Linear Decision Surfaces from LLM World Models.
CoRR, May, 2025

D2 Actor Critic: Diffusion Actor Meets Distributional Critic.
Trans. Mach. Learn. Res., 2025

Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMs.
Trans. Mach. Learn. Res., 2025

Thoughts and Lessons on Using Visual Foundation Models for Manipulation.
Trans. Mach. Learn. Res., 2025

Expert of Experts Verification and Alignment (EVAL) Framework for Large Language Models Safety in Gastroenterology.
npj Digit. Medicine, 2025

Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
Solving Robotics Problems in Zero-Shot with Vision-Language Models.
CoRR, 2024

2023
To the Noise and Back: Diffusion for Shared Autonomy.
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States.
Proceedings of the Conference on Robot Learning, 2023

2022
Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization.
CoRR, 2022

Invariance Through Latent Alignment.
Proceedings of the Robotics: Science and Systems XVIII, New York City, NY, USA, June 27, 2022

2021
Invariance Through Inference.
CoRR, 2021

World Model as a Graph: Learning Latent Landmarks for Planning.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Learning Intrinsic Rewards as a Bi-Level Optimization Problem.
Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

One-Shot Pruning of Recurrent Neural Networks by Jacobian Spectrum Evaluation.
Proceedings of the 8th International Conference on Learning Representations, 2020

2018
Learning as a Sampling Problem.
PhD thesis, 2018

Transfer Learning for Estimating Causal Effects using Neural Networks.
CoRR, 2018

Some Considerations on Learning to Explore via Meta-Reinforcement Learning.
CoRR, 2018

Evolved Policy Gradients.
CoRR, 2018

The Importance of Sampling inMeta-Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Evolved Policy Gradients.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
One-Shot Imitation Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Third Person Imitation Learning.
Proceedings of the 5th International Conference on Learning Representations, 2017

2015
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models.
CoRR, 2015


  Loading...