Esther Derman

According to our database1, Esther Derman authored at least 15 papers between 2018 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Reward Redistribution for CVaR MDPs using a Bellman Operator on L-infinity.
CoRR, February, 2026

2025
Long-Horizon Model-Based Offline Reinforcement Learning Without Conservatism.
CoRR, December, 2025

Robust Reinforcement Learning for Discrete Compositional Generation via General Soft Operators.
CoRR, June, 2025

State Entropy Regularization for Robust Reinforcement Learning.
CoRR, June, 2025

Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024
Tree Search-Based Policy Optimization under Stochastic Execution Delay.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization.
CoRR, 2023

Policy Gradient for s-Rectangular Robust Markov Decision Processes.
CoRR, 2023

Policy Gradient for Rectangular Robust Markov Decision Processes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2021
Twice regularized MDPs and the equivalence between robustness and regularization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Acting in Delayed Environments with Non-Stationary Markov Policies.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Distributional Robustness and Regularization in Reinforcement Learning.
CoRR, 2020

2019
A Bayesian Approach to Robust Reinforcement Learning.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

2018
Soft-Robust Actor-Critic Policy-Gradient.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018


  Loading...