Esther Derman

According to our database¹, Esther Derman authored at least 15 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Reward Redistribution for CVaR MDPs using a Bellman Operator on L-infinity.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

Long-Horizon Model-Based Offline Reinforcement Learning Without Conservatism.

[BibT_eX]

[DOI]

CoRR, December, 2025

Robust Reinforcement Learning for Discrete Compositional Generation via General Soft Operators.

[BibT_eX]

[DOI]

CoRR, June, 2025

State Entropy Regularization for Robust Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, June, 2025

Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024

Tree Search-Based Policy Optimization under Stochastic Execution Delay.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization.

[BibT_eX]

[DOI]

Uri Gadot

Esther Derman

Navdeep Kumar

Maxence Mohamed Elfatihi

Kfir Levy

Shie Mannor

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization.

[BibT_eX]

[DOI]

CoRR, 2023

Policy Gradient for s-Rectangular Robust Markov Decision Processes.

[BibT_eX]

[DOI]

CoRR, 2023

Policy Gradient for Rectangular Robust Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2021

Twice regularized MDPs and the equivalence between robustness and regularization.

[BibT_eX]

[DOI]

Esther Derman

Matthieu Geist

Shie Mannor

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Acting in Delayed Environments with Non-Stationary Markov Policies.

[BibT_eX]

[DOI]

Esther Derman

Gal Dalal

Shie Mannor

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Distributional Robustness and Regularization in Reinforcement Learning.

[BibT_eX]

[DOI]

Esther Derman

Shie Mannor

CoRR, 2020

2019

A Bayesian Approach to Robust Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

2018

Soft-Robust Actor-Critic Policy-Gradient.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Esther Derman

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...