Esther Derman

According to our database1, Esther Derman authored at least 13 papers between 2018 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Robust Reinforcement Learning for Discrete Compositional Generation via General Soft Operators.
CoRR, June, 2025

State Entropy Regularization for Robust Reinforcement Learning.
CoRR, June, 2025

Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024
Tree Search-Based Policy Optimization under Stochastic Execution Delay.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization.
CoRR, 2023

Policy Gradient for s-Rectangular Robust Markov Decision Processes.
CoRR, 2023

Policy Gradient for Rectangular Robust Markov Decision Processes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2021
Twice regularized MDPs and the equivalence between robustness and regularization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Acting in Delayed Environments with Non-Stationary Markov Policies.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Distributional Robustness and Regularization in Reinforcement Learning.
CoRR, 2020

2019
A Bayesian Approach to Robust Reinforcement Learning.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

2018
Soft-Robust Actor-Critic Policy-Gradient.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018


  Loading...