Michal Nauman

According to our database1, Michal Nauman authored at least 19 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
When Does Non-Uniform Replay Matter in Reinforcement Learning?
CoRR, May, 2026

Reward-Conditioned Reinforcement Learning.
CoRR, March, 2026

What Does Flow Matching Bring To TD Learning?
CoRR, March, 2026

2025
floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL.
CoRR, September, 2025

Relative Entropy Pathwise Policy Optimization.
CoRR, July, 2025

Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners.
CoRR, May, 2025

FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control.
CoRR, May, 2025

Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models.
CoRR, May, 2025

Compute-Optimal Scaling for Value-Based Deep RL.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

A Case for Validation Buffer in Pessimistic Actor-Critic.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Value-Based Deep RL Scales Predictably.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Decoupled Policy Actor-Critic: Bridging Pessimism and Risk Awareness in Reinforcement Learning.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models.
CoRR, 2024

Bigger, Regularized, Optimistic: scaling for compute and sample efficient continuous control.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Decoupled Actor-Critic.
CoRR, 2023

On Many-Actions Policy Gradient.
Proceedings of the International Conference on Machine Learning, 2023

2022
On All-Action Policy Gradients.
CoRR, 2022

2020
Low-Variance Policy Gradient Estimation with World Models.
CoRR, 2020


  Loading...