Paavo Parmas

According to our database1, Paavo Parmas authored at least 7 papers between 2018 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Model-based Reinforcement Learning with Scalable Composite Policy Gradient Estimators.
Proceedings of the International Conference on Machine Learning, 2023

2022
Proppo: a Message Passing Framework for Customizable and Composable Learning Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
A unified view of likelihood ratio and reparameterization gradients.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme.
CoRR, 2019

2018
Total stochastic gradient algorithms and applications in reinforcement learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos.
Proceedings of the 35th International Conference on Machine Learning, 2018


  Loading...