Pedro P. Santos

Orcid: 0000-0002-4587-9528

According to our database1, Pedro P. Santos authored at least 11 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Entropic Risk-Aware Monte Carlo Tree Search.
CoRR, January, 2026

Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning via Predictive Observation Imputation (Abstract Reprint).
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning.
CoRR, May, 2025

A Computational Model of Inclusive Pedagogy: From Understanding to Application.
CoRR, May, 2025

Centralized training with hybrid execution in multi-agent reinforcement learning via predictive observation imputation.
Artif. Intell., 2025

The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
The impact of data distribution on Q-learning with function approximation.
Mach. Learn., September, 2024

Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Generalizing Objective-Specification in Markov Decision Processes.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2021
Understanding the Impact of Data Distribution on Q-learning with Function Approximation.
CoRR, 2021

A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers.
CoRR, 2021


  Loading...