Pedro P. Santos

Orcid: 0000-0002-4587-9528

According to our database¹, Pedro P. Santos authored at least 11 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Entropic Risk-Aware Monte Carlo Tree Search.

[BibT_eX]

[DOI]

CoRR, January, 2026

Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning via Predictive Observation Imputation (Abstract Reprint).

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning.

[BibT_eX]

[DOI]

Pedro P. Santos

Alberto Sardinha

Francisco S. Melo

CoRR, May, 2025

A Computational Model of Inclusive Pedagogy: From Understanding to Application.

[BibT_eX]

[DOI]

CoRR, May, 2025

Centralized training with hybrid execution in multi-agent reinforcement learning via predictive observation imputation.

[BibT_eX]

[DOI]

Artif. Intell., 2025

The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes.

[BibT_eX]

[DOI]

Pedro P. Santos

Alberto Sardinha

Francisco S. Melo

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

The impact of data distribution on Q-learning with function approximation.

[BibT_eX]

[DOI]

Mach. Learn., September, 2024

Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Generalizing Objective-Specification in Markov Decision Processes.

[BibT_eX]

[DOI]

Pedro P. Santos

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2021

Understanding the Impact of Data Distribution on Q-learning with Function Approximation.

[BibT_eX]

[DOI]

CoRR, 2021

A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers.

[BibT_eX]

[DOI]

CoRR, 2021

Pedro P. Santos

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...