Simon du Toit

According to our database1, Simon du Toit authored at least 6 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Self-Supervised On-Policy Reinforcement Learning via Contrastive Proximal Policy Optimisation.
CoRR, May, 2026

2025
Oryx: a Performant and Scalable Algorithm for Many-Agent Coordination in Offline MARL.
CoRR, May, 2025

Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies.
CoRR, May, 2025

Oryx: a Scalable Sequence Model for Many-Agent Coordination in Offline MARL.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Sable: a Performant, Efficient and Scalable Sequence Model for MARL.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning.
CoRR, 2024


  Loading...