Timon Willi

Orcid: 0000-0003-4405-5700

According to our database1, Timon Willi authored at least 19 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Rethinking Rubric Generation for Improving LLM Judge and Reward Modeling for Open-ended Tasks.
CoRR, February, 2026

Balanced Accuracy: The Right Metric for Evaluating LLM Judges - Explained through Youden's J statistic.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Training AI Co-Scientists Using Rubric Rewards.
CoRR, December, 2025

The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind.
CoRR, June, 2025

2024
The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games.
CoRR, 2024

Mixture of Experts in a Mixture of RL settings.
RLJ, 2024

JaxMARL: Multi-Agent RL Environments and Algorithms in JAX.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Mixtures of Experts Unlock Parameter Scaling for Deep RL.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Mixtures of Experts for Scaling up Neural Networks in Order Execution.
Proceedings of the 5th ACM International Conference on AI in Finance, 2024


Scaling Opponent Shaping to High Dimensional Games.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Analysing the Sample Complexity of Opponent Shaping.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023
Leading the Pack: N-player Opponent Shaping.
CoRR, 2023

JaxMARL: Multi-Agent RL Environments in JAX.
CoRR, 2023

Adversarial Cheap Talk.
Proceedings of the International Conference on Machine Learning, 2023

2022
COLA: Consistent Learning with Opponent-Learning Awareness.
Proceedings of the International Conference on Machine Learning, 2022

Model-Free Opponent Shaping.
Proceedings of the International Conference on Machine Learning, 2022

2019
Recurrent Neural Processes.
CoRR, 2019


  Loading...