Kaito Ariu

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Global Behavior of Learning Dynamics in Zero-Sum Games with Memory Asymmetry.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model.

[BibT_eX]

[DOI]

Se-Young Yun

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Theoretical Guarantees for Minimum Bayes Risk Decoding.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Synchronization in Learning in Periodic Zero-Sum Games Triggers Divergence from Nash Equilibrium.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Optimal clustering from noisy binary feedback.

[BibT_eX]

[DOI]

Mach. Learn., May, 2024

Rate-Optimal Bayesian Simple Regret in Best Arm Identification.

[BibT_eX]

[DOI]

Math. Oper. Res., 2024

Time-Varyingness in Auction Breaks Revenue Equivalence.

[BibT_eX]

[DOI]

CoRR, 2024

Last Iterate Convergence in Monotone Mean Field Games.

[BibT_eX]

[DOI]

Noboru Isobe

CoRR, 2024

Synchronization behind Learning in Periodic Zero-Sum Games Triggers Divergence from Nash equilibrium.

[BibT_eX]

[DOI]

CoRR, 2024

Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

Nash Equilibrium and Learning Dynamics in Three-Player Matching m-Action Games.

[BibT_eX]

[DOI]

CoRR, 2024

On Universally Optimal Algorithms for A/B Testing.

[BibT_eX]

[DOI]

Po-An Wang

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Matroid Semi-Bandits in Sublinear Time.

[BibT_eX]

[DOI]

Ruo-Chun Tzeng

Naoto Ohsaka

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Model-Based Minimum Bayes Risk Decoding for Text Generation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Adaptively Perturbed Mirror Descent for Learning in Games.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Filtered Direct Preference Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding.

[BibT_eX]

[DOI]

Yuu Jinnai

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Memory Asymmetry Creates Heteroclinic Orbits to Nash Equilibrium in Learning in Zero-Sum Games.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Model-Based Minimum Bayes Risk Decoding.

[BibT_eX]

[DOI]

CoRR, 2023

On Uniformly Optimal Algorithms for Best Arm Identification in Two-Armed Bandits with Fixed Budget.

[BibT_eX]

[DOI]

Po-An Wang

CoRR, 2023

Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model.

[BibT_eX]

[DOI]

Se-Young Yun

CoRR, 2023

A Slingshot Approach to Learning in Monotone Games.

[BibT_eX]

[DOI]

CoRR, 2023

Memory Asymmetry: A Key to Convergence in Zero-Sum Games.

[BibT_eX]

[DOI]

CoRR, 2023

Exploration of Unranked Items in Safe Online Learning to Re-Rank.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium.

[BibT_eX]

[DOI]