Tong Yang

Affiliations:

Carnegie Mellon University (CMU), Pittsburgh, PA, USA

According to our database¹, Tong Yang authored at least 10 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Achieving Logarithmic Regret in KL-Regularized Zero-Sum Markov Games.

[BibT_eX]

[DOI]

CoRR, October, 2025

Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent.

[BibT_eX]

[DOI]

CoRR, August, 2025

Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL.

[BibT_eX]

[DOI]

CoRR, June, 2025

Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Faster WIND: Accelerating Iterative Best-of-N Distillation for LLM Alignment.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024

Faster WIND: Accelerating Iterative Best-of-<i>N</i> Distillation for LLM Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

In-Context Learning with Representations: Contextual Generalization of Trained Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023

Federated Natural Policy Gradient Methods for Multi-task Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Tong Yang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...