Tong Yang

Affiliations:
  • Carnegie Mellon University (CMU), Pittsburgh, PA, USA


According to our database1, Tong Yang authored at least 9 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent.
CoRR, August, 2025

Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL.
CoRR, June, 2025

Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games.
CoRR, February, 2025

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Faster WIND: Accelerating Iterative Best-of-N Distillation for LLM Alignment.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024
Faster WIND: Accelerating Iterative Best-of-<i>N</i> Distillation for LLM Alignment.
CoRR, 2024

Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

In-Context Learning with Representations: Contextual Generalization of Trained Transformers.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Federated Natural Policy Gradient Methods for Multi-task Reinforcement Learning.
CoRR, 2023


  Loading...