Tianhao Wu

Affiliations:
  • University of California, Berkeley, CA, USA (PhD 2021)
  • Peking University, School of Mathematical Sciences, Beijing, China


According to our database1, Tianhao Wu authored at least 8 papers between 2020 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment.
CoRR, 2023

A Reduction-based Framework for Sequential Decision Making with Delayed Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Statistical Inference on Multi-armed Bandits with Delayed Feedback.
Proceedings of the International Conference on Machine Learning, 2023

2022
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee.
Proceedings of the International Conference on Machine Learning, 2022

A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
A Unified Framework for Conservative Exploration.
CoRR, 2021

On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020


  Loading...