Hugh Zhang

According to our database1, Hugh Zhang authored at least 8 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models.
CoRR, 2024

Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization.
CoRR, 2024

2023
Chain-of-Thought Reasoning is a Policy Improvement Operator.
CoRR, 2023

No-regret Learning Dynamics for Sequential Correlated Equilibria.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
A Simple Adaptive Procedure Converging to Forgiving Correlated Equilibria.
CoRR, 2022

Equilibrium Finding in Normal-Form Games via Greedy Regret Minimization.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2020
Trading Off Diversity and Quality in Natural Language Generation.
CoRR, 2020

2019
Unifying Human and Statistical Evaluation for Natural Language Generation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019


  Loading...