Hanyang Zhao

According to our database1, Hanyang Zhao authored at least 16 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability.
CoRR, June, 2025

R3: Robust Rubric-Agnostic Reward Models.
CoRR, May, 2025

Fine-Tuning Diffusion Generative Models via Rich Preference Optimization.
CoRR, March, 2025

Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning.
CoRR, February, 2025

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey.
J. Artif. Intell. Res., 2025


RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MallowsPO: Fine-Tune Your LLM with Preference Dispersions.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines.
CoRR, 2024

RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization.
CoRR, 2024

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey.
CoRR, 2024

Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning.
CoRR, 2024

Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions.
CoRR, 2024

Score-based Diffusion Models via Stochastic Differential Equations - a Technical Tutorial.
CoRR, 2024

Contractive Diffusion Probabilistic Models.
CoRR, 2024

2023
Policy Optimization for Continuous Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


  Loading...