Hui Yuan

Orcid: 0009-0008-0466-6332

Affiliations:
  • Princeton University, Department of Electrical and Computer Engineering, NJ, USA
  • University of Science and Technology of China, Hefei, China (former)


According to our database1, Hui Yuan authored at least 14 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations.
CoRR, February, 2025

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Adversarial Attacks on Online Learning to Rank with Stochastic Click Models.
Trans. Mach. Learn. Res., 2024

Diffusion Model for Data-Driven Black-Box Optimization.
CoRR, 2024

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences.
CoRR, 2024

Gradient Guidance for Diffusion Models: An Optimization Perspective.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Conversational Dueling Bandits in Generalized Linear Models.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

MaxMin-RLHF: Alignment with Diverse Human Preferences.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2020
Learning Entangled Single-Sample Gaussians in the Subset-of-Signals Model.
Proceedings of the Conference on Learning Theory, 2020

Learning Entangled Single-Sample Distributions via Iterative Trimming.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020


  Loading...