Gerald Shen

According to our database1, Gerald Shen authored at least 7 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training.
CoRR, July, 2025

Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment.
CoRR, February, 2025

Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

HelpSteer2-Preference: Complementing Ratings with Preferences.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
HelpSteer2: Open-source dataset for training top-performing reward models.
CoRR, 2024

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment.
CoRR, 2024

HelpSteer 2: Open-source dataset for training top-performing reward models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024


  Loading...