Gerald Shen

According to our database1, Gerald Shen authored at least 11 papers between 2024 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, August, 2025

Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training.
CoRR, July, 2025

Llama-Nemotron: Efficient Reasoning Models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, May, 2025

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.
CoRR, April, 2025

Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment.
CoRR, February, 2025

Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

HelpSteer2-Preference: Complementing Ratings with Preferences.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Nemotron-4 340B Technical Report.
CoRR, 2024

HelpSteer2: Open-source dataset for training top-performing reward models.
CoRR, 2024

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment.
CoRR, 2024

HelpSteer 2: Open-source dataset for training top-performing reward models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024


  Loading...