Saiyong Yang
According to our database1,
Saiyong Yang authored at least 15 papers
between 2011 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation.
CoRR, February, 2026
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models.
CoRR, February, 2026
Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models.
CoRR, February, 2026
CoRR, January, 2026
2025
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control.
CoRR, November, 2025
DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation.
CoRR, November, 2025
CoRR, October, 2025
CoRR, October, 2025
Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners.
CoRR, September, 2025
Proceedings of the Natural Language Processing and Chinese Computing, 2025
Eliminating Retrieval Knowledge Conflicts: Cross-Validation Re-ranking with Large Language Models.
Proceedings of the International Joint Conference on Neural Networks, 2025
2012
Proceedings of the IEEE International Conference on Consumer Electronics, 2012
2011
IEEE Trans. Consumer Electron., 2011