Saiyong Yang

According to our database1, Saiyong Yang authored at least 15 papers between 2011 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Tool Learning Needs Nothing More Than a Free 8B Language Model.
CoRR, April, 2026

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation.
CoRR, February, 2026

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models.
CoRR, February, 2026

Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models.
CoRR, February, 2026

ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning.
CoRR, January, 2026

2025
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control.
CoRR, November, 2025

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation.
CoRR, November, 2025

Think Outside the Policy: In-Context Steered Policy Optimization.
CoRR, October, 2025

Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error.
CoRR, October, 2025

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding.
CoRR, October, 2025

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners.
CoRR, September, 2025

RAG-Targeted SFT Improves RAG-Enhanced Math Reasoning.
Proceedings of the Natural Language Processing and Chinese Computing, 2025

Eliminating Retrieval Knowledge Conflicts: Cross-Validation Re-ranking with Large Language Models.
Proceedings of the International Joint Conference on Neural Networks, 2025

2012
A low-cost hand gesture human-computer interaction system.
Proceedings of the IEEE International Conference on Consumer Electronics, 2012

2011
Real-time skin color detection under rapidly changing illumination conditions.
IEEE Trans. Consumer Electron., 2011


  Loading...