Xiaoliang Fu
Orcid: 0000-0003-2550-3600
According to our database1,
Xiaoliang Fu authored at least 8 papers
between 2018 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Placing Puzzle Pieces Where They Matter: A Question Augmentation Framework for Reinforcement Learning.
CoRR, April, 2026
SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting.
CoRR, April, 2026
From logπ to π: Taming Divergence in Soft Clipping via Bilateral Decoupled Decay of Probability Gradient Weight.
CoRR, March, 2026
Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training.
CoRR, February, 2026
How to Allocate, How to Learn? Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization.
CoRR, February, 2026
MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning.
CoRR, February, 2026
2024
Proceedings of the HCI in Business, Government and Organizations, 2024
2018
Proceedings of the IECON 2018, 2018