Bo Wang
Orcid: 0000-0003-0526-0533Affiliations:
- Fudan University, Shanghai, China
According to our database1,
Bo Wang authored at least 18 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning.
CoRR, March, 2026
CoRR, January, 2026
CoRR, January, 2026
CoRR, January, 2026
Time-Frequency Token Advantage Clipping for Training Efficient Large Reasoning Model.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, July, 2025
ACM Trans. Manag. Inf. Syst., 2025
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
2024
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective.
CoRR, 2024
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling.
CoRR, 2024
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments.
CoRR, 2024
Aligning Large Language Models from Self-Reference AI Feedback with one General Principle.
CoRR, 2024
CoRR, 2024
Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024