Wang Zhang

Orcid: 0009-0001-2436-8761

Affiliations:
  • ByteDance Inc., San Jose, USA


According to our database1, Wang Zhang authored at least 7 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
TensorHub: Scalable and Elastic Weight Transfer for LLM RL Training.
CoRR, April, 2026

Laminar: A Scalable Asynchronous RL Post-Training Framework.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks.
CoRR, April, 2025

DAPO: An Open-Source LLM Reinforcement Learning System at Scale.
CoRR, March, 2025


DAPO: An Open-Source LLM Reinforcement Learning System at Scale.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

HybridFlow: A Flexible and Efficient RLHF Framework.
Proceedings of the Twentieth European Conference on Computer Systems, 2025


  Loading...