Hongze Tan

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2025
GTPO and GRPO-S: Token and Sequence-Level Reward Shaping with Policy Entropy.
CoRR, August, 2025

Improving DAPO from a Mixed-Policy Perspective.
CoRR, July, 2025


  Loading...