Han Zhang
Orcid: 0000-0001-5660-6237Affiliations:
- Peng Cheng Laboratory, Shenzhen, China
- Harbin Institute of Technology - Shenzhen (HIT-SZ), School of Computer Science and Technology, Shenzhen, China
According to our database1,
Han Zhang
authored at least 10 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on github.com
On csauthors.net:
Bibliography
2025
Group Expectation Policy Optimization for Stable Heterogeneous Reinforcement Learning in LLMs.
CoRR, August, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles.
CoRR, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
CLLE: A Benchmark for Continual Language Learning Evaluation in Multilingual Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
2021
PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation.
CoRR, 2021