Han Zhang

Orcid: 0000-0001-5660-6237

Affiliations:
  • Peng Cheng Laboratory, Shenzhen, China
  • Harbin Institute of Technology - Shenzhen (HIT-SZ), School of Computer Science and Technology, Shenzhen, China


According to our database1, Han Zhang authored at least 10 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Group Expectation Policy Optimization for Stable Heterogeneous Reinforcement Learning in LLMs.
CoRR, August, 2025

COPR: Continual Human Preference Learning via Optimal Policy Regularization.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Correcting Large Language Model Behavior via Influence Function.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

BeyondGender: A Multifaceted Bilingual Dataset for Practical Sexism Detection.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles.
CoRR, 2024

CPPO: Continual Learning for Reinforcement Learning with Human Feedback.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
COPF: Continual Learning Human Preference through Optimal Policy Fitting.
CoRR, 2023

2022
Prompt-Based Prototypical Framework for Continual Relation Extraction.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

CLLE: A Benchmark for Continual Language Learning Evaluation in Multilingual Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation.
CoRR, 2021


  Loading...