Han Zhang

Orcid: 0000-0001-5660-6237

Affiliations:

Peng Cheng Laboratory, Shenzhen, China
Harbin Institute of Technology - Shenzhen (HIT-SZ), School of Computer Science and Technology, Shenzhen, China

According to our database¹, Han Zhang authored at least 10 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Group Expectation Policy Optimization for Stable Heterogeneous Reinforcement Learning in LLMs.

[BibT_eX]

[DOI]

CoRR, August, 2025

COPR: Continual Human Preference Learning via Optimal Policy Regularization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Correcting Large Language Model Behavior via Influence Function.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

BeyondGender: A Multifaceted Bilingual Dataset for Practical Sexism Detection.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles.

[BibT_eX]

[DOI]

CoRR, 2024

CPPO: Continual Learning for Reinforcement Learning with Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

COPF: Continual Learning Human Preference through Optimal Policy Fitting.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Prompt-Based Prototypical Framework for Continual Relation Extraction.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

CLLE: A Benchmark for Continual Language Learning Evaluation in Multilingual Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation.

[BibT_eX]

[DOI]

CoRR, 2021

Han Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...