Zhanhui Zhou

According to our database1, Zhanhui Zhou authored at least 18 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
RePO: Replay-Enhanced Policy Optimization.
CoRR, June, 2025

SafeCoT: Improving VLM Safety with Minimal Reasoning.
CoRR, June, 2025

Mitigating Object Hallucination via Robust Local Perception Search.
CoRR, June, 2025

VLMs Can Aggregate Scattered Training Patches.
CoRR, June, 2025

Emergent Response Planning in LLM.
CoRR, February, 2025

2024
MaskMA: Towards Zero-Shot Multi-Agent Decision Making with Mask-Based Collaborative Learning.
Trans. Mach. Learn. Res., 2024

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level.
CoRR, 2024

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models.
CoRR, 2024

Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey.
CoRR, 2024

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Inference-Time Language Model Alignment via Integrated Value Guidance.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models.
CoRR, 2023

2022
INTENT: Interactive Tensor Transformation Synthesis.
Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, 2022


  Loading...