We stand with Ukraine

We stand with Ukraine

Zhiwei He

Orcid: 0000-0002-4807-0062

Affiliations:

Shanghai Jiao Tong University, China

According to our database¹, Zhiwei He authored at least 34 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

DeepCompress: A Dual Reward Strategy for Dynamically Exploring and Compressing Reasoning Chains.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, October, 2025

Igniting Language Intelligence: The Hitchhiker's Guide from Chain-of-Thought Reasoning to Language Agents.

[BibT_eX]

[DOI]

Zhuosheng Zhang

,

,

,

,

,

,

,

,

,

,

ACM Comput. Surv., August, 2025

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Zhuosheng Zhang

,

,

,

,

CoRR, May, 2025

Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, May, 2025

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Zhuosheng Zhang

,

,

,

,

CoRR, April, 2025

Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Zhuosheng Zhang

,

,

,

CoRR, March, 2025

The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Zhuosheng Zhang

,

,

,

,

CoRR, January, 2025

Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Zhuosheng Zhang

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RaSA: Rank-Sharing Low-Rank Adaptation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Zhuosheng Zhang

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Exploring Human-Like Translation Strategy with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

Zhuosheng Zhang

,

,

,

,

,

Trans. Assoc. Comput. Linguistics, 2024

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Zhuosheng Zhang

,

,

,

,

CoRR, 2024

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Evaluating Knowledge-based Cross-lingual Inconsistency in Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

MarkLLM: An Open-Source Toolkit for LLM Watermarking.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Is Cognition and Action Consistent or Not: Investigating Large Language Model's Personality.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

CLEAN-EVAL: Clean Evaluation on Contaminated Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model.

[BibT_eX]

[DOI]

,

,

,

Zhuosheng Zhang

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Improving Open-Ended Text Generation via Adaptive Decoding.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Zhuosheng Zhang

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method.

[BibT_eX]

[DOI]

,

,

,

,

Zhuosheng Zhang

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Unsupervised Sign Language Translation and Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Zhuosheng Zhang

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Discrete limited attentional collaborative filtering for fast social recommendation.

[BibT_eX]

[DOI]

,

,

,

,

,

Eng. Appl. Artif. Intell., 2023

CLEAN-EVAL: Clean Evaluation on Contaminated Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2023

ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

TeCS: A Dataset and Benchmark for Tense Consistency of Machine Translation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022

Tencent AI Lab - Shanghai Jiao Tong University Low-Resource Translation System for the WMT22 Translation Task.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Seventh Conference on Machine Translation, 2022

Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Loading...