Zhiwei He
Orcid: 0000-0002-4807-0062Affiliations:
- Shanghai Jiao Tong University, China
According to our database1,
Zhiwei He
authored at least 32 papers
between 2022 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Igniting Language Intelligence: The Hitchhiker's Guide from Chain-of-Thought Reasoning to Language Agents.
ACM Comput. Surv., August, 2025
CoRR, July, 2025
DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning.
CoRR, May, 2025
Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training.
CoRR, May, 2025
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards.
CoRR, May, 2025
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning.
CoRR, April, 2025
Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique.
CoRR, March, 2025
The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models.
CoRR, March, 2025
CoRR, January, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
Trans. Assoc. Comput. Linguistics, 2024
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding.
CoRR, 2024
CoRR, 2024
Is Cognition and Action Consistent or Not: Investigating Large Language Model's Personality.
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Eng. Appl. Artif. Intell., 2023
CoRR, 2023
ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
2022
Tencent AI Lab - Shanghai Jiao Tong University Low-Resource Translation System for the WMT22 Translation Task.
Proceedings of the Seventh Conference on Machine Translation, 2022
Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022