Wenqi Zhang

Orcid: 0000-0002-8312-0184

Affiliations:

Zhejiang University (ZJU), College of Computer Science and Technology, Hangzhou, China

According to our database¹, Wenqi Zhang authored at least 40 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering.

[BibT_eX]

[DOI]

CoRR, September, 2025

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts.

[BibT_eX]

[DOI]

CoRR, September, 2025

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks.

[BibT_eX]

[DOI]

CoRR, August, 2025

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

GUI-G<sup>2</sup>: Gaussian Reward Modeling for GUI Grounding.

[BibT_eX]

[DOI]

CoRR, July, 2025

SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence.

[BibT_eX]

[DOI]

CoRR, May, 2025

Let LLMs Break Free from Overthinking via Self-Braking Tuning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency.

[BibT_eX]

[DOI]

CoRR, April, 2025

A Survey on (M)LLM-Based GUI Agents.

[BibT_eX]

[DOI]

CoRR, April, 2025

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks.

[BibT_eX]

[DOI]

CoRR, March, 2025

Think Twice, Click Once: Enhancing GUI Grounding via Fast and Slow Systems.

[BibT_eX]

[DOI]

CoRR, March, 2025

DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL.

[BibT_eX]

[DOI]

CoRR, March, 2025

AskToAct: Enhancing LLMs Tool Use via Self-Correcting Clarification.

[BibT_eX]

[DOI]

CoRR, March, 2025

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding.

[BibT_eX]

[DOI]

CoRR, January, 2025

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining.

[BibT_eX]

[DOI]

CoRR, January, 2025

ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Scaling LLMs' Social Reasoning: Sprinkle Cognitive "Aha Moment" into Fundamental Long-thought Logical Capabilities.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

STaR-SQL: Self-Taught Reasoner for Text-to-SQL.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Specialized Mathematical Solving by a Step-By-Step Expression Chain Generation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation.

[BibT_eX]

[DOI]

CoRR, 2024

Entering Real Social World! Benchmarking the Theory of Mind and Socialization Capabilities of LLMs from a First-person Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

TaskBench: Benchmarking Large Language Models for Task Automation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Advancing Process Verification for Large Language Models via Tree-Based Preference Learning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Learning Global Controller in Latent Space for Parameter-Efficient Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow.

[BibT_eX]

[DOI]

CoRR, 2023

An Expression Tree Decoding Strategy for Mathematical Equation Generation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Enhancing Emotion Recognition in Conversation via Multi-view Feature Alignment and Memorization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

PromptNER: Prompt Locating and Typing for Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

A Closed-Loop Perception, Decision-Making and Reasoning Mechanism for Human-Like Navigation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Multi-View Reasoning: Consistent Contrastive Learning for Math Word Problem.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Query-based Instance Discrimination Network for Relational Triple Extraction.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

Learning to Navigate in a VUCA Environment: Hierarchical Multi-expert Approach.

[BibT_eX]

[DOI]

CoRR, 2021

Learning to Navigate in a VUCA Environment: Hierarchical Multi-expert Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Deep Reinforcement Learning for Multi-contact Motion Planning of Hexapod Robots.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Wenqi Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...