Zhenhailong Wang

Orcid: 0000-0002-4704-5455

According to our database1, Zhenhailong Wang authored at least 38 papers between 2021 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs.
CoRR, May, 2026

Advancing Creative Physical Intelligence in Large Multimodal Models.
CoRR, May, 2026

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing.
CoRR, May, 2026

From Context to Skills: Can Language Models Learn from Context Skillfully?
CoRR, April, 2026

OSExpert: Computer-Use Agents Learning Professional Skills via Exploration.
CoRR, March, 2026

Predicting Camera Pose from Perspective Descriptions for Spatial Reasoning.
CoRR, February, 2026

A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence.
Trans. Mach. Learn. Res., 2026

Compositional Reasoning via Joint Image and Language Decomposition.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

2025
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning.
CoRR, October, 2025

Analyzing and Internalizing Complex Policy Documents for LLM Agents.
CoRR, October, 2025

Multimodal Policy Internalization for Conversational Agents.
CoRR, October, 2025

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence.
CoRR, July, 2025

Perception-Aware Policy Optimization for Multimodal Reasoning.
CoRR, July, 2025

DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs.
CoRR, April, 2025

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents.
CoRR, March, 2025

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks.
CoRR, January, 2025

Visually Descriptive Language Model for Vector Graphics Reasoning.
Trans. Mach. Learn. Res., 2025

Infogent: An Agent-Based Framework for Web Information Aggregation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SYNTHIA: Novel Concept Design with Affordance Composition.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Text-Based Reasoning About Vector Graphics.
CoRR, 2024

Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

MIRACLE: An Online, Explainable Multimodal Interactive Concept Learning System.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration.
CoRR, 2023

RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought.
CoRR, 2023

Paxion: Patching Action Knowledge in Video-Language Foundation Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Language-First Approach for Procedure Planning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Model-Agnostic Multitask Fine-tuning for Few-shot Vision-Language Transfer Learning.
CoRR, 2022

Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

NewsClaims: A New Benchmark for Claim Detection from News with Attribute Knowledge.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Open Vocabulary Electroencephalography-to-Text Decoding and Zero-Shot Sentiment Classification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
NewsClaims: A New Benchmark for Claim Detection from News with Background Knowledge.
CoRR, 2021

Future is not One-dimensional: Graph Modeling based Complex Event Schema Induction for Event Prediction.
CoRR, 2021

RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations, 2021

The Future is not One-dimensional: Complex Event Schema Induction by Graph Modeling for Event Prediction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021


  Loading...