Wenhao Huang

Affiliations:

01.AI, Beijing, China
Peking University, Key Laboratory of Machine Perception, Beijing, China (former)
Beijing University of Posts and Telecommunications, School of Software Engineering, Beijing, China (former)

According to our database¹, Wenhao Huang authored at least 111 papers between 2010 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2025

RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization.

[BibT_eX]

[DOI]

CoRR, November, 2025

MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity.

[BibT_eX]

[DOI]

CoRR, November, 2025

Scaling Latent Reasoning via Looped Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures.

[BibT_eX]

[DOI]

CoRR, October, 2025

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation.

[BibT_eX]

[DOI]

CoRR, September, 2025

VideoScore2: Think before You Score in Generative Video Evaluation.

[BibT_eX]

[DOI]

CoRR, September, 2025

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning.

[BibT_eX]

[DOI]

CoRR, September, 2025

Reverse-Engineered Reasoning for Open-Ended Generation.

[BibT_eX]

[DOI]

CoRR, September, 2025

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling.

[BibT_eX]

[DOI]

CoRR, August, 2025

MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents.

[BibT_eX]

[DOI]

CoRR, August, 2025

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction.

[BibT_eX]

[DOI]

CoRR, August, 2025

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference.

[BibT_eX]

[DOI]

CoRR, August, 2025

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving.

[BibT_eX]

[DOI]

CoRR, July, 2025

First Return, Entropy-Eliciting Explore.

[BibT_eX]

[DOI]

CoRR, July, 2025

A Systematic Analysis of Hybrid Linear Attention.

[BibT_eX]

[DOI]

CoRR, July, 2025

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization.

[BibT_eX]

[DOI]

CoRR, July, 2025

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World.

[BibT_eX]

[DOI]

CoRR, June, 2025

SciDA: Scientific Dynamic Assessor of LLMs.

[BibT_eX]

[DOI]

CoRR, June, 2025

ScaleLong: A Multi-Timescale Benchmark for Long Video Understanding.

[BibT_eX]

[DOI]

CoRR, May, 2025

MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation.

[BibT_eX]

[DOI]

CoRR, May, 2025

P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark.

[BibT_eX]

[DOI]

CoRR, May, 2025

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation.

[BibT_eX]

[DOI]

CoRR, May, 2025

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs.

[BibT_eX]

[DOI]

CoRR, April, 2025

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, April, 2025

A Comprehensive Survey on Long Context Language Modeling.

[BibT_eX]

[DOI]

CoRR, March, 2025

FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis.

[BibT_eX]

[DOI]

CoRR, March, 2025

YuE: Scaling Open Foundation Models for Long-Form Music Generation.

[BibT_eX]

[DOI]

CoRR, March, 2025

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models.

[BibT_eX]

[DOI]

CoRR, February, 2025

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, February, 2025

Aligning Instruction Tuning with Pre-training.

[BibT_eX]

[DOI]

CoRR, January, 2025

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Steering Protein Family Design through Profile Bayesian Flow.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

LIME: Less Is More for MLLM Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Can MLLMs Understand the Deep Implication Behind Chinese Images?

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions.

[BibT_eX]

[DOI]

CoRR, 2024

Can MLLMs Understand the Deep Implication Behind Chinese Images?

[BibT_eX]

[DOI]

CoRR, 2024

PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model.

[BibT_eX]

[DOI]

CoRR, 2024

PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness.

[BibT_eX]

[DOI]

CoRR, 2024

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet.

[BibT_eX]

[DOI]

CoRR, 2024

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks.

[BibT_eX]

[DOI]

CoRR, 2024

MIO: A Foundation Model on Multimodal Tokens.

[BibT_eX]

[DOI]

CoRR, 2024

OmniBench: Towards The Future of Universal Omni-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

LIME: Less Is More for MLLM Evaluation.

[BibT_eX]

[DOI]

CoRR, 2024

Foundation Models for Music: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm.

[BibT_eX]

[DOI]

CoRR, 2024

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association.

[BibT_eX]

[DOI]

CoRR, 2024

LongIns: A Challenging Long-context Instruction-based Exam for LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents.

[BibT_eX]

[DOI]

CoRR, 2024

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series.

[BibT_eX]

[DOI]

CoRR, 2024

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning.

[BibT_eX]

[DOI]

CoRR, 2024

Yi: Open Foundation Models by 01.AI.

[BibT_eX]

[DOI]

CoRR, 2024

m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM.

[BibT_eX]

[DOI]

CoRR, 2024

LLM Agents for Psychology: A Study on Gamified Assessments.

[BibT_eX]

[DOI]

CoRR, 2024

GenDec: A robust generative Question-decomposition method for Multi-hop reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation.

[BibT_eX]

[DOI]

CoRR, 2024

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MORE-3S: Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

LLaSM: Large Language and Speech Model.

[BibT_eX]

[DOI]

CoRR, 2023

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.

[BibT_eX]

[DOI]

CoRR, 2023

Chinese Open Instruction Generalist: A Preliminary Release.

[BibT_eX]

[DOI]

CoRR, 2023

2022

PEMP: Leveraging Physics Properties to Enhance Molecular Property Prediction.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2020

Real-time Transportation Prediction Correction using Reconstruction Error in Deep Learning.

[BibT_eX]

[DOI]

Shuai Liu

Guojie Song

Wenhao Huang

ACM Trans. Knowl. Discov. Data, 2020

2019

Text Assisted Insight Ranking Using Context-Aware Memory Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning Personalized End-to-End Goal-Oriented Dialog.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2015

Mining Dependencies Considering Time Lag in Spatio-Temporal Traffic Data.

[BibT_eX]

[DOI]

Proceedings of the Web-Age Information Management - 16th International Conference, 2015

Traffic Flow Decomposition and Prediction Based on Robust Principal Component Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE 18th International Conference on Intelligent Transportation Systems, 2015

Hybrid Multi-metric K-Nearest Neighbor Regression for Traffic Flow Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE 18th International Conference on Intelligent Transportation Systems, 2015

Probabilistic dynamic causal model for temporal data.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Improving deep neural network ensembles using reconstruction error.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Learning Common Metrics for Homogenous Tasks in Traffic Flow Prediction.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

Incorporating temporal smoothness and group structure in learning with incomplete data.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015

Short-term traffic flow forecasting: Multi-metric KNN with related station discovery.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015

2014

Deep Architecture for Traffic Flow Prediction: Deep Belief Networks With Multitask Learning.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2014

A Spatial-temporal Topic Segmentation Model for Human Mobile Behavior.

[BibT_eX]

[DOI]

Proceedings of the Web-Age Information Management - 15th International Conference, 2014

Metric-Based Multi-Task Grouping Neural Network for Traffic Flow Forecasting.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Networks - ISNN 2014, 2014

Dynamic boosting in deep learning using reconstruction error.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Deep process neural network for temporal deep learning.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Traffic zone division using mobile billing data.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Fuzzy Systems and Knowledge Discovery, 2014

2013

Adaptive Weight Optimization for Classification of Imbalanced Data.

[BibT_eX]

[DOI]

Proceedings of the Intelligence Science and Big Data Engineering, 2013

Cost sensitive GPS-based activity recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Fuzzy Systems and Knowledge Discovery, 2013

Hierarchical destination prediction based on GPS history.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Fuzzy Systems and Knowledge Discovery, 2013

Automated urban location annotation on mobile records.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Fuzzy Systems and Knowledge Discovery, 2013

Deep Architecture for Traffic Flow Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advanced Data Mining and Applications - 9th International Conference, 2013

2011

Discrete Trajectory Prediction on Mobile Data.

[BibT_eX]

[DOI]

Proceedings of the Web Technologies and Applications - 13th Asia-Pacific Web Conference, 2011

2010

Systematic Improvement of Monte-Carlo Tree Search with Self-generated Neural-Networks Controllers.

[BibT_eX]

[DOI]

Proceedings of the Learning and Intelligent Optimization, 4th International Conference, 2010

Anchor Points Seeking of Large Urban Crowd Based on the Mobile Billing Data.

[BibT_eX]

[DOI]

Proceedings of the Advanced Data Mining and Applications - 6th International Conference, 2010

Wenhao Huang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...