Hao Liang

Orcid: 0009-0000-2963-2210

Affiliations:
  • Peking University, Center for Data Science, Beijing, China


According to our database1, Hao Liang authored at least 67 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning.
CoRR, May, 2026

Uni-Synergy: Bridging Understanding and Generation for Personalized Reasoning via Co-operative Reinforcement Learning.
CoRR, May, 2026

FLARE: Full-Modality Long-Video Audiovisual Retrieval Benchmark with User-Simulated Queries.
CoRR, May, 2026

K12-KGraph: A Curriculum-Aligned Knowledge Graph for Benchmarking and Training Educational LLMs.
CoRR, May, 2026

TraceAV-Bench: Benchmarking Multi-Hop Trajectory Reasoning over Long Audio-Visual Videos.
CoRR, May, 2026

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models.
CoRR, April, 2026

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models.
CoRR, March, 2026

Towards Next-Generation LLM Training: From the Data-Centric Perspective.
CoRR, March, 2026

One-Eval: An Agentic System for Automated and Traceable LLM Evaluation.
CoRR, March, 2026

BrowseComp-V<sup>3</sup>: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents.
CoRR, February, 2026

Canvas-of-Thought: Grounding Reasoning via Mutable Structured States.
CoRR, February, 2026

M2A: Multimodal Memory Agent with Dual-Layer Hybrid Memory for Long-Term Personalized Interactions.
CoRR, February, 2026

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks.
CoRR, February, 2026

Data Preparation for Large Language Models.
J. Comput. Sci. Technol., January, 2026

MathMixup: Boosting LLM Mathematical Reasoning with Difficulty-Controllable Data Synthesis and Curriculum Learning.
CoRR, January, 2026

LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts.
Proceedings of the ACM Web Conference 2026, 2026

Let's Verify Math Questions Step by Step.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

2025
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI.
CoRR, December, 2025

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling.
CoRR, December, 2025

BRACE: A Benchmark for Robust Audio Caption Quality Evaluation.
CoRR, December, 2025

VABench: A Comprehensive Benchmark for Audio-Video Generation.
CoRR, December, 2025

DataGovBench: Benchmarking LLM Agents for Real-World Data Governance Workflows.
CoRR, December, 2025

VCU-Bridge: Hierarchical Visual Connotation Understanding via Semantic Bridging.
CoRR, November, 2025

FlipVQA-Miner: Cross-Page Visual Question-Answer Mining from Textbooks.
CoRR, November, 2025

Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL.
CoRR, November, 2025

Rethinking Text-to-SQL: Dynamic Multi-turn SQL Interaction for Real-world Database Exploration.
CoRR, October, 2025

Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval.
CoRR, October, 2025

LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding.
CoRR, October, 2025

MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning.
CoRR, October, 2025

CapGeo: A Caption-Assisted Approach to Geometric Reasoning.
CoRR, October, 2025

DARO: Difficulty-Aware Reweighting Policy Optimization.
CoRR, October, 2025

Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge.
CoRR, September, 2025

Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models.
CoRR, June, 2025

Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions.
CoRR, June, 2025

LogicPuzzleRL: Cultivating Robust Mathematical Reasoning in LLMs via Reinforcement Learning.
CoRR, June, 2025

LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts.
CoRR, May, 2025

Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning.
CoRR, May, 2025

Concept-as-Tree: Synthetic Data is All You Need for VLM Personalization.
CoRR, March, 2025

Evaluating and Predicting Distorted Human Body Parts for Generated Images.
CoRR, March, 2025

MathClean: A Benchmark for Synthetic Mathematical Data Cleaning.
CoRR, February, 2025

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification.
CoRR, February, 2025

UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

SynthVLM: Towards High-Quality and Efficient Synthesis of Image-Caption Datasets for Vision-Language Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MathScape: Benchmarking Multimodal Large Language Models in Real-World Mathematical Contexts.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PAS: Plug-and-Play Prompt Augmentation System.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

Training Data Distribution Estimation for Optimized Pre-training Data Management.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

QAEncoder: Towards Aligned Representation Learning in Question Answering Systems.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
MC-LLaVA: Multi-Concept Personalized Vision-Language Model.
CoRR, 2024

EVQAScore: Efficient Video Question Answering Data Evaluation.
CoRR, 2024

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction.
CoRR, 2024

Baichuan Alignment Technical Report.
CoRR, 2024

Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models.
CoRR, 2024

QAEncoder: Towards Aligned Representation Learning in Question Answering System.
CoRR, 2024

BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search.
CoRR, 2024

Data Proportion Detection for Optimized Data Management for Large Language Models.
CoRR, 2024

MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark.
CoRR, 2024

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs.
CoRR, 2024

Are Bigger Encoders Always Better in Vision Large Models?
CoRR, 2024

Synth-Empathy: Towards High-Quality Synthetic Empathy Data.
CoRR, 2024

SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models.
CoRR, 2024

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System.
CoRR, 2024

KeyVideoLLM: Towards Large-scale Video Keyframe Selection.
CoRR, 2024

Efficient-Empathy: Towards Efficient and Effective Selection of Empathy Data.
CoRR, 2024

A Survey of Multimodal Large Language Model from A Data-centric Perspective.
CoRR, 2024


  Loading...