Hao Liang
Orcid: 0009-0000-2963-2210Affiliations:
- Peking University, Center for Data Science, Beijing, China
According to our database1,
Hao Liang authored at least 67 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on github.com
On csauthors.net:
Bibliography
2026
LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning.
CoRR, May, 2026
Uni-Synergy: Bridging Understanding and Generation for Personalized Reasoning via Co-operative Reinforcement Learning.
CoRR, May, 2026
FLARE: Full-Modality Long-Video Audiovisual Retrieval Benchmark with User-Simulated Queries.
CoRR, May, 2026
K12-KGraph: A Curriculum-Aligned Knowledge Graph for Benchmarking and Training Educational LLMs.
CoRR, May, 2026
TraceAV-Bench: Benchmarking Multi-Hop Trajectory Reasoning over Long Audio-Visual Videos.
CoRR, May, 2026
CoRR, April, 2026
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models.
CoRR, March, 2026
CoRR, March, 2026
CoRR, March, 2026
BrowseComp-V<sup>3</sup>: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents.
CoRR, February, 2026
CoRR, February, 2026
M2A: Multimodal Memory Agent with Dual-Layer Hybrid Memory for Long-Term Personalized Interactions.
CoRR, February, 2026
Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks.
CoRR, February, 2026
MathMixup: Boosting LLM Mathematical Reasoning with Difficulty-Controllable Data Synthesis and Curriculum Learning.
CoRR, January, 2026
Proceedings of the ACM Web Conference 2026, 2026
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026
2025
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI.
CoRR, December, 2025
Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling.
CoRR, December, 2025
CoRR, December, 2025
CoRR, November, 2025
CoRR, November, 2025
CoRR, November, 2025
Rethinking Text-to-SQL: Dynamic Multi-turn SQL Interaction for Real-world Database Exploration.
CoRR, October, 2025
CoRR, October, 2025
LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding.
CoRR, October, 2025
CoRR, October, 2025
Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge.
CoRR, September, 2025
Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models.
CoRR, June, 2025
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions.
CoRR, June, 2025
LogicPuzzleRL: Cultivating Robust Mathematical Reasoning in LLMs via Reinforcement Learning.
CoRR, June, 2025
CoRR, May, 2025
CoRR, March, 2025
CoRR, March, 2025
CoRR, February, 2025
CoRR, February, 2025
UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
SynthVLM: Towards High-Quality and Efficient Synthesis of Image-Caption Datasets for Vision-Language Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
MathScape: Benchmarking Multimodal Large Language Models in Real-World Mathematical Contexts.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction.
CoRR, 2024
Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models.
CoRR, 2024
CoRR, 2024
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search.
CoRR, 2024
CoRR, 2024
MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark.
CoRR, 2024
SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models.
CoRR, 2024
CoRR, 2024
CoRR, 2024