Haibo Qiu

Orcid: 0000-0001-6179-2695

According to our database1, Haibo Qiu authored at least 29 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Seirênes: Adversarial Self-Play with Evolving Distractions for LLM Reasoning.
CoRR, May, 2026

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents.
CoRR, April, 2026

Omni-I2C: A Holistic Benchmark for High-Fidelity Image-to-Code Generation.
CoRR, March, 2026

Flexible Entropy Control in RLVR with Gradient-Preserving Perspective.
CoRR, February, 2026

TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution.
CoRR, February, 2026

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR.
CoRR, February, 2026

Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR.
CoRR, January, 2026

MobileDreamer: Generative Sketch World Model for GUI Agent.
CoRR, January, 2026

2025
Learning When to Look: A Disentangled Curriculum for Strategic Perception in Multimodal Reasoning.
CoRR, December, 2025

Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning.
CoRR, November, 2025

Context Cascade Compression: Exploring the Upper Limits of Text Compression.
CoRR, November, 2025

VinciCoder: Unifying Multimodal Code Generation via Coarse-to-fine Visual Reinforcement Learning.
CoRR, November, 2025

Metis-SPECS: Decoupling Multimodal Learning via Self-distilled Preference-based Cold Start.
CoRR, October, 2025

Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning.
CoRR, October, 2025

Towards Better & Faster Autoregressive Image Generation: From the Perspective of Entropy.
CoRR, October, 2025

DeepSketcher: Internalizing Visual Manipulation for Multimodal Reasoning.
CoRR, September, 2025

STAGE: Stable and Generalizable GRPO for Autoregressive Image Generation.
CoRR, September, 2025

OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds.
CoRR, September, 2025

Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning.
CoRR, June, 2025

GUIRoboTron-Speech: Towards Automated GUI Agents Based on Speech Instructions.
CoRR, June, 2025

UniToken: Harmonizing Multimodal Understanding and Generation through Unified Visual Encoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

2024
Development and validation of a deep interpretable network for continuous acute kidney injury prediction in critically ill patients.
Artif. Intell. Medicine, 2024

2023
PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation.
CoRR, 2023

Collect-and-Distribute Transformer for 3D Point Cloud Analysis.
CoRR, 2023

2022
GFNet: Geometric Flow Network for 3D Point Cloud Semantic Segmentation.
Trans. Mach. Learn. Res., 2022

End2End Occluded Face Recognition by Masking Corrupted Features.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

2021
SynFace: Face Recognition with Synthetic Data.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2019
Cross View Fusion for 3D Human Pose Estimation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Basis Representation to Refine 3D Human Pose Estimations.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019


  Loading...