Hao Luo

Orcid: 0000-0003-4612-5450

Affiliations:
  • Peking University, Beijing, China


According to our database1, Hao Luo authored at least 18 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Being-H0.7: A Latent World-Action Model from Egocentric Videos.
CoRR, May, 2026

Unmasking the Illusion of Embodied Reasoning in Vision-Language-Action Models.
CoRR, April, 2026

OpenT2M: No-frill Motion Generation with Open-source,Large-scale, High-quality Data.
CoRR, March, 2026

Conservative Offline Robot Policy Learning via Posterior-Transition Reweighting.
CoRR, March, 2026

Joint-Aligned Latent Action: Towards Scalable VLA Pretraining in the Wild.
CoRR, February, 2026

Rethinking Visual-Language-Action Model Scaling: Alignment, Mixture, and Regularization.
CoRR, February, 2026

Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization.
CoRR, January, 2026

2025
Spatial-Aware VLA Pretraining through Visual-Physical Alignment from Human Videos.
CoRR, December, 2025

DiG-Flow: Discrepancy-Guided Flow Matching for Robust VLA Models.
CoRR, December, 2025

Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos.
CoRR, July, 2025

Learning Video-Conditioned Policy on Unlabelled Data with Joint Embedding Predictive Transformer.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Unified Multimodal Understanding via Byte-Pair Visual Encoding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

VideoOrion: Tokenizing Object Dynamics in Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Pre-trained Visual Dynamics Representations for Efficient Policy Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Reinforcement Learning Friendly Vision-Language Model for Minecraft.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
A Survey on Transformers in Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

CLIP4MC: An RL-Friendly Vision-Language Model for Minecraft.
CoRR, 2023

Model-Based Decentralized Policy Optimization.
CoRR, 2023


  Loading...