Hao Chen

Orcid: 0000-0002-8400-3780

Affiliations:

Chinese University of Hong Kong, Hong Kong

According to our database¹, Hao Chen authored at least 17 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

HarmoWAM: Harmonizing Generalizable and Precise Manipulation via Adaptive World Action Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models.

[BibT_eX]

[DOI]

CoRR, March, 2026

LaST<sub>0</sub>: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model.

[BibT_eX]

[DOI]

CoRR, January, 2026

Generative Archetype-Grounded Item Representations for Sequential Recommendation.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2026, 2026

2025

ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, December, 2025

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, September, 2025

Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning.

[BibT_eX]

[DOI]

CoRR, June, 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model.

[BibT_eX]

[DOI]

CoRR, March, 2025

SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems.

[BibT_eX]

[DOI]

CoRR, March, 2025

Fast-in-Slow: A Dual-System VLA Model Unifying Fast Manipulation within Slow Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

SFANet: Spatial-Frequency Attention Network for Weather Forecasting.

[BibT_eX]

[DOI]

CoRR, 2024

SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the 35th British Machine Vision Conference, 2024

2023

Traj-MAE: Masked Autoencoders for Trajectory Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Hao Chen

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...