Hao Chen

Orcid: 0000-0002-8400-3780

Affiliations:
  • Chinese University of Hong Kong, Hong Kong


According to our database1, Hao Chen authored at least 14 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models.
CoRR, March, 2026

LaST<sub>0</sub>: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model.
CoRR, January, 2026

Generative Archetype-Grounded Item Representations for Sequential Recommendation.
Proceedings of the ACM Web Conference 2026, 2026

2025
ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation.
CoRR, December, 2025

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation.
CoRR, September, 2025

Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning.
CoRR, June, 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model.
CoRR, March, 2025

SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems.
CoRR, March, 2025

MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
SFANet: Spatial-Frequency Attention Network for Weather Forecasting.
CoRR, 2024

SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning.
Proceedings of the 35th British Machine Vision Conference, 2024

2023
Traj-MAE: Masked Autoencoders for Trajectory Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023


  Loading...