Xiaoshuai Hao

Orcid: 0009-0007-4209-6695

According to our database1, Xiaoshuai Hao authored at least 56 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
M3-Net: A Cost-Effective Graph-Free MLP-Based Model for Traffic Prediction.
CoRR, August, 2025

NavA<sup>3</sup>: Understanding Any Instruction, Navigating Anywhere, Finding Anything.
CoRR, August, 2025

VisualTrans: A Benchmark for Real-World Visual Transformation Reasoning.
CoRR, August, 2025

Synergistic Prompting for Robust Visual Recognition with Missing Modalities.
CoRR, July, 2025

Training-free Generation of Temporally Consistent Rewards from VLMs.
CoRR, July, 2025

RoboBrain 2.0 Technical Report.
CoRR, July, 2025

What Really Matters for Robust Multi-Sensor HD Map Construction?
CoRR, July, 2025

SafeMap: Robust HD Map Construction from Incomplete Observations.
CoRR, July, 2025

I<sup>2</sup>S-TFCKD: Intra-Inter Set Knowledge Distillation with Time-Frequency Calibration for Speech Enhancement.
CoRR, June, 2025

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought.
CoRR, June, 2025

SVD: Spatial Video Dataset.
CoRR, June, 2025

Uneven Event Modeling for Partially Relevant Video Retrieval.
CoRR, June, 2025

Your Classifier Can Do More: Towards Bridging the Gaps in Classification, Robustness, and Generation.
CoRR, May, 2025

VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation.
CoRR, May, 2025

RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration.
CoRR, May, 2025

MEGA: Second-Order Gradient Alignment for Catastrophic Forgetting Mitigation in GFSCIL.
CoRR, April, 2025

FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird's Eye View.
CoRR, April, 2025

STViT+: improving self-supervised multi-camera depth estimation with spatial-temporal context and adversarial geometry regularization.
Appl. Intell., April, 2025

Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
CoRR, March, 2025

TLA: Tactile-Language-Action Model for Contact-Rich Manipulation.
CoRR, March, 2025

AffordGrasp: In-Context Affordance Reasoning for Open-Vocabulary Task-Oriented Grasping in Clutter.
CoRR, March, 2025

AS-GCL: Asymmetric Spectral Augmentation on Graph Contrastive Learning.
CoRR, February, 2025

MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception.
CoRR, January, 2025

Multi-Modal Molecular Representation Learning via Structure Awareness.
IEEE Trans. Image Process., 2025

A hierarchical reinforcement learning framework for multi-UAV combat using leader-follower strategy.
Knowl. Based Syst., 2025

BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation.
Inf. Fusion, 2025

MapFusion: A novel BEV feature fusion network for multi-modal map construction.
Inf. Fusion, 2025

A universal sampling method based on feature and structural comprehensive proximity measure.
Neurocomputing, 2025

ESC-MISR: Enhancing Spatial Correlations for Multi-image Super-Resolution in Remote Sensing.
Proceedings of the MultiMedia Modeling, 2025

Enhancing Adversarial Robustness of Vision-Language Models through Low-Rank Adaptation.
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025

TASAR: Transfer-based Attack on Skeletal Action Recognition.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Communication-Efficient Personalized Federal Graph Learning via Low-Rank Decomposition.
CoRR, 2024

DWCL: Dual-Weighted Contrastive Learning for Multi-View Clustering.
CoRR, 2024

ESC-MISR: Enhancing Spatial Correlations for Multi-Image Super-Resolution in Remote Sensing.
CoRR, 2024

BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation.
CoRR, 2024

Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track.
CoRR, 2024

The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition.
CoRR, 2024

What Foundation Models can Bring for Robot Learning in Manipulation : A Survey.
CoRR, 2024

DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation.
CoRR, 2024

Is Your HD Map Constructor Reliable under Sensor Corruptions?
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

FTF-ER: Feature-Topology Fusion-Based Experience Replay Method for Continual Graph Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MBFusion: A New Multi-modal BEV Feature Fusion Method for HD Map Construction.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Customized Treatment Per Pixel for Blind Image Super-Resolution.
Proceedings of the IEEE International Conference on Acoustics, 2024

MapDistill: Boosting Efficient Camera-Based HD Map Construction via Camera-LiDAR Fusion Model Distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Enhancing 3D Hand Pose Estimation via Dense Ordinal Regression Network.
Proceedings of the 35th British Machine Vision Conference, 2024

2023
Team AcieLee: Technical Report for EPIC-SOUNDS Audio-Based Interaction Recognition Challenge 2023.
CoRR, 2023

MixGen: A New Multi-Modal Data Augmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2023

Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Listen and Look: Multi-Modal Aggregation and Co-Attention Network for Video-Audio Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

2021
Multi-Feature Graph Attention Network for Cross-Modal Video-Text Retrieval.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

What Matters: Attentive and Relational Feature Aggregation Network for Video-Text Retrieval.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2020
The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020).
CoRR, 2020


  Loading...