Shuanghao Bai

Orcid: 0009-0002-6047-0242

According to our database1, Shuanghao Bai authored at least 22 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Assistance Without Interruption: A Benchmark and LLM-based Framework for Non-Intrusive Human-Robot Assistance.
CoRR, May, 2026

HEX: Humanoid-Aligned Experts for Cross-Embodiment Whole-Body Manipulation.
CoRR, April, 2026

Reshaping Action Error Distributions for Reliable Vision-Language-Action Models.
CoRR, February, 2026

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models.
CoRR, February, 2026

2025
RoboMirror: Understand Before You Imitate for Video to Humanoid Locomotion.
CoRR, December, 2025

Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives.
CoRR, December, 2025

Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey.
CoRR, October, 2025

VCoT-Grasp: Grasp Foundation Models with Visual Chain-of-Thought Reasoning for Language-driven Grasp Generation.
CoRR, October, 2025

Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation.
CoRR, August, 2025

Dual-Path Stable Soft Prompt Generation for Domain Generalization.
CoRR, May, 2025

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation.
CoRR, May, 2025

Rethinking Latent Representations in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation.
CoRR, February, 2025

An information-theoretic approach for heterogeneous differentiable causal discovery.
Neural Networks, 2025

Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

VLAS: Vision-Language-Action Model with Speech Instructions for Customized Robot Manipulation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Enhancing Robot Task Planning and Execution through Multi-Layer Large Language Models.
Sensors, March, 2024

Revisiting the Adversarial Robustness of Vision Language Models: a Multimodal Perspective.
CoRR, 2024

Jacobian Regularizer-based Neural Granger Causality.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Improving Cross-Domain Few-Shot Classification with Multilayer Perceptron.
Proceedings of the IEEE International Conference on Acoustics, 2024

Soft Prompt Generation for Domain Generalization.
Proceedings of the Computer Vision - ECCV 2024, 2024

Prompt-Based Distribution Alignment for Unsupervised Domain Adaptation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024


  Loading...