Xinghang Li

According to our database1, Xinghang Li authored at least 24 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
RotVLA: Rotational Latent Action for Vision-Language-Action Model.
CoRR, May, 2026

Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising.
CoRR, April, 2026

Multi-View Video Diffusion Policy: A 3D Spatio-Temporal-Aware Video Action Model.
CoRR, April, 2026

Scaling World Model for Hierarchical Manipulation Policies.
CoRR, February, 2026

Implicit Chain-of-Thought Reasoning via Task-Aware Latent Motion for transferable VLA.
Pattern Recognit., 2026

2025
RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation.
CoRR, November, 2025

Emu3.5: Native Multimodal Models are World Learners.
CoRR, October, 2025

From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors.
CoRR, October, 2025

Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots.
CoRR, September, 2025

Unified Vision-Language-Action Model.
CoRR, June, 2025

SafeGenBench: A Benchmark Framework for Security Vulnerability Detection in LLM-Generated Code.
CoRR, June, 2025

Sparse Spectral Training and Inference on Euclidean and Hyperbolic Neural Networks.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

ProcWorld: Benchmarking Large Model Planning in Reachability-Constrained Environments.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models.
CoRR, 2024

SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot Interaction.
CoRR, 2024

Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Vision-Language Foundation Models as Effective Robot Imitators.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Mixed Neural Voxels for Fast Multi-view Video Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
REVE-CE: Remote Embodied Visual Referring Expression in Continuous Environment.
IEEE Robotics Autom. Lett., 2022

Mixed Neural Voxels for Fast Multi-view Video Synthesis.
CoRR, 2022

Embodied Multi-Agent Task Planning from Ambiguous Instruction.
Proceedings of the Robotics: Science and Systems XVIII, New York City, NY, USA, June 27, 2022

2021
Robotic Indoor Scene Captioning from Streaming Video.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

MCTS-Based Robotic Exploration for Scene Graph Generation.
Proceedings of the Cognitive Systems and Information Processing, 2021

Embodied Semantic Scene Graph Generation.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021


  Loading...