Yingbo Tang

Orcid: 0009-0001-1657-256X

According to our database1, Yingbo Tang authored at least 18 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Walk With Me: Long-Horizon Social Navigation for Human-Centric Outdoor Assistance.
CoRR, April, 2026

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation.
CoRR, April, 2026

Diff-COPE: Diffusion-Based Category-Level 6D Object Pose Estimation.
IEEE Trans. Circuits Syst. Video Technol., March, 2026

Embodied Spatial Affordance: Spatial-Aware Affordance Learning for Embodied Navigation and Manipulation.
IEEE Trans. Image Process., 2026

NavA³: Understanding Any Instruction, Navigating Anywhere, Finding Anything.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
MiMo-Embodied: X-Embodied Foundation Model Technical Report.
CoRR, November, 2025

Is your VLM Sky-Ready? A Comprehensive Spatial Intelligence Benchmark for UAV Navigation.
CoRR, November, 2025

RoboAfford++: A Generative AI-Enhanced Dataset for Multimodal Affordance Learning in Robotic Manipulation and Navigation.
CoRR, November, 2025

Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track.
CoRR, October, 2025

NavA<sup>3</sup>: Understanding Any Instruction, Navigating Anywhere, Finding Anything.
CoRR, August, 2025

RoboBrain 2.0 Technical Report.
CoRR, July, 2025

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought.
CoRR, June, 2025

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

RoboAfford: A Dataset and Benchmark for Enhancing Object and Spatial Affordance Learning in Robot Manipulation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

AffordGrasp: In-Context Affordance Reasoning for Open-Vocabulary Task-Oriented Grasping in Clutter.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

2024
Semi-Supervised Few-Shot Object Detection via Adaptive Pseudo Labeling.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

2022
Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention.
IEEE Trans. Circuits Syst. Video Technol., 2022

Visual Grasping with Spectral Clustering and Heuristic Searching for Robot in Cluttered Environments.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2022


  Loading...