Dong Wang
Affiliations:- Shanghai Artificial Intelligence Laboratory (Shanghai AI Lab), China
- Northwestern Polytechnical University, Xi'an, China
  According to our database1,
  Dong Wang
  authored at least 57 papers
  between 2017 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control.
    
  
    CoRR, August, 2025
    
  
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation.
    
  
    CoRR, May, 2025
    
  
AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations.
    
  
    CoRR, April, 2025
    
  
MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation.
    
  
    CoRR, March, 2025
    
  
OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation.
    
  
    CoRR, February, 2025
    
  
    CoRR, January, 2025
    
  
AlignBot: Aligning VLM-Powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots.
    
  
    Proceedings of the IEEE International Conference on Robotics and Automation, 2025
    
  
COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models.
    
  
    Proceedings of the IEEE International Conference on Robotics and Automation, 2025
    
  
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
    
  
Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction.
    
  
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
    
  
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
    
  
    Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
    
  
  2024
    CoRR, 2024
    
  
Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning.
    
  
    CoRR, 2024
    
  
    CoRR, 2024
    
  
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control.
    
  
    CoRR, 2024
    
  
    CoRR, 2024
    
  
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Control and Rendering.
    
  
    Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
    
  
Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection.
    
  
    Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
    
  
Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs.
    
  
    Proceedings of the IEEE International Conference on Robotics and Automation, 2024
    
  
    Proceedings of the Computer Vision - ECCV 2024, 2024
    
  
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
    
  
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
    
  
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
    
  
    Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024
    
  
    Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
    
  
X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge Transfer.
    
  
    Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
    
  
    Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
    
  
  2023
Calibration-free quantitative phase imaging in multi-core fiber endoscopes using end-to-end deep learning.
    
  
    CoRR, 2023
    
  
Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs.
    
  
    CoRR, 2023
    
  
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance.
    
  
    CoRR, 2023
    
  
    Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
    
  
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.
    
  
    Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
    
  
    Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
    
  
    Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
    
  
    Proceedings of the IEEE Global Communications Conference, 2023
    
  
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
    
  
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
    
  
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
    
  
    Proceedings of the Conference on Robot Learning, 2023
    
  
  2022
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
    
  
  2021
    Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
    
  
  2020
  2019
    Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
    
  
    Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
    
  
  2018
    Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
    
  
  2017
    IEEE Trans. Intell. Transp. Syst., 2017