We stand with Ukraine

We stand with Ukraine

Dong Wang

Affiliations:

Shanghai Artificial Intelligence Laboratory (Shanghai AI Lab), China
Northwestern Polytechnical University, Xi'an, China

According to our database¹, Dong Wang authored at least 59 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

FastUMI-100K: Advancing Data-driven Robotic Manipulation with a Large-scale UMI-style Dataset.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Trajectory Conditioned Cross-embodiment Skill Transfer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, October, 2025

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

Hume: Introducing System-2 Thinking in Visual-Language-Action Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, April, 2025

MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, March, 2025

OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

AlignBot: Aligning VLM-Powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Efficient Diffusion as Low Light Enhancer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Open-Vocabulary Octree-Graph for 3D Scene Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Night-to-Day Translation via Illumination Degradation Disentanglement.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Towards Flexible and Efficient Diffusion Low Light Enhancer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding.

[BibT_eX]

[DOI]

,

,

,

,

Shanghang Zhang

,

,

CoRR, 2024

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Control and Rendering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Any2Point: Empowering Any-Modality Large Models for Efficient 3D Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Implicit Event-RGBD Neural SLAM.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Gerald Schaefer

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge Transfer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Color Event Enhanced Single-Exposure HDR Imaging.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Calibration-free quantitative phase imaging in multi-core fiber endoscopes using end-to-end deep learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Nektarios Koukourakis

,

Jürgen W. Czarske

,

CoRR, 2023

Implicit Event-RGBD Neural SLAM.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Cross-Domain Policy Adaptation via Value-Guided Data Filtering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multi-Modal Beam Selection: A Transfer Methodology for Multi-Frequency.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE Global Communications Conference, 2023

Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fully Self-Supervised Depth Estimation from Defocus Clue.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Affordance-Driven Next-Best-View Planning for Robotic Grasping.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 2023

2022

Balanced Multimodal Learning via On-the-fly Gradient Modulation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Temporal Relational Modeling with Self-Supervision for Action Segmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Gated forward refinement network for action segmentation.

[BibT_eX]

[DOI]

,

,

Neurocomputing, 2020

Curriculum Audiovisual Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2020

2019

C^3 Framework: An Open-source PyTorch Code for Crowd Counting.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2019

Early Action Prediction With Generative Adversarial Networks.

[BibT_eX]

[DOI]

,

,

IEEE Access, 2019

Listen to the Image.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Memory-Augmented Temporal Dynamic Learning for Action Recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Cross-Modal Message Passing for Two-Stream Fusion.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Anomaly Detection in Traffic Scenes via Spatial-Aware Motion Reconstruction.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Intell. Transp. Syst., 2017

Loading...