Pengwei Wang

Orcid: 0000-0002-6412-5449

Affiliations:
  • South China University of Technology, Guangzhou, China


According to our database1, Pengwei Wang authored at least 46 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Mask World Model: Predicting What Matters for Robust Robot Policy Learning.
CoRR, April, 2026

PRM-as-a-Judge: A Dense Evaluation Paradigm for Fine-Grained Robotic Auditing.
CoRR, March, 2026

SaPaVe: Towards Active Perception and Manipulation in Vision-Language-Action Models for Robotics.
CoRR, March, 2026

AnyTouch 2: General Optical Tactile Representation Learning For Dynamic Tactile Perception.
CoRR, February, 2026

Reshaping Action Error Distributions for Reliable Vision-Language-Action Models.
CoRR, February, 2026

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models.
CoRR, February, 2026

RoboBrain 2.5: Depth in Sight, Time in Mind.
CoRR, January, 2026

Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation.
CoRR, January, 2026

2025
Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation.
CoRR, December, 2025

Do You Have Freestyle? Expressive Humanoid Locomotion via Audio Control.
CoRR, December, 2025

RoboMirror: Understand Before You Imitate for Video to Humanoid Locomotion.
CoRR, December, 2025

RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics.
CoRR, December, 2025

RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation.
CoRR, November, 2025

METIS: Multi-Source Egocentric Training for Integrated Dexterous Vision-Language-Action Model.
CoRR, November, 2025

PIGEON: VLM-Driven Object Navigation via Points of Interest Selection.
CoRR, November, 2025

RoboOS-NeXT: A Unified Memory-based Framework for Lifelong, Scalable, and Robust Multi-Robot Collaboration.
CoRR, October, 2025

Robobench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain.
CoRR, October, 2025

From Language to Locomotion: Retargeting-free Humanoid Control via Motion Latent Guidance.
CoRR, October, 2025

OmniSAT: Compact Action Token, Faster Auto Regression.
CoRR, October, 2025

TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics.
CoRR, October, 2025

MathSticks: A Benchmark for Visual Symbolic Compositional Reasoning with Matchstick Puzzles.
CoRR, October, 2025

NavA<sup>3</sup>: Understanding Any Instruction, Navigating Anywhere, Finding Anything.
CoRR, August, 2025

RoboBrain 2.0 Technical Report.
CoRR, July, 2025

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation.
CoRR, July, 2025

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought.
CoRR, June, 2025

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics.
CoRR, June, 2025

RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration.
CoRR, May, 2025

Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
CoRR, March, 2025

AffordGrasp: In-Context Affordance Reasoning for Open-Vocabulary Task-Oriented Grasping in Clutter.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

SafeMap: Robust HD Map Construction from Incomplete Observations.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation.
CoRR, 2024

2019
Logic Rules Powered Knowledge Graph Embedding.
CoRR, 2019

Dual Adversarial Learning Based Network Alignment.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Semi-supervised Classification-based Local Vertex Ranking via Dual Generative Adversarial Nets.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Integrating Local Vertex/Edge Embedding via Deep Matrix Fusion and Siamese Multi-label Classification.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

2018
Concept and Attention-Based CNN for Question Retrieval in Multi-View Learning.
ACM Trans. Intell. Syst. Technol., 2018

Density-Adaptive Local Edge Representation Learning with Generative Adversarial Network Multi-label Edge Classification.
Proceedings of the IEEE International Conference on Data Mining, 2018

Density-aware Local Siamese Autoencoder Network Embedding with Autoencoder Graph Clustering.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
Large-scale extraction of drug-disease pairs from the medical literature.
J. Assoc. Inf. Sci. Technol., 2017

Can Machines Intelligently Propose Novel and Reasonable Scientific Hypotheses?
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Concept Embedded Convolutional Semantic Model for Question Retrieval.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

2016
Learning to Extract Conditional Knowledge for Question Answering using Dialogue.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2013
Depth Camera Based Real-Time Fingertip Detection Using Multi-view Projection.
Proceedings of the Human-Computer Interaction. Towards Intelligent and Implicit Interaction, 2013


  Loading...