Pengxiang Ding

Orcid: 0000-0002-4049-7467

According to our database1, Pengxiang Ding authored at least 36 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.
CoRR, June, 2025

RationalVLA: A Rational Vision-Language-Action Model with Dual System.
CoRR, June, 2025

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning.
CoRR, May, 2025

Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions.
CoRR, May, 2025

ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning.
CoRR, May, 2025

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation.
CoRR, May, 2025

Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach.
CoRR, April, 2025

Unicorn: Text-Only Data Synthesis for Vision Language Model Training.
CoRR, March, 2025

Exploring the Evolution of Physics Cognition in Video Generation: A Survey.
CoRR, March, 2025

MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models.
CoRR, March, 2025

Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding.
CoRR, March, 2025

Humanoid-VLA: Towards Universal Humanoid Control with Visual Integration.
CoRR, February, 2025

Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport.
CoRR, February, 2025

Rethinking Latent Representations in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation.
CoRR, February, 2025

Enhancing Adversarial Transferability via Component-Wise Augmentation Method.
CoRR, January, 2025

VLAS: Vision-Language-Action Model with Speech Instructions for Customized Robot Manipulation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Instance-Incremental Scene Graph Generation From Real-World Point Clouds via Normalizing Flows.
IEEE Trans. Circuits Syst. Video Technol., February, 2024

DHRNet: A Dual-path Hierarchical Relation Network for multi-person pose estimation.
Knowl. Based Syst., 2024

QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning.
CoRR, 2024

Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation.
CoRR, 2024

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction.
CoRR, 2024

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration.
CoRR, 2024

RL2AC: Reinforcement Learning-based Rapid Online Adaptive Control for Legged Robot Robust Locomotion.
Proceedings of the Robotics: Science and Systems XX, 2024

ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

PiTe: Pixel-Temporal Alignment for Large Video-Language Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots.
Proceedings of the Computer Vision - ECCV 2024, 2024

Expressive Forecasting of 3D Whole-Body Human Motions.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots.
CoRR, 2023

2022
Towards More Realistic Human Motion Prediction With Attention to Motion Coordination.
IEEE Trans. Circuits Syst. Video Technol., 2022

DC-net: Dual-Consistency semi-supervised learning for 3D left atrium segmentation from MRI.
Biomed. Signal Process. Control., 2022

2021
TrajectoryCNN: A New Spatio-Temporal Feature Learning Network for Human Motion Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2021

Uncertainty-aware Human Motion Prediction.
CoRR, 2021

An Attractor-Guided Neural Networks for Skeleton-Based Human Motion Prediction.
CoRR, 2021


  Loading...