Xiaoyang Wu

Orcid: 0009-0002-2277-7104

Affiliations:
  • The University of Hong Kong, SAR, China


According to our database1, Xiaoyang Wu authored at least 31 papers between 2022 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
PonderV2: Improved 3D Representation With a Universal Pre-Training Paradigm.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

Train Once, Deploy Anywhere: Realize Data-Efficient Dynamic Object Manipulation.
CoRR, August, 2025

LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans.
CoRR, July, 2025

DreamComposer++: Empowering Diffusion Models with Multi-View Conditions for 3D Content Generation.
CoRR, July, 2025

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning.
CoRR, June, 2025

Sonata: Self-Supervised Learning of Reliable Point Representations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
VIRT: Vision Instructed Transformer for Robotic Manipulation.
CoRR, 2024

Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation.
CoRR, 2024

Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images.
CoRR, 2024

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding.
CoRR, 2024

LiT: Unifying LiDAR "Languages" with LiDAR Translator.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

OpenIns3D: Snap and Lookup for 3D Open-Vocabulary Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

UniPAD: A Universal Pre-Training Paradigm for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DreamComposer: Controllable 3D Object Generation via Multi-View Conditions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Large-Scale 3D Representation Learning with Multi-Dataset Point Prompt Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GroupContrast: Semantic-Aware Self-Supervised Representation Learning for 3D Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GPT4Point: A Unified Framework for Point-Language Understanding and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multi-Space Alignments Towards Universal LiDAR Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Point Transformer V3: Simpler, Faster, Stronger.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OCBEV: Object-Centric BEV Transformer for Multi-View 3D Object Detection.
Proceedings of the International Conference on 3D Vision, 2024

2023
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm.
CoRR, 2023

Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract.
CoRR, 2023

SAM3D: Segment Anything in 3D Scenes.
CoRR, 2023

Hierarchical Dense Correlation Distillation for Few-Shot Segmentation.
CoRR, 2023

GeoSpark: Sparking up Point Cloud Segmentation with Geometry Clue.
CoRR, 2023

Understanding Imbalanced Semantic Segmentation Through Neural Collapse.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hierarchical Dense Correlation Distillation for Few-Shot Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Point Transformer V2: Grouped Vector Attention and Partition-based Pooling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


  Loading...