Na Zhao

CoRR, May, 2026

PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, April, 2026

VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation.

[BibT_eX]

[DOI]

CoRR, March, 2026

Dual-Supervised Asymmetric Co-Training for Semi-Supervised Medical Domain Generalization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

Toward Generative Understanding: Incremental Few-Shot Semantic Segmentation With Diffusion Models.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2026

GAS: Geometry-Appearance Synergy for Consistent Video Customization.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling, 2026

TAVEN: Task-driven Adaptive Viewpoint Exploration for Training-Free 3D Spatial Reasoning and Understanding.

[BibT_eX]

[DOI]

Shuyi Jiang

Zhihao Yuan

Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026

Graph Smoothing for Enhanced Local Geometry Learning in Point Cloud Analysis.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

RaLiFlow: Scene Flow Estimation with 4D Radar and LiDAR Point Clouds.

[BibT_eX]

[DOI]

Jingyun Fu

Zhiyu Xiang

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Artemis: Structured Visual Reasoning for Perception Policy Learning.

[BibT_eX]

[DOI]

CoRR, December, 2025

Agentic Learner with Grow-and-Refine Multimodal Semantic Memory.

[BibT_eX]

[DOI]

CoRR, November, 2025

Late-decoupled 3D Hierarchical Semantic Segmentation with Semantic Prototype Discrimination based Bi-branch Supervision.

[BibT_eX]

[DOI]

CoRR, November, 2025

TokenSwap: Backdoor Attack on the Compositional Understanding of Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

CT3D++: Improving 3D Object Detection with Keypoint-Induced Channel-wise Transformer.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., July, 2025

Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations.

[BibT_eX]

[DOI]

CoRR, June, 2025

Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion.

[BibT_eX]

[DOI]

CoRR, January, 2025

Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

SDCoT++: Improved Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Graph Embedded Contrastive Learning for Multi-View Clustering.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

OcSplats: Rendering Occluded Humans with Prior Knowledge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Robust Multi-View Learning via Representation Fusion of Sample-Level Attention and Alignment of Simulated Perturbation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction.

[BibT_eX]

[DOI]

Heng Jia

Linchao Zhu

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Collaborative Tree Search for Enhancing Embodied Multi-Agent Collaboration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection.

[BibT_eX]

[DOI]

Jiangyi Wang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2024

GS<sup>2</sup>-GNeSF: Geometry-Semantics Synergy for Generalizable Neural Semantic Fields.

[BibT_eX]

[DOI]

Chengshun Wang

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

On-the-fly Point Feature Representation for Point Clouds Analysis.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Improving 3D Occupancy Prediction through Class-Balancing Loss and Multi-Scale Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Artificial Intelligence, 2024

End-to-End Semi-Supervised 3D Instance Segmentation with PCTeacher.

[BibT_eX]

[DOI]

Linfeng Li

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

View-Consistent 3D Editing with Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LASO: Language-Guided Affordance Segmentation on 3D Object.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds.

[BibT_eX]

[DOI]

Yuyang Zhao

Proceedings of the 35th British Machine Vision Conference, 2024

Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection.

[BibT_eX]

[DOI]

Yunsong Wang

Proceedings of the 35th British Machine Vision Conference, 2024

Dual-Perspective Knowledge Enrichment for Semi-supervised 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Robust Visual Recognition with Class-Imbalanced Open-World Noisy Data.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding.

[BibT_eX]

[DOI]

Yunsong Wang

Proceedings of the International Conference on 3D Vision, 2024

2023

PDR: Progressive Depth Regularization for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., December, 2023

Refining 6-DoF Grasps with Context-Specific Classifiers.

[BibT_eX]

[DOI]

IROS, 2023

Generalized Few-Shot Point Cloud Segmentation Via Geometric Words.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Robust Few-shot Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

Yating Xu

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022

Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking IoU-based Optimization for Single-stage 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Static-Dynamic Co-teaching for Class-Incremental 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Few-Shot 3D Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

PS2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

SESS: Self-Ensembling Semi-Supervised 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2019

2018

End2End Semantic Segmentation for 3D Indoor Scenes.

[BibT_eX]

[DOI]