Na Zhao

Orcid: 0000-0003-2329-7014

Affiliations:
  • Singapore University of Technology and Design, Singapore
  • National University of Singapore, Singapore (PhD 2021)


According to our database1, Na Zhao authored at least 63 papers between 2014 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Weakly Supervised Cross-Modal Learning for 4D Radar Scene Flow Estimation.
CoRR, May, 2026

PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving.
CoRR, April, 2026

VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation.
CoRR, March, 2026

Dual-Supervised Asymmetric Co-Training for Semi-Supervised Medical Domain Generalization.
IEEE Trans. Multim., 2026

Toward Generative Understanding: Incremental Few-Shot Semantic Segmentation With Diffusion Models.
IEEE Trans. Image Process., 2026

GAS: Geometry-Appearance Synergy for Consistent Video Customization.
Proceedings of the MultiMedia Modeling, 2026

TAVEN: Task-driven Adaptive Viewpoint Exploration for Training-Free 3D Spatial Reasoning and Understanding.
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026

Graph Smoothing for Enhanced Local Geometry Learning in Point Cloud Analysis.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

RaLiFlow: Scene Flow Estimation with 4D Radar and LiDAR Point Clouds.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Artemis: Structured Visual Reasoning for Perception Policy Learning.
CoRR, December, 2025

Agentic Learner with Grow-and-Refine Multimodal Semantic Memory.
CoRR, November, 2025

Late-decoupled 3D Hierarchical Semantic Segmentation with Semantic Prototype Discrimination based Bi-branch Supervision.
CoRR, November, 2025

TokenSwap: Backdoor Attack on the Compositional Understanding of Large Vision-Language Models.
CoRR, September, 2025

CT3D++: Improving 3D Object Detection with Keypoint-Induced Channel-wise Transformer.
Int. J. Comput. Vis., July, 2025

Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations.
CoRR, June, 2025

Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion.
CoRR, January, 2025

Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization.
IEEE Trans. Multim., 2025

SDCoT++: Improved Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detection.
IEEE Trans. Image Process., 2025

AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Graph Embedded Contrastive Learning for Multi-View Clustering.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

OcSplats: Rendering Occluded Humans with Prior Knowledge.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Robust Multi-View Learning via Representation Fusion of Sample-Level Attention and Alignment of Simulated Perturbation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Collaborative Tree Search for Enhancing Embodied Multi-Agent Collaboration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization.
Int. J. Comput. Vis., 2024

GS<sup>2</sup>-GNeSF: Geometry-Semantics Synergy for Generalizable Neural Semantic Fields.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

On-the-fly Point Feature Representation for Point Clouds Analysis.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Improving 3D Occupancy Prediction through Class-Balancing Loss and Multi-Scale Representation.
Proceedings of the IEEE Conference on Artificial Intelligence, 2024

End-to-End Semi-Supervised 3D Instance Segmentation with PCTeacher.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

View-Consistent 3D Editing with Gaussian Splatting.
Proceedings of the Computer Vision - ECCV 2024, 2024

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image.
Proceedings of the Computer Vision - ECCV 2024, 2024

LASO: Language-Guided Affordance Segmentation on 3D Object.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds.
Proceedings of the 35th British Machine Vision Conference, 2024

Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection.
Proceedings of the 35th British Machine Vision Conference, 2024

Dual-Perspective Knowledge Enrichment for Semi-supervised 3D Object Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Robust Visual Recognition with Class-Imbalanced Open-World Noisy Data.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding.
Proceedings of the International Conference on 3D Vision, 2024

2023
PDR: Progressive Depth Regularization for Monocular 3D Object Detection.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Refining 6-DoF Grasps with Context-Specific Classifiers.
IROS, 2023

Generalized Few-Shot Point Cloud Segmentation Via Geometric Words.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Robust Few-shot Point Cloud Semantic Segmentation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking IoU-based Optimization for Single-stage 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions.
Proceedings of the Computer Vision - ECCV 2022, 2022

Static-Dynamic Co-teaching for Class-Incremental 3D Object Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Few-Shot 3D Point Cloud Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
PS2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

SESS: Self-Ensembling Semi-Supervised 3D Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation.
CoRR, 2019

2018
End2End Semantic Segmentation for 3D Indoor Scenes.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017
VideoWhisper: Toward Discriminative Unsupervised Video Feature Learning With Attention-Based Recurrent Neural Networks.
IEEE Trans. Multim., 2017

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Learning content-social influential features for influence analysis.
Int. J. Multim. Inf. Retr., 2016

Discrete Image Hashing Using Large Weakly Annotated Photo Collections.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2014
Automatic image annotation by semi-supervised manifold kernel density estimation.
Inf. Sci., 2014

Searching for Recent Celebrity Images in Microblog Platform.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014


  Loading...