Hengshuang Zhao

Orcid: 0000-0001-8277-2706

According to our database1, Hengshuang Zhao authored at least 90 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation.
CoRR, 2024

Pixel-GS: Density Control with Pixel-aware Gradient for 3D Gaussian Splatting.
CoRR, 2024

OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation.
CoRR, 2024

GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding.
CoRR, 2024

UniMODE: Unified Monocular 3D Object Detection.
CoRR, 2024

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding.
CoRR, 2024

Memory Consistency Guided Divide-and-Conquer Learning for Generalized Category Discovery.
CoRR, 2024

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data.
CoRR, 2024

2023
Patch-Based Separable Transformer for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Open World Entity Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

PhysFormer++: Facial Video-Based Physiological Measurement with SlowFast Temporal Difference Transformer.
Int. J. Comput. Vis., June, 2023

Fully Convolutional Networks for Panoptic Segmentation With Point-Based Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Adaptive Perspective Distillation for Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases.
CoRR, 2023

Self-supervised Learning for Enhancing Geometrical Modeling in 3D-Aware Generative Adversarial Network.
CoRR, 2023

Point Transformer V3: Simpler, Faster, Stronger.
CoRR, 2023

VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation.
CoRR, 2023

DreamComposer: Controllable 3D Object Generation via Multi-View Conditions.
CoRR, 2023

GPT4Point: A Unified Framework for Point-Language Understanding and Generation.
CoRR, 2023

LivePhoto: Real Image Animation with Text-guided Motion Control.
CoRR, 2023

A Lightweight Clustering Framework for Unsupervised Semantic Segmentation.
CoRR, 2023

Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding.
CoRR, 2023

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm.
CoRR, 2023

UniPAD: A Universal Pre-training Paradigm for Autonomous Driving.
CoRR, 2023

Uni3DETR: Unified 3D Detection Transformer.
CoRR, 2023

DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model.
CoRR, 2023

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation.
CoRR, 2023

Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training.
CoRR, 2023

InsightMapper: A Closer Look at Inner-instance Information for Vectorized High-Definition Mapping.
CoRR, 2023

AnyDoor: Zero-shot Object-level Image Customization.
CoRR, 2023

GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping.
CoRR, 2023

SAM3D: Segment Anything in 3D Scenes.
CoRR, 2023

OCBEV: Object-Centric BEV Transformer for Multi-View 3D Object Detection.
CoRR, 2023

VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection.
CoRR, 2023

Influencer Backdoor Attack on Semantic Segmentation.
CoRR, 2023

ScribbleSeg: Scribble-based Interactive Image Segmentation.
CoRR, 2023

GeoSpark: Sparking up Point Cloud Segmentation with Geometry Clue.
CoRR, 2023

TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CorresNeRF: Image Correspondence Priors for Neural Radiance Fields.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Uni3DETR: Unified 3D Detection Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Universal Adaptive Data Augmentation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

BT<sup>2</sup>: Backward-compatible Training with Basis Transformation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Shrinking Class Space for Enhanced Certainty in Semi-Supervised Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Open-vocabulary Panoptic Segmentation with Embedding Modulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Detecting Everything in the Open World: Towards Universal Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Prior Guided Feature Enrichment Network for Few-Shot Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners.
CoRR, 2022

General Adversarial Defense Against Black-box Attacks via Pixel Level and Feature Level Distribution Alignments.
CoRR, 2022

Universal Adaptive Data Augmentation.
CoRR, 2022

Point Transformer V2: Grouped Vector Attention and Partition-based Pooling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning.
Proceedings of the Computer Vision - ECCV 2022, 2022

DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness.
Proceedings of the Computer Vision - ECCV 2022, 2022

PhysFormer: Facial Video-based Physiological Measurement with Temporal Difference Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Generalized Few-shot Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Stratified Transformer for 3D Point Cloud Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FocalClick: Towards Practical Interactive Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

LAVT: Language-Aware Vision Transformer for Referring Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Adversarial Examples on Segmentation Models Can be Easy to Transfer.
CoRR, 2021

Do Different Tracking Tasks Require Different Appearance Models?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dual-Cross Central Difference Network for Face Anti-Spoofing.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Point Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking Semantic Segmentation From a Sequence-to-Sequence Perspective With Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PAConv: Position Adaptive Convolution With Dynamic Kernel Assembling on Point Clouds.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Fully Convolutional Networks for Panoptic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Semi-Supervised Semantic Segmentation With Directional Context-Aware Consistency.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Bidirectional Projection Network for Cross Dimension Scene Understanding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Distilling Knowledge via Knowledge Review.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Hierarchical Interaction Network for Video Object Segmentation from Referring Expressions.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Generalized Few-Shot Semantic Segmentation.
CoRR, 2020

GridMask Data Augmentation.
CoRR, 2020

Exploring Self-Attention for Image Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Region Refinement Network for Salient Object Detection.
CoRR, 2019

Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

PointWeb: Enhancing Local Neighborhood Features for Point Cloud Processing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

UPSNet: A Unified Panoptic Segmentation Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
PSANet: Point-wise Spatial Attention Network for Scene Parsing.
Proceedings of the Computer Vision - ECCV 2018, 2018

Compositing-Aware Image Search.
Proceedings of the Computer Vision - ECCV 2018, 2018

ICNet for Real-Time Semantic Segmentation on High-Resolution Images.
Proceedings of the Computer Vision - ECCV 2018, 2018

SegStereo: Exploiting Semantic Information for Disparity Estimation.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Automatic Real-time Background Cut for Portrait Videos.
CoRR, 2017

Pyramid Scene Parsing Network.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Augmented Feedback in Semantic Segmentation Under Image Level Supervision.
Proceedings of the Computer Vision - ECCV 2016, 2016


  Loading...