Xu Zheng

Orcid: 0000-0003-4008-8951

Affiliations:
  • AI Thrust, Hong Kong University of Science and Technology (HKUST), Guangzhou, China


According to our database1, Xu Zheng authored at least 50 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation.
CoRR, June, 2025

Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation.
CoRR, June, 2025

Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection.
CoRR, June, 2025

BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation.
CoRR, June, 2025

Shifting AI Efficiency From Model-Centric to Data-Centric Compression.
CoRR, May, 2025

Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding.
CoRR, May, 2025

Manifold-aware Representation Learning for Degradation-agnostic Image Restoration.
CoRR, May, 2025

MLLMs are Deeply Affected by Modality Bias.
CoRR, May, 2025

Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?
CoRR, May, 2025

Adversarial Robustness for Unified Multi-Modal Encoders via Efficient Calibration.
CoRR, May, 2025

Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization.
CoRR, May, 2025

Split Matching for Inductive Zero-shot Semantic Segmentation.
CoRR, May, 2025

Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook.
CoRR, March, 2025

OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation.
CoRR, March, 2025

MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation.
CoRR, March, 2025

Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance.
CoRR, March, 2025

360SFUDA++: Towards Source-Free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2025

RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning.
CoRR, February, 2025

Distilling efficient Vision Transformers from CNNs for semantic segmentation.
Pattern Recognit., 2025

Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Frozen is better than learning: A new design of prototype-based classifier for semantic segmentation.
Pattern Recognit., 2024

MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection.
CoRR, 2024

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges.
CoRR, 2024

Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation.
CoRR, 2024

EIT-1M: One Million EEG-Image-Text Pairs for Human Visual-textual Recognition and More.
CoRR, 2024

OmniBind: Teach to Build Unequal-Scale Modality Interaction for Omni-Bind of All.
CoRR, 2024

GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation.
CoRR, 2024

EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition.
CoRR, 2024

ExACT: Language-guided Conceptual Reasoning and Uncertainty Estimation for Event-based Action Recognition and More.
CoRR, 2024

UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All.
CoRR, 2024

Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation.
CoRR, 2024

Chasing Day and Night: Towards Robust and Efficient All-Day Object Detection Guided by an Event Camera.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Transformer-CNN Cohort: Semi-supervised Semantic Segmentation by the Best of Both Students.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

EventBind: Learning a Unified Representation to Bind Them All for Event-Based Open-World Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

Centering the Value of Every Modality: Towards Efficient and Resilient Modality-Agnostic Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning Modality-Agnostic Representation for Semantic Segmentation from Any Modalities.
Proceedings of the Computer Vision - ECCV 2024, 2024

Semantics, Distortion, and Style Matter: Towards Source-Free UDA for Panoramic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Adversarial co-training for semantic segmentation over medical images.
Comput. Biol. Medicine, May, 2023

CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation.
CoRR, 2023

E-CLIP: Towards Label-efficient Event-based Open-world Understanding by CLIP.
CoRR, 2023

Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks.
CoRR, 2023

A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Look at the Neighbor: Distortion-aware Unsupervised Domain Adaptation for Panoramic Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Transformer-CNN Cohort: Semi-supervised Semantic Segmentation by the Best of Both Students.
CoRR, 2022

All One Needs to Know about Priors for Deep Image Restoration and Enhancement: A Survey.
CoRR, 2022

Uncertainty-aware deep co-training for semi-supervised medical image segmentation.
Comput. Biol. Medicine, 2022

Uncertainty teacher with dense focal loss for semi-supervised medical image segmentation.
Comput. Biol. Medicine, 2022


  Loading...