Xu Zheng
Orcid: 0000-0003-4008-8951Affiliations:
- AI Thrust, Hong Kong University of Science and Technology (HKUST), Guangzhou, China
According to our database1,
Xu Zheng
authored at least 50 papers
between 2022 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
CoRR, June, 2025
CoRR, June, 2025
Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection.
CoRR, June, 2025
BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation.
CoRR, June, 2025
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization.
CoRR, May, 2025
Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook.
CoRR, March, 2025
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation.
CoRR, March, 2025
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation.
CoRR, March, 2025
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance.
CoRR, March, 2025
360SFUDA++: Towards Source-Free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2025
RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning.
CoRR, February, 2025
Pattern Recognit., 2025
Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
Frozen is better than learning: A new design of prototype-based classifier for semantic segmentation.
Pattern Recognit., 2024
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection.
CoRR, 2024
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges.
CoRR, 2024
CoRR, 2024
EIT-1M: One Million EEG-Image-Text Pairs for Human Visual-textual Recognition and More.
CoRR, 2024
CoRR, 2024
GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation.
CoRR, 2024
EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition.
CoRR, 2024
ExACT: Language-guided Conceptual Reasoning and Uncertainty Estimation for Event-based Action Recognition and More.
CoRR, 2024
CoRR, 2024
Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation.
CoRR, 2024
Chasing Day and Night: Towards Robust and Efficient All-Day Object Detection Guided by an Event Camera.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Transformer-CNN Cohort: Semi-supervised Semantic Segmentation by the Best of Both Students.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
EventBind: Learning a Unified Representation to Bind Them All for Event-Based Open-World Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024
Centering the Value of Every Modality: Towards Efficient and Resilient Modality-Agnostic Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Learning Modality-Agnostic Representation for Semantic Segmentation from Any Modalities.
Proceedings of the Computer Vision - ECCV 2024, 2024
Semantics, Distortion, and Style Matter: Towards Source-Free UDA for Panoramic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Comput. Biol. Medicine, May, 2023
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation.
CoRR, 2023
CoRR, 2023
CoRR, 2023
A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Look at the Neighbor: Distortion-aware Unsupervised Domain Adaptation for Panoramic Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Transformer-CNN Cohort: Semi-supervised Semantic Segmentation by the Best of Both Students.
CoRR, 2022
All One Needs to Know about Priors for Deep Image Restoration and Enhancement: A Survey.
CoRR, 2022
Comput. Biol. Medicine, 2022
Uncertainty teacher with dense focal loss for semi-supervised medical image segmentation.
Comput. Biol. Medicine, 2022