Xianwei Zhuang

Orcid: 0009-0004-4392-6126

According to our database1, Xianwei Zhuang authored at least 28 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Not All Tokens and Heads Are Equally Important: Dual-Level Attention Intervention for Hallucination Mitigation.
CoRR, June, 2025

VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning.
CoRR, April, 2025

Do we really have to filter out random noise in pre-training data for language models?
CoRR, February, 2025

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model.
CoRR, January, 2025

VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware Sparsification.
CoRR, January, 2025

SemiGMMPoint: Semi-supervised point cloud segmentation based on Gaussian mixture models.
Pattern Recognit., 2025

UniCoTT: A Unified Framework for Structural Chain-of-Thought Distillation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

HCoTT: Hierarchical Chain-of-Thought Distillation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Towards Multimodal-augmented Pre-trained Language Models via Self-balanced Expectation-Maximization Iteration.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

GPA: Global and Prototype Alignment for Audio-Text Retrieval.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

TFCD: Towards Multi-modal Sarcasm Detection via Training-Free Counterfactual Debiasing.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

What are the Generator Preferences for End-to-end Task-Oriented Dialog System?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval.
Proceedings of the Computer Vision - ECCV 2024, 2024

Uncertainty-Aware Sign Language Video Retrieval with Probability Distribution Modeling.
Proceedings of the Computer Vision - ECCV 2024, 2024

PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2022
Residual Swin Transformer Unet with Consistency Regularization for Automatic Breast Ultrasound Tumor Segmentation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022


  Loading...