Xianwei Zhuang
Orcid: 0009-0004-4392-6126
According to our database1,
Xianwei Zhuang
authored at least 28 papers
between 2022 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Not All Tokens and Heads Are Equally Important: Dual-Level Attention Intervention for Hallucination Mitigation.
CoRR, June, 2025
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning.
CoRR, April, 2025
Do we really have to filter out random noise in pre-training data for language models?
CoRR, February, 2025
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model.
CoRR, January, 2025
VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware Sparsification.
CoRR, January, 2025
SemiGMMPoint: Semi-supervised point cloud segmentation based on Gaussian mixture models.
Pattern Recognit., 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Towards Multimodal-augmented Pre-trained Language Models via Self-balanced Expectation-Maximization Iteration.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
TFCD: Towards Multi-modal Sarcasm Detection via Training-Free Counterfactual Debiasing.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Uncertainty-Aware Sign Language Video Retrieval with Probability Distribution Modeling.
Proceedings of the Computer Vision - ECCV 2024, 2024
PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2022
Residual Swin Transformer Unet with Consistency Regularization for Automatic Breast Ultrasound Tumor Segmentation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022