Yuxin Xie

Affiliations:
  • Peking University, School of Electronic and Computer Engineering, Beijing, China


According to our database1, Yuxin Xie authored at least 12 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning.
CoRR, April, 2025

Do we really have to filter out random noise in pre-training data for language models?
CoRR, February, 2025

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model.
CoRR, January, 2025

VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware Sparsification.
CoRR, January, 2025

Towards Zero-shot Cross-lingual SLU with Syntax-aware Multi-view Contrastive Learning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

GPA: Global and Prototype Alignment for Audio-Text Retrieval.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval.
Proceedings of the Computer Vision - ECCV 2024, 2024

PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


  Loading...