We stand with Ukraine

We stand with Ukraine

Yuxin Xie

Affiliations:

Peking University, School of Electronic and Computer Engineering, Beijing, China

According to our database¹, Yuxin Xie authored at least 14 papers between 2024 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

SupCLAP: Controlling Optimization Trajectory Drift in Audio-Text Contrastive Learning with Support Vector Regularization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, September, 2025

VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, April, 2025

Do we really have to filter out random noise in pre-training data for language models?

[BibT_eX]

[DOI]

,

,

,

,

CoRR, February, 2025

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, January, 2025

VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware Sparsification.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, January, 2025

SpeechSEC: A Unified Multi-Task Framework for Speech Synthesis, Editing, and Continuation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Towards Zero-shot Cross-lingual SLU with Syntax-aware Multi-view Contrastive Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

GPA: Global and Prototype Alignment for Audio-Text Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Loading...