Xianghu Yue

Orcid: 0000-0003-3527-6034

According to our database¹, Xianghu Yue authored at least 29 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

HalluAudio: A Comprehensive Benchmark for Hallucination Detection in Large Audio-Language Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

Whisper-MLA: Reducing GPU Memory Consumption of ASR Models based on MHA2MLA Conversion.

[BibT_eX]

[DOI]

CoRR, March, 2026

VoiceBench: Benchmarking LLM-Based Voice Assistants.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2026

Listening for "You": Enhancing Speech Image Retrieval via Target Speaker Extraction.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2026

PAL: Prompting analytic learning with missing modality for multi-modal class-incremental learning.

[BibT_eX]

[DOI]

Pattern Recognit., 2026

EA-VAE: Learning to Reconstruct Dysarthric Speech via Variational Autoencoder with Encoding Alignment.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture.

[BibT_eX]

[DOI]

CoRR, April, 2025

Analytic Class Incremental Learning for Sound Source Localization With Privacy Protection.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2025

EventLip: Enhancing Event-Based Lip Reading via Frequency-Aware Spatiotemporal Hypergraph Modeling.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Text-Guided HuBERT: Self-Supervised Speech Pre-Training via Generative Adversarial Networks.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing.

[BibT_eX]

[DOI]

CoRR, 2024

Language Without Borders: A Dataset and Benchmark for Code-Switching Lip Reading.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

PD-Refiner: An Underlying Surface Inheritance Refiner with Adaptive Edge-Aware Supervision for Point Cloud Denoising.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MMAL: Multi-Modal Analytic Learning for Exemplar-Free Audio-Visual Class Incremental Tasks.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023

Self-Supervised Acoustic Word Embedding Learning via Correspondence Transformer Encoder.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Token2vec: A Joint Self-Supervised Pre-Training Framework Using Unpaired Speech and Text.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Transriber: Few-Shot Lyrics Transcription With Self-Training.

[BibT_eX]

[DOI]

Xiaoxue Gao

Xianghu Yue

Haizhou Li

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Self-Supervised Learning With Segmental Masking for Speech Representation.

[BibT_eX]

[DOI]

Xianghu Yue

Jingru Lin

Fabian Ritter Gutierrez

Haizhou Li

IEEE J. Sel. Top. Signal Process., 2022

2021

Phonetically Motivated Self-Supervised Speech Representation Learning.

[BibT_eX]

[DOI]

Xianghu Yue

Haizhou Li

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Data augmentation in fault diagnosis based on the Wasserstein generative adversarial network with gradient penalty.

[BibT_eX]

[DOI]

Xin Gao

Fang Deng

Xianghu Yue

Neurocomputing, 2020

2019

Multisource Energy Harvesting System for a Wireless Sensor Network Node in the Field Environment.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2019

Multidimensional zero-crossing interval points: a low sampling rate acoustic fingerprint recognition method.

[BibT_eX]

[DOI]

Xianghu Yue

Fang Deng

Yue Xu

Sci. China Inf. Sci., 2019

Multi-Graph Decoding for Code-Switching ASR.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Linguistically Motivated Parallel Data Augmentation for Code-Switch Language Modeling.

[BibT_eX]

[DOI]

Grandee Lee

Xianghu Yue

Haizhou Li

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

End-to-End Code-Switching ASR for Low-Resourced Language Pairs.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2017

Energy-Based Sound Source Localization with Low Power Consumption in Wireless Sensor Networks.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., 2017

A Novel Robust Trilateration Method Applied to Ultra-Wide Bandwidth Location Systems.

[BibT_eX]

[DOI]

Sensors, 2017

Xianghu Yue

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...