Xun Gong

Orcid: 0000-0002-3364-8407

Affiliations:

Shanghai Jiao Tong University, X-LANCE, Department of Computer Science and Engineering, MoE Key Laboratory of Artificial Intelligence, Shanghai, China

According to our database¹, Xun Gong authored at least 24 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Bagpiper: Solving Open-Ended Audio Tasks via Rich Captions.

[BibT_eX]

[DOI]

CoRR, February, 2026

UrgentMOS: Unified Multi-Metric and Preference Learning for Robust Speech Quality Assessment.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM.

[BibT_eX]

[DOI]

CoRR, May, 2025

Ranking and Selection of Bias Words for Contextual Bias Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

2024

SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Advanced Long-Content Speech Recognition With Factorized Neural Transducer.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

ConMamba: A Convolution-Augmented Mamba Encoder Model for Efficient End-to-End ASR Systems.

[BibT_eX]

[DOI]

Haoxiang Hou

Xun Gong

Yanmin Qian

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Contextual Biasing Speech Recognition in Speech-enhanced Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

Whisper-KDQ: A Lightweight Whisper via Guided Knowledge Distillation and Quantization for Efficient ASR.

[BibT_eX]

[DOI]

CoRR, 2023

Text Only Domain Adaptation with Phoneme Guided Data Splicing for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Joint Discriminator and Transfer Based Fast Domain Adaptation For End-To-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Factorized AED: Factorized Attention-Based Encoder-Decoder for Text-Only Domain Adaptive ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Efficient Text-Only Domain Adaptation For CTC-Based ASR.

[BibT_eX]

[DOI]

Chang Chen

Xun Gong

Yanmin Qian

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Layer-Wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition.

[BibT_eX]

[DOI]

Yanmin Qian

Xun Gong

Houjun Huang

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition.

[BibT_eX]

[DOI]

Xun Gong

Zhikai Zhou

Yanmin Qian

CoRR, 2022

Knowledge Transfer and Distillation from Autoregressive to Non-Autoregessive Speech Recognition.

[BibT_eX]

[DOI]

Xun Gong

Zhikai Zhou

Yanmin Qian

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Speaker Embedding Augmentation with Noise Distribution Matching.

[BibT_eX]

[DOI]

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Layer-Wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

End-to-End Texture-Aware and Depth-Aware Embedded Advertising for Videos.

[BibT_eX]

[DOI]

Jiasen Li

Xun Gong

Boning Li

Proceedings of the ICCTA 2020: 6th International Conference on Computer and Technology Applications, 2020

Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Xun Gong

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...