Xun Gong

Orcid: 0000-0002-3364-8407

Affiliations:
  • Shanghai Jiao Tong University, X-LANCE, Department of Computer Science and Engineering, MoE Key Laboratory of Artificial Intelligence, Shanghai, China


According to our database1, Xun Gong authored at least 14 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Advanced Long-Content Speech Recognition With Factorized Neural Transducer.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Advanced Long-Content Speech Recognition With Factorized Neural Transducer.
CoRR, 2024

2023
Whisper-KDQ: A Lightweight Whisper via Guided Knowledge Distillation and Quantization for Efficient ASR.
CoRR, 2023

Joint Discriminator and Transfer Based Fast Domain Adaptation For End-To-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Factorized AED: Factorized Attention-Based Encoder-Decoder for Text-Only Domain Adaptive ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Layer-Wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition.
CoRR, 2022

Knowledge Transfer and Distillation from Autoregressive to Non-Autoregessive Speech Recognition.
Proceedings of the Interspeech 2022, 2022

The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Speaker Embedding Augmentation with Noise Distribution Matching.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Layer-Wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
End-to-End Texture-Aware and Depth-Aware Embedded Advertising for Videos.
Proceedings of the ICCTA 2020: 6th International Conference on Computer and Technology Applications, 2020

Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


  Loading...