Xurong Xie
Orcid: 0000-0002-6714-6296
  According to our database1,
  Xurong Xie
  authored at least 67 papers
  between 2014 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding.
    
  
    CoRR, June, 2025
    
  
Towards LLM-Empowered Fine-Grained Speech Descriptors for Explainable Emotion Recognition.
    
  
    CoRR, May, 2025
    
  
On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition.
    
  
    CoRR, May, 2025
    
  
Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models.
    
  
    CoRR, May, 2025
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
Emotionally Challenging Games Can Satisfy Older Adults' Psychological Needs: From Empirical Study to Design Guidelines.
    
  
    Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems, 2025
    
  
"Imitating at the Last Minute": Exploring Approaches to Enhance Public Speaking Performance in Limited Time.
    
  
    Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025
    
  
  2024
Survey of neurocognitive disorder detection methods based on speech, visual, and virtual reality technologies.
    
  
    Virtual Real. Intell. Hardw., 2024
    
  
Self-Supervised ASR Models and Features for Dysarthric and Elderly Speech Recognition.
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2024
    
  
Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition.
    
  
    CoRR, 2024
    
  
Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines.
    
  
    CoRR, 2024
    
  
Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation.
    
  
    CoRR, 2024
    
  
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask.
    
  
    CoRR, 2024
    
  
    Proceedings of the Social Robotics - 16th International Conference, 2024
    
  
Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM<sup>+</sup> Guidelines.
    
  
    Proceedings of the Social Robotics - 16th International Conference, 2024
    
  
Investigation of Cross Modality Feature Fusion for Audio-Visual Dysarthric Speech Assessment.
    
  
    Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024
    
  
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask.
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition.
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition.
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
Towards High-Performance and Low-Latency Feature-Based Speaker Adaptation of Conformer Speech Recognition Systems.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
  2023
Probing Lexical Ambiguity in Chinese Characters via Their Word Formations: Convergence of Perceived and Computed Metrics.
    
  
    Cogn. Sci., November, 2023
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2023
    
  
Exploiting Cross-Domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition.
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition.
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems.
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
Unsupervised Model-Based Speaker Adaptation of End-To-End Lattice-Free MMI Model for Speech Recognition.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2023
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2023
    
  
Exploring Self-Supervised Pre-Trained ASR Models for Dysarthric and Elderly Speech Recognition.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2023
    
  
ChallengeDetect: Investigating the Potential of Detecting In-Game Challenge Experience from Physiological Measures.
    
  
    Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023
    
  
  2022
    IEEE ACM Trans. Audio Speech Lang. Process., 2022
    
  
Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition.
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2022
    
  
Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition.
    
  
    CoRR, 2022
    
  
On-the-fly Feature Based Speaker Adaptation for Dysarthric and Elderly Speech Recognition.
    
  
    CoRR, 2022
    
  
Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition.
    
  
    CoRR, 2022
    
  
A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition.
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems.
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
Exploiting Cross Domain Acoustic-to-Articulatory Inverted Features for Disordered Speech Recognition.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2022
    
  
Detecting challenge from physiological signals: A primary study with a typical game scenario.
    
  
    Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022
    
  
  2021
    IEEE ACM Trans. Audio Speech Lang. Process., 2021
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2021
    
  
Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition.
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2021
    
  
    Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2021
    
  
Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition.
    
  
    Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
    
  
    Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
    
  
    Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
    
  
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition.
    
  
    Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
    
  
Development of the Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using the Dementiabank Corpus.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2021
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2021
    
  
  2020
    Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
    
  
    Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
    
  
  2019
Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features.
    
  
    Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
    
  
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition.
    
  
    Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
    
  
BLHUC: Bayesian Learning of Hidden Unit Contributions for Deep Neural Network Speaker Adaptation.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2019
    
  
Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2019
    
  
  2018
Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion.
    
  
    Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
    
  
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.
    
  
    Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
    
  
    Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
    
  
  2017
    Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
    
  
  2016
Deep Neural Network Based Acoustic-to-Articulatory Inversion Using Phone Sequence Information.
    
  
    Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
    
  
  2015
    Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
    
  
Efficient use of DNN bottleneck features in generalized variable parameter HMMs for noise robust speech recognition.
    
  
    Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
    
  
  2014
    Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014