Mengzhe Geng

Orcid: 0000-0002-7886-439X

According to our database1, Mengzhe Geng authored at least 32 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation.
CoRR, 2024

2023
Audio-Visual End-to-End Multi-Channel Speech Separation, Dereverberation and Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

A Survey of Reasoning with Foundation Models.
CoRR, 2023

Towards Automatic Data Augmentation for Disordered Speech Recognition.
CoRR, 2023

Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition.
CoRR, 2023

Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems.
CoRR, 2023

Use of Speech Impairment Severity for Dysarthric Speech Recognition.
CoRR, 2023

Adversarial Data Augmentation Using VAE-GAN for Disordered Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Exploring Self-Supervised Pre-Trained ASR Models for Dysarthric and Elderly Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Bayesian Neural Network Language Modeling for Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition.
CoRR, 2022

On-the-fly Feature Based Speaker Adaptation for Dysarthric and Elderly Speech Recognition.
CoRR, 2022

Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection.
Proceedings of the Interspeech 2022, 2022

Confidence Score Based Conformer Speaker Adaptation for Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems.
Proceedings of the Interspeech 2022, 2022

Exploiting Cross Domain Acoustic-to-Articulatory Inverted Features for Disordered Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Audio-Visual Multi-Channel Integration and Recognition of Overlapped Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Recent Progress in the CUHK Dysarthric Speech Recognition System.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Adversarial Data Augmentation for Disordered Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Development of the Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using the Dementiabank Corpus.
Proceedings of the IEEE International Conference on Acoustics, 2021

Bayesian Transformer Language Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Neural Architecture Search for Speech Recognition.
CoRR, 2020

Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Investigation of Data Augmentation Techniques for Disordered Speech Recognition.
Proceedings of the Interspeech 2020, 2020

2019
The CUHK Dysarthric Speech Recognition Systems for English and Cantonese.
Proceedings of the Interspeech 2019, 2019


  Loading...