Rongfeng Su

Orcid: 0000-0002-7228-5768

According to our database1, Rongfeng Su authored at least 22 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data.
CoRR, 2024

2023
Audio-video database from subacute stroke patients for dysarthric speech intelligence assessment and preliminary analysis.
Biomed. Signal Process. Control., 2023

2022
On-the-fly Feature Based Speaker Adaptation for Dysarthric and Elderly Speech Recognition.
CoRR, 2022

Respiratory and laryngeal influences on voice in post-stroke dysarthria: a pilot study.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

A Phone-Level Speaker Embedding Extraction Framework with Multi-Gate Mixture-of-Experts Based Multi-Task Learning.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition.
Proceedings of the Interspeech 2022, 2022

A New Method for Predicting Severity Level of Dysarthric Speech Based on Joint Feature-Sample Selection using Audio-Visual Data.
Proceedings of the International Conference on Asian Language Processing, 2022

An Investigation of Magnitude-Based and Phase-Based Features for Large-Scale Speaker Identification.
Proceedings of the International Conference on Asian Language Processing, 2022

2020
Cross-Domain Deep Visual Feature Generation for Mandarin Audio-Visual Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.
Proceedings of the Interspeech 2020, 2020

2019
Towards the Speech Features of Early-Stage Dementia: Design and Application of the Mandarin Elderly Cognitive Speech Database.
Proceedings of the Interspeech 2019, 2019

Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.
Proceedings of the Interspeech 2019, 2019

2018
Semi-supervised Cross-domain Visual Feature Learning for Audio-Visual Broadcast Speech Transcription.
Proceedings of the Interspeech 2018, 2018

Gaussian Process Neural Networks for Speech Recognition.
Proceedings of the Interspeech 2018, 2018

2017
Multimodal learning using 3D audio-visual data for audio-visual speech recognition.
Proceedings of the 2017 International Conference on Asian Language Processing, 2017

2016
A multi-channel/multi-speaker interactive 3D audio-visual speech corpus in Mandarin.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Convolutional neural network bottleneck features for bi-directional generalized variable parameter HMMs.
Proceedings of the IEEE International Conference on Information and Automation, 2016

2015
Automatic Complexity Control of Generalized Variable Parameter HMMs for Noise Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Generalized variable parameter HMMs based acoustic-to-articulatory inversion.
Proceedings of the INTERSPEECH 2015, 2015

Efficient use of DNN bottleneck features in generalized variable parameter HMMs for noise robust speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

2014
Deep neural network bottleneck features for generalized variable parameter HMMs.
Proceedings of the INTERSPEECH 2014, 2014

2013
Automatic model complexity control for generalized variable parameter HMMs.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013


  Loading...