Leyuan Qu

Orcid: 0000-0001-6694-5355

According to our database1, Leyuan Qu authored at least 14 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Disentangling Prosody Representations With Unsupervised Speech Reconstruction.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2023
Emphasizing unseen words: New vocabulary acquisition for end-to-end speech recognition.
Neural Networks, April, 2023

Few Shot Learning Guided by Emotion Distance for Cross-corpus Speech Emotion Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Data Augmentation with Unsupervised Speaking Style Transfer for Speech Emotion Recognition.
CoRR, 2022

A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021
Neural Network Learning for Robust Speech Recognition.
PhD thesis, 2021

Hearing Faces: Target Speaker Text-to-Speech Synthesis from a Face.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Multimodal Target Speech Separation with Voice and Face References.
Proceedings of the Interspeech 2020, 2020

Variational Autoencoder with Global- and Medium Timescale Auxiliaries for Emotion Recognition from Speech.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2020, 2020

2019
LipSound: Neural Mel-Spectrogram Reconstruction for Lip Reading.
Proceedings of the Interspeech 2019, 2019

2018
Combining Articulatory Features with End-to-End Learning in Speech Recognition.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018

2016
Senone log-likelihood ratios based articulatory features in pronunciation erroneous tendency detecting.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Landmark of Mandarin nasal codas and its application in pronunciation error detection.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016


  Loading...