Tao Li

Orcid: 0000-0001-5578-3960

Affiliations:
  • Northwestern Polytechnical University, School of Computer Science, Audio, Speech and Language Processing Group, Xi'an, China


According to our database1, Tao Li authored at least 15 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
METTS: Multilingual Emotional Text-to-Speech by Cross-Speaker and Cross-Lingual Emotion Transfer.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

U-Style: Cascading U-Nets With Multi-Level Speaker and Style Modeling for Zero-Shot Voice Cloning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling.
CoRR, 2024

2023
MSM-VC: High-Fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-Scale Style Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Vec-Tok Speech: speech vectorization and tokenization for neural speech generation.
CoRR, 2023

Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling.
Proceedings of the IEEE International Conference on Acoustics, 2023

HIGNN-TTS: Hierarchical Prosody Modeling With Graph Neural Networks for Expressive Long-Form TTS.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Cross-Speaker Emotion Disentangling and Transfer for End-to-End Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Multi-speaker Multi-style Text-to-speech Synthesis with Single-speaker Single-style Training Data Scenarios.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

One-Shot Voice Conversion For Style Transfer Based On Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Controllable cross-speaker emotion transfer for end-to-end speech synthesis.
CoRR, 2021

Controllable Emotion Transfer For End-to-End Speech Synthesis.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Enriching Source Style Transfer in Recognition-Synthesis Based Non-Parallel Voice Conversion.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021


  Loading...