Tao Li

Orcid: 0000-0001-5578-3960

According to our database1, Tao Li authored at least 15 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
METTS: Multilingual Emotional Text-to-Speech by Cross-Speaker and Cross-Lingual Emotion Transfer.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

U-Style: Cascading U-Nets With Multi-Level Speaker and Style Modeling for Zero-Shot Voice Cloning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling.
CoRR, 2024

2023
MSM-VC: High-Fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-Scale Style Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Vec-Tok Speech: speech vectorization and tokenization for neural speech generation.
CoRR, 2023

Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling.
Proceedings of the IEEE International Conference on Acoustics, 2023

HIGNN-TTS: Hierarchical Prosody Modeling With Graph Neural Networks for Expressive Long-Form TTS.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Cross-Speaker Emotion Disentangling and Transfer for End-to-End Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Multi-speaker Multi-style Text-to-speech Synthesis with Single-speaker Single-style Training Data Scenarios.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

One-Shot Voice Conversion For Style Transfer Based On Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Controllable cross-speaker emotion transfer for end-to-end speech synthesis.
CoRR, 2021

Controllable Emotion Transfer For End-to-End Speech Synthesis.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Enriching Source Style Transfer in Recognition-Synthesis Based Non-Parallel Voice Conversion.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021


  Loading...