Yiwei Guo

Orcid: 0000-0002-2681-717X

According to our database1, Yiwei Guo authored at least 19 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech.
CoRR, 2024

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Speaker Adaptive Text-to-Speech With Timbre-Normalized Vector-Quantized Feature.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross Attention.
CoRR, 2023

Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations.
CoRR, 2023

Acoustic BPE for Speech Generation with Discrete Tokens.
CoRR, 2023

Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition.
CoRR, 2023

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching.
CoRR, 2023

DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech.
CoRR, 2023

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding.
CoRR, 2023

Joint Node Representation Learning and Clustering for Attributed Graph via Graph Diffusion Convolution.
Proceedings of the International Joint Conference on Neural Networks, 2023

DiffVoice: Text-to-Speech with Latent Diffusion.
Proceedings of the IEEE International Conference on Acoustics, 2023

Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023

Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Spatio-Temporal Dynamics of Entropy in EEGS during Music Stimulation of Alzheimer's Disease Patients with Different Degrees of Dementia.
Entropy, 2022

BiasedWalk: Learning Global-aware Node Embeddings via Biased Sampling.
CoRR, 2022

VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature.
Proceedings of the Interspeech 2022, 2022

Unsupervised Word-Level Prosody Tagging for Controllable Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

2020
A Reinforcement Learning Approach to Train Timetabling for Inter-City High Speed Railway Lines.
Proceedings of the 5th IEEE International Conference on Intelligent Transportation Engineering, 2020


  Loading...