Kun Zhou
Orcid: 0000-0002-7869-4474Affiliations:
- Alibaba DAMO Academy, Singapore
- National University of Singapore, Department of Electrical and Computer Engineering, Singapore (PhD 2023)
According to our database1,
Kun Zhou
authored at least 34 papers
between 2019 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Multi-Step Prediction and Control of Hierarchical Emotion Distribution in Text-to-Speech Synthesis.
CoRR, July, 2025
Plug-and-Play Co-Occurring Face Attention for Robust Audio-Visual Speaker Extraction.
CoRR, May, 2025
CoRR, May, 2025
InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation.
CoRR, March, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
2024
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions.
CoRR, 2024
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with A Conditional Diffusion Model.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024
2023
IEEE Trans. Affect. Comput., 2023
2022
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity.
CoRR, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-Stage Sequence-to-Sequence Training.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the Blizzard Challenge 2021, virtual, October 23, 2021, 2021
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019