Kun Zhou
Orcid: 0000-0002-7869-4474Affiliations:
- Alibaba DAMO Academy, Singapore
- National University of Singapore, Department of Electrical and Computer Engineering, Singapore (PhD 2023)
  According to our database1,
  Kun Zhou
  authored at least 34 papers
  between 2019 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
- 
    on orcid.org
On csauthors.net:
Bibliography
  2025
Multi-Step Prediction and Control of Hierarchical Emotion Distribution in Text-to-Speech Synthesis.
    
  
    CoRR, July, 2025
    
  
Plug-and-Play Co-Occurring Face Attention for Robust Audio-Visual Speaker Extraction.
    
  
    CoRR, May, 2025
    
  
    CoRR, May, 2025
    
  
InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation.
    
  
    CoRR, March, 2025
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution.
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models.
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
  2024
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions.
    
  
    CoRR, 2024
    
  
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with A Conditional Diffusion Model.
    
  
    Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
    
  
    Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
    Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024
    
  
  2023
    IEEE Trans. Affect. Comput., 2023
    
  
  2022
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion.
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
  2021
Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity.
    
  
    CoRR, 2021
    
  
    Proceedings of the IEEE Spoken Language Technology Workshop, 2021
    
  
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-Stage Sequence-to-Sequence Training.
    
  
    Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
    
  
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2021
    
  
    Proceedings of the Blizzard Challenge 2021, virtual, October 23, 2021, 2021
    
  
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer.
    
  
    Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
    
  
  2020
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data.
    
  
    Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
    
  
    Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
    
  
    Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020
    
  
    Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
    
  
    Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
    
  
  2019
    Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019