Minchuan Chen

Orcid: 0009-0001-1512-6672

According to our database1, Minchuan Chen authored at least 15 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2023
Relative Boundary Modeling: A High-Resolution Cricket Bowl Release Detection Framework with I3D Features.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Exploring Loss Function and Rank Fusion for Enhanced Person Re-identification.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Image- and Instance-Level Data Augmentation for Occluded Instance Segmentation.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

2022
A compact transformer-based GAN vocoder.
Proceedings of the Interspeech 2022, 2022

2021
EfficientSing: A Chinese Singing Voice Synthesis System Using Duration-Free Acoustic Model and HiFi-GAN Vocoder.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-Pooling Strategy and Window-Based Attention.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture.
Proceedings of the 38th International Conference on Machine Learning, 2021

Unsupervised Learning for Multi-Style Speech Synthesis with Limited Data.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Neural Text Normalization with Partial Parameter Generator and Pointer-Generator Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Non-Parallel Voice Conversion with Fewer Labeled Data by Conditional Generative Adversarial Networks.
Proceedings of the Interspeech 2020, 2020

Nonparallel Emotional Speech Conversion Using VAE-GAN.
Proceedings of the Interspeech 2020, 2020

Flow-TTS: A Non-Autoregressive Network for Text to Speech Based on Flow.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Cross-Lingual, Multi-Speaker Text-To-Speech Synthesis Using Neural Speaker Embedding.
Proceedings of the Interspeech 2019, 2019


  Loading...