We stand with Ukraine

We stand with Ukraine

Minchuan Chen

Orcid: 0009-0001-1512-6672

According to our database¹, Minchuan Chen authored at least 21 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

LEF-TTS: Lightweight and Efficient End-to-End Text-to-Speech Synthesis With Multi-Stream Generator.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion.

[DOI]

,

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Improving Multilingual Text-to-Speech with Mixture-of-Language-Experts and Accent Disentanglement.

[DOI]

,

,

,

,

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

DFlow: A Generative Model Combining Denoising AutoEncoder and Normalizing Flow for High Fidelity Waveform Generation.

[DOI]

,

,

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

ESVC: Combining Adaptive Style Fusion and Multi-Level Feature Disentanglement for Expressive Singing Voice Conversion.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Relative Boundary Modeling: A High-Resolution Cricket Bowl Release Detection Framework with I3D Features.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Exploring Loss Function and Rank Fusion for Enhanced Person Re-identification.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Image- and Instance-Level Data Augmentation for Occluded Instance Segmentation.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Exploring multi-task learning and data augmentation in dementia detection with self-supervised pretrained models.

[DOI]

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

A compact transformer-based GAN vocoder.

[DOI]

,

,

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

EfficientSing: A Chinese Singing Voice Synthesis System Using Duration-Free Acoustic Model and HiFi-GAN Vocoder.

[DOI]

,

,

,

,

,

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-Pooling Strategy and Window-Based Attention.

[DOI]

,

,

,

,

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture.

[DOI]

,

,

,

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Unsupervised Learning for Multi-Style Speech Synthesis with Limited Data.

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Neural Text Normalization with Partial Parameter Generator and Pointer-Generator Network.

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check.

[DOI]

,

,

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Non-Parallel Voice Conversion with Fewer Labeled Data by Conditional Generative Adversarial Networks.

[DOI]

,

,

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Nonparallel Emotional Speech Conversion Using VAE-GAN.

[DOI]

,

,

,

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Flow-TTS: A Non-Autoregressive Network for Text to Speech Based on Flow.

[DOI]

,

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Cross-Lingual, Multi-Speaker Text-To-Speech Synthesis Using Neural Speaker Embedding.

[DOI]

,

,

,

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Loading...