Wen-Chin Huang

Orcid: 0000-0003-3172-3335

According to our database1, Wen-Chin Huang authored at least 51 papers between 2018 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders.
CoRR, 2023

AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion.
CoRR, 2023

The Singing Voice Conversion Challenge 2023.
CoRR, 2023

Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Comparative Study of Voice Conversion Models With Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

The Singing Voice Conversion Challenge 2023.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Improving Severity Preservation of Healthy-to-Pathological Voice Conversion With Global Style Tokens.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
A Comparative Study of Self-Supervised Speech Representation Based Voice Conversion.
IEEE J. Sel. Top. Signal Process., 2022

Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition.
Proceedings of the Interspeech 2022, 2022

End-to-End Binaural Speech Synthesis.
Proceedings of the Interspeech 2022, 2022

The VoiceMOS Challenge 2022.
Proceedings of the Interspeech 2022, 2022

Direct Noisy Speech Modeling for Noisy-To-Noisy Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2022

S3PRL-VC: Open-Source Voice Conversion Framework with Self-Supervised Speech Representations.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Identity Preserving Normal to Dysarthric Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2022

LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

Generalization Ability of MOS Prediction Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022

SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Many-to-Many Voice Transformer Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Pretraining Techniques for Sequence-to-Sequence Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

The AS-NU System for the M2VoC Challenge.
CoRR, 2021

EMA2S: An End-to-End Multimodal Articulatory-to-Speech System.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder.
Proceedings of the IEEE International Conference on Acoustics, 2021

Speech Recognition by Simply Fine-Tuning Bert.
Proceedings of the IEEE International Conference on Acoustics, 2021

Any-to-One Sequence-to-Sequence Voice Conversion Using Self-Supervised Discrete Speech Representations.
Proceedings of the IEEE International Conference on Acoustics, 2021

Non-Autoregressive Sequence-To-Sequence Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2021

Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

On Prosody Modeling for ASR+TTS Based Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Noisy-to-Noisy Voice Conversion Framework with Denoising Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Investigation of Text-to-Speech-based Synthetic Parallel Data for Sequence-to-Sequence Non-Parallel Voice Conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Unsupervised Representation Disentanglement Using Cross Domain Features and Adversarial Learning in Variational Autoencoder Based Voice Conversion.
IEEE Trans. Emerg. Top. Comput. Intell., 2020

Examining the effects of developer familiarity on bug fixing.
J. Syst. Softw., 2020

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.
CoRR, 2020

Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations.
CoRR, 2020

The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders.
CoRR, 2020

The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS.
CoRR, 2020

Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions.
CoRR, 2020

Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion.
CoRR, 2020

An Empirical Study on Issue Knowledge Transfer from Python to R for Machine Learning Software.
Proceedings of the 32nd International Conference on Software Engineering and Knowledge Engineering, 2020

Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining.
Proceedings of the Interspeech 2020, 2020

2019
The ASVspoof 2019 database.
CoRR, 2019

MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion.
Proceedings of the Interspeech 2019, 2019

Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion.
Proceedings of the Interspeech 2019, 2019

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
WaveNet 聲碼器及其於語音轉換之應用 (WaveNet Vocoder and its Applications in Voice Conversion) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018


  Loading...