Hoon-Young Cho

Orcid: 0000-0002-6850-6580

According to our database1, Hoon-Young Cho authored at least 19 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Mels-Tts : Multi-Emotion Multi-Lingual Multi-Speaker Text-To-Speech System Via Disentangled Style Tokens.
Proceedings of the IEEE International Conference on Acoustics, 2024

Latent Filling: Latent Space Data Augmentation for Zero-Shot Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Hierarchical Timbre-Cadence Speaker Encoder for Zero-shot Speech Synthesis.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space.
CoRR, 2022

2021
GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis.
CoRR, 2021

GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

N-Singer: A Non-Autoregressive Korean Singing Voice Synthesis System for Pronunciation Enhancement.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

FastPitchFormant: Source-Filter Based Decomposed Modeling for Speech Synthesis.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Effective Emotion Transplantation in an End-to-End Text-to-Speech System.
IEEE Access, 2020

VocGAN: A High-Fidelity Real-Time Vocoder with a Hierarchically-Nested Adversarial Network.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaking Speed Control of End-to-End Speech Synthesis Using Sentence-Level Conditioning.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Detecting Mismatch Between Text Script and Voice-Over Using Utterance Verification Based on Phoneme Recognition Ranking.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2011
Zero-Crossing-Based Channel Attentive Weighting of Cepstral Features for Robust Speech Recognition: The ETRI 2011 CHiME Challenge System.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2007
Data-Driven Subvector Clustering using the Cross-Entropy Method.
Proceedings of the IEEE International Conference on Acoustics, 2007

2004
On the use of channel-attentive MFCC for robust recognition of partially corrupted speech.
IEEE Signal Process. Lett., 2004

Emotion verification for emotion detection and unknown emotion rejection.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

1998
A Robust Front-End for Telephone Speech Recognition.
Proceedings of the PRICAI'98, 1998


  Loading...