Byoung Jin Choi

Orcid: 0000-0003-1319-8215

According to our database¹, Byoung Jin Choi authored at least 17 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Efficient Streaming TTS Acoustic Model with Depthwise RVQ Decoding Strategies in a Mamba Framework.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

2024

Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Variable-Length Speaker Conditioning in Flow-Based Text-to-Speech.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion Guidance.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

Transduce and Speak: Neural Transducer for Text-To-Speech with Semantic Token Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

SNAC: Speaker-Normalized Affine Coupling Layer in Flow-Based Architecture for Zero-Shot Multi-Speaker Text-to-Speech.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

A Controllable Multi-Lingual Multi-Speaker Multi-Style Text-to-Speech Synthesis With Multivariate Information Minimization.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech.

[BibT_eX]

[DOI]

CoRR, 2022

Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Expressive Text-to-Speech Using Style Tag.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Diff-TTS: A Denoising Diffusion Model for Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Memory Attention: Robust Alignment Using Gating Mechanism for End-to-End Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2020

Convex feasibility problems on uniformly convex metric spaces.

[BibT_eX]

[DOI]

Byoung Jin Choi

Un Cig Ji

Yongdo Lim

Optim. Methods Softw., 2020

WaveNODE: A Continuous Normalizing Flow for Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, 2020

Reformer-TTS: Neural Speech Synthesis with Reformer Network.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018

Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Byoung Jin Choi

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...