Eunwoo Song

According to our database1, Eunwoo Song authored at least 37 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Unified Speech-Text Pretraining for Spoken Dialog Modeling.
CoRR, 2024

2023
Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech.
CoRR, 2023

Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems.
Proceedings of the Interspeech 2022, 2022

Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
Proceedings of the Interspeech 2022, 2022

TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder.
Proceedings of the Interspeech 2022, 2022

Effective Data Augmentation Methods for Neural Text-to-Speech Systems.
Proceedings of the International Conference on Electronics, Information, and Communication, 2022

Linear Prediction-based Parallel WaveGAN Speech Synthesis.
Proceedings of the International Conference on Electronics, Information, and Communication, 2022

2021
Improved Parallel Wavegan Vocoder with Perceptually Weighted Spectrogram Loss.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators.
Proceedings of the IEEE International Conference on Acoustics, 2021

TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Speaker-Adaptive Neural Vocoders for Parametric Speech Synthesis Systems.
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020

Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder.
Proceedings of the Interspeech 2020, 2020

Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ExcitGlow: Improving a WaveGlow-based Neural Vocoder with Linear Prediction Analysis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems.
CoRR, 2019

Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation.
Proceedings of the Interspeech 2019, 2019

ExcitNet Vocoder: A Neural Excitation Model for Parametric Speech Synthesis Systems.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
Speaker-adaptive neural vocoders for statistical parametric speech synthesis systems.
CoRR, 2018

Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis.
Proceedings of the Interspeech 2018, 2018

A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems.
Proceedings of the Interspeech 2018, 2018

Modeling-By-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis.
Proceedings of the Interspeech 2016, 2016

Multi-class learning algorithm for deep neural network-based statistical parametric speech synthesis.
Proceedings of the 24th European Signal Processing Conference, 2016

Area-efficient one-cycle correction scheme for timing errors in flip-flop based pipelines.
Proceedings of the IEEE Asian Solid-State Circuits Conference, 2016

2015
Deep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model.
Proceedings of the INTERSPEECH 2015, 2015

Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A constrained two-layer compression technique for ECG waves.
Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

2014
Fixed-point implementation of MPEG-D unified speech and audio coding decoder.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

2013
Speech enhancement for pathological voice using time-frequency trajectory excitation modeling.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013


  Loading...