Keiichiro Oura

Orcid: 0009-0004-7611-9435

According to our database¹, Keiichiro Oura authored at least 70 papers between 2006 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Deep Hidden Semi-Markov Model-Based Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Access, 2026

2023

SPTK4: An Open-Source Software Toolkit for Speech Signal Processing.

[BibT_eX]

[DOI]

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis System.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2021

Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components.

[BibT_eX]

[DOI]

IEEE Access, 2021

Periodnet: A Non-Autoregressive Waveform Generation Model with a Structure Separating Periodic and Aperiodic Components.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fast and High-Quality Singing Voice Synthesis System Based on Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Semi-Supervised Learning Based on Hierarchical Generative Models for End-to-End Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Singing voice synthesis based on convolutional neural networks.

[BibT_eX]

[DOI]

CoRR, 2019

Low computational cost speech synthesis based on deep neural networks using hidden semi-Markov model structures.

[BibT_eX]

[DOI]

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Deep neural network based real-time speech vocoder with periodic and aperiodic inputs.

[BibT_eX]

[DOI]

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Speaker-dependent Wavenet-based Delay-free Adpcm Speech Coding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Singing Voice Synthesis Based on Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Mel-Cepstrum-Based Quantization Noise Shaping Applied to Neural-Network-Based Speech Waveform Synthesis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

WaveNet-Based Zero-Delay Lossless Speech Coding.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Statistical Voice Conversion Based on Wavenet.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

The NITech text-to-speech system for the Blizzard Challenge 2018.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2018, Hyderabad, India, September 8, 2018, 2018

Discriminative Feature Extraction Based on Sequential Variational Autoencoder for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Singing Voice Conversion Using Posted Waveform Data on Music Social Media.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Speaker Adaptation for Speech Synthesis Based on Deep Neural Networks Using Hidden Semi-Markov Model Structures.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Image Recognition Based on Convolutional Neural Networks Using Features Generated from Separable Lattice Hidden Markov Models.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Recent Development of the DNN-based Singing Voice Synthesis System - Sinsy.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Speech Synthesis Using WaveNet Vocoder Based on Periodic/Aperiodic Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Simultaneous Optimization of Multiple Tree-Based Factor Analyzed HMM for Speech Synthesis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

The NITech text-to-speech system for the Blizzard Challenge 2017.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2017, Stockholm, Sweden, August 25, 2017, 2017

Generalization of Thai tone contour in HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

User Generated Dialogue Systems: uDialogue.

[BibT_eX]

[DOI]

Proceedings of the Human-Harmonized Information Technology, Volume 2, 2017

2016

Temporal modeling in neural network based statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Singing Voice Synthesis Based on Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Voice Conversion Based on Trajectory Model Training of Neural Networks Considering Global Variance.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Redefining the Linguistic Context Feature Set for HMM and DNN TTS Through Position and Parsing.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Trajectory training considering global variance for speech synthesis based on neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

The NITech text-to-speech system for the Blizzard Challenge 2016.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2016, Cuppertino, CA, USA, September 16, 2016, 2016

2015

Using speaker adaptive training to realize Mandarin-Tibetan cross-lingual speech synthesis.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2015

The effect of neural networks in statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The NITECH HMM-based text-to-speech system for the Blizzard Challenge 2015.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2015, 2015

2014

A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

HMM-Based singing voice synthesis and its application to Japanese and English.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Voice interaction system with 3D-CG virtual agent for stand-alone smartphones.

[BibT_eX]

[DOI]

Proceedings of the second international conference on Human-agent interaction, 2014

Overview of NITECH HMM-based text-to-speech system for Blizzard Challenge 2014.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014

2013

Speech Synthesis Based on Hidden Markov Models.

[BibT_eX]

[DOI]

Proc. IEEE, 2013

Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Real-time control of expressive speech synthesis using kinect body tracking.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Mmdagent - A fully open-source toolkit for voice interaction systems.

[BibT_eX]

[DOI]

Akinobu Lee

Keiichiro Oura

Keiichi Tokuda

Proceedings of the IEEE International Conference on Acoustics, 2013

Overview of NITECH HMM-based speech synthesis system for Blizzard Challenge 2013.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2013, 2013

Realizing Tibetan speech synthesis by speaker adaptive training.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping.

[BibT_eX]

[DOI]

Speech Commun., 2012

Pitch adaptive training for hmm-based singing voice synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2012.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2012, Portland, OR, USA, September 14, 2012, 2012

2011

Large-Scale Subjective Evaluations of Speech Rate Control Methods for HMM-Based Speech Synthesizers.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An optimization algorithm of independent mean and variance parameter tying structures for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2011.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011

2010

Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

A Covariance-Tying Technique for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project.

[BibT_eX]

[DOI]

Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Recent development of the HMM-based singing voice synthesis system - Sinsy.

[BibT_eX]

[DOI]

Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

HMM-based singing voice synthesis system using pitch-shifted pseudo training data.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2010.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010

Personalising Speech-To-Speech Translation in the EMIME Project.

[BibT_eX]

[DOI]

Proceedings of the ACL 2010, 2010

2009

Thousands of voices for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2009.

[BibT_eX]

[DOI]

Keiichiro Oura

Yi-Jian Wu

Keiichi Tokuda

Proceedings of the Blizzard Challenge 2009, Edinburgh, Scotland, UK, September 4, 2009, 2009

2008

A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

2006

Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Keiichiro Oura

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...