Keiichiro Oura

According to our database1, Keiichiro Oura authored at least 55 papers between 2006 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis System.
Proceedings of the IEEE International Conference on Acoustics, 2023

2021
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components.
IEEE Access, 2021

Periodnet: A Non-Autoregressive Waveform Generation Model with a Structure Separating Periodic and Aperiodic Components.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis.
Proceedings of the Interspeech 2020, 2020

Fast and High-Quality Singing Voice Synthesis System Based on Convolutional Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Semi-Supervised Learning Based on Hierarchical Generative Models for End-to-End Speech Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Singing voice synthesis based on convolutional neural networks.
CoRR, 2019

Speaker-dependent Wavenet-based Delay-free Adpcm Speech Coding.
Proceedings of the IEEE International Conference on Acoustics, 2019

Singing Voice Synthesis Based on Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Mel-Cepstrum-Based Quantization Noise Shaping Applied to Neural-Network-Based Speech Waveform Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

WaveNet-Based Zero-Delay Lossless Speech Coding.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Statistical Voice Conversion Based on Wavenet.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Discriminative Feature Extraction Based on Sequential Variational Autoencoder for Speaker Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Singing Voice Conversion Using Posted Waveform Data on Music Social Media.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Speaker Adaptation for Speech Synthesis Based on Deep Neural Networks Using Hidden Semi-Markov Model Structures.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Image Recognition Based on Convolutional Neural Networks Using Features Generated from Separable Lattice Hidden Markov Models.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Recent Development of the DNN-based Singing Voice Synthesis System - Sinsy.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Speech Synthesis Using WaveNet Vocoder Based on Periodic/Aperiodic Decomposition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Simultaneous Optimization of Multiple Tree-Based Factor Analyzed HMM for Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Generalization of Thai tone contour in HMM-based speech synthesis.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

User Generated Dialogue Systems: uDialogue.
Proceedings of the Human-Harmonized Information Technology, Volume 2, 2017

2016
Temporal modeling in neural network based statistical parametric speech synthesis.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Singing Voice Synthesis Based on Deep Neural Networks.
Proceedings of the Interspeech 2016, 2016

Voice Conversion Based on Trajectory Model Training of Neural Networks Considering Global Variance.
Proceedings of the Interspeech 2016, 2016

Redefining the Linguistic Context Feature Set for HMM and DNN TTS Through Position and Parsing.
Proceedings of the Interspeech 2016, 2016

Trajectory training considering global variance for speech synthesis based on neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Using speaker adaptive training to realize Mandarin-Tibetan cross-lingual speech synthesis.
Multim. Tools Appl., 2015

The effect of neural networks in statistical parametric speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech.
Proceedings of the INTERSPEECH 2014, 2014

Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

HMM-Based singing voice synthesis and its application to Japanese and English.
Proceedings of the IEEE International Conference on Acoustics, 2014

Voice interaction system with 3D-CG virtual agent for stand-alone smartphones.
Proceedings of the second international conference on Human-agent interaction, 2014

2013
Speech Synthesis Based on Hidden Markov Models.
Proc. IEEE, 2013

Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis.
Comput. Speech Lang., 2013

Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Real-time control of expressive speech synthesis using kinect body tracking.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Mmdagent - A fully open-source toolkit for voice interaction systems.
Proceedings of the IEEE International Conference on Acoustics, 2013

Realizing Tibetan speech synthesis by speaker adaptive training.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping.
Speech Commun., 2012

Pitch adaptive training for hmm-based singing voice synthesis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Large-Scale Subjective Evaluations of Speech Rate Control Methods for HMM-Based Speech Synthesizers.
Proceedings of the INTERSPEECH 2011, 2011

An optimization algorithm of independent mean and variance parameter tying structures for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora.
IEEE Trans. Speech Audio Process., 2010

A Covariance-Tying Technique for HMM-Based Speech Synthesis.
IEICE Trans. Inf. Syst., 2010

Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Recent development of the HMM-based singing voice synthesis system - Sinsy.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

HMM-based singing voice synthesis system using pitch-shifted pseudo training data.
Proceedings of the INTERSPEECH 2010, 2010

Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2010


2009
Thousands of voices for HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2009, 2009

Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems.
Proceedings of the INTERSPEECH 2009, 2009

2008
A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System.
IEICE Trans. Inf. Syst., 2008

Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

2006
Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006


  Loading...