Yoshinori Shiga

According to our database1, Yoshinori Shiga authored at least 49 papers between 1994 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Neural speech-rate conversion with multispeaker WaveNet vocoder.
Speech Commun., 2022

2021
Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU.
IEEE Access, 2021

Noise Level Limited Sub-Modeling for Diffusion Probabilistic Vocoders.
Proceedings of the IEEE International Conference on Acoustics, 2021

High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Text-to-Speech Synthesis.
Proceedings of the Speech-to-Speech Translation, 2020

Multilingualization of Speech Processing.
Proceedings of the Speech-to-Speech Translation, 2020

Transformer-Based Text-to-Speech with Weighted Forced Attention.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders.
Proceedings of the Interspeech 2019, 2019

Duration Modeling with Global Phoneme-Duration Vectors.
Proceedings of the Interspeech 2019, 2019

Investigations of Real-time Gaussian Fftnet and Parallel Wavenet Neural Vocoders with Simple Acoustic Features.
Proceedings of the IEEE International Conference on Acoustics, 2019

HMM-based TTS System Framework.
Proceedings of the IEEE Conference on Computational Intelligence for Financial Engineering & Economics, 2019

Tacotron-Based Acoustic Model Using Phoneme Alignment for Practical Neural Text-to-Speech Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Improving FFTNet Vocoder with Noise Shaping and Subband Approaches.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multilingual Grapheme-to-Phoneme Conversion with Global Character Vectors.
Proceedings of the Interspeech 2018, 2018

An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Sentence Selection Based on Extended Entropy Using Phonetic and Prosodic Contexts for Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Development of the "VoiceTra" Multi-Lingual Speech Translation System.
IEICE Trans. Inf. Syst., 2017

Global Syllable Vectors for Building TTS Front-End with Deep Learning.
Proceedings of the Interspeech 2017, 2017

Subband wavenet with overlapped single-sideband filterbanks.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Superpositional HMM-Based Intonation Synthesis Using a Functional F0 Model.
J. Signal Process. Syst., 2016

Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework.
Proceedings of the Interspeech 2016, 2016

Using Zero-Frequency Resonator to Extract Multilingual Intonation Structure.
Proceedings of the Interspeech 2016, 2016

2015
A cloud robotics approach towards dialogue-oriented robot speech.
Adv. Robotics, 2015

HMM based myanmar text to speech system.
Proceedings of the INTERSPEECH 2015, 2015

Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts.
Proceedings of the INTERSPEECH 2015, 2015

Extraction of pitch register from expressive speech in Japanese.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis.
IEEE J. Sel. Top. Signal Process., 2014

Non-monologue HMM-based speech synthesis for service robots: A cloud robotics approach.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Tuning intonation with pitch accent decomposition for HMM-based expressive speech synthesis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Multilingual Speech-to-Speech Translation System: VoiceTra.
Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

Improvements to HMM-based speech synthesis based on parameter generation with rich context models.
Proceedings of the INTERSPEECH 2013, 2013

A targets-based superpositional model of fundamental frequency contours applied to HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2013, 2013

2012
Experiments on unsupervised statistical parametric speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Resonance-based spectral deformation in HMM-based speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis.
Proceedings of the INTERSPEECH 2012, 2012

Effect of anti-aliasing filtering on the quality of speech from an HMM-based synthesizer.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Toward Construction of Spoken Dialogue System that Evokes Users' Spontaneous Backchannels.
Proceedings of the SIGDIAL 2011 Conference, 2011

Analysis on Effects of Text-to-Speech and Avatar Agent in Evoking Users' Spontaneous Listener's Reactions.
Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

2010
Improved training of excitation for HMM-based parametric speech synthesis.
Proceedings of the INTERSPEECH 2010, 2010

2009
Pulse density representation of spectrum for statistical speech processing.
Proceedings of the INTERSPEECH 2009, 2009

2007
An F<sub>0</sub> contour control model using an F<sub>0</sub> contour codebook.
Syst. Comput. Jpn., 2007

2004
Accurate spectral envelope estimation for articulation-to-speech synthesis.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Source-filter separation for articulation-to-speech synthesis.
Proceedings of the INTERSPEECH 2004, 2004

Estimating detailed spectral envelopes using articulatory clustering.
Proceedings of the INTERSPEECH 2004, 2004

2003
Estimation of voice source and vocal tract characteristics based on multi-frame analysis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Estimating the spectral envelope of voiced speech using multi-frame analysis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

1998
Segmental duration control based on an articulatory model.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1994
A novel segment-concatenation algorithm for a cepstrum-based synthesizer.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994


  Loading...