Masami Akamine

According to our database1, Masami Akamine authored at least 50 papers between 1990 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2019
Transfer Learning for Unseen Slots in End-to-End Dialogue State Tracking.
Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019

2018
Out-of-Domain Slot Value Detection for Spoken Dialogue Systems with Context Information.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Dialog State Tracking for Unseen Values Using an Extended Attention Mechanism.
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018

2016
Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time-Frequency Energy Reallocation Approach.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Voice Activity Detection: Merging Source and Filter-based Information.
IEEE Signal Process. Lett., 2016

Statistical Bandwidth Extension for Speech Synthesis Based on Gaussian Mixture Model with Sub-Band Basis Spectrum Model.
IEICE Trans. Inf. Syst., 2016

2015
Emotional transplant in statistical speech synthesis based on emotion additive model.
Proceedings of the INTERSPEECH 2015, 2015

A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization.
Proceedings of the INTERSPEECH 2015, 2015

2014
Building HMM-TTS Voices on Diverse Data.
IEEE J. Sel. Top. Signal Process., 2014

Integrated Expression Prediction and Speech Synthesis From Text.
IEEE J. Sel. Top. Signal Process., 2014

On the impact of excitation and spectral parameters for expressive statistical parametric speech synthesis.
Comput. Speech Lang., 2014

GMM-based bandwidth extension using sub-band basis spectrum model.
Proceedings of the INTERSPEECH 2014, 2014

2013
Complex cepstrum for statistical parametric speech synthesis.
Speech Commun., 2013


Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis.
Proceedings of the INTERSPEECH 2013, 2013

Complex cepstrum analysis based on the minimum mean squared error.
Proceedings of the IEEE International Conference on Acoustics, 2013

Training a supra-segmental parametric F0 model without interpolating F0.
Proceedings of the IEEE International Conference on Acoustics, 2013

Integrated automatic expression prediction and speech synthesis from text.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Decision tree-based acoustic models for speech recognition.
EURASIP J. Audio Speech Music. Process., 2012

Combining multiple high quality corpora for improving HMM-TTS.
Proceedings of the INTERSPEECH 2012, 2012

HMM-based speech synthesis using sub-band basis spectrum model.
Proceedings of the INTERSPEECH 2012, 2012

Histogram-based spectral equalization for HMM-based speech synthesis using mel-LSP.
Proceedings of the INTERSPEECH 2012, 2012

Speech factorization for HMM-TTS based on cluster adaptive training.
Proceedings of the INTERSPEECH 2012, 2012

Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training.
Proceedings of the INTERSPEECH 2012, 2012

Complex cepstrum as phase information in statistical parametric speech synthesis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Decision Tree-Based Acoustic Models for Speech Recognition with Improved Smoothness.
IEICE Trans. Inf. Syst., 2011

One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model.
Proceedings of the IEEE International Conference on Acoustics, 2011

Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Sub-band basis spectrum model for pitch-synchronous log-spectrum and phase based on approximation of sparse coding.
Proceedings of the INTERSPEECH 2010, 2010

Unit selection speech synthesis using multiple speech units at non-adjacent segments for prosody and waveform generation.
Proceedings of the IEEE International Conference on Acoustics, 2010

Covariance clustering on Riemannian manifolds for acoustic model compression.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Feedback loop for prosody prediction in concatenative speech synthesis.
Proceedings of the INTERSPEECH 2009, 2009

Decision tree acoustic models for ASR.
Proceedings of the INTERSPEECH 2009, 2009

Bayesian feature enhancement using a mixture of unscented transformation for uncertainty decoding of noisy speech.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Multilevel parametric-base F0 model for speech synthesis.
Proceedings of the INTERSPEECH 2008, 2008

Comparative evaluation of different methods for voice activity detection.
Proceedings of the INTERSPEECH 2008, 2008

Speech recognition using soft decision trees.
Proceedings of the INTERSPEECH 2008, 2008

Feature enhancement by speaker-normalized splice for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
An F<sub>0</sub> contour control model using an F<sub>0</sub> contour codebook.
Syst. Comput. Jpn., 2007

HMM-based speech recognition using decision trees instead of GMMs.
Proceedings of the INTERSPEECH 2007, 2007

1999
Automatic generation of synthesis units by unit selection based on closed-loop training.
Syst. Comput. Jpn., 1999

Toshiba English text-to-speech synthesizer (TESS).
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

CELP speech coding based on an adaptive pulse position codebook.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Automatic rule generation for linguistic features analysis using inductive learning technique: linguistic features analysis in TOS drive TTS system.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

An F0 contour control model for totally speaker driven text to speech system.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS).
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A 2.4 kbps variable bit rate ADP-CELP speech coder.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Automatic generation of speech synthesis units based on closed loop training.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1991
Adaptive bit-allocation between the pole-zero synthesis filter and excitation in CELP.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
CELP coding with an adaptive density pulse excitation model.
Proceedings of the 1990 International Conference on Acoustics, 1990


  Loading...