Hisashi Kawai

Proceedings of the Blizzard Challenge 2009, Edinburgh, Scotland, UK, September 4, 2009, 2009

2008

Investigation of Optimum Electrode Locations by Using an Automatized Surface Electromyography Analysis Technique.

[BibT_eX]

[DOI]

IEEE Trans. Biomed. Eng., 2008

Phone duration modeling using gradient tree boosting.

[BibT_eX]

[DOI]

Junichi Yamagishi

Takao Kobayashi

Speech Commun., 2008

Unit database pruning based on the cost degradation criterion for concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Communicative speech synthesis with XIMERA: a first step.

[BibT_eX]

[DOI]

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

A preselection method based on cost degradation from the optimal sequence for concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Reduction of correlation computation in the permutation of the frequency domain ICA by selecting DOAs estimated in subarrays.

[BibT_eX]

[DOI]

Hao Yuan

Toshiharu Horiuchi

Proceedings of the 15th European Signal Processing Conference, 2007

ATRECSS - ATR English speech corpus for speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Evaluation of text-to-speech systems: Blizzard Challenge 2007, 2007

2006

The ATR multilingual speech-to-speech translation system.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2006

An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis.

[BibT_eX]

[DOI]

Speech Commun., 2006

A text-prompted distributed speaker verification system implemented on a cellular phone and a mobile terminal.

[BibT_eX]

[DOI]

Tsuneo Kato

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Quick individual fitting methods of simplified hearing compensation for elderly people.

[BibT_eX]

[DOI]

Kengo Fujita

Tsuneo Kato

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

A Short-Latency Unit Selection Method with Redundant Search for Concatenative Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Constructing a Phonetic-Rich Speech Corpus While Controlling Time-Dependent Voice Quality Variability for English Speech Synthesis.

[BibT_eX]

[DOI]

Toshio Hirai

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A DOA estimation method for 3D multiple source signals using independent component analysis.

[BibT_eX]

[DOI]

Hao Yuan

Makoto Yamada

Proceedings of the 14th European Signal Processing Conference, 2006

Evaluation result of transmission control mechanism for multimedia streams based on the multi-RTCP scheme over multiple IP-based networks.

[BibT_eX]

[DOI]

Norihiro Fukumoto

Hideaki Yamada

Proceedings of the 3rd IEEE Consumer Communications and Networking Conference, 2006

Developing a Test Bed of English Text-to-Speech System XIMERA for the Blizzard Challenge 2006.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2006, Pittsburgh, PA, USA, September 16, 2006, 2006

2005

Discriminative training and explicit duration modeling for HMM-based automatic segmentation.

[BibT_eX]

[DOI]

Speech Commun., 2005

Improvement of rejection performance of keyword spotting using anti-keywords derived from large vocabulary considering acoustical similarity to keywords.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Estimation of intonation variation with constrained tone transformations.

[BibT_eX]

[DOI]

Keikichi Hirose

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Analysis of major factors of naturalness degradation in concatenative synthesis.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

SNR-dependent background noise compensation of PESQ values for cellular phone speech.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

XIMERA: a new TTS from ATR based on corpus-based technologies.

[BibT_eX]

[DOI]

Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

A study on automatic detection of Japanese vowel devoicing for speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Using a depth-restricted search to reduce delays in unit selection.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Formulating contextual tonal variations in Mandarin.

[BibT_eX]

[DOI]

Keikichi Hirose

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Minimum segmentation error based discriminative training for speech synthesis application.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Scaling of waveform segments along the time axis for concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

An evaluation of automatic phone segmentation for concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Optimizing integrated cost function for segment selection in concatenative speech synthesis based on perceptual evaluations.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Tone pattern discrimination combining parametric modeling and maximum likelihood estimation.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Segment selection considering local degradation of naturalness in concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Tone feature extraction through parametric modeling and analysis-by-synthesis-based pattern matching.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Feature extraction for unit selection in concatenative speech synthesis: comparison between AIM, LPC, and MFCC.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Design of a Mandarin sentence set for corpus-based speech synthesis by use of a multi-tier algorithm taking account of the varied prosodic and spectral characteristics.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Perceptual evaluation of naturalness due to substitution of Chinese syllable for concatenative speech synthesis.

[BibT_eX]

[DOI]

Jinlin Lu

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Acoustic measures vs. phonetic features as predictors of audible discontinuity in concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2000

A design method of speech corpus for text-to-speech synthesis taking account of prosody.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1998

Recognition of connected digit speech in Japanese collected over the telephone network.

[BibT_eX]

[DOI]