Yoshinori Sagisaka

According to our database1, Yoshinori Sagisaka authored at least 150 papers between 1986 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2020
Sentence-Final Prosody Analysis of Japanese Communicative Speech Based on the Command-Response Model.
Proceedings of the IEEE Conference on Computer Applications, 2020

2018
Analysis of L2 Learners' Progress of Distinguishing Mandarin Tone 2 and Tone 3.
Proceedings of the Interspeech 2018, 2018

2017
Developing a speech corpus from web news for Myanmar (Burmese) language.
Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017

Cross-Modal Analysis Between Phonation Differences and Texture Images Based on Sentiment Correlations.
Proceedings of the Interspeech 2017, 2017

2016
Comparison of Grapheme-to-Phoneme Conversion Methods on a Myanmar Pronunciation Dictionary.
Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, 2016

Analysis of Chinese Syllable Durations in Running Speech of Japanese L2 Learners.
Proceedings of the Interspeech 2016, 2016

2015
Analysis on L2 learners' perception errors between geminate and singleton of Japanese consonants using loudness related parameters.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

A study of the production of unstressed vowels by Japanese speakers of English using the J-AESOP corpus.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

Cross-modal description of sentiment information embedded in speech.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

2014
Integrating Dictionaries into an Unsupervised Model for Myanmar Word Segmentation.
Proceedings of the Fifth Workshop on South and Southeast Asian Natural Language Processing, 2014

Communicative F0 generation based on impressions.
Proceedings of the 5th IEEE Conference on Cognitive Infocommunications, 2014

Sentiment analysis of color attributes derived from vowel sound impression for multimodal expression.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Global F0 control parameter prediction based on impressions for communicative prosody generation.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

A Purely Monotonic Approach to Machine Translation for Similar Languages.
Proceedings of the 2013 International Conference on Asian Language Processing, 2013

Density Maximization in Context-Sense Metric Space for All-words WSD.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Trans-disciplinary spoken language processing studies for scientific understanding of second language learner's characteristics.
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

2011
Perceptual Training of Vowel Length Contrast of Japanese by L2 Listeners: Effects of an Isolated Word versus a Word Embedded in Sentences.
Proceedings of the INTERSPEECH 2011, 2011

Perceptual Studies of Japanese Geminate Insertion Phenomena Based on Timing Control Characteristics.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

A Requirement of Texts for Evaluation of Rhythm in English Speech by Learners.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

2010
The effect of a word embedded in a sentence and speaking rate variation on the perceptual training of geminate and singleton consonant distinction.
Proceedings of the INTERSPEECH 2010, 2010

Computational Modeling of Timing Control and its Application to Objective Evaluation of the Second Language Proficiency.
Proceedings of the Electronic Speech Signal Processing, 2010

2009
Analysis on paralinguistic prosody control in perceptual impression space using multiple dimensional scaling.
Speech Commun., 2009

Perceptual training of singleton and geminate stops in Japanese language by Korean learners.
Proceedings of the INTERSPEECH 2009, 2009

Effects of mora-timing in English rhythm control by Japanese learners.
Proceedings of the INTERSPEECH 2009, 2009

Model-based automatic evaluation of L2 learner's English timing.
Proceedings of the INTERSPEECH 2009, 2009

Objective evaluation of English learners' timing control based on a measure reflecting perceptual characteristics.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Corpus-based speech synthesis from reading speech to communicative speech.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Three-sectional-staff characterization of Cantonese level tones.
Proceedings of the INTERSPEECH 2008, 2008

Objective evaluation of second language learner<sup>2</sup>s translation proficiency using statistical translation measures.
Proceedings of the ISCA Tutorial and Research Workshop on Experimental Linguistics, 2008

Model-based duration analysis on English natives and Thai learners.
Proceedings of the ISCA Tutorial and Research Workshop on Experimental Linguistics, 2008

2007
Syllable-based Thai duration model using multi-level linear regression and syllable accommodation.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank.
Proceedings of the INTERSPEECH 2007, 2007

Inter-language prosodic style modification experiment using word impression vector for communicative speech generation.
Proceedings of the INTERSPEECH 2007, 2007

F<sub>0</sub> analysis of perceptual distance among Cantonese level tones.
Proceedings of the INTERSPEECH 2007, 2007

2006
Speech recognition of foreign out-of-vocabulary words using a hierarchical language model.
Proceedings of the INTERSPEECH 2006, 2006

Language modeling of Chinese personal names based on character units for continuous Chinese speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

2005
Generation and perception of F<sub>0</sub> markedness for communicative speech synthesis.
Speech Commun., 2005

Effect of speaking rate on the acceptability of change in segment duration.
Speech Commun., 2005

Effect of intra-phrase position on acceptability of change in segment duration in sentence speech.
Speech Commun., 2005

Editorial.
Speech Commun., 2005

Application of auditory image model for speech event detection.
Proceedings of the INTERSPEECH 2005, 2005

Analysis on command sequences of a F0 generation model for Mandarin speech and its application to their automatic extraction.
Proceedings of the INTERSPEECH 2005, 2005

Communicative speech synthesis using constituent word attributes.
Proceedings of the INTERSPEECH 2005, 2005

Improved speech recognition word lattice translation by confidence measure.
Proceedings of the INTERSPEECH 2005, 2005

Speech recognition of a named entity.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

F0 control characterization by perceptual impressions on speaking attitudes using Multiple Dimensional Scaling analysis.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Mis-recognized utterance detection using hierarchical language model.
Proceedings of the INTERSPEECH 2004, 2004

Analysis of the phone level contributions to objective evaluation of English speech by non-natives.
Proceedings of the INTERSPEECH 2004, 2004

2003
Multi-class composite N-gram language model.
Speech Commun., 2003

Multiclass composite N-gram language model based on connection direction.
Syst. Comput. Jpn., 2003

Generation and perception of f_0 markedness in conversational speech with adverbs expressing degrees.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Word class modeling for speech recognition with out-of-task words using a hierarchical language model.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Corpus-based modeling of naturalness estimation in timing control for non-native speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Analysis and modeling of syllable duration for Thai speech synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Speaker clustering for speech recognition using vocal tract parameters.
Speech Commun., 2002

A stochastic speech understanding method to generate interlingual representations.
Syst. Comput. Jpn., 2002

Effects of intra-phrase position on acceptability of changes in segmental duration in sentence speech.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Unit selection synthesis.
Proceedings of the 4th ITRW on Speech Synthesis, Perthshire, Scotland, UK, August 29, 2001

A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Structured language model for class identification of out-of-vocabulary words arising from multiple wordclasses.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Pronunciation variant analysis using speaking style parallel corpus.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

New language models using phrase structures extracted from parse trees.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Multi-class composite n-gram language model using multiple word clusters and word successions.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Multi-Class Composite N-gram Language Model for Spoken Language Processing Using Multiple Word Clusters.
Proceedings of the Association for Computational Linguistic, 2001

2000
Statistical language modeling with a class-basedn-multigram model.
Comput. Speech Lang., 2000

An embedded knowledge integration for hybrid language modelling.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A tagger-aided language model with a stack decoder.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A language model for conversational speech recognition using information designed for speech translation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Fine keyword clustering using a thesaurus and example sentences for speech translation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A hierarchical language model incorporating class-dependent word models for OOV words recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Evaluation of the ATR-matrix speech translation system with a pair comparison method between the system and humans.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Rules, but what for? - rule description as efficient and robust abstraction of corpora and optimal fitting to applications -.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Pronunciation variants description using recognition error modeling with phonetic derivation hypotheses.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Analysis of acoustic models trained on a large-scale Japanese speech database.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Cellular-phone based speech-to-speech translation system ATR-MATRIX.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Integrating detailed information into a language model.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Automatic generation of multiple pronunciations based on neural networks.
Speech Commun., 1999

Phoneme boundary estimation using bidirectional recurrent neural networks and its applications.
Syst. Comput. Jpn., 1999

Multiple pronunciation dictionary using HMM-state confusion characteristics.
Comput. Speech Lang., 1999

Improving n-gram modeling using distance-related unit association maximum entropy language modeling.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Part-of-speech n-gram and word n-gram fused language model.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Model-based speaker normalization methods for speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Japanese spontaneous speech database with wide regional and age distribution.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Multi-class composite N-gram based on connection direction.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Reliable utterance segment recognition by integrating a grammar with statistical language constraints.
Speech Commun., 1998

Model parameter estimation for mixture density polynomial segment models.
Comput. Speech Lang., 1998

Grammatical word graph re-generation for spontaneous speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A Japanese-to-English speech translation system: ATR-MATRIX.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Effects of phonetic quality and duration on perceptual acceptability of temporal changes in speech.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Neural network based pronunciation modeling with applications to speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Speaker clustering for speech recognition using the parameters characterizing vocal-tract dimensions.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Speaker normalized acoustic modeling based on 3-D Viterbi decoding.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Learning a Syntagmatic and Paradigmatic Structure from Language Data with a Bi-Multigram Model.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1997
Automatic extraction of fundamental frequency control rules by statistical analysis.
Syst. Comput. Jpn., 1997

ATR Speech Translation Research Project in Japan.
Künstliche Intell., 1997

Speech recognition using HMM-state confusion characteristics.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Integration of grammar and statistical language constraints for partial word-sequence recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Cyclic autocorrelation-based linear prediction analysis of speech.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Automatic generation of a pronunciation dictionary based on a pronunciation network.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Segment boundary estimation using recurrent neural networks.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Fast word-graph generation for spontaneous conversational speech translation.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Task adaptation using MAP estimation in N-gram language modeling.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Accent Phrase Segmentation by F<sub>0</sub> Clustering Using Superpositional Modelling.
Proceedings of the Computing Prosody, 1997

Measuring temporal compensation effect in speech perception.
Proceedings of the Computing Prosody, 1997

Comparison of <i>F</i>0 Control Rules Derived from Multiple Speech Databases.
Proceedings of the Computing Prosody, 1997

Prediction of Major Phrase Boundary Location and Pause Insertion Using a Stochastic Context-free Grammar.
Proceedings of the Computing Prosody, 1997

1996
Japanese speech databases for robust speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Speech recognition based on acoustically derived segment units.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Spontaneous dialogue speech recognition using cross-word context constrained word graphs.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Variable-order N-gram generation by word-class splitting and consecutive word grouping.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Design of a speech recognition system based on acoustically derived segmental units.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Editorial.
Speech Commun., 1995

Acoustic characteristics of speaker individuality: Control and conversion.
Speech Commun., 1995

Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks.
Speech Commun., 1995

Speech segment network approach for optimization of synthesis unit set.
Comput. Speech Lang., 1995

Effect of rasta-type processing for speech recognition with speaking-rate mismatches.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Minimum classification error training algorithm for feature extractor and pattern classifier in speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Automatic detection of major phrase boundaries using statistical properties of superpositional F0 control model parameters.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Automatic prosodic segmentation by F<sub>0</sub> clustering using superpositional modeling.
Proceedings of the 1995 International Conference on Acoustics, 1995

Stochastic modeling of pause insertion using context-free grammar.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Automatic extraction of FO control parameters using statistical analysis.
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

Effect of speaking style on parameters of fundamental frequency contour.
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

A speech and language database for speech translation research.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Acceptability of temporal modification in consonant and vowel onsets.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Voice adaptation using multi-functional transformation with weighting by radial basis function networks.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Prediction of prosodic phrase boundaries using stochastic context-free grammar.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Speech spectrum transformation by speaker interpolation.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Duration modelling with multiple split regression.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Tree-based unit selection for English speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
ATR μ-talk speech synthesis system.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Acceptability and discrimination threshold for distortion of segmental duration in Japanese words.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Pause characteristics and local phrase-dependency structure in Japanese.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Speech segment network approach for an optimal synthesis unit set.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Optimization of intonation control using statistical F<sub>0</sub> resetting characteristics.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Concatenative speech synthesis by minimum distortion criteria.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Statistical modeling of segmental duration and power control for Japanese.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1990
ATR Japanese speech database as a tool of speech recognition and synthesis.
Speech Commun., 1990

Speech synthesis from text.
IEEE Commun. Mag., 1990

On unit selection algorithms and their evaluation in non-uniform unit speech synthesis.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990

The control of segmental duration in speech synthesis using linguistic properties.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990

On the unit search criteria and algorithms for speech synthesis using non-uniform units.
Proceedings of the First International Conference on Spoken Language Processing, 1990

A large-scale Japanese speech database.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Statistical analysis for segmental duration rules in Japanese speech synthesis.
Proceedings of the First International Conference on Spoken Language Processing, 1990

On the prediction of global F<sub>0</sub> shape for Japanese text-to-speech.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
Adaptive manipulation of non-uniform synthesis units using multi-level unit transcription.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

Construction of a large-scale Japanese speech database and its management system.
Proceedings of the IEEE International Conference on Acoustics, 1989

1988
Speech synthesis by rule using an optimal selection of non-uniform synthesis units.
Proceedings of the IEEE International Conference on Acoustics, 1988

1987
Acoustic-phonetic labels in a Japanese speech database.
Proceedings of the European Conference on Speech Technology, 1987

1986
Composite phoneme units for the speech synthesis of Japanese.
Speech Commun., 1986

Word identification method for Japanese text-to-speech conversion system.
Proceedings of the IEEE International Conference on Acoustics, 1986


  Loading...