Seiichi Nakagawa

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Elimination of person names in spoken documents for privacy protection.

[BibT_eX]

[DOI]

Ryo Kawaguchi

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013

A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric.

[BibT_eX]

[DOI]

Speech Commun., 2013

Development and Evaluation of Spoken Dialog Systems with One or Two Agents through Two Domains.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Spoken Term Detection by N-gram Index with Exact Distance for NTCIR-SpokenDoc2.

[BibT_eX]

[DOI]

Nagisa Sakamoto

Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Overview of the NTCIR-10 SpokenDoc-2 Task.

[BibT_eX]

[DOI]

Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Development and evaluation of spoken dialog systems with one or two agents.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Robust/fast out-of-vocabulary spoken term detection by N-gram index with exact distance through text/speech input.

[BibT_eX]

[DOI]

Nagisa Sakamoto

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Single channel dereverberation method in log-melspectral domain using limited stereo data for distant speaker identification.

[BibT_eX]

[DOI]

Aditya Arie Nugraha

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Speaker identification using pseudo pitch synchronized phase information in noisy environments.

[BibT_eX]

[DOI]

Yuta Kawakami

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Topic-Dependent-Class-Based $n$-Gram Language Model.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Speaker Identification and Verification by Combining MFCC and Phase Information.

[BibT_eX]

[DOI]

Shinji Ohtsuka

IEEE Trans. Speech Audio Process., 2012

Class-Based N-Gram Language Model for New Words Using Out-of-Vocabulary to In-Vocabulary Similarity.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Risk-Based Semi-Supervised Discriminative Language Modeling for Broadcast Transcription.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Improving the Readability of ASR Results for Lectures Using Multiple Hypotheses and Sentence-Level Knowledge.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Development of large vocabulary continuous speech recognition system for Mongolian language.

[BibT_eX]

[DOI]

Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Developing Partially-Transcribed Speech Corpus from Edited Transcriptions.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Multi-objective optimization for semi-supervised discriminative language modeling.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fast NMF based approach and improved VQ based approach for speech recognition from mixed sound.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An online evaluation system for English pronunciation intelligibility for Japanese English learners.

[BibT_eX]

[DOI]

Hiroshi Kibishi

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

On the use of phase information-based joint factor analysis for speaker verification under channel mismatch condition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Soft-clustering technique for training data in Age-and gender-independent speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2011

High speed spoken term detection by combination of n-gram array of a syllable lattice and LVCSR result for NTCIR-SpokenDoc.

[BibT_eX]

[DOI]

Keisuke Iwami

Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Speech Recognition in Mixed Sound of Speech and Music Based on Vector Quantization and Non-Negative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Lattice-Based Risk Minimization Training for Unsupervised Language Model Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

New Feature Parameters for Pronunciation Evaluation in English Presentations at International Conferences.

[BibT_eX]

[DOI]

Hiroshi Kibishi

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Hidden Boosted MMI and Hierarchical State Posterior Feature for Automatic Speech Recognition Based on Hidden Conditional Neural Fields.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Efficient out-of-vocabulary term detection by n-gram array indices with distance from a syllable lattice.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Automatic speech recognition using Hidden Conditional Neural Fields.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Detection of precisely transcribed parts from inexact transcribed corpus.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Topic-Dependent Language Model with Voting on Noun History.

[BibT_eX]

[DOI]

ACM Trans. Asian Lang. Inf. Process., 2010

Speaker Recognition by Combining MFCC and Phase Information in Noisy Conditions.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Evaluation of Combinational Use of Discriminant Analysis-Based Acoustic Feature Transformation and Discriminative Training.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Distant Speech Recognition Using a Microphone Array Network.

[BibT_eX]

[DOI]

Alberto Yoshihiro Nakano

IEICE Trans. Inf. Syst., 2010

Topic dependent class based language model evaluation on automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Out-of-vocabulary term detection by n-gram array with distance from continuous syllable recognition results.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Evaluation of Privacy Protection Techniques for Speech Signals.

[BibT_eX]

[DOI]

Proceedings of the Information Processing and Management of Uncertainty in Knowledge-Based Systems. Applications, 2010

Speech recognition using long-term phase information.

[BibT_eX]

[DOI]

Eiichi Sueyoshi

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Integration of cache-based model and topic dependent class model with soft clustering and soft voting.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Lecture subtopic retrieval by retrieval keyword expansion using subordinate concept.

[BibT_eX]

[DOI]

Noboru Kanedera

Tetsuo Funada

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Constructing Japanese test collections for spoken term detection.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Automatic evaluation of English pronunciation by Japanese speakers using various acoustic features and pattern recognition techniques.

[BibT_eX]

[DOI]

Kuniaki Hirabayashi

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Improving the readability of class lecture ASR results using a confusion network.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speaker identification by combining MFCC and phase information in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Effective use of pause information in language modelling for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Topic dependent language model based on topic voting on noun history.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Estimating the position and orientation of an acoustic source with a microphone array network.

[BibT_eX]

[DOI]

Alberto Yoshihiro Nakano

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

High improvement of speaker identification and verification by combining MFCC and phase information.

[BibT_eX]

[DOI]

Shinji Ohtsuka

Proceedings of the IEEE International Conference on Acoustics, 2009

Language Model Based on Word Order Sensitive Matrix Representation in Latent Semantic Analysis for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

Response timing generation and response type selection for a spontaneous spoken dialog system.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Analysis and Robust Extraction of Changing Named Entities.

[BibT_eX]

[DOI]

Shoko Endo

Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, 2009

Privacy Protection for Speech Information.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Information Assurance and Security, 2009

2008

Robust Speech Recognition by Combining Short-Term and Long-Term Spectrum Based Position-Dependent CMN with Conventional CMN.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Linear Discriminant Analysis Using a Generalized Mean of Class Covariances and Its Application to Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Noisy Speech Recognition Based on Integration/Selection of Multiple Noise Suppression Methods Using Noise GMMs.

[BibT_eX]

[DOI]

Souta Hamaguchi

IEICE Trans. Inf. Syst., 2008

Developing Corpus of Japanese Classroom Lecture Speech Contents.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2008

Blind dereverberation based on CMN and spectral subtraction by multi-channel LMS algorithm.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A browsing system for classroom lecture speech.

[BibT_eX]

[DOI]

Shingo Togashi

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Evaluating spoken language model based on filler prediction model in speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Analysis of relationship between impression of human-to-human conversations and prosodic change and its modeling.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speech recognition performance of CJLC: corpus of Japanese lecture contents.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Class lecture summarization taking into account consecutiveness of important sentences.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Robust Extraction of Named Entity Including Unfamiliar Word.

[BibT_eX]

[DOI]

Shinya Hida

Proceedings of the ACL 2008, 2008

2007

Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM.

[BibT_eX]

[DOI]

Speech Commun., 2007

Indonesian-Japanese Transitive Translation using English for CLIR.

[BibT_eX]

[DOI]

Inf. Media Technol., 2007

A Machine Learning Approach for an Indonesian-English Cross Language Question Answering System.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2007

A Spoken Dialog System for Chat-Like Conversations Considering Response Timing.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Analysis of effect of compensation parameter estimation for CMN on speech/speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Power linear discriminant analysis.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

One-pass LVCSR algorithm using linear lexicon search and 1-best approximation tree-structured lexicon search.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Selection of optimal dimensionality reduction method using chernoff bound for segmental unit input HMM.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Construction of spoken language model including fillers using filler prediction model.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Prosody change and response timing analysis in spontaneously spoken dialogs and their modeling in a spoken dialog system.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A statistical method of evaluating pronunciation proficiency for presentation in English.

[BibT_eX]

[DOI]

Kei Ohta

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Speaker recognition by combining MFCC and phase information.

[BibT_eX]

[DOI]

Kouhei Asakawa

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Automatic extraction of cue phrases for important sentences in lecture speech and automatic lecture speech summarization.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Robust Distant Speech Recognition by Combining Position-Dependent CMN with Conventional CMN.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Generalization of Linear Discriminant Analysis used in Segmental Unit Input HMM for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

A machine learning approach for indonesian question answering system.

[BibT_eX]

Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, 2007

Expanding Indonesian-Japanese Small Translation Dictionary Using a Pivot Language.

[BibT_eX]

[DOI]

Proceedings of the ACL 2007, 2007

2006

Response Timing Detection Using Prosodic and Linguistic Information for Human-friendly Spoken Dialog Systems.

[BibT_eX]

[DOI]

Inf. Media Technol., 2006

Text-Independent/Text-Prompted Speaker Recognition by Combining Speaker-Specific GMM with Speaker Adapted Syllable-Based HMM.

[BibT_eX]

[DOI]

Wei Zhang

Mitsuo Takahashi

IEICE Trans. Inf. Syst., 2006

Robust Distant Speech Recognition by Combining Multiple Microphone-Array Processing with Position-Dependent CMN.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2006

Summarization of spoken Lectures Based on Linguistic Surface and prosodic Information.

[BibT_eX]

[DOI]

Shingo Togashi

Masaru Yamaguchi

Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

A spoken Dialog System with Automatic Recovery Mechanism from misrecognition.

[BibT_eX]

[DOI]

Hirotoshi Yano

Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Noisy speech recognition based on selection of multiple noise suppression methods using noise GMMs.

[BibT_eX]

[DOI]

Souta Hamaguchi

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005

Combining outputs of multiple LVCSR models by machine learning.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2005

Large-vocabulary continuous speech recognition using linear lexicon search and 1-best approximation tree-structured lexicon search.

[BibT_eX]

[DOI]

Nobutoshi Takahashi

Syst. Comput. Jpn., 2005

Detection and recognition of correction utterances on misrecognition of spoken dialog system.

[BibT_eX]

[DOI]

Naoko Kakutani

Syst. Comput. Jpn., 2005

An Unsupervised Speaker Adaptation Method for Lecture-Style Spontaneous Speech Recognition Using Multiple Recognition Systems.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2005

Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2005

Robust distant speech recognition based on position dependent CMN using a novel multiple microphone processing technique.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Robust distant speaker recognition based on position dependent cepstral mean normalization.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A statistical method of evaluating pronunciation proficiency for Japanese words.

[BibT_eX]

[DOI]

Kei Ohta

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Multimodal interface for organization name input based on combination of isolated word recognition and continuous base-word recognition.

[BibT_eX]

[DOI]

Hironori Oshikawa

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Query Transitive Translation Using IR Score for Indonesian-Japanese CLIR.

[BibT_eX]

[DOI]

Proceedings of the Information Retrieval Technology, 2005

2004

Estimating high-confidence portions based on agreement among outputs of multiple LVCSR models.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2004

Robust spoken document retrieval methods for misrecognition and out-of-vocabulary keywords.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2004

Confidence measure and rejection based on correctness probability of recognition candidates.

[BibT_eX]

[DOI]

Ichiro Akahori

Syst. Comput. Jpn., 2004

An Empirical Study on Multiple LVCSR Model Combination by Machine Learning.

[BibT_eX]

[DOI]

Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Unsupervised speaker adaptation using high confidence portion recognition results by multiple recognition systems.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Speech interface for name input based on combination of recognition methods using syllable-based n-gram and word dictionary.

[BibT_eX]

[DOI]

Hironori Oshikawa

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Keyword recognition and extraction by multiple-LVCSRs with 60, 000 words in speech-driven WEB retrieval task.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Robust distant speech recognition based on position dependent CMN.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM.

[BibT_eX]

[DOI]

Wei Zhang

Mitsuo Takahashi

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Integrating Cross-Lingually Relevant News Articles and Monolingual Web Documents in Bilingual Lexicon Acquisition.

[BibT_eX]

[DOI]

Proceedings of the COLING 2004, 2004

2003

Speaker change detection and speaker clustering using VQ distortion measure.

[BibT_eX]

[DOI]

Kazumasa Mori

Syst. Comput. Jpn., 2003

Interpreter for Highly Portable Spoken Dialogue System.

[BibT_eX]

[DOI]

Masamitsu Umeda

Proceedings of the SIGDIAL 2003 Workshop, 2003

Generation of natural response timing using decision tree based on prosodic and linguistic information.

[BibT_eX]

[DOI]

Masashi Takeuchi

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Text-independent speaker recognition by speaker-specific GMM and speaker adapted syllable-based HMM.

[BibT_eX]

[DOI]

Wei Zhang

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A statistical method of evaluating pronunciation proficiency for English words spoken by Japanese.

[BibT_eX]

[DOI]

Kazumasa Mori

Naoki Nakamura

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Evaluating multiple LVCSR model combination in NTCIR-3 speech-driven web retrieval task.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Comparison of effects of acoustic and language knowledge on spontaneous speech perception/recognition between human and automatic speech recognizer.

[BibT_eX]

[DOI]

Masahisa Shingu

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Detection and recognition of correction utterance in spontaneously spoken dialog.

[BibT_eX]

[DOI]

Naoko Kakutani

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Confidence of agreement among multiple LVCSR models and model combination by SVM.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Speech recognition under noisy environments using segmental unit input HMM.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2002

Differences of speech rate, interphoneme distance and likelihood caused by speaking style, their relationship, and recognition performance.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2002

English Speech Database Read by Japanese Learners for CALL System Development.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

A confidence measure based on agreement among multiple LVCSR models - correlation between pair of acoustic models and confidence.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Syllable recognition using syllable-segment statistics and syllable-based HMM.

[BibT_eX]

[DOI]

Nobutoshi Takahashi

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Speaker independent speech recognition using features based on glottal sound source.

[BibT_eX]

[DOI]

Daisuke Yamada

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Evaluation of spectral subtraction with smoothing of time direction on the Aurora 2 task.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Detection and recognition of repaired speech on misrecognized utterances for speech input of car navigation system.

[BibT_eX]

[DOI]

Naoko Kakutani

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

A Development Tool for Spoken Dialogue Systems and Its Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 4th International Conference, 2001

Automatic construction of CALL system from TV news program with captions.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Instantaneous estimation of accentuation habits for Japanese students to learn English pronunciation.

[BibT_eX]

[DOI]

Naoki Nakamura

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A fast calculation method in LVCSRS by time-skipping and clustering of probability density distributions.

[BibT_eX]

[DOI]

Yukihisa Horibe

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Experimental evaluation on confidence of agreement among multiple Japanese LVCSR models.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition.

[BibT_eX]

[DOI]

Kazumasa Mori

Proceedings of the IEEE International Conference on Acoustics, 2001

Discriminative training of HMM using maximum normalized likelihood algorithm.

[BibT_eX]

[DOI]

Konstantin Markov

Satoshi Nakamura

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

A Semantic Interpreter and a Cooperative Response Generator for a Robust Spoken Dialogue System.

[BibT_eX]

[DOI]

Toshihiko Itoh

Int. J. Pattern Recognit. Artif. Intell., 2000

Relationship among speaking style, inter-phoneme's distance and speech recognition performance.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A system for retrieving broadcast news speech documents using voice input keywords and similarity between words.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Instantaneous estimation of prosodic pronunciation habits for Japanese students to learn English pronunciation.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Quality improvement of PSOLA analysis-synthesis using partial zero-phase conversion.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Performance comparison among HMM, DTW, and human abilities in terms of identifying stress patterns of word utterances.

[BibT_eX]

[DOI]

Yukiko Fujisawa

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A portable development tool for spoken dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Overview of an intelligent system for information retrieval based on human-machine dialogue through spoken language.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Usability of Browser-Based Pen-Touch/Speech User Interfaces for Form-Based Application in Mobile Environment.

[BibT_eX]

[DOI]

Takahiro Nakano

Proceedings of the Advances in Multimodal Interfaces, 2000

1999

A Retrieval System of Broadcast News Speech Documents through Keyboard and Voice.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

HMM composition of segmental unit input HMM for noisy speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998

Text-independent speaker recognition using non-linear frame likelihood transformation.

[BibT_eX]

[DOI]

Speech Commun., 1998

Comparison of continuous speech recognition systems with unknown-word processing for speech disfluencies.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1998

Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Discriminative training of GMM using a modified EM algorithm for speaker recognition.

[BibT_eX]

[DOI]

Text-independent speaker recognition using multiple information sources.

[BibT_eX]

[DOI]

Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system.

[BibT_eX]

[DOI]

Yoshifumi Hirose

Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependency.

[BibT_eX]

[DOI]

Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique.

[BibT_eX]

[DOI]

Yukiko Fujisawa

1997

Speech recognition using hidden Markov models based on segmental statistics.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1997

An English conversation and pronunciation CAI system using speech recognition technology.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Automatic detection of accent in English words spoken by Japanese students.

[BibT_eX]

[DOI]

Nariaki Ohashi

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Speaker verification using frame and utterance level likelihood normalization.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

A Robust Dialogue System with Spontaneous Speech Understanding and Cooperative Response.

[BibT_eX]

[DOI]

Proceedings of the Interactive Spoken Dialog Systems: Bringing Speech and NLP Together in Real Applications@ACL/EACL 1997, 1997

1996

Prosodic manipulation system of speech material for perceptual experiments.

[BibT_eX]

[DOI]

Keikichi Hirose

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Automatic detection of accent nuclei at the head of words for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Evaluation of segmental unit input HMM.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

A Comparative Study of Output Probability Functions in HMMs.

[BibT_eX]

[DOI]

Li Zhao

IEICE Trans. Inf. Syst., 1995

Relationship among Recognition Rate, Rejection Rate and False Alarm Rate in a Spoken Word Recognition System.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 1995

Comparative evaluation of segmental unit input HMM and conditional density HMM.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Investigation on unknown word processing and strategies for spontaneous speech understanding.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994

Estimation of the probability density function and a <i>posteriori</i> probability by neural networks, and applications to vowel recognition.

[BibT_eX]

[DOI]

Yoshiyuki Ono

Syst. Comput. Jpn., 1994

A context-free grammar-driven, one-pass HMM-based continuous speech recognition method.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1994

A comparison study of output probability functions in HMMs through spoken digit recognition.

[BibT_eX]

[DOI]

Li Zhao

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation.

[BibT_eX]

[DOI]

Yutaka Tsurumi

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Three language identification methods based on HMMs.

[BibT_eX]

[DOI]

Allan A. Reyes

Takashi Seino

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Concept and grammar acquisition based on combining with visual and auditory information.

[BibT_eX]

[DOI]

Mikio Masukata

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Evaluation of unknown word processing in a spoken word recognition system.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993

Spoken language identification using ergodic HMM with emphasized state transition.

[BibT_eX]

[DOI]

Takashi Seino

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Evaluation of VQ-distortion based HMM.

[BibT_eX]

[DOI]

Li Zhao

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A new speech recognition method based on VQ-distortion measure and HMM.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1993

1992

Speech recognition using various sequential networks.

[BibT_eX]

[DOI]

Isao Hayakawa

Syst. Comput. Jpn., 1992

Speaker-independent, text-independent language identification by HMM.

[BibT_eX]

[DOI]

Yoshio Ueda

Takashi Seino

Proceedings of the Second International Conference on Spoken Language Processing, 1992

A frame-synchronous continuous speech recognition algorithm using a top-down parsing of context-free grammar.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Relationship among phoneme/word recognition rate, perplexity and sentence recognition and comparison of language models.

[BibT_eX]

[DOI]

Isao Murase

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991

The syntax-oriented spoken Japanese understanding system SPOJOS-SYNO II.

[BibT_eX]

[DOI]

Isao Murase

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1990

Segmentation of continuous speech by HMM and bayesian probability.

[BibT_eX]

[DOI]

Yasuhide Hashimoto

Syst. Comput. Jpn., 1990

Diction for phoneme/syllable/word-category and identification of language using HMM.

[BibT_eX]

[DOI]

Yoshio Ueda

Proceedings of the First International Conference on Spoken Language Processing, 1990

Sentence recognition method using word cooccurrence probability and its evaluation.

[BibT_eX]

[DOI]

Isao Murase

Proceedings of the First International Conference on Spoken Language Processing, 1990

Speaker adaptation of continuous parameter HMM.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

Comparison among time-delay neural networks, LVQ2 discrete parameter HMM and continuous parameter HMM.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

1989

The syntax-oriented speech understanding system - SPOJUS-SYNO.

[BibT_eX]

[DOI]

Yoshihisa Ohguro

Yasuhide Hashimoto

Proceedings of the First European Conference on Speech Communication and Technology, 1989

A lOObit/s speech coding using a speech recognition technique.

[BibT_eX]

[DOI]

Proceedings of the First European Conference on Speech Communication and Technology, 1989

1988

A method for continuous speech segmentation using HMM.

[BibT_eX]

[DOI]

Yasuhide Hashimoto

Proceedings of the 9th International Conference on Pattern Recognition, 1988

1987

Speaker-independent word recognition by less cost and stochastic dynamic time warping method.

[BibT_eX]

[DOI]

Hirobumi Nakanishi

Proceedings of the European Conference on Speech Technology, 1987

Spoken sentence recognition by time-synchronous parsing algorithm of context-free grammar.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1987

1986

Syllable-based connected spoken word recognition by two pass O(n) DP matching and hidden Markov models.

[BibT_eX]

[DOI]

Mohammed M. Jilan

Proceedings of the IEEE International Conference on Acoustics, 1986

On quick word spotting techniques.

[BibT_eX]

[DOI]

Alexander G. Hauptmann

Masaru Tomita

Proceedings of the IEEE International Conference on Acoustics, 1986

1985

A connected spoken word recognition algorithm by augmented continuous DP matching.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1985

1984

Connected spoken word recognition algorithms by constant time delay DP, O(n) DP and augmented continuous DP matching.

[BibT_eX]

[DOI]

Inf. Sci., 1984

1983

A Recognition Method of Connected Spoken Words With Syntactical Constraints by Augmented Continuous DP Algorithm.

[BibT_eX]

[DOI]

Proceedings of the 8th International Joint Conference on Artificial Intelligence. Karlsruhe, 1983

A connected spoken word recognition method by O(n) dynamic programming pattern matching algorithm.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1983

1979

A Parallel Tree Search Method.

[BibT_eX]

Toshiyuki Sakai

Proceedings of the Sixth International Joint Conference on Artificial Intelligence, 1979

1978

A word recognition method from a classified phoneme string in the Lithan speech understanding system.

[BibT_eX]

[DOI]