Satoru Hayamizu

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Toward a High Performance Piano Practice Support System for Beginners.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Lipreading using deep bottleneck features for optical and depth images.

[BibT_eX]

[DOI]

Koichi Miyazaki

Proceedings of the 14th International Conference on Auditory-Visual Speech Processing, 2017

Toward effective noise reduction for sub-Nyquist high-frame-rate MRI techniques with deep learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Swallowing function evaluation using deep-learning-based acoustic signal processing.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Investigation of DNN-Based Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2016

Spoken Document Retrieval Using Neighboring Documents and Extended Language Models for Query Likelihood Model.

[BibT_eX]

[DOI]

Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, 2016

Investigation of clinical process visualization using EMR data in clinics.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2016, 2016

2015

Multi-modal service operation estimation using DNN-based acoustic bag-of-features.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

Stream weight estimation using higher order statistics in multi-modal speech recognition.

[BibT_eX]

[DOI]

Kazuto Ukai

Proceedings of the Auditory-Visual Speech Processing, 2015

Audio-visual speech recognition using deep bottleneck features and high-performance lipreading.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

Data collection for mobile audio-visual speech recognition in various environments.

[BibT_eX]

[DOI]

Seko Takumi

Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Segmented Spoken Document Retrieval Using Word Co-occurrence Information.

[BibT_eX]

[DOI]

Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

Audio-visual voice conversion using noise-robust features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Improvement of utterance clustering by using employees' sound and area data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Analysis of customer communication by employee in restaurant and lead time estimation.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013

Probabilistic expression of Polynomial Semantic Indexing and its application for classification.

[BibT_eX]

[DOI]

Kentaro Minoura

Pattern Recognit. Lett., 2013

Measurement and analysis of speech data toward improving service in restaurant.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Spoken Document Retrieval Using Extended Query Model and Web Documents.

[BibT_eX]

[DOI]

Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Hidden Markov Model for Analyzing Time-Series Health Checkup Data.

[BibT_eX]

[DOI]

Proceedings of the MEDINFO 2013, 2013

Improvement of lipreading performance using discriminative feature and speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2013

Audio-visual interaction in sparse representation features for noise robust audio-visual speech recognition.

[BibT_eX]

[DOI]

Peng Shen

Proceedings of the Auditory-Visual Speech Processing, 2013

Confidence estimation and keyword extraction from speech recognition result based on Web information.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Time-series analysis of health checkup data using Hidden-Markov model.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2013, 2013

Improvement of Lip Reading Performance in Real Environments Using Speaker and Environmental Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

2012

Visual Analysis of Health Checkup Data Using Multidimensional Scaling.

[BibT_eX]

[DOI]

J. Adv. Comput. Intell. Intell. Informatics, 2012

Sparse representation of audio features for sputum detection from lung sounds.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Pattern Recognition, 2012

GIF-LR: GA-based informative feature for lipreading.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

GIF-SP: GA-based informative feature for noisy speech recognition.

[BibT_eX]

[DOI]

Yoji Tagami

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Multi-stream acoustic model adaptation for noisy speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Feature reconstruction using sparse imputation for noise robust audio-visual speech recognition.

[BibT_eX]

[DOI]

Peng Shen

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Statistical voice conversion using GA-based informative feature.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Toward polyphonic musical instrument identification using example-based sparse representation.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Toward improvement of SDR accuracy using LDA and query expansion for SpokenDoc.

[BibT_eX]

[DOI]

Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

2010

A robust audio-visual speech recognition using audio-visual voice activity detection.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Template-based spectral estimation using microphone array for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Topic-dependent n-gram models based on optimization of context lengths in LDA.

[BibT_eX]

[DOI]

Akira Nakamura

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Decision fusion by boosting method for multi-modal voice activity detection.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2010

Evaluation of real-time audio-visual speech recognition.

[BibT_eX]

[DOI]

Peng Shen

Proceedings of the Auditory-Visual Speech Processing, 2010

2009

Voice activity detection based on fusion of audio and visual information.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2009

2008

CENSREC-AV: evaluation frameworks for audio-visual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007

GEMSIS - a novel application of speech recognition to emergency and disaster medicine.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006

Note-Taking Support for Nurses Using Digital Pen Character Recognition System.

[BibT_eX]

[DOI]

Proceedings of the Interactive Technologies and Sociotechnical Systems, 2006

Automatic metadata generation and video editing based on speech and image recognition for medical education contents.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005

An Auto-Regressive, Non-Stationary Excited Signal Parameter Estimation Method and an Evaluation of a Singing-Voice Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2002

Speech completion: on-demand completion assistance using filled pauses for speech input interfaces.

[BibT_eX]

[DOI]

Masataka Goto

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

Jijo-2: An Office Robot that Communicates and Learns.

[BibT_eX]

[DOI]

IEEE Intell. Syst., 2001

2000

Speech enhancement based on the subspace method.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2000

Interactive Learing and Management of Visual Inforamtion via Human-like Software Robot.

[BibT_eX]

[DOI]

Osamu Hasegawa

Katsuhiko Sakaue

New Gener. Comput., 2000

Multimodal corpora for human-machine interaction research.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999

A multimodal database of gestures and speech.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A real-time filled pause detection system for spontaneous speech recognition.

[BibT_eX]

[DOI]

Masataka Goto

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A spoken dialog system for a mobile office robot.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998

Gesture Recognition Using HLAC Features of PARCOR Images and HMM Based Recognizer.

[BibT_eX]

Takio Kurita

Proceedings of the 3rd International Conference on Face & Gesture Recognition (FG '98), 1998

1997

Socially Embedded Learning of the Office-Conversant Mobil Robot Jijo-2.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

Information Integration of the Office-Conversant Mobile Robot Jijo-2.

[BibT_eX]

Proceedings of the Progress in Connectionist-Based Information Systems: Proceedings of the 1997 International Conference on Neural Information Processing and Intelligent Information Systems, 1997

Speech enhancement using CSS-based array processing.

[BibT_eX]

[DOI]

Futoshi Asano

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Are Listeners Paying Attention to the Hand Gestures of an Anthropomorphic Agent? An Evaluation Using a Gaze Tracking Method.

[BibT_eX]

[DOI]

Proceedings of the Gesture and Sign Language in Human-Computer Interaction, 1997

1996

Combining probabilistic map and dialog for robust life-long office navigation.

[BibT_eX]

[DOI]

Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS 1996, 1996

Pitch pattern clustering of user utterances in human-machine dialogue.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

RWC multimodal database for interactions by integration of spoken language and visual information.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995

Active Agent Oriented Multimodal Interface System.

[BibT_eX]

[DOI]

Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

1994

Annotating illocutionary force types and phonological features into a spontaneous dialogue corpus: an experimental study.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Generating phoneme models for forming phonological concepts.

[BibT_eX]

[DOI]

Hiroaki Kojima

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Collecting and analyzing nonverbal elements for maintenance of dialog using a wizard of oz simulation.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Statistical modeling and recognition of rhythm in speech.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Word accent patterns modelling by concatenation of mora hidden Markov models.

[BibT_eX]

[DOI]

Takashi Yoshimura

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993

Prediction of protein secondary structure by the hidden Markov model.

[BibT_eX]

[DOI]

Kiyoshi Asai

Ken'ichi Handa

Comput. Appl. Biosci., 1993

Detection of unknown words in large vocabulary speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992

Formation of phonological concept structures from spoken word samples.

[BibT_eX]

[DOI]

Hiroaki Kojima

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Detection of unknown words and automatic estimation of their transcriptions in continuous speech recognition.

[BibT_eX]

[DOI]

Hozumi Tanaka

Proceedings of the Second International Conference on Spoken Language Processing, 1992

A spoken language dialogue system for automatic collection of spontaneous speech.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Continuous speech recognition by context-dependent phonetic HMM and an efficient algorithm for finding N-Best sentence hypotheses.

[BibT_eX]

[DOI]

Hozumi Tanaka

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Dividing the distributions of HMM and linear interpolation in speech recognition.

[BibT_eX]

[DOI]

Kiyoshi Asai

Ken'ichi Handa

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1990

Improved Hidden Markov Modeling for Speaker-Independent Continuous Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

The ETL speech database for speech analysis and recognition research.

[BibT_eX]

[DOI]

Kozo Ohta

Proceedings of the First International Conference on Spoken Language Processing, 1990

Description of acoustic variations by tree-based phone modeling.

[BibT_eX]

[DOI]

Kai-Fu Lee

Hsiao-Wuen Hon

Proceedings of the First International Conference on Spoken Language Processing, 1990

Allophone clustering for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

1988

A large vocabulary word recognition system using rule-based network representation of acoustic characteristic variations.

[BibT_eX]

[DOI]

Kozo Ohta

Proceedings of the IEEE International Conference on Acoustics, 1988

1986

A demiphoneme network representation of speech and automatic labeling techniques for speech data base construction.

[BibT_eX]

[DOI]