Satoru Hayamizu

According to our database1, Satoru Hayamizu authored at least 86 papers between 1986 and 2021.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2021
Multi-Angle Lipreading with Angle Classification-Based Feature Extraction and Its Application to Audio-Visual Speech Recognition.
Future Internet, 2021

Combination of temporal and spatial denoising methods for cine MRI.
Proceedings of the 3rd IEEE Global Conference on Life Sciences and Technologies, 2021

Speech Recognition using Deep Canonical Correlation Analysis in Noisy Environments.
Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods, 2021

Anomalous Sound Detection Based On Attention Mechanism.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Multi-angle lipreading using angle classification and angle-specific feature integration.
Proceedings of the International Conference on Communications, 2020

2018
A Deep Learning-Based Approach for Road Pothole Detection in Timor Leste.
Proceedings of the 2018 IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI), Singpapore, Singapore, July 31, 2018

Audio-visual Voice Conversion Using Deep Canonical Correlation Analysis for Deep Bottleneck Features.
Proceedings of the Interspeech 2018, 2018

Toward a High Performance Piano Practice Support System for Beginners.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Lipreading using deep bottleneck features for optical and depth images.
Proceedings of the Auditory-Visual Speech Processing, 2017

Toward effective noise reduction for sub-Nyquist high-frame-rate MRI techniques with deep learning.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Swallowing function evaluation using deep-learning-based acoustic signal processing.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Investigation of DNN-Based Audio-Visual Speech Recognition.
IEICE Trans. Inf. Syst., 2016

Spoken Document Retrieval Using Neighboring Documents and Extended Language Models for Query Likelihood Model.
Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, 2016

Investigation of clinical process visualization using EMR data in clinics.
Proceedings of the AMIA 2016, 2016

2015
Multi-modal service operation estimation using DNN-based acoustic bag-of-features.
Proceedings of the 23rd European Signal Processing Conference, 2015

Stream weight estimation using higher order statistics in multi-modal speech recognition.
Proceedings of the Auditory-Visual Speech Processing, 2015

Audio-visual speech recognition using deep bottleneck features and high-performance lipreading.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Data collection for mobile audio-visual speech recognition in various environments.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Segmented Spoken Document Retrieval Using Word Co-occurrence Information.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

Audio-visual voice conversion using noise-robust features.
Proceedings of the IEEE International Conference on Acoustics, 2014

Improvement of utterance clustering by using employees' sound and area data.
Proceedings of the IEEE International Conference on Acoustics, 2014

Analysis of customer communication by employee in restaurant and lead time estimation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Probabilistic expression of Polynomial Semantic Indexing and its application for classification.
Pattern Recognit. Lett., 2013

Measurement and analysis of speech data toward improving service in restaurant.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Spoken Document Retrieval Using Extended Query Model and Web Documents.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Hidden Markov Model for Analyzing Time-Series Health Checkup Data.
Proceedings of the MEDINFO 2013, 2013

Improvement of lipreading performance using discriminative feature and speaker adaptation.
Proceedings of the Auditory-Visual Speech Processing, 2013

Audio-visual interaction in sparse representation features for noise robust audio-visual speech recognition.
Proceedings of the Auditory-Visual Speech Processing, 2013

Confidence estimation and keyword extraction from speech recognition result based on Web information.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Time-series analysis of health checkup data using Hidden-Markov model.
Proceedings of the AMIA 2013, 2013

Improvement of Lip Reading Performance in Real Environments Using Speaker and Environmental Adaptation.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

2012
Visual Analysis of Health Checkup Data Using Multidimensional Scaling.
J. Adv. Comput. Intell. Intell. Informatics, 2012

Sparse representation of audio features for sputum detection from lung sounds.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

GIF-LR: GA-based informative feature for lipreading.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

GIF-SP: GA-based informative feature for noisy speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Multi-stream acoustic model adaptation for noisy speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Feature reconstruction using sparse imputation for noise robust audio-visual speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Statistical voice conversion using GA-based informative feature.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Toward polyphonic musical instrument identification using example-based sparse representation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Toward improvement of SDR accuracy using LDA and query expansion for SpokenDoc.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

2010
A robust audio-visual speech recognition using audio-visual voice activity detection.
Proceedings of the INTERSPEECH 2010, 2010

Template-based spectral estimation using microphone array for speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Topic-dependent n-gram models based on optimization of context lengths in LDA.
Proceedings of the INTERSPEECH 2010, 2010

Decision fusion by boosting method for multi-modal voice activity detection.
Proceedings of the Auditory-Visual Speech Processing, 2010

Evaluation of real-time audio-visual speech recognition.
Proceedings of the Auditory-Visual Speech Processing, 2010

2009
Voice activity detection based on fusion of audio and visual information.
Proceedings of the Auditory-Visual Speech Processing, 2009

2008
CENSREC-AV: evaluation frameworks for audio-visual speech recognition.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007
GEMSIS - a novel application of speech recognition to emergency and disaster medicine.
Proceedings of the INTERSPEECH 2007, 2007

2006
Note-Taking Support for Nurses Using Digital Pen Character Recognition System.
Proceedings of the Interactive Technologies and Sociotechnical Systems, 2006

Automatic metadata generation and video editing based on speech and image recognition for medical education contents.
Proceedings of the INTERSPEECH 2006, 2006

2005
An Auto-Regressive, Non-Stationary Excited Signal Parameter Estimation Method and an Evaluation of a Singing-Voice Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2002
Speech completion: on-demand completion assistance using filled pauses for speech input interfaces.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Jijo-2: An Office Robot that Communicates and Learns.
IEEE Intell. Syst., 2001

2000
Speech enhancement based on the subspace method.
IEEE Trans. Speech Audio Process., 2000

Interactive Learing and Management of Visual Inforamtion via Human-like Software Robot.
New Gener. Comput., 2000

Multimodal corpora for human-machine interaction research.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
A multimodal database of gestures and speech.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A real-time filled pause detection system for spontaneous speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A spoken dialog system for a mobile office robot.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Gesture Recognition Using HLAC Features of PARCOR Images and HMM Based Recognizer.
Proceedings of the 3rd International Conference on Face & Gesture Recognition (FG '98), 1998

1997
Socially Embedded Learning of the Office-Conversant Mobil Robot Jijo-2.
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

Information Integration of the Office-Conversant Mobile Robot Jijo-2.
Proceedings of the Progress in Connectionist-Based Information Systems: Proceedings of the 1997 International Conference on Neural Information Processing and Intelligent Information Systems, 1997

Speech enhancement using CSS-based array processing.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Are Listeners Paying Attention to the Hand Gestures of an Anthropomorphic Agent? An Evaluation Using a Gaze Tracking Method.
Proceedings of the Gesture and Sign Language in Human-Computer Interaction, 1997

1996
Combining probabilistic map and dialog for robust life-long office navigation.
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS 1996, 1996

Pitch pattern clustering of user utterances in human-machine dialogue.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

RWC multimodal database for interactions by integration of spoken language and visual information.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
Active Agent Oriented Multimodal Interface System.
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

1994
Annotating illocutionary force types and phonological features into a spontaneous dialogue corpus: an experimental study.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Generating phoneme models for forming phonological concepts.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Collecting and analyzing nonverbal elements for maintenance of dialog using a wizard of oz simulation.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Statistical modeling and recognition of rhythm in speech.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Word accent patterns modelling by concatenation of mora hidden Markov models.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Prediction of protein secondary structure by the hidden Markov model.
Comput. Appl. Biosci., 1993

Detection of unknown words in large vocabulary speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
Formation of phonological concept structures from spoken word samples.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Detection of unknown words and automatic estimation of their transcriptions in continuous speech recognition.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

A spoken language dialogue system for automatic collection of spontaneous speech.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Continuous speech recognition by context-dependent phonetic HMM and an efficient algorithm for finding N-Best sentence hypotheses.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Dividing the distributions of HMM and linear interpolation in speech recognition.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1990
Improved Hidden Markov Modeling for Speaker-Independent Continuous Speech Recognition.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

The ETL speech database for speech analysis and recognition research.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Description of acoustic variations by tree-based phone modeling.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Allophone clustering for continuous speech recognition.
Proceedings of the 1990 International Conference on Acoustics, 1990

1988
A large vocabulary word recognition system using rule-based network representation of acoustic characteristic variations.
Proceedings of the IEEE International Conference on Acoustics, 1988

1986
A demiphoneme network representation of speech and automatic labeling techniques for speech data base construction.
Proceedings of the IEEE International Conference on Acoustics, 1986


  Loading...