Masafumi Nishimura

Orcid: 0000-0001-7633-9340

Affiliations:
  • IBM Research


According to our database1, Masafumi Nishimura authored at least 80 papers between 1984 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Utterance-style-dependent Speaker Verification by Utilizing Emotions.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Eating and Drinking Behavior Recognition Using Multimodal Fusion.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Multi-Self-Supervised Learning Model-Based Throat Microphone Speech Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Automatic Detection of Crushing Completion Timing of Food.
Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Identification of vocal tract state before and after swallowing using acoustic features.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

Throat microphone speech recognition using wav2vec 2.0 and feature mapping.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

2021
Automatic Detection of Chewing and Swallowing.
Sensors, 2021

Tablet-Based Automatic Assessment for Early Detection of Alzheimer's Disease Using Speech Responses to Daily Life Questions.
Frontiers Digit. Health, 2021

A Study for Detecting Mild Cognitive Impairment by Analyzing Conversations with Humanoid Robots.
Proceedings of the 3rd IEEE Global Conference on Life Sciences and Technologies, 2021

Automatic Detection of Chewing and Swallowing Using Multichannel Sound Information.
Proceedings of the 3rd IEEE Global Conference on Life Sciences and Technologies, 2021

Automatic Detection of Chewing and Swallowing Using Attention-Based Fusion.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

Question Generation using Knowledge Graphs with the T5 Language Model and Masked Self-Attention.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

2020
Automatic Detection of the Chewing Side Using Two-channel Recordings under the Ear.
Proceedings of the 2nd IEEE Global Conference on Life Sciences and Technologies, 2020

A data augmentation-based technique to classify chewing and swallowing using LSTM.
Proceedings of the 2nd IEEE Global Conference on Life Sciences and Technologies, 2020

Automatic Detection of Chewing and Swallowing Using Hybrid CTC/Attention.
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

BERT-based Automatic Text Scoring for Collaborative Learning.
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

A Data Augmentation Technique for Automatic Detection of Chewing Side and Swallowing.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Multimodal Behavior Analysis Towards Detecting Mild Cognitive Impairment: Preliminary Results on Gait and Speech.
Proceedings of the MEDINFO 2019: Health and Wellbeing e-Networks for All, 2019

Knowledge Distillation for Throat Microphone Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Effects of Mounting Position on Throat Microphone Speech Recognition.
Proceedings of the IEEE 8th Global Conference on Consumer Electronics, 2019

Estimation of Number of Chewing Strokes and Swallowing Events by Using LSTM-CTC and Throat Microphone.
Proceedings of the IEEE 8th Global Conference on Consumer Electronics, 2019

2018
Detecting breathing sounds in realistic Japanese telephone conversations and its application to automatic speech recognition.
Speech Commun., 2018

Dialogue Breakdown Detection Based on Nonlinguistic Acoustic Information.
Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018

Bottleneck feature-mediated DNN-based feature mapping for throat microphone speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Deep learning-based water-intake estimation method using second half of swallowing sound.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

A Deep-Learning-Based Method of Estimating Water Intake.
Proceedings of the 41st IEEE Annual Computer Software and Applications Conference, 2017

DNN-based feature transformation for speech recognition using throat microphone.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2015
Discriminative re-ranking for automatic speech recognition by leveraging invariant structures.
Speech Commun., 2015

A metric for evaluating speech recognizer output based on human-perception model.
Proceedings of the INTERSPEECH 2015, 2015

2014
Regularized feature-space discriminative adaptation for robust ASR.
Proceedings of the INTERSPEECH 2014, 2014

Leveraging phonetic context dependent invariant structure for continuous speech recognition.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

2013
Channel-mapping for speech corpus recycling.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Acoustically discriminative language model training with pseudo-hypothesis.
Speech Commun., 2012

Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech.
Speech Commun., 2012

Discriminative Reranking for LVCSR Leveraging Invariant Structure.
Proceedings of the INTERSPEECH 2012, 2012

Model-based noise reduction leveraging frequency-wise confidence metric for in-car speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Agglomerative Hierarchical Clustering of Emotions in Speech Based on Subjective Relative Similarity.
Proceedings of the INTERSPEECH 2011, 2011

Continuous Digits Recognition Leveraging Invariant Structure.
Proceedings of the INTERSPEECH 2011, 2011

Acoustic Model Training with Detecting Transcription Errors in the Training Data.
Proceedings of the INTERSPEECH 2011, 2011

Breath-Detection-Based Telephony Speech Phrasing.
Proceedings of the INTERSPEECH 2011, 2011

Combining Feature Space Discriminative Training with Long-Term Spectro-Temporal Features for Noise-Robust Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Training of error-corrective model for ASR without using audio data.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Dynamic Features in the Linear-Logarithmic Hybrid Domain for Automatic Speech Recognition in a Reverberant Environment.
IEEE J. Sel. Top. Signal Process., 2010

Long-Term Spectro-Temporal and Static Harmonic Features for Voice Activity Detection.
IEEE J. Sel. Top. Signal Process., 2010

DOA Estimation with Local-Peak-Weighted CSP.
EURASIP J. Adv. Signal Process., 2010

Speech synthesis by modeling harmonics structure with multiple function.
Proceedings of the INTERSPEECH 2010, 2010

Improved voice activity detection using static harmonic features.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Japanese pitch conversion for voice morphing based on differential modeling.
Proceedings of the INTERSPEECH 2009, 2009

Dynamic features in the linear domain for robust automatic speech recognition in a reverberant environment.
Proceedings of the INTERSPEECH 2009, 2009

Acoustically discriminative training for language models.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Local Peak Enhancement for In-Car Speech Recognition in Noisy Environment.
IEICE Trans. Inf. Syst., 2008

Short- and long-term dynamic features for robust speech recognition.
Proceedings of the INTERSPEECH 2008, 2008

Phone-duration-dependent long-term dynamic features for a stochastic model-based voice activity detection.
Proceedings of the INTERSPEECH 2008, 2008

Improving phoneme and accent estimation by leveraging a dictionary for a stochastic TTS front-end.
Proceedings of the IEEE International Conference on Acoustics, 2008

Local peak enhancement combined with noise reduction algorithms for robust automatic speech recognition in automobiles.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Automatic Prosody Labeling Using Multiple Models for Japanese.
IEICE Trans. Inf. Syst., 2007

Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone.
Proceedings of the INTERSPEECH 2007, 2007

Determining Recording Location Based on Synchronization Positions of Audiowatermarking.
Proceedings of the IEEE International Conference on Acoustics, 2007

Unsupervised Lexicon Acquisition from Speech and Text.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Acoustic Model Adaptation Using First-Order Linear Prediction for Reverberant Speech.
IEICE Trans. Inf. Syst., 2006

Estimation of recording location using audio watermarking.
Proceedings of the 8th workshop on Multimedia & Security, 2006

Unsupervised Adaptation of a Stochastic Language Model Using a Japanese Raw Corpus.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Simultaneous Adaptation of Echo Cancellation and Spectral Subtraction for In-Car Speech Recognition.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005

A stochastic approach to phoneme and accent estimation.
Proceedings of the INTERSPEECH 2005, 2005

2004
Improved HMM Separation for Distant-Talking Speech Recognition.
IEICE Trans. Inf. Syst., 2004

Sound Source Localization Using a Profile Fitting Method with Sound Reflectors.
IEICE Trans. Inf. Syst., 2004

Acoustic model adaptation using first order prediction for reverberant speech.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Language model adaptation using word clustering.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2001
Improvement of a structured language model: arbori-context tree.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
A method for style adaptation to spontaneous speech by using a semi-linear interpolation technique.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A Stochastic Parser Based on a Structural Word Prediction Model.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

1998
Word clustering for a word bi-gram model.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1991
Speaker adaptation method for fenonic markov model-based speech recognition.
Syst. Comput. Jpn., 1991

1989
HMM-based speech recognition using dynamic spectral feature.
Proceedings of the IEEE International Conference on Acoustics, 1989

1988
Speaker adaptation method for HMM-based speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1988

1987
HMM-Based speech recognition using multi-dimensional multi-labeling.
Proceedings of the IEEE International Conference on Acoustics, 1987

1986
Speaker adaptation for a hidden Markov model.
Proceedings of the IEEE International Conference on Acoustics, 1986

1985
Isolated word recognition using hidden Markov models.
Proceedings of the IEEE International Conference on Acoustics, 1985

1984
A method for recognizing Japanese monosyllables by using intermediate cumulative distance.
Proceedings of the IEEE International Conference on Acoustics, 1984


  Loading...