Ken'ichi Kumatani

According to our database1, Ken'ichi Kumatani authored at least 48 papers between 2001 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Deploying self-supervised learning in the wild for hybrid automatic speech recognition.
CoRR, 2022

2021
Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition.
CoRR, 2021

Tackling Dynamics in Federated Incremental Learning with Variational Embedding Rehearsal.
CoRR, 2021

Multilingual Speech Recognition using Knowledge Transfer across Learning Processes.
CoRR, 2021

Dynamic Gradient Aggregation for Federated Domain Adaptation.
CoRR, 2021

One-Shot Voice Conversion with Speaker-Agnostic StarGAN.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data.
Proceedings of the 38th International Conference on Machine Learning, 2021

Ensemble Combination between Different Time Segmentations.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Federated Transfer Learning with Dynamic Gradient Aggregation.
CoRR, 2020

Sequence-Level Self-Learning with Multiple Hypotheses.
Proceedings of the Interspeech 2020, 2020

A Federated Approach in Training Acoustic Models.
Proceedings of the Interspeech 2020, 2020

Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Robust Multi-Channel Speech Recognition Using Frequency Aligned Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Time-Delayed Bottleneck Highway Networks Using a DFT Feature for Keyword Spotting.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Direct modeling of raw audio with DNNS for wake word detection.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2015
Free energy for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2013
Speaker tracking with spherical microphone arrays.
Proceedings of the IEEE International Conference on Acoustics, 2013

Joint constrained maximum likelihood regression for overlapping speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors.
IEEE Signal Process. Mag., 2012

A signal-separation-based array postfilter for distant speech recognition.
Proceedings of the INTERSPEECH 2012, 2012

Microphone Array Post-filter based on Spatially-Correlated Noise Measurements for Distant Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Microphone array processing for distant speech recognition: Spherical arrays.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Microphone array processing for distant speech recognition: Towards real-world deployment.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Microphone Arrays.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011
On the combination of voice prompt suppression with maximum kurtosis beamforming.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Block-wise incremental adaptation algorithm for maximum kurtosis beamforming.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Maximum kurtosis beamforming with a subspace filter for distant speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

An information filter for voice prompt suppression.
Proceedings of the Conference Record of the Forty Fifth Asilomar Conference on Signals, 2011

2010
Subband beamforming with higher order statistics for distant speech recognition.
PhD thesis, 2010

Maximum negentropy beamforming with superdirectivity.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
Beamforming With a Maximum Negentropy Criterion.
IEEE Trans. Speech Audio Process., 2009

2008
A Neural Network Based Regression Approach for Recognizing Simultaneous Speech.
Proceedings of the Machine Learning for Multimodal Interaction, 5th International Workshop, 2008

Maximum kurtosis beamforming with the generalized sidelobe canceller.
Proceedings of the INTERSPEECH 2008, 2008

Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Adaptive Beamforming With a Minimum Mutual Information Criterion.
IEEE Trans. Speech Audio Process., 2007

To Separate Speech.
Proceedings of the Machine Learning for Multimodal Interaction , 2007

State Synchronous Modeling on Phone Boundary for Audio Visual Speech Recognition and Application to Muti-View Face Images.
Proceedings of the IEEE International Conference on Acoustics, 2007

Minimum mutual information beamforming for simultaneous active speakers.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
The ISL RT-06S Speech-to-Text System.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Mouth Region Localization Method Based on Gaussian Mixture Model.
Proceedings of the Advances in Machine Vision, 2006

Advances in lecture recognition: the ISL RT-06s evaluation system.
Proceedings of the INTERSPEECH 2006, 2006

2002
Multi-Modal Temporal Asynchronicity Modeling by Product HMMs for Robust.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Robust bi-modal speech recognition based on state synchronous modeling and stream weight optimization.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Speech Detection By Facial Image For Multimodal Speech Recognition.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

An Adaptive Integration Based On Product Hmm For Audio-Visual Speech Recognition.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001


  Loading...