Ken'ichi Kumatani

CoRR, 2021

Multilingual Speech Recognition using Knowledge Transfer across Learning Processes.

[BibT_eX]

[DOI]

CoRR, 2021

Dynamic Gradient Aggregation for Federated Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2021

One-Shot Voice Conversion with Speaker-Agnostic StarGAN.

[BibT_eX]

[DOI]

Sefik Emre Eskimez

Robert Gmyr

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Ensemble Combination between Different Time Segmentations.

[BibT_eX]

[DOI]

Jeremy Heng Meng Wong

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Federated Transfer Learning with Dynamic Gradient Aggregation.

[BibT_eX]

[DOI]

CoRR, 2020

Sequence-Level Self-Learning with Multiple Hypotheses.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Federated Approach in Training Acoustic Models.

[BibT_eX]

[DOI]

Sree Hari Krishnan Parthasarathi

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Robust Multi-Channel Speech Recognition Using Frequency Aligned Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning.

[BibT_eX]

[DOI]

Ladislav Mosner

Minhua Wu

Anirudh Raju

Proceedings of the IEEE International Conference on Acoustics, 2019

Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Time-Delayed Bottleneck Highway Networks Using a DFT Feature for Keyword Spotting.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Direct modeling of raw audio with DNNS for wake word detection.

[BibT_eX]

[DOI]

Sankaran Panchapagesan

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2015

Free energy for speech recognition.

[BibT_eX]

[DOI]

Rita Singh

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2013

Speaker tracking with spherical microphone arrays.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Joint constrained maximum likelihood regression for overlapping speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2012

A signal-separation-based array postfilter for distant speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Microphone Array Post-filter based on Spatially-Correlated Noise Measurements for Distant Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Microphone array processing for distant speech recognition: Spherical arrays.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Microphone array processing for distant speech recognition: Towards real-world deployment.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Microphone Arrays.

[BibT_eX]

[DOI]

Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011

On the combination of voice prompt suppression with maximum kurtosis beamforming.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Block-wise incremental adaptation algorithm for maximum kurtosis beamforming.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Maximum kurtosis beamforming with a subspace filter for distant speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

An information filter for voice prompt suppression.

[BibT_eX]

[DOI]

Proceedings of the Conference Record of the Forty Fifth Asilomar Conference on Signals, 2011

2010

Subband beamforming with higher order statistics for distant speech recognition.

[BibT_eX]

[DOI]

PhD thesis, 2010

Maximum negentropy beamforming with superdirectivity.

[BibT_eX]

[DOI]

Proceedings of the 18th European Signal Processing Conference, 2010

2009

Beamforming With a Maximum Negentropy Criterion.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

2008

A Neural Network Based Regression Approach for Recognizing Simultaneous Speech.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Multimodal Interaction, 5th International Workshop, 2008

Maximum kurtosis beamforming with the generalized sidelobe canceller.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Adaptive Beamforming With a Minimum Mutual Information Criterion.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

To Separate Speech.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Multimodal Interaction , 2007

State Synchronous Modeling on Phone Boundary for Audio Visual Speech Recognition and Application to Muti-View Face Images.

[BibT_eX]

[DOI]

Rainer Stiefelhagen

Proceedings of the IEEE International Conference on Acoustics, 2007

Minimum mutual information beamforming for simultaneous active speakers.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

The ISL RT-06S Speech-to-Text System.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Multimodal Interaction, 2006

Mouth Region Localization Method Based on Gaussian Mixture Model.

[BibT_eX]

[DOI]

Rainer Stiefelhagen

Proceedings of the Advances in Machine Vision, 2006

Advances in lecture recognition: the ISL RT-06s evaluation system.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2002

Multi-Modal Temporal Asynchronicity Modeling by Product HMMs for Robust.

[BibT_eX]

[DOI]

Satoshi Tamura

Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Robust bi-modal speech recognition based on state synchronous modeling and stream weight optimization.

[BibT_eX]

[DOI]

Satoshi Tamura

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Speech Detection By Facial Image For Multimodal Speech Recognition.

[BibT_eX]

[DOI]

Kazumasa Murai

Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

An Adaptive Integration Based On Product Hmm For Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]