Shinji Watanabe

Tomohiro Narita

EURASIP J. Adv. Signal Process., 2015

Uncertainty training and decoding methods of deep neural networks based on stochastic representation of enhanced features.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2015, 2015

Efficient learning for spoken language understanding tasks with word embedding based pre-training.

[BibT_eX]

[DOI]

Yi Luan

Ramón Fernandez Astudillo

Bret Harsham

Proceedings of the INTERSPEECH 2015, 2015

Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2015, 2015

Robust speech processing using observation uncertainty and uncertainty propagation: session and paper overview.

[BibT_eX]

[DOI]

Ahmed Hussen Abdelaziz

Dorothea Kolossa

Proceedings of the INTERSPEECH 2015, 2015

Uncertainty propagation through deep neural networks.

[BibT_eX]

[DOI]

Ahmed Hussen Abdelaziz

Proceedings of the INTERSPEECH 2015, 2015

Discriminative method for recurrent neural network language models.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Structure discovery of deep neural network based on evolutionary algorithms.

[BibT_eX]

[DOI]

Takahiro Shinozaki

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2015

Automation of system building for state-of-the-art large vocabulary speech recognition using evolution strategy.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Robust speech recognition in unknown reverberant and noisy conditions.

[BibT_eX]

[DOI]

Sri Harish Reddy Mallidi

Hynek Hermansky

Stavros Tsakalidis

Richard M. Schwartz

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The third 'CHiME' speech separation and recognition challenge: Dataset, task and baselines.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Feature-space structural MAPLR with regression tree-based multiple transformation matrices for DNN.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Bayesian Speech and Language Processing

[BibT_eX]

[DOI]

Jen-Tzung Chien

Cambridge University Press, ISBN: 9781107295360, 2015

2014

Structural Bayesian Linear Regression for Hidden Markov Models.

[BibT_eX]

[DOI]

Biing-Hwang Fred Juang

J. Signal Process. Syst., 2014

Discriminative NMF and its application to single-channel source separation.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

Cost-level integration of statistical and rule-based dialog managers.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

Sequential maximum mutual information linear discriminant analysis for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Recurrent deep neural networks for robust speech recognition.

[BibT_eX]

[DOI]

Chao Weng

Dong Yu

Biing-Hwang Fred Juang

Proceedings of the IEEE International Conference on Acoustics, 2014

Black box optimization for automatic speech recognition.

[BibT_eX]

[DOI]

Jonathan Le Roux

Proceedings of the IEEE International Conference on Acoustics, 2014

Log-linear dialog manager.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Ensemble integration of calibrated speaker localization and statistical speech detection in domestic environments.

[BibT_eX]

[DOI]

Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Sequence discriminative training for low-rank deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013

Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

Influence relation estimation based on lexical entrainment in conversation.

[BibT_eX]

[DOI]

Speech Commun., 2013

Prior-shared feature and model space speaker adaptation by consistently employing map estimation.

[BibT_eX]

[DOI]

Speech Commun., 2013

Training data selection with user's physical characteristics data for acceleration-based activity modeling.

[BibT_eX]

[DOI]

Takuya Maekawa

Pers. Ubiquitous Comput., 2013

Cluster-based dynamic variance adaptation for interconnecting speech enhancement pre-processor and speech recognizer.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Ensemble learning for speech enhancement.

[BibT_eX]

[DOI]

Jonathan Le Roux

John R. Hershey

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Blocked Gibbs sampling based multi-scale mixture model for speaker clustering on noisy data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Discriminative training of acoustic models for system combination.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

Statistical Dialogue Management using Intention Dependency Graph.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Stereo-based feature enhancement using dictionary learning.

[BibT_eX]

[DOI]

John R. Hershey

Proceedings of the IEEE International Conference on Acoustics, 2013

The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Effectiveness of discriminative training and feature transformation for reverberated and noisy speech.

[BibT_eX]

[DOI]

John R. Hershey

Proceedings of the IEEE International Conference on Acoustics, 2013

The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

A generalized discriminative training framework for system combination.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Statistical Voice Conversion Based on Noisy Channel Model.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Structural Classification Methods Based on Weighted Finite-State Transducers for Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Frame-wise model re-estimation method based on Gaussian pruning with weight normalization for noise robust voice activity detection.

[BibT_eX]

[DOI]

Speech Commun., 2012

Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2012, 2012

Bag Of ARCS: New representation of speech segment features based on finite state machines.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

MFCC enhancement using joint corrupted and noise feature space for highly non-stationary noise environments.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Effect of dialog acts on word use in polylogue.

[BibT_eX]

[DOI]

Roland Roller

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Basis vector orthogonalization for an improved kernel gradient matching pursuit method.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Decoding network optimization using minimum transition error training.

[BibT_eX]

[DOI]

Yotaro Kubo

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Noise suppression with unsupervised joint speaker adaptation and noise mixture model estimation.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Discriminative feature transforms using differenced maximum mutual information.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Handling uncertain observations in unsupervised topic-mixture language model adaptation.

[BibT_eX]

[DOI]

Ekapol Chuangsuwanich

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Topic tracking language model for speech recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2011

Bayesian linear regression for Hidden Markov Model based on optimizing variational bounds.

[BibT_eX]

[DOI]

Biing-Hwang Juang

Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Unsupervised Activity Recognition with User's Physical Characteristics Data.

[BibT_eX]

[DOI]

Takuya Maekawa

Proceedings of the 15th IEEE International Symposium on Wearable Computers (ISWC 2011), 2011

Model Adaptation for Automatic Speech Recognition Based on Multiple Time Scale Evolution.

[BibT_eX]

[DOI]

Biing-Hwang Juang

Proceedings of the INTERSPEECH 2011, 2011

Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2011, 2011

Learning Influences from Word Use in Polylogue.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2011, 2011

A Robust Estimation Method of Noise Mixture Model for Noise Suppression.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2011, 2011

Fashion Coordinates Recommender System Using Photographs from Fashion Magazines.

[BibT_eX]

[DOI]

Hiroshi Sawada

Proceedings of the IJCAI 2011, 2011

Gibbs sampling based Multi-scale Mixture Model for speaker clustering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

High accurate model-integration-based voice conversion using dynamic features and model structure optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Subspace pursuit method for kernel-log-linear models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Non-stationary noise estimation method based on bias-residual component decomposition for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Variance Compensation for Recognition of Reverberant Speech with Dereverberation Preprocessing.

[BibT_eX]

[DOI]

Marc Delcroix

Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011

2010

Predictor-Corrector Adaptation by Using Time Evolution System With Macroscopic Time Scale.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

A Sequential Pattern Classifier Based on Hidden Markov Kernel Machine and Its Application to Phoneme Classification.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2010

Online Unsupervised Classification With Model Comparison in the Variational Bayes Framework for Voice Activity Detection.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2010

Application of topic tracking model to language model adaptation and meeting analysis.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Real-time meeting recognition and understanding using distant microphones and omni-directional camera.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data.

[BibT_eX]

[DOI]

Takaaki Hori

Proceedings of the INTERSPEECH 2010, 2010

Probabilistic integration of joint density model and speaker model for voice conversion.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2010, 2010

A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2010, 2010

Improvements of search error risk minimization in viterbi beam search for speech recognition.

[BibT_eX]

[DOI]

Takaaki Hori

Proceedings of the INTERSPEECH 2010, 2010

Voice activity detection using frame-wise model re-estimation method based on Gaussian pruning with weight normalization.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2010, 2010

Minimum Error Classification with geometric margin control.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

A discriminative model for continuous speech recognition based on Weighted Finite State Transducers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Discriminative training based on an integrated view of MPE and MMI in margin and error space.

[BibT_eX]

[DOI]

Erik McDermott

Proceedings of the IEEE International Conference on Acoustics, 2010

Search error risk minimization in Viterbi beam search for speech recognition.

[BibT_eX]

[DOI]

Takaaki Hori

Proceedings of the IEEE International Conference on Acoustics, 2010

Using online model comparison in the Variational Bayes framework for online unsupervised Voice Activity Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Fast similarity search on a large speech data set with neighborhood graph indexing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Static and Dynamic Variance Compensation for Recognition of Reverberant Speech With Dereverberation Preprocessing.

[BibT_eX]

[DOI]

Marc Delcroix

IEEE Trans. Speech Audio Process., 2009

Margin-space integration of MPE loss via differencing of MMI functionals for generalized error-weighted discriminative training.

[BibT_eX]

[DOI]

Erik McDermott

Proceedings of the INTERSPEECH 2009, 2009

Stereo-input speech recognition using sparseness-based time-frequency masking in a reverberant environment.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2009, 2009

Topic Tracking Model for Analyzing Consumer Purchase Behavior.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2009, 2009

On-line adaptation and Bayesian detection of environmental changes based on a macroscopic time evolution system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

A unified view for discriminative objective functions based on negative exponential of difference measure between strings.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Combined static and dynamic variance adaptation for efficient interconnection of speech enhancement pre-processor with speech recognizer.

[BibT_eX]

[DOI]

Marc Delcroix

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Incremental Adaptation Based on a Macroscopic Time Evolution System.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Atsushi Sako

IEEE Trans. Speech Audio Process., 2006

Speech Recognition Based on Student's t-Distribution Derived from Total Bayesian Framework.

[BibT_eX]

[DOI]