Tomohiro Nakatani

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multiplicative Updates and Joint Diagonalization Based Acceleration for Under-Determined BSS Using a Full-Rank Spatial Covariance Model.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Global Conference on Signal and Information Processing, 2018

Noisy cGMM: Complex Gaussian Mixture Model with Non-Sparse Noise Model for Joint Source Separation and Denoising.

[BibT_eX]

[DOI]

Proceedings of the 26th European Signal Processing Conference, 2018

FastFCA: Joint Diagonalization Based Acceleration of Audio Source Separation Using a Full-Rank Spatial Covariance Model.

[BibT_eX]

[DOI]

Proceedings of the 26th European Signal Processing Conference, 2018

Feature-Based Learning Hidden Unit Contributions for Domain Adaptation of RNN-LMs.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Factorised Hidden Layer Based Domain Adaptation for Recurrent Neural Network Language Models.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Online MVDR Beamformer Based on Complex Gaussian Mixture Model With Spatial Prior for Noise Robust ASR.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Integration of Spatial Cue-Based Noise Reduction and Speech Model-Based Source Restoration for Real Time Speech Enhancement.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Speaker-Aware Neural Network Based Beamformer for Speaker Extraction in Speech Mixtures.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Uncertainty Decoding with Adaptive Sampling for Noise Robust DNN-Based Acoustic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Unfolded Deep Recurrent Convolutional Neural Network with Jump Ahead Connections for Acoustic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Improved Example-Based Speech Enhancement by Using Deep Neural Network Acoustic Model for Noise Robust Example Search.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Neural Network-Based Spectrum Estimation for Online WPE Dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Forward-Backward Convolutional LSTM for Acoustic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Deep Clustering-Based Beamforming for Separation with Unknown Number of Sources.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Feedback connection for deep neural network-based acoustic modeling.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Cumulative moving averaged bottleneck speaker vectors for online speaker adaptation of CNN-based acoustic models.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Integrating DNN-based and spatial clustering-based mask estimation for robust MVDR beamforming.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Deep mixture density network for statistical model-based feature enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Probabilistic spatial dictionary based online adaptive beamforming for meeting recognition in noisy and reverberant environments.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Online environmental adaptation of CNN-based acoustic models using spatial diffuseness features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Unsupervised utterance-wise beamformer estimation with speech recognition-level criterion.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming.

[BibT_eX]

[DOI]

Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

Data-driven and physical model-based designs of probabilistic spatial dictionary for online meeting diarization and adaptive beamforming.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

Learning speaker representation for neural network based multichannel speaker extraction.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Adversarial training for data-driven speech enhancement without parallel corpus.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Exploiting imbalanced textual and acoustic data for training prosodically-enhanced RNNLMs.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

The REVERB Challenge: A Benchmark Task for Reverberation-Robust ASR Techniques.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Multichannel Speech Enhancement Approaches to DNN-Based Far-Field Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2016

Differenced maximum mutual information criterion for robust unsupervised acoustic model adaptation.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2016

Sparseness-based multichannel nonnegative matrix factorization for blind source separation.

[BibT_eX]

[DOI]

Takuya Higuchi

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Modeling audio directional statistics using a probabilistic spatial dictionary for speaker diarization in real meetings.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Speech Intelligibility Prediction Based on the Envelope Power Spectrum Model with the Dynamic Compressive Gammachirp Auditory Filterbank.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Factorized Linear Input Network for Acoustic Model Adaptation in Noisy Conditions.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Optimization of Speech Enhancement Front-End with Speech Recognition-Level Criterion.

[BibT_eX]

[DOI]

Takuya Higuchi

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Noise robust speech recognition using recent developments in neural networks for computer vision.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A generative-discriminative hybrid approach to multi-channel noise reduction for robust automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Real-time integration of statistical model-based speech enhancement with unsupervised noise PSD estimation using microphone array.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Modeling audio directional statistics using a complex bingham mixture model for blind source extraction from diffuse noise.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multi-pass feature enhancement based on generative-discriminative hybrid approach for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Context adaptive deep neural networks for fast acoustic model adaptation in noisy conditions.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Spatial correlation model based observation vector clustering and MVDR beamforming for meeting recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Reverberation-robust underdetermined source separation with non-negative tensor double deconvolution.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Complex angular central Gaussian mixture model for directional statistics in mask-based microphone array signal processing.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

2015

Blind Suppression of Nonstationary Diffuse Acoustic Noise Based on Spatial Covariance Matrix Decomposition.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2015

Acoustic Event Detection in Speech Overlapping Scenarios Based on High-Resolution Spectral Input and Deep Learning.

[BibT_eX]

[DOI]

Miquel Espi

IEICE Trans. Inf. Syst., 2015

Strategies for distant speech recognitionin reverberant environments.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2015

Exploiting spectro-temporal locality in deep learning based acoustic event detection.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2015

Robust i-vector extraction for neural network adaptation in noisy environment.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Text-informed speech enhancement with deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Feature extraction strategies in deep learning based acoustic event detection.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Far-field speech recognition using CNN-DNN-HMM with convolution in time.

[BibT_eX]

[DOI]

Shigeki Karita

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Modeling inter-node acoustic dependencies with Restricted Boltzmann Machine for distributed microphone array based BSS.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Feature enhancement based on generative-discriminative hybrid approach with gmms and DNNS for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Context adaptive deep neural networks for fast acoustic model adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Exploring multi-channel features for denoising-autoencoder-based speech enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Permutation-free clustering of relative transfer function features for blind source separation.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Relaxed disjointness based clustering for joint blind source separation and dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

Fast segment search for corpus-based speech enhancement based on speech recognition technology.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Probabilistic integration of diffuse noise suppression and dereverberation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Unsupervised non-parametric Bayesian modeling of non-stationary noise for model-based noise suppression.

[BibT_eX]

[DOI]

Yotaro Kubo

Proceedings of the IEEE International Conference on Acoustics, 2014

Spectrogram patch based acoustic event detection and classification in speech overlapping conditions.

[BibT_eX]

[DOI]

Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013

Noise Model Transfer: Novel Approach to Robustness Against Nonstationary Noise.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

Dominance Based Integration of Spatial and Spectral Features for Speech Enhancement.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2013

Cluster-based dynamic variance adaptation for interconnecting speech enhancement pre-processor and speech recognizer.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Source number estimation based on clustering of speech activity sequences for microphone array processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Microphone-location dependent mask estimation for BSS using spatially distributed asynchronous microphones.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems, 2013

On the robustness of distributed EM based BSS in asynchronous distributed microphone array scenarios.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Conditional emission densities for combining speech enhancement and recognition systems.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Blind source separation using spatially distributed microphones based on microphone-location dependent source activities.

[BibT_eX]

[DOI]

Mehrez Souden

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Model-based noise suppression using unsupervised estimation of hidden Markov model for non-stationary noise.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Is speech enhancement pre-processing still relevant when using deep neural networks for acoustic modeling?

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Formulation of the REMOS concept from an uncertainty decoding perspective.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Noise model transfer using affine transformation with application to large vocabulary reverberant speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

An integration of source location cues for speech clustering in distributed microphone arrays.

[BibT_eX]

[DOI]

Mehrez Souden

Proceedings of the IEEE International Conference on Acoustics, 2013

Coupling beamforming with spatial and spectral feature based spectral enhancement and its application to meeting recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Permutation-free convolutive blind source separation via full-band clustering based on frequency-independent source presence priors.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Unsupervised discriminative adaptation using differenced maximum mutual information based linear regression.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Dereverberation for reverberation-robust microphone arrays.

[BibT_eX]

[DOI]

Proceedings of the 21st European Signal Processing Conference, 2013

2012

Generalization of Multi-Channel Linear Prediction Methods for Blind MIMO Impulse Response Shortening.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Probabilistic Speaker Diarization With Bag-of-Words Representations of Speaker Angle Information.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2012

Noise Power Spectral Density Tracking: A Maximum Likelihood Perspective.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2012

Frame-wise model re-estimation method based on Gaussian pruning with weight normalization for noise robust voice activity detection.

[BibT_eX]

[DOI]

Speech Commun., 2012

Distributed microphone array processing for speech source separation with classifier fusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2012

Dynamic variance adaptation using differenced maximum mutual information.

[BibT_eX]

[DOI]

Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Example-based speech enhancement with joint utilization of spatial, spectral & temporal cues of speech and noise.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Time-varying residual noise feature model estimation for multi-microphone speech recognition.

[BibT_eX]

[DOI]

Emmanuel Ternon

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A multichannel MMSE-based framework for joint blind source separation and noise reduction.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

LogMax observation model with MFCC-based spectral prior for reduction of highly nonstationary ambient noise.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

New analytical update rule for TDOA inference for underdetermined BSS in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Introduction of speech log-spectral priors into dereverberation based on Itakura-Saito distance minimization.

[BibT_eX]

[DOI]

Yasuaki Iwata

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Noise suppression with unsupervised joint speaker adaptation and noise mixture model estimation.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Discriminative feature transforms using differenced maximum mutual information.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Sparse vector factorization for underdetermined BSS using wrapped-phase GMM and source log-spectral prior.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Survey on approaches to speech recognition in reverberant environments.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

New analytical calculation and estimation of TDOA for underdetermined BSS in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

A Multichannel Feature-Based Processing for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Reduction of Highly Nonstationary Ambient Noise by Integrating Spectral and Locational Characteristics of Speech and Noise for Robust ASR.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Single Channel Dereverberation Using Example-Based Speech Enhancement with Uncertainty Decoding Technique.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Robust Estimation Method of Noise Mixture Model for Noise Suppression.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Speech enhancement based on log spectral envelope model and harmonicity-derived spectral mask, and its coupling with feature compensation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Joint unsupervised learning of hidden Markov source models and source location models for multichannel source separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Non-stationary noise estimation method based on bias-residual component decomposition for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Hybrid approach for multichannel source separation combining time-frequency mask with multi-channel Wiener filter.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Variance Compensation for Recognition of Reverberant Speech with Dereverberation Preprocessing.

[BibT_eX]

[DOI]

Marc Delcroix

Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011

2010

Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Introduction to the Special Issue on Processing Reverberant Speech: Methodologies and Applications.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Noise robust voice activity detection based on periodic to aperiodic component ratio.

[BibT_eX]

[DOI]

Speech Commun., 2010

Real-time meeting recognition and understanding using distant microphones and omni-directional camera.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Cepstral smoothing of separated signals for underdetermined speech separation.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Multichannel source separation based on source location cue with log-spectral shaping by hidden Markov source model.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Voice activity detection using frame-wise model re-estimation method based on Gaussian pruning with weight normalization.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Music dereverberation using harmonic structure source model and Wiener filter.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Single channel source separation based on sparse source observation model with harmonic constraint.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Blind upmix of stereo music signals using multi-step linear prediction based reverberation extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Simultaneous clustering of mixing and spectral model parameters for blind sparse source separation.

[BibT_eX]

[DOI]

Hiroshi Sawada

Proceedings of the IEEE International Conference on Acoustics, 2010

Inverse Filtering for Speech Dereverberation Without the Use of Room Acoustics Information.

[BibT_eX]

[DOI]

Proceedings of the Speech Dereverberation., 2010

2009

Integrated Speech Enhancement Method Using Noise Suppression and Dereverberation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Suppression of Late Reverberation Effect on Speech Signal Using Long-Term Multiple-step Linear Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Static and Dynamic Variance Compensation for Recognition of Reverberant Speech With Dereverberation Preprocessing.

[BibT_eX]

[DOI]

Marc Delcroix

IEEE Trans. Speech Audio Process., 2009

Development of Japanese infant speech database from longitudinal recordings.

[BibT_eX]

[DOI]

Speech Commun., 2009

Statistical models for speech dereverberation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

A probabilistic speaker clustering for DOA-based diarization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

A study of mutual front-end processing method based on statistical model for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Stereo Source Separation and Source Counting with MAP Estimation with Dirichlet Prior Considering Spatial Aliasing Problem.

[BibT_eX]

[DOI]

Proceedings of the Independent Component Analysis and Signal Separation, 2009

A speaker diarization method based on the probabilistic fusion of audio-visual location information.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Multimodal Interfaces, 2009

Adaptive dereverberation of speech signals with speaker-position change detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Real-time speech enhancement in noisy reverberant multi-talker environments based on a location-independent room acoustics model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Robust speech dereverberation based on non-negativity and sparse nature of speech spectrograms.

[BibT_eX]

[DOI]

Hirokazu Kameoka

Proceedings of the IEEE International Conference on Acoustics, 2009

Blind sparse source separation for unknown number of sources using Gaussian mixture model fitting with Dirichlet prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Fast algorithm for conditional separation and dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 17th European Signal Processing Conference, 2009

2008

Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

A method for fundamental frequency estimation and voicing decision: Application to infant utterances recorded in real acoustical environments.

[BibT_eX]

[DOI]

Speech Commun., 2008

Missing feature speech recognition in a meeting situation with maximum SNR beamforming.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

Study of integration of statistical model-based voice activity detection and noise suppression.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Maximum likelihood approach to speech enhancement for noisy reverberant signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Blind speech dereverberation with multi-channel linear prediction based on short time fourier transform representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Combined static and dynamic variance adaptation for efficient interconnection of speech enhancement pre-processor with speech recognizer.

[BibT_eX]

[DOI]

Marc Delcroix

Proceedings of the IEEE International Conference on Acoustics, 2008

An integrated method for blind separation and dereverberation of convolutive audio mixtures.

[BibT_eX]

[DOI]

Proceedings of the 2008 16th European Signal Processing Conference, 2008

Principles and applications of dereverberation for noisy and reverberant audio signals.

[BibT_eX]

[DOI]

Proceedings of the 42nd Asilomar Conference on Signals, Systems and Computers, 2008

2007

Harmonicity-Based Blind Dereverberation for Single-Channel Speech Signals.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Robust blind dereverberation of speech signals based on characteristics of short-time speech segments.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Joint Source-Channel Modeling and Estimation for Speech Dereverberation.

[BibT_eX]

[DOI]

Biing-Hwang Juang

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Multi-step linear prediction based speech dereverberation in noisy reverberant environment.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Two-Microphone Voice Activity Detection Based on the Homogeneity of the Direction of Arrival Estimates.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Study on Speech Dereverberation with Autocorrelation Codebook.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2006

Blind dereverberation of monaural speech signals based on harmonic structure.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2006

Study of noise robust voice activity detection based on periodic component to aperiodic component ratio.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

Speech Dereverberation Based on Probabilistic Models of Source and Room Acoustics.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Spectral Subtraction Steered by Multi-Step Forward Linear Prediction For Single Channel Speech Dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Harmonicity Based Dereverberation for Improving Automatic Speech Recognition Performance and Speech Intelligibility.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005

Efficient blind dereverberation framework for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Fast Estimation of a Precise Dereverberation Filter based on Speech Harmonicity.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Speech Signal Analysis with Exponential Autoregressive Model.

[BibT_eX]

[DOI]

Hiroko Kato Solvang

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Automatic Sound-Imitation Word Recognition from Environmental Sounds Focusing on Ambiguity Problem in Determining Phonemes.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2004: Trends in Artificial Intelligence, 2004

Harmonicity based blind dereverberation with time warping.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

Harmonicity based monaural speech dereverberation with time warping and F0 adaptive window.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Improving automatic speech recognition performance and speech inteligibility with harmonicity based dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Improvement in robustness of speech feature extraction method using sub-band based periodicity and aperiodicity decomposition.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Developmental changes in voiced-segment ratio for Japanese infants and parents.

[BibT_eX]

[DOI]

Shigeaki Amano

Tadahisa Kondo

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

One Microphone Blind Dereverberation Based on Quasi-periodicity of Speech Signals.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Dominance spectrum based v/UV classification and f_0 estimation.

[BibT_eX]

[DOI]

Toshio Irino

Parham Zolfaghari

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Blind dereverberation of single channel speech signal based on harmonic structure.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Robust fundamental frequency estimation against background noise and spectral distortion.

[BibT_eX]

[DOI]

Toshio Irino

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Evaluation of a speech recognition / generation method based on HMM and straight.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

1999

Listening to two simultaneous speeches.

[BibT_eX]

[DOI]

Speech Commun., 1999

Harmonic sound stream segregation using localization and its application to speech stream segregation.

[BibT_eX]

[DOI]

Speech Commun., 1999

1998

Sound Ontology for Computational Auditory Scence Analysis.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

1997

Understanding Three Simultaneous Speeches.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

1996

A new speech enhancement: speech stream segregation.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Localization by harmonic structure and its application to harmonic sound stream segregation.

[BibT_eX]

[DOI]

Masataka Goto

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Interfacing Sound Stream Segregation to Automatic Speech Recognition - Preliminary Results on Listening to Several Sounds Simultaneously.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, 1996

1995

Residue-Driven Architecture for Computational Auditory Scene Analysis.

[BibT_eX]

[DOI]

Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

A computational model of sound stream segregation with multi-agent paradigm.

[BibT_eX]

[DOI]