Takuya Yoshioka

According to our database1, Takuya Yoshioka authored at least 101 papers between 2004 and 2021.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2021
Continuous Speech Separation with Ad Hoc Microphone Arrays.
CoRR, 2021

Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings.
CoRR, 2021

2020
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription.
CoRR, 2020

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR.
CoRR, 2020

Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis.
CoRR, 2020

Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer.
CoRR, 2020

Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020.
CoRR, 2020

Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings.
CoRR, 2020

Continuous speech separation: dataset and analysis.
CoRR, 2020

An End-to-End Architecture of Online Multi-Channel Speech Separation.
Proceedings of the Interspeech 2020, 2020

Neural Speech Separation Using Spatially Distributed Microphones.
Proceedings of the Interspeech 2020, 2020

Serialized Output Training for End-to-End Overlapped Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers.
Proceedings of the Interspeech 2020, 2020

Dual-Path RNN: Efficient Long Sequence Modeling for Time-Domain Single-Channel Speech Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Continuous Speech Separation: Dataset and Analysis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Meeting Transcription Using Virtual Microphone Arrays.
CoRR, 2019

Meeting Transcription Using Asynchronous Distant Microphones.
Proceedings of the Interspeech 2019, 2019

Low-latency Speaker-independent Continuous Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Single-channel Speech Extraction Using Speaker Inventory and Attention Network.
Proceedings of the IEEE International Conference on Acoustics, 2019


Speech Separation Using Speaker Inventory.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Dover: A Method for Combining Diarization Outputs.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks.
Proceedings of the Interspeech 2018, 2018

Investigations on Data Augmentation and Loss Functions for Deep Learning Based Speech-Background Separation.
Proceedings of the Interspeech 2018, 2018

Multi-Microphone Neural Speech Separation for Far-Field Multi-Talker Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Efficient Integration of Fixed Beamformers and Speech Separation Networks for Multi-Channel Far-Field Speech Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Online MVDR Beamformer Based on Complex Gaussian Mixture Model With Spatial Prior for Noise Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Unsupervised utterance-wise beamformer estimation with speech recognition-level criterion.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

Cracking the cocktail party problem by multi-beam deep attractor network.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

The REVERB Challenge: A Benchmark Task for Reverberation-Robust ASR Techniques.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Multichannel Speech Enhancement Approaches to DNN-Based Far-Field Speech Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research.
EURASIP J. Adv. Signal Process., 2016

Sparseness-based multichannel nonnegative matrix factorization for blind source separation.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement.
Proceedings of the Interspeech 2016, 2016

Optimization of Speech Enhancement Front-End with Speech Recognition-Level Criterion.
Proceedings of the Interspeech 2016, 2016

Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models.
Proceedings of the Interspeech 2016, 2016

Noise robust speech recognition using recent developments in neural networks for computer vision.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Context adaptive deep neural networks for fast acoustic model adaptation in noisy conditions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Strategies for distant speech recognitionin reverberant environments.
EURASIP J. Adv. Signal Process., 2015

Environmentally robust ASR front-end for deep neural network acoustic models.
Comput. Speech Lang., 2015

Robust i-vector extraction for neural network adaptation in noisy environment.
Proceedings of the INTERSPEECH 2015, 2015

Far-field speech recognition using CNN-DNN-HMM with convolution in time.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Multichannel sound source dereverberation and separation for arbitrary number of sources based on Bayesian nonparametrics.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Relaxed disjointness based clustering for joint blind source separation and dereverberation.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

Investigation of unsupervised adaptation of DNN acoustic models with filter bank input.
Proceedings of the IEEE International Conference on Acoustics, 2014

Impact of single-microphone dereverberation on DNN-based meeting transcription systems.
Proceedings of the IEEE International Conference on Acoustics, 2014

Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013
Noise Model Transfer: Novel Approach to Robustness Against Nonstationary Noise.
IEEE Trans. Speech Audio Process., 2013

Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting.
IEEE Trans. Speech Audio Process., 2013

Dominance Based Integration of Spatial and Spectral Features for Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2013

Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds.
Comput. Speech Lang., 2013

The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Conditional emission densities for combining speech enhancement and recognition systems.
Proceedings of the INTERSPEECH 2013, 2013

Formulation of the REMOS concept from an uncertainty decoding perspective.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Noise model transfer using affine transformation with application to large vocabulary reverberant speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Coupling beamforming with spatial and spectral feature based spectral enhancement and its application to meeting recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Dereverberation for reverberation-robust microphone arrays.
Proceedings of the 21st European Signal Processing Conference, 2013

2012
Generalization of Multi-Channel Linear Prediction Methods for Blind MIMO Impulse Response Shortening.
IEEE Trans. Speech Audio Process., 2012

Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera.
IEEE Trans. Speech Audio Process., 2012

Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition.
IEEE Signal Process. Mag., 2012

Noise Power Spectral Density Tracking: A Maximum Likelihood Perspective.
IEEE Signal Process. Lett., 2012

Log-normal matrix factorization with application to speech-music separation.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Time-varying residual noise feature model estimation for multi-microphone speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

MFCC enhancement using joint corrupted and noise feature space for highly non-stationary noise environments.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

LogMax observation model with MFCC-based spectral prior for reduction of highly nonstationary ambient noise.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Survey on approaches to speech recognition in reverberant environments.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization.
IEEE Trans. Speech Audio Process., 2011

Reduction of Highly Nonstationary Ambient Noise by Integrating Spectral and Locational Characteristics of Speech and Noise for Robust ASR.
Proceedings of the INTERSPEECH 2011, 2011

Speech enhancement based on log spectral envelope model and harmonicity-derived spectral mask, and its coupling with feature compensation.
Proceedings of the IEEE International Conference on Acoustics, 2011

I-Divergence-based dereverberation method with auxiliary function approach.
Proceedings of the IEEE International Conference on Acoustics, 2011

Joint unsupervised learning of hidden Markov source models and source location models for multichannel source separation.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction.
IEEE Trans. Speech Audio Process., 2010

Real-time meeting recognition and understanding using distant microphones and omni-directional camera.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Multichannel source separation based on source location cue with log-spectral shaping by hidden Markov source model.
Proceedings of the INTERSPEECH 2010, 2010

Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure.
Proceedings of the IEEE International Conference on Acoustics, 2010

Music dereverberation using harmonic structure source model and Wiener filter.
Proceedings of the IEEE International Conference on Acoustics, 2010

Statistical Model of Speech Signals Based on Composite Autoregressive System with Application to Blind Source Separation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

Inverse Filtering for Speech Dereverberation Without the Use of Room Acoustics Information.
Proceedings of the Speech Dereverberation., 2010

2009
Integrated Speech Enhancement Method Using Noise Suppression and Dereverberation.
IEEE Trans. Speech Audio Process., 2009

Statistical models for speech dereverberation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Adaptive dereverberation of speech signals with speaker-position change detection.
Proceedings of the IEEE International Conference on Acoustics, 2009

Real-time speech enhancement in noisy reverberant multi-talker environments based on a location-independent room acoustics model.
Proceedings of the IEEE International Conference on Acoustics, 2009

Robust speech dereverberation based on non-negativity and sparse nature of speech spectrograms.
Proceedings of the IEEE International Conference on Acoustics, 2009

Fast algorithm for conditional separation and dereverberation.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model.
IEEE Trans. Speech Audio Process., 2008

Maximum likelihood approach to speech enhancement for noisy reverberant signals.
Proceedings of the IEEE International Conference on Acoustics, 2008

Adaptive suppression of non-stationary noise by using the variational Bayesian method.
Proceedings of the IEEE International Conference on Acoustics, 2008

Blind speech dereverberation with multi-channel linear prediction based on short time fourier transform representation.
Proceedings of the IEEE International Conference on Acoustics, 2008

An integrated method for blind separation and dereverberation of convolutive audio mixtures.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
Dereverberation by Using Time-Variant Nature of Speech Production System.
EURASIP J. Adv. Signal Process., 2007

Robust blind dereverberation of speech signals based on characteristics of short-time speech segments.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Study on Speech Dereverberation with Autocorrelation Codebook.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2006

Robust decomposition of inverse filter of channel and prediction error filter of speech signal for dereverberation.
Proceedings of the 14th European Signal Processing Conference, 2006

2004
Automatic Chord Transcription with Concurrent Recognition of Chord Symbols and Boundaries.
Proceedings of the ISMIR 2004, 2004


  Loading...