Shinji Watanabe

According to our database1, Shinji Watanabe authored at least 209 papers between 1995 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Evolution-Strategy-Based Automation of System Development for High-Performance Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

Dry, Focus, and Transcribe: End-to-End Integration of Dereverberation, Beamforming, and ASR.
CoRR, 2019

Massively Multilingual Adversarial Speech Recognition.
CoRR, 2019

2018
Low Resource Multi-modal Data Augmentation for End-to-end ASR.
CoRR, 2018

Stream attention-based multi-array end-to-end speech recognition.
CoRR, 2018

Multi-encoder multi-resolution framework for end-to-end speech recognition.
CoRR, 2018

Vectorization of hypotheses and speech for faster beam search in encoder decoder-based speech recognition.
CoRR, 2018

Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling.
CoRR, 2018

Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection.
CoRR, 2018

Analysis of Multilingual Sequence-to-Sequence speech recognition systems.
CoRR, 2018

Promising Accurate Prefix Boosting for sequence-to-sequence ASR.
CoRR, 2018

CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments.
CoRR, 2018

Building Corpora for Single-Channel Speech Separation Across Multiple Domains.
CoRR, 2018

Language model integration based on memory control for sequence to sequence speech recognition.
CoRR, 2018

Transfer learning of language-independent end-to-end ASR with language model fusion.
CoRR, 2018

End-to-End Monaural Multi-speaker ASR System without Pretraining.
CoRR, 2018

Cycle-consistency training for end-to-end speech recognition.
CoRR, 2018

Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling.
CoRR, 2018

Phasebook and Friends: Leveraging Discrete Representations for Source Separation.
CoRR, 2018

End-to-end Speech Recognition with Word-based RNN Language Models.
CoRR, 2018

Back-Translation-Style Data Augmentation for End-to-End ASR.
CoRR, 2018

Low-Resource Contextual Topic Identification on Speech.
CoRR, 2018

Weakly Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation.
CoRR, 2018

A Purely End-to-end System for Multi-speaker Speech Recognition.
CoRR, 2018

Multi-Head Decoder for End-to-End Speech Recognition.
CoRR, 2018

ESPnet: End-to-End Speech Processing Toolkit.
CoRR, 2018

The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines.
CoRR, 2018

Multi-Modal Data Augmentation for End-to-end ASR.
CoRR, 2018

Building state-of-the-art distant speech recognition using the CHiME-4 challenge with a setup of speech enhancement baseline.
CoRR, 2018

Student-Teacher Learning for BLSTM Mask-based Speech Enhancement.
CoRR, 2018

Low-Resource Contextual Topic Identification on Speech.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

End-to-end Speech Recognition With Word-Based Rnn Language Models.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Back-Translation-Style Data Augmentation for end-to-end ASR.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018


Student-Teacher Learning for BLSTM Mask-based Speech Enhancement.
Proceedings of the Interspeech 2018, 2018

Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.
Proceedings of the Interspeech 2018, 2018

Multi-Modal Data Augmentation for End-to-end ASR.
Proceedings of the Interspeech 2018, 2018

Semi-Supervised End-to-End Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Multi-Head Decoder for End-to-End Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Effectiveness of Single-Channel BLSTM Enhancement for Language Identification.
Proceedings of the Interspeech 2018, 2018

Auxiliary Feature Based Adaptation of End-to-end ASR Systems.
Proceedings of the Interspeech 2018, 2018

Building State-of-the-art Distant Speech Recognition Using the CHiME-4 Challenge with a Setup of Speech Enhancement Baseline.
Proceedings of the Interspeech 2018, 2018

The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines.
Proceedings of the Interspeech 2018, 2018

End-to-End Multi-Speaker Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An End-to-End Language-Tracking Speech Recognizer for Mixed-Language Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speaker Adaptation for Multichannel End-to-End Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Purely End-to-End System for Multi-speaker Speech Recognition.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Duration-Controlled LSTM for Polyphonic Sound Event Detection.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

Hybrid CTC/Attention Architecture for End-to-End Speech Recognition.
J. Sel. Topics Signal Processing, 2017

Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming.
J. Sel. Topics Signal Processing, 2017

Prior-based Binary Masking and Discriminative Methods for Reverberant and Noisy Speech Recognition Using Distant Stereo Microphones.
JIP, 2017

An analysis of environment, microphone and data simulation mismatches in robust speech recognition.
Computer Speech & Language, 2017

Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backend.
Computer Speech & Language, 2017

The third 'CHiME' speech separation and recognition challenge: Analysis and outcomes.
Computer Speech & Language, 2017

Multi-microphone speech recognition in everyday environments.
Computer Speech & Language, 2017

Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition.
CoRR, 2017

Multichannel End-to-end Speech Recognition.
CoRR, 2017

Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM.
CoRR, 2017

Does speech enhancement work with end-to-end ASR objectives?: Experimental analysis of multichannel end-to-end ASR.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Coupled Initialization of Multi-Channel Non-Negative Matrix Factorization Based on Spatial and Spectral Information.
Proceedings of the Interspeech 2017, 2017

Semi-Supervised Learning of a Pronunciation Dictionary from Disjoint Phonemic Transcripts and Text.
Proceedings of the Interspeech 2017, 2017

Advances in Joint CTC-Attention Based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM.
Proceedings of the Interspeech 2017, 2017

Multichannel End-to-end Speech Recognition.
Proceedings of the 34th International Conference on Machine Learning, 2017

Student-teacher network learning with enhanced features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Deep long short-term memory adaptive beamforming networks for multichannel robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Joint CTC-attention based end-to-end speech recognition using multi-task learning.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Language independent end-to-end architecture for joint language identification and speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Composite embedding systems for ZeroSpeech2017 Track1.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Multi-level language modeling and decoding for open vocabulary end-to-end speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Joint CTC/attention decoding for end-to-end speech recognition.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Discriminative Beamforming with Phase-Aware Neural Networks for Speech Enhancement and Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Toolkits for Robust Speech Processing.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Preliminaries.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Training Data Augmentation and Data Selection.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Novel Deep Architectures in Speech Processing.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Deep Recurrent Networks for Separation and Recognition of Single-Channel Speech in Nonstationary Background Audio.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

The CHiME Challenges: Robust Speech Recognition in Everyday Environments.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning.
CoRR, 2016

Single-Channel Multi-Speaker Separation using Deep Clustering.
CoRR, 2016

Automated structure discovery and parameter tuning of neural network language model based on evolution strategy.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Dialog state tracking with attention-based sequence-to-sequence learning.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training.
Proceedings of the Interspeech 2016, 2016

Single-Channel Multi-Speaker Separation Using Deep Clustering.
Proceedings of the Interspeech 2016, 2016

Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs.
Proceedings of the Interspeech 2016, 2016

Improved MVDR Beamforming Using Single-Channel Mask Prediction Networks.
Proceedings of the Interspeech 2016, 2016

Driver confusion status detection using recurrent neural networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Deep beamforming networks for multi-channel speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep unfolding for multichannel source separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Sequence summarizing neural network for speaker adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Minimum word error training of long short-term memory recurrent neural network language models for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep clustering: Discriminative embeddings for segmentation and separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

High-accuracy user identification using EEG biometrics.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016

Beamforming networks using spatial covariance features for far-field speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Effectiveness of dereverberation, feature transformation, discriminative training methods, and system combination approach for various reverberant environments.
EURASIP J. Adv. Sig. Proc., 2015

Deep clustering: Discriminative embeddings for segmentation and separation.
CoRR, 2015

Uncertainty training and decoding methods of deep neural networks based on stochastic representation of enhanced features.
Proceedings of the INTERSPEECH 2015, 2015

Efficient learning for spoken language understanding tasks with word embedding based pre-training.
Proceedings of the INTERSPEECH 2015, 2015

Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks.
Proceedings of the INTERSPEECH 2015, 2015

Robust speech processing using observation uncertainty and uncertainty propagation: session and paper overview.
Proceedings of the INTERSPEECH 2015, 2015

Uncertainty propagation through deep neural networks.
Proceedings of the INTERSPEECH 2015, 2015

Discriminative method for recurrent neural network language models.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Structure discovery of deep neural network based on evolutionary algorithms.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

Automation of system building for state-of-the-art large vocabulary speech recognition using evolution strategy.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Robust speech recognition in unknown reverberant and noisy conditions.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The third 'CHiME' speech separation and recognition challenge: Dataset, task and baselines.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Feature-space structural MAPLR with regression tree-based multiple transformation matrices for DNN.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Bayesian Speech and Language Processing
Cambridge University Press, ISBN: 9781107295360, 2015

2014
Structural Bayesian Linear Regression for Hidden Markov Models.
Signal Processing Systems, 2014

Research note: Residents' Assessment of Local Government Information Systems.
The Review of Socionetwork Strategies, 2014

Discriminative NMF and its application to single-channel source separation.
Proceedings of the INTERSPEECH 2014, 2014

Cost-level integration of statistical and rule-based dialog managers.
Proceedings of the INTERSPEECH 2014, 2014

Sequential maximum mutual information linear discriminant analysis for speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Recurrent deep neural networks for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Black box optimization for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Log-linear dialog manager.
Proceedings of the IEEE International Conference on Acoustics, 2014

Ensemble integration of calibrated speaker localization and statistical speech detection in domestic environments.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Sequence discriminative training for low-rank deep neural networks.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013
Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting.
IEEE Trans. Audio, Speech & Language Processing, 2013

Influence relation estimation based on lexical entrainment in conversation.
Speech Communication, 2013

Prior-shared feature and model space speaker adaptation by consistently employing map estimation.
Speech Communication, 2013

Are Depositors Aware of the Governance of their Banks?1.
The Review of Socionetwork Strategies, 2013

Training data selection with user's physical characteristics data for acceleration-based activity modeling.
Personal and Ubiquitous Computing, 2013

Cluster-based dynamic variance adaptation for interconnecting speech enhancement pre-processor and speech recognizer.
Computer Speech & Language, 2013

Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds.
Computer Speech & Language, 2013

A comprehensive map of the influenza A virus replication cycle.
BMC Systems Biology, 2013

Ensemble learning for speech enhancement.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Blocked Gibbs sampling based multi-scale mixture model for speaker clustering on noisy data.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Discriminative training of acoustic models for system combination.
Proceedings of the INTERSPEECH 2013, 2013

Statistical Dialogue Management using Intention Dependency Graph.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Stereo-based feature enhancement using dictionary learning.
Proceedings of the IEEE International Conference on Acoustics, 2013

The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines.
Proceedings of the IEEE International Conference on Acoustics, 2013

Effectiveness of discriminative training and feature transformation for reverberated and noisy speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

A generalized discriminative training framework for system combination.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

High density and reliable packaging technology with Non Conductive Film for 3D/TSV.
Proceedings of the 2013 IEEE International 3D Systems Integration Conference (3DIC), 2013

2012
Statistical Voice Conversion Based on Noisy Channel Model.
IEEE Trans. Audio, Speech & Language Processing, 2012

Structural Classification Methods Based on Weighted Finite-State Transducers for Automatic Speech Recognition.
IEEE Trans. Audio, Speech & Language Processing, 2012

Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera.
IEEE Trans. Audio, Speech & Language Processing, 2012

Frame-wise model re-estimation method based on Gaussian pruning with weight normalization for noise robust voice activity detection.
Speech Communication, 2012

Integrated network analysis reveals a novel role for the cell cycle in 2009 pandemic influenza virus-induced inflammation in macaque lungs.
BMC Systems Biology, 2012

Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model.
Proceedings of the INTERSPEECH 2012, 2012

Bag Of ARCS: New representation of speech segment features based on finite state machines.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

MFCC enhancement using joint corrupted and noise feature space for highly non-stationary noise environments.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Effect of dialog acts on word use in polylogue.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Basis vector orthogonalization for an improved kernel gradient matching pursuit method.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Decoding network optimization using minimum transition error training.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Noise suppression with unsupervised joint speaker adaptation and noise mixture model estimation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Discriminative feature transforms using differenced maximum mutual information.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Handling uncertain observations in unsupervised topic-mixture language model adaptation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Topic tracking language model for speech recognition.
Computer Speech & Language, 2011

Bayesian linear regression for Hidden Markov Model based on optimizing variational bounds.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Unsupervised Activity Recognition with User's Physical Characteristics Data.
Proceedings of the 15th IEEE International Symposium on Wearable Computers (ISWC 2011), 2011

Model Adaptation for Automatic Speech Recognition Based on Multiple Time Scale Evolution.
Proceedings of the INTERSPEECH 2011, 2011

Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model.
Proceedings of the INTERSPEECH 2011, 2011

Learning Influences from Word Use in Polylogue.
Proceedings of the INTERSPEECH 2011, 2011

A Robust Estimation Method of Noise Mixture Model for Noise Suppression.
Proceedings of the INTERSPEECH 2011, 2011

Gibbs sampling based Multi-scale Mixture Model for speaker clustering.
Proceedings of the IEEE International Conference on Acoustics, 2011

High accurate model-integration-based voice conversion using dynamic features and model structure optimization.
Proceedings of the IEEE International Conference on Acoustics, 2011

Subspace pursuit method for kernel-log-linear models.
Proceedings of the IEEE International Conference on Acoustics, 2011

Non-stationary noise estimation method based on bias-residual component decomposition for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Variance Compensation for Recognition of Reverberant Speech with Dereverberation Preprocessing.
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011

2010
Predictor-Corrector Adaptation by Using Time Evolution System With Macroscopic Time Scale.
IEEE Trans. Audio, Speech & Language Processing, 2010

A Sequential Pattern Classifier Based on Hidden Markov Kernel Machine and Its Application to Phoneme Classification.
J. Sel. Topics Signal Processing, 2010

Online Unsupervised Classification With Model Comparison in the Variational Bayes Framework for Voice Activity Detection.
J. Sel. Topics Signal Processing, 2010

Application of topic tracking model to language model adaptation and meeting analysis.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Real-time meeting recognition and understanding using distant microphones and omni-directional camera.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data.
Proceedings of the INTERSPEECH 2010, 2010

Probabilistic integration of joint density model and speaker model for voice conversion.
Proceedings of the INTERSPEECH 2010, 2010

A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination.
Proceedings of the INTERSPEECH 2010, 2010

Improvements of search error risk minimization in viterbi beam search for speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Voice activity detection using frame-wise model re-estimation method based on Gaussian pruning with weight normalization.
Proceedings of the INTERSPEECH 2010, 2010

Minimum Error Classification with geometric margin control.
Proceedings of the IEEE International Conference on Acoustics, 2010

A discriminative model for continuous speech recognition based on Weighted Finite State Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2010

Discriminative training based on an integrated view of MPE and MMI in margin and error space.
Proceedings of the IEEE International Conference on Acoustics, 2010

Search error risk minimization in Viterbi beam search for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Using online model comparison in the Variational Bayes framework for online unsupervised Voice Activity Detection.
Proceedings of the IEEE International Conference on Acoustics, 2010

Fast similarity search on a large speech data set with neighborhood graph indexing.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Static and Dynamic Variance Compensation for Recognition of Reverberant Speech With Dereverberation Preprocessing.
IEEE Trans. Audio, Speech & Language Processing, 2009

Margin-space integration of MPE loss via differencing of MMI functionals for generalized error-weighted discriminative training.
Proceedings of the INTERSPEECH 2009, 2009

Stereo-input speech recognition using sparseness-based time-frequency masking in a reverberant environment.
Proceedings of the INTERSPEECH 2009, 2009

Topic Tracking Model for Analyzing Consumer Purchase Behavior.
Proceedings of the IJCAI 2009, 2009

On-line adaptation and Bayesian detection of environmental changes based on a macroscopic time evolution system.
Proceedings of the IEEE International Conference on Acoustics, 2009

A unified view for discriminative objective functions based on negative exponential of difference measure between strings.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches.
Proceedings of the IEEE International Conference on Acoustics, 2008

Combined static and dynamic variance adaptation for efficient interconnection of speech enhancement pre-processor with speech recognizer.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Incremental Adaptation Based on a Macroscopic Time Evolution System.
Proceedings of the IEEE International Conference on Acoustics, 2007

Performance Evaluation to Optimize the UMP System Focusing on Network Transmission Speed.
Proceedings of the Japan-China Joint Workshop on Frontier of Computer Science and Technology, 2007

2006
Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition.
IEEE Trans. Audio, Speech & Language Processing, 2006

Speech Recognition Based on Student's t-Distribution Derived from Total Bayesian Framework.
IEICE Transactions, 2006

Acoustic Model Adaptation Based on Coarse/Fine Training of Transfer Vectors Using Directional Statistics.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Selection of Shared-State Hidden Markov Model Structure Using Bayesian Criterion.
IEICE Transactions, 2005

Effects of Bayesian predictive classification using variational Bayesian posteriors for sparse training data in speech recognition.
Proceedings of the INTERSPEECH 2005, 2005

2004
Variational bayesian estimation and clustering for speech recognition.
IEEE Trans. Speech and Audio Processing, 2004

Acoustic model adaptation based on coarse/fine training of transfer vectors and its application to a speaker adaptation task.
Proceedings of the INTERSPEECH 2004, 2004

Bayesian modelling of the speech spectrum using mixture of Gaussians.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Automatic determination of acoustic model topology using variational Bayesian estimation and clustering.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Application of variational Bayesian estimation and clustering to acoustic model adaptation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Application of Variational Bayesian Approach to Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Constructing shared-state hidden Markov models based on a Bayesian approach.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

1996
Computerized analysis for classification of heart diseases in echocardiographic images.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

A new stabilized zero - Crossing representation in the wavelet transform domain and its application to image processing.
Proceedings of the 8th European Signal Processing Conference, 1996

1995
A new stabilized zero-crossing representation in the wavelet transform domain and signal reconstruction.
Proceedings of the Proceedings 1995 International Conference on Image Processing, 1995


  Loading...