Kiyohiro Shikano

According to our database1, Kiyohiro Shikano authored at least 312 papers between 1976 and 2014.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2007, "For contributions to speech recognition, dialog systems, voice conversion, and acoustic field realization".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2014
Alaryngeal Speech Enhancement Based on One-to-Many Eigenvoice Conversion.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

Musical-noise-free blind speech extraction integrating microphone array and iterative spectral subtraction.
Signal Processing, 2014

Music Signal Separation Based on Supervised Nonnegative Matrix Factorization with Orthogonality and Maximum-Divergence Penalties.
IEICE Transactions, 2014

Topic Classification of Spoken Inquiries Using Transductive Support Vector Machine.
Proceedings of the Natural Interaction with Robots, 2014

Evaluation of Invalid Input Discrimination Using Bag-of-Words for Speech-Oriented Guidance System.
Proceedings of the Natural Interaction with Robots, 2014

Development of a Toolkit Handling Multiple Speech-Oriented Guidance Agents for Mobile Applications.
Proceedings of the Natural Interaction with Robots, 2014

2013
Comparison of Methods for Topic Classification of Spoken Inquiries.
JIP, 2013

Robust music signal separation based on supervised nonnegative matrix factorization with prevention of basis sharing.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2013

Musical noise analysis for Bayesian minimum mean-square error speech amplitude estimators based on higher-order statistics.
Proceedings of the INTERSPEECH 2013, 2013

Music signal separation by supervised nonnegative matrix factorization with basis deformation.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Superresolution-based stereo signal separation via supervised nonnegative matrix factorization.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Toward musical-noise-free blind speech extraction: Concept and its applications.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Semi-blind algorithm for joint noise suppression and dereverberation based on higher-order statistics and acoustic model likelihood.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Statistical Voice Conversion Techniques for Body-Conducted Unvoiced Speech Enhancement.
IEEE Trans. Audio, Speech & Language Processing, 2012

Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction.
IEEE Trans. Audio, Speech & Language Processing, 2012

Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech.
Speech Communication, 2012

Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator.
IEICE Transactions, 2012

Theoretical Analysis of Amounts of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array.
IEICE Transactions, 2012

Musical-Noise-Free Blind Speech Extraction Using ICA-Based Noise Estimation with Channel Selection.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

Theoretical Analysis of Musical Noise Generation in Noise Reduction Methods with Decision-Directed a Priori SNR Estimator.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

Blind speech extraction for Non-Audible Murmur speech with speaker's movement noise.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2012

Musical-noise-free blind speech extraction using ICA-based noise estimation and iterative spectral subtraction.
Proceedings of the 11th International Conference on Information Science, 2012

Spoken Inquiry Discrimination Using Bag-of-Words for Speech-Oriented Guidance System.
Proceedings of the INTERSPEECH 2012, 2012

Evaluation of Many-to-Many Alignment Algorithm by Automatic Pronunciation Annotation Using Web Text Mining.
Proceedings of the INTERSPEECH 2012, 2012

Statistical approach to voice quality control in esophageal speech enhancement.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Speech kurtosis estimation from observed noisy signal based on generalized Gaussian distribution prior and additivity of cumulants.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Musical-noise-free speech enhancement: Theory and evaluation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Sound-localization-preserved binaural MMSE STSA estimator with explicit and implicit binaural cues.
Proceedings of the 20th European Signal Processing Conference, 2012

Object-based stereo up-mixer for wave field synthesis based on spatial information clustering.
Proceedings of the 20th European Signal Processing Conference, 2012

Real-time semi-blind speech extraction with speaker direction tracking on Kinect.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Response generation based on statistical machine translation for speech-oriented guidance system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Comparative study on various noise reduction methods with decision-directed a priori SNR estimator via higher-order statistics.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Optimization scheme of joint noise suppression and dereverberation based on higher-order statistics.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Musical Noise Controllable Algorithm of Channelwise Spectral Subtraction and Adaptive Beamforming Based on Higher Order Statistics.
IEEE Trans. Audio, Speech & Language Processing, 2011

Theoretical Analysis of Musical Noise in Generalized Spectral Subtraction Based on Higher Order Statistics.
IEEE Trans. Audio, Speech & Language Processing, 2011

Sound Field Reproduction by Wavefront Synthesis Using Directly Aligned Multi Point Control.
IEICE Transactions, 2011

Semi-blind speech extraction for robot using visual information and noise statistics.
Proceedings of the 2011 IEEE International Symposium on Signal Processing and Information Technology, 2011

Blind Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator.
Proceedings of the INTERSPEECH 2011, 2011

Theoretical Analysis of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array.
Proceedings of the INTERSPEECH 2011, 2011

Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation.
Proceedings of the INTERSPEECH 2011, 2011

Automatic musical thumbnailing based on audio object localization and its evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2011

Robust sound field reproduction integrating multi-point sound field control and wave field synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2011

Theoretical analysis of musical noise in Wiener filtering family via higher-order statistics.
Proceedings of the IEEE International Conference on Acoustics, 2011

An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques.
Proceedings of the IEEE International Conference on Acoustics, 2011

Acoustic model training for non-audible murmur recognition using transformed normal speech data.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Analysis and Recognition of NAM Speech Using HMM Distances and Visual Information.
IEEE Trans. Audio, Speech & Language Processing, 2010

Silent-speech enhancement using body-conducted vocal-tract resonance signals.
Speech Communication, 2010

Improvements of the One-to-Many Eigenvoice Conversion System.
IEICE Transactions, 2010

Adaptive Training for Voice Conversion Based on Eigenvoices.
IEICE Transactions, 2010

Evaluation of Extremely Small Sound Source Signals Used in Speaking-Aid System with Statistical Voice Conversion.
IEICE Transactions, 2010

Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models.
IEICE Transactions, 2010

Musical-Noise Analysis in Methods of Integrating Microphone Array and Spectral Subtraction Based on Higher-Order Statistics.
EURASIP J. Adv. Sig. Proc., 2010

Linear transformation approaches to many-to-one voice conversion.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Improvement of speech recognition performance for spoken-oriented robot dialog system using end-fire array.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Comparison of methods for topic classification in a speech-oriented guidance system.
Proceedings of the INTERSPEECH 2010, 2010

Adaptive voice-quality control based on one-to-many eigenvoice conversion.
Proceedings of the INTERSPEECH 2010, 2010

The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion.
Proceedings of the INTERSPEECH 2010, 2010

MMSE STSA estimator with nonstationary noise estimation based on ICA for high-quality speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2010

Non-parallel training for many-to-many eigenvoice conversion.
Proceedings of the IEEE International Conference on Acoustics, 2010

Speech enhancement in presence of diffuse background noise: Why using blind signal extraction?
Proceedings of the IEEE International Conference on Acoustics, 2010

Complex Newton algorithm for blind signal extraction of speech in diffuse noise.
Proceedings of the IEEE International Conference on Acoustics, 2010

Statistical approach to enhancing esophageal speech based on Gaussian mixture models.
Proceedings of the IEEE International Conference on Acoustics, 2010

Blind Speech Extraction Combining Generalized MMSE STSA Estimator and ICA-Based Noise and Speech Probability Density Function Estimations.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

Theoretical analysis of musical noise in generalized spectral subtraction: Why should not use power/amplitude subtraction?
Proceedings of the 18th European Signal Processing Conference, 2010

Blind signal extraction based joint suppression of diffuse background noise and late reverberation.
Proceedings of the 18th European Signal Processing Conference, 2010

Musical noise controllable algorithm of channelwise spectral subtraction and beamforming based on higher-order statistics criterion.
Proceedings of the 2nd International Workshop on Cognitive Information Processing, 2010

2009
Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment.
IEEE Trans. Audio, Speech & Language Processing, 2009

Techniques in rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics.
Speech Communication, 2009

Enhancement of speech signals separated from their convolutive mixture by FDICA algorithm.
Digital Signal Processing, 2009

Temporal quantization of spatial information using directional clustering for multichannel audio coding.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Semi-blind suppression of internal noise for hands-free robot spoken dialog system.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Technologies for processing body-conducted speech detected with non-audible murmur microphone.
Proceedings of the INTERSPEECH 2009, 2009

Many-to-many eigenvoice conversion with reference voice.
Proceedings of the INTERSPEECH 2009, 2009

Electrolaryngeal speech enhancement based on statistical voice conversion.
Proceedings of the INTERSPEECH 2009, 2009

Target Speech Enhancement in Presence of Jammer and Diffuse Background Noise.
Proceedings of the Independent Component Analysis and Signal Separation, 2009

Musical noise generation analysis for noise reduction methods based on spectral subtraction and MMSE STSA estimation.
Proceedings of the IEEE International Conference on Acoustics, 2009

Voice conversion for various types of body transmitted speech.
Proceedings of the IEEE International Conference on Acoustics, 2009

Musical noise analysis based on higher order statistics for microphone array and nonlinear signal processing.
Proceedings of the IEEE International Conference on Acoustics, 2009

Source adaptive blind signal extraction using closed-form ICA for hands-free robot spoken dialogue system.
Proceedings of the IEEE International Conference on Acoustics, 2009

Hands-free speech recognition challenge for real-world speech dialogue systems.
Proceedings of the IEEE International Conference on Acoustics, 2009

Acoustic compensation methods for body transmitted speech conversion.
Proceedings of the IEEE International Conference on Acoustics, 2009

Kernel-based nonlinear independent component analysis for underdetermined blind source separation.
Proceedings of the IEEE International Conference on Acoustics, 2009

Multiple ICA-based real-time blind source extraction applied to handy size microphone.
Proceedings of the IEEE International Conference on Acoustics, 2009

Enhanced wiener post-processing based on partial projection back of the blind signal separation noise estimate.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Rapid Compensation of Temperature Fluctuation Effect for Multichannel Sound Field Reproduction System.
IEICE Transactions, 2008

Fast Convergence Blind Source Separation Using Frequency Subband Interpolation by Null Beamforming.
IEICE Transactions, 2008

Building an Effective Speech Corpus by Utilizing Statistical Multidimensional Scaling Method.
IEICE Transactions, 2008

Cost Reduction of Acoustic Modeling for Real-Environment Applications Using Unsupervised and Selective Training.
IEICE Transactions, 2008

Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System.
IEICE Transactions, 2008

Language model for the web search task in a spoken dialogue system for children.
Proceedings of the 1st Workshop on Child, Computer and Interaction, 2008

Real-time implementation of blind spatial subtraction array for hands-free robot spoken dialogue system.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

An improved permutation solver for blind signal separation based front-ends in robot audition.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Maximum a posteriori adaptation for many-to-one eigenvoice conversion.
Proceedings of the INTERSPEECH 2008, 2008

Question and answer database optimization using speech recognition results.
Proceedings of the INTERSPEECH 2008, 2008

Development and evaluation of hands-free spoken dialogue system for railway station guidance.
Proceedings of the INTERSPEECH 2008, 2008

Speaker verification with non-audible murmur segments by combining global alignment kernel and penalized logistic regression machine.
Proceedings of the INTERSPEECH 2008, 2008

An improved one-to-many eigenvoice conversion system.
Proceedings of the INTERSPEECH 2008, 2008

Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments.
Proceedings of the INTERSPEECH 2008, 2008

Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory.
Proceedings of the INTERSPEECH 2008, 2008

Rapid unsupervised speaker adaptation robust in reverberant environment conditions.
Proceedings of the INTERSPEECH 2008, 2008

Hybrid structure of inverse filtering and DOA-parameterized wavefront synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2008

Source-oriented localization control of stereo audio signals based on blind source separation.
Proceedings of the IEEE International Conference on Acoustics, 2008

Distant talking robust speech recognition using late reflection components of room impulse response.
Proceedings of the IEEE International Conference on Acoustics, 2008

Frequency domain semi-blind signal separation: application to the rejection of internal noises.
Proceedings of the IEEE International Conference on Acoustics, 2008

Extension of score function difference for frequency domain blind source separation.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics.
IEICE Transactions, 2007

Interface for Barge-in Free Spoken Dialogue System Based on Sound Field Reproduction and Microphone Array.
EURASIP J. Adv. Sig. Proc., 2007

Unvoiced Speech Recognition Using Tissue-Conductive Acoustic Sensor.
EURASIP J. Adv. Sig. Proc., 2007

An evaluation of many-to-one voice conversion algorithms with pre-stored speaker data sets.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Regression approaches to voice quality controll based on one-to-many eigenvoice conversion.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Voice activity detection applied to hands-free spoken dialogue robot based on decoding using acoustic and language model.
Proceedings of the 1st International Conference on Robot Communication and Coordination, 2007

Real-Time Continuous Speech Recognition System on SH-4A Microprocessor.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

Robust spatial subtraction array with independent component analysis for speech enhancement.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Noise-robust hands-free speech recognition using SIMO-model-based blind source separation.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Study on speaker verification with non-audible murmur segments.
Proceedings of the INTERSPEECH 2007, 2007

Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model.
Proceedings of the INTERSPEECH 2007, 2007

Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees.
Proceedings of the INTERSPEECH 2007, 2007

How to judge reusability of existing speech corpora for target task by utilizing statistical multidimensional scaling.
Proceedings of the INTERSPEECH 2007, 2007

Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection.
Proceedings of the INTERSPEECH 2007, 2007

Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task.
Proceedings of the INTERSPEECH 2007, 2007

One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices.
Proceedings of the IEEE International Conference on Acoustics, 2007

Permutation-Robust Structure for ICA-Based Blind Source Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2007

Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA.
Proceedings of the IEEE International Conference on Acoustics, 2007

High-Presence Hearing-Aid System using DSP-Based Real-Time Blind Source Separation Module.
Proceedings of the IEEE International Conference on Acoustics, 2007

Insights Gained from Development and Long-Term Operation of a Real-Environment Speech-Oriented Guidance System.
Proceedings of the IEEE International Conference on Acoustics, 2007

Barge-in- and noise-free spoken dialogue interface based on sound field control and semi-blind source separation.
Proceedings of the 15th European Signal Processing Conference, 2007

Development and portability of ASR and Q&A modules for real-environment speech-oriented guidance systems.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Blind source separation based on a fast-convergence algorithm combining ICA and beamforming.
IEEE Trans. Audio, Speech & Language Processing, 2006

An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis.
Speech Communication, 2006

Non-Audible Murmur (NAM) Recognition.
IEICE Transactions, 2006

Interface for Barge-in Free Spoken Dialogue System Using Nullspace Based Sound Field Control and Beamforming.
IEICE Transactions, 2006

Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models.
IEICE Transactions, 2006

Utterance-Based Selective Training for the Automatic Creation of Task-Dependent Acoustic Models.
IEICE Transactions, 2006

Blind Separation of Acoustic Signals Combining SIMO-Model-Based Independent Component Analysis and Binary Masking.
EURASIP J. Adv. Sig. Proc., 2006

Embedded Julius: Continuous Speech Recognition Software for Microprocessor.
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006

Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Eigenvoice conversion based on Gaussian mixture model.
Proceedings of the INTERSPEECH 2006, 2006

Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation.
Proceedings of the INTERSPEECH 2006, 2006

Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech.
Proceedings of the INTERSPEECH 2006, 2006

Improving body transmitted unvoiced speech with statistical voice conversion.
Proceedings of the INTERSPEECH 2006, 2006

Speaker verification with non-audible murmur segments.
Proceedings of the INTERSPEECH 2006, 2006

Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training.
Proceedings of the INTERSPEECH 2006, 2006

Double-Talk Free Spoken Dialogue Interface Combining Sound Field Control With Semi-Blind Source Separation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Improving Rapid Unsupervised Speaker Adaptation Based On Hmm Sufficient Statistics.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

ICA and Binary-Mask-Based Blind Source Separation with Small Directional Microphones.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

Two-stage blind separation of moving sound sources with pocket-size real-time DSP module.
Proceedings of the 14th European Signal Processing Conference, 2006

2005
Estimation of Shape Parameter of GGD Function by Negentropy Matching.
Neural Processing Letters, 2005

Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA.
IEICE Transactions, 2005

On-Line Relaxation Algorithm Applicable to Acoustic Fluctuation for Inverse Filter in Multichannel Sound Reproduction System.
IEICE Transactions, 2005

A Self-Generator Method for Initial Filters of SIMO-ICA Applied to Blind Separation of Binaural Sound Mixtures.
IEICE Transactions, 2005

Foreword.
IEICE Transactions, 2005

Blind Separation and Deconvolution for Convolutive Mixture of Speech Combining SIMO-Model-Based ICA and Multichannel Inverse Filtering.
IEICE Transactions, 2005

Blind Separation of Speech by Fixed-Point ICA with Source Adaptive Negentropy Approximation.
IEICE Transactions, 2005

Interface for Barge-in Free Spoken Dialogue System Combining Adaptive Sound Field Control and Microphone Array.
IEICE Transactions, 2005

Designing Target Cost Function Based on Prosody of Speech Database.
IEICE Transactions, 2005

Blind sound scene decomposition for robot audition using SIMO-model-based ICA.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Two-stage blind source separation based on ICA and binary masking for real-time robot audition system.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

NAM-to-speech conversion with Gaussian mixture models.
Proceedings of the INTERSPEECH 2005, 2005

Operating a public spoken guidance system in real environment.
Proceedings of the INTERSPEECH 2005, 2005

Remodeling of the sensor for non-audible murmur (NAM).
Proceedings of the INTERSPEECH 2005, 2005

Applications of NAM microphones in speech recognition for privacy in human-machine communication.
Proceedings of the INTERSPEECH 2005, 2005

Investigating the role of the Lombard reflex in non-audible murmur (NAM) recognition.
Proceedings of the INTERSPEECH 2005, 2005

Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments.
Proceedings of the INTERSPEECH 2005, 2005

Speech Enhancement Based on Blind Source Separation in Car Environments.
Proceedings of the 21st International Conference on Data Engineering Workshops, 2005

Blind separation of binaural sound mixtures using SIMO-ICA with self-generator for initial filter.
Proceedings of the 13th European Signal Processing Conference, 2005

Two-stage blind source separation combining SIMO-model-based ICA and adaptive beamforming.
Proceedings of the 13th European Signal Processing Conference, 2005

Blind separation of more than two sources based on high-convergence algorithm combining ICA and beamforming.
Proceedings of the 13th European Signal Processing Conference, 2005

Barge-in free spoken dialogue interface using nullspace-based sound field control and beamforming.
Proceedings of the 13th European Signal Processing Conference, 2005

2004
Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers Based on the 3-D N-Best Search Method.
VLSI Signal Processing, 2004

Negentropy based voice-activity detection for noise estimation in very low SNR condition.
IEICE Electronic Express, 2004

Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Rapid EM training based on model-integration.
Proceedings of the INTERSPEECH 2004, 2004

MAP estimation of speech spectral component under GGD a priori.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs.
Proceedings of the INTERSPEECH 2004, 2004

Recent progress of open-source LVCSR engine julius and Japanese model repository.
Proceedings of the INTERSPEECH 2004, 2004

Non-audible murmur (NAM) speech recognition using a stethoscopic NAM microphone.
Proceedings of the INTERSPEECH 2004, 2004

Robust speech recognition with spectral subtraction in low SNR.
Proceedings of the INTERSPEECH 2004, 2004

Interface for barge-in free spoken dialogue system using adaptive sound field control.
Proceedings of the INTERSPEECH 2004, 2004

Blind separation of binaural sound mixtures using SIMO-model-based independent component analysis.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Public speech-oriented guidance system with adult and child discrimination capability.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Overdetermined blind separation for convolutive mixtures of speech based on multistage ICA using subarray processing.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Real-time word confidence scoring using local posterior probabilities on tree trellis search.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Evaluation of Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

Single Channel Speech Enhancement: MAP Estimation Using GGD Prior Under Blind Setup.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

Stable and Low-Distortion Algorithm Based on Overdetermined Blind Separation for Convolutive Mixtures of Speech.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

Evaluation of blind separation and deconvolution for binaural-sound mixtures using SIMO-model-based ICA.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

On-line adaptive algorithm to acoustic fluctuation for inverse filter relaxation in sound reproduction system.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

Audible (normal) speech and inaudible murmur recognition using NAM microphone.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
Multiple beamforming with source localization based on CSP analysis.
Systems and Computers in Japan, 2003

Blind Source Separation Combining Independent Component Analysis and Beamforming.
EURASIP J. Adv. Sig. Proc., 2003

Stable learning algorithm for low-distortion blind separation of real speech mixture combining multistage ICA and linear prediction.
Proceedings of the ITRW on Non-Linear Speech Processing, 2003

Blind separation and deconvolution of MIMO system driven by colored inputs using SIMO-model-based ICA with information-geometric learning.
Proceedings of the NNSP 2003, 2003

High-fidelity blind separation for convolutive mixture of acoustic signals using SIMO-model-based independent component analysis.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

Model-integration rapid training based on maximum likelihood for speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Blind separation and deconvolution for convolutive mixture of speech using SIMO-model-based ICA and multichannel inverse filtering.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Unsupervised speaker adaptation based on HMM sufficient statistics in various noisy environments.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Simple designing methods of corpus-based visual speech synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Environmental sound source identification based on hidden Markov model for robust speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Non-audible murmur recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

GMM-based voice conversion applied to emotional speech synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A semi-blind source separation method for hands-free speech recognition of multiple talkers.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Segment selection considering local degradation of naturalness in concatenative speech synthesis.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Non-audible murmur recognition input interface using stethoscopic microphone attached to the skin.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Interface for barge-in free spoken dialogue system based on sound field control and microphone array.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Distant-talking speech recognition based on a 3-D Viterbi search using a microphone array.
IEEE Trans. Speech and Audio Processing, 2002

Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Designing speech database with prosodic variety for expressive TTS system.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

ASKA: receptionist robot with speech dialogue system.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30, 2002

Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Speech enhancement in car environment using blind source separation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Evaluation of cross-language voice conversion using bilingual and non-bilingual databases.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Selective multi-path acoustic model based on database likelihoods.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Using start/end timings of spectral transitions between phonemes in concatenative speech synthesis.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit.
Proceedings of the IEEE International Conference on Acoustics, 2002

Talker localization in a real acoustic environment based on DOA estimation and statistical sound source identification.
Proceedings of the IEEE International Conference on Acoustics, 2002

Bund source separation based on Multi-Stage ICA combining frequency-domain ICA and time-domain ICA.
Proceedings of the IEEE International Conference on Acoustics, 2002

Adaptive compensation of temperature fluctuation effect in sound reproduction system.
Proceedings of the 11th European Signal Processing Conference, 2002

Evaluation of fast-convergence algorithm for ICA-based blind source separation of real convolutive mixture.
Proceedings of the 11th European Signal Processing Conference, 2002

Comparison of time-domain ICA, frequency-domain ICA and multistage ICA for blind source separation.
Proceedings of the 11th European Signal Processing Conference, 2002

2001
HMM-separation-based speech recognition for a distant moving speaker.
IEEE Trans. Speech and Audio Processing, 2001

Evaluation on unsupervised speaker adaptation based on sufficient HMM statictics of selected speakers.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Unsupervised noisy environment adaptation algorithm using MLLR and speaker selection.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

High quality voice conversion based on Gaussian mixture model with dynamic frequency warping.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Blind source separation for speech based on fast-convergence algorithm with ICA and beamforming.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Automatic n-gram language model creation from web resources.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Evaluation of cross-language voice conversion based on GMM and straight.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Julius - an open source real-time large vocabulary recognition engine.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Elderly acoustic model for large vocabulary continuous speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

An Adaptive Integration Based On Product Hmm For Audio-Visual Speech Recognition.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Unsupervised speaker adaptation based on sufficient HMM statistics of selected speakers.
Proceedings of the IEEE International Conference on Acoustics, 2001

Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum.
Proceedings of the IEEE International Conference on Acoustics, 2001

Speech enhancement by multiple beamforming with reflection signal equalization.
Proceedings of the IEEE International Conference on Acoustics, 2001

Gaussian mixture selection using context-independent HMM.
Proceedings of the IEEE International Conference on Acoustics, 2001

A microphone array-based 3-D N-best search algorithm for the simultaneous recognition of multiple sound sources in real environments.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Model adaptation by HMM decomposition and composition in noisy reverberant environments.
Systems and Computers in Japan, 2000

IPA Japanese Dictation Free Software Project.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Investigation of analysis and synthesis parameters of straight by subjective evaluation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Straight-based voice conversion algorithm based on Gaussian mixture model.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Blind source separation based on subband ICA and beamforming.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Stream weight optimization of speech and lip image sequence for audio-visual speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Free software toolkit for Japanese large vocabulary continuous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Manipulating speech pitch periods according to optimal insertion/deletion position in residual signal for intonation control in speech synthesis.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Robust fundamental frequency estimation using instantaneous frequencies of harmonic components.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Speech-to-Face Movement Synthesis based on HMMS.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Speech recognition for a distant moving speaker based on HMM composition and separation.
Proceedings of the IEEE International Conference on Acoustics, 2000

Localization of multiple sound sources based on a CSP analysis with a microphone array.
Proceedings of the IEEE International Conference on Acoustics, 2000

A new phonetic tied-mixture model for efficient decoding.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Simultaneous recognition of multiple sound sources based on 3-d n-best search using microphone array.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Lip movement synthesis from speech based on Hidden Markov Models.
Speech Communication, 1998

Speech-to-lip movement synthesis maximizing audio-visual joint probability based on EM algorithm.
Proceedings of the Second IEEE Workshop on Multimedia Signal Processing, 1998

Compression algorithm of trigram language models based on maximum likelihood estimation.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Speech-to-lip movement synthesis based on the EM algorithm using audio-visual HMMs.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi search.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Evaluation of model adaptation by HMM decomposition on telephone speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Sharable software repository for Japanese large vocabulary continuous speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

The design of the newspaper-based Japanese large vocabulary continuous speech recognition corpus.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphing.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Hands-free speech recognition based on 3-D Viterbi search using a microphone array.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Robust speech recognition in car environments.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Efficient representation of short-time phase based on group delay.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Lip Movement Synthesis from Speech Based on Hidden Markov Models.
Proceedings of the 3rd International Conference on Face & Gesture Recognition (FG '98), 1998

Subjective Evaluation for HMM-Based Speech-To-Lip Movement Synthesis.
Proceedings of the Auditory-Visual Speech Processing, 1998

1997
A non-iterative model-adaptive e-CMN/PMC approach for speech recognition in car environments.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Room acoustics and reverberation: impact on hands-free recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Improved bimodal speech recognition using tied-mixture HMMs and 5000 word audio-visual synchronous database.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Microphone array design measures for hands-free speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Maximum likelihood successive state splitting algorithm for tied-mixture HMNET.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Model adaptation based on HMM decomposition for reverberant speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Speech to lip movement synthesis by HMM.
Proceedings of the ESCA Workshop on Audio-Visual Speech Processing, 1997

1996
Entropy coded vector quantization with hidden Markov models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Robust speech recognition with speaker localization by a microphone array.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Noise and room acoustics distorted speech recognition by HMM composition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
A Speech Dialogue System with Multimodal Interface for Telephone Directory Assistance.
IEICE Transactions, 1995

An HMM State Duration Control Algorithm Applied to Large-Vocabulary Spontaneous Speech Recognition.
IEICE Transactions, 1995

1994
Large-vocabulary continuous speech recognition algorithm applied to a multi-modal telephone directory assistance system.
Speech Communication, 1994

Dictation Machine Based on Japanese Character Source Modeling.
IJPRAI, 1994

A Large-Vocabulary Continuous Speech Recognition Algorithm and its Application to a Multi-Modal Telephone Directory Assistance System.
Proceedings of the Human Language Technology, 1994

A multi-modal dialogue system for telephone directory assistance.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

An HMM duration control algorithm with a low computational cost.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Search algorithm that merges candidates in meaning level for very large vocabulary spontaneous speech recognition.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Very-large-vocabulary continuous speech recognition algorithm for telephone directory assistance.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Dictation system using inductively auto-generated syntax.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Recognition of noisy speech by composition of hidden Markov models.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
Recent Topics in Speech Recognition Research at NTT Laboratories.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Hardware implementation of realtime 1000-word HMM-LR continuous speech recognition.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Phoneme HMM evaluation algorithm without phoneme labeling.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Speaker adaptation by modifying mixture coefficients of speaker-independent mixture Gaussian HMMs.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Continuous speech recognition for medical diagnoses using a character trigram model.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

1991
Language processing in connection with speech translation at ATR interpreting telephony research laboratories.
Speech Communication, 1991

1990
ATR Japanese speech database as a tool of speech recognition and synthesis.
Speech Communication, 1990

Spotting Phonemes and Syllables for Continuous Speech Recognition Using Time-Delay Neural Networks.
Systems and Computers in Japan, 1990

Phoneme recognition expert system using spectrogram reading knowledge and neural networks.
Systems and Computers in Japan, 1990

Phoneme segmentation expert system using spectrogram reading knowledge.
Systems and Computers in Japan, 1990

Integration of speech recognition and language processing in spoken language translation system (SL-TRANS).
Proceedings of the First International Conference on Spoken Language Processing, 1990

On the robustness of HMM and ANN speech recognition algorithms.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Japanese phonetic typewriter using HMM phone units and syllable trigrams.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Speaker weighted training of HMM using multiple reference speakers.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Neural Network Approach To Word Category Prediction For English Texts.
Proceedings of the 13th International Conference on Computational Linguistics, 1990

1989
Modularity and scaling in large phonemic neural networks.
IEEE Trans. Acoustics, Speech, and Signal Processing, 1989

Phoneme recognition using time-delay neural networks.
IEEE Trans. Acoustics, Speech, and Signal Processing, 1989

Phoneme recognition expert system using spectrogram reading knowledge and neural networks.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

Fast back-propagation learning methods for large phonemic neural networks.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

1986
Speech recognition based on top-down and bottom-up phoneme recognition.
Systems and Computers in Japan, 1986

1985
Speaker-independent isolated word recognition based on multiple templates using split method.
Systems and Computers in Japan, 1985

1983
Isolated word recognition using phoneme-like templates.
Proceedings of the IEEE International Conference on Acoustics, 1983

1981
Acoustic processing in the conversational speech recognition system.
Proceedings of the IEEE International Conference on Acoustics, 1981

1976
Speech recognition in the question-answering system operated by conversational speech.
Proceedings of the IEEE International Conference on Acoustics, 1976


  Loading...