Shigeki Sagayama

Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Variable-length coding of ACELP gain using Entropy-Constrained VQ.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Communications and Information Technologies, 2012

Hidden Markov Convolutive Mixture Model for Pitch Contour Analysis of Speech.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Speaker-Dependent Voice Activity Detection Robust to Background Speech Noise.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Assistance for Novice Users on Creating Songs from Japanese Lyrics.

[BibT_eX]

[DOI]

Satoru Fukayama

Daisuke Saito

Proceedings of the Non-Cochlear Sound: Proceedings of the 38th International Computer Music Conference, 2012

Comparative evaluations of various harmonic/percussive sound separation algorithms based on anisotropic continuity of spectrogram.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

User-guided independent vector analysis with source activity tuning.

[BibT_eX]

[DOI]

Takuma Ono

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Explicit beat structure modeling for non-negative matrix factorization-based multipitch analysis.

[BibT_eX]

[DOI]

Kazuki Ochiai

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Constrained and regularized variants of non-negative matrix factorization incorporating music-specific constraints.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A tandem connectionist model using combination of multi-scale spectro-temporal features for acoustic event detection.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2011

Diffuse Noise Suppression Using Crystal-Shaped Microphone Arrays.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Computational auditory induction as a missing-data model-fitting problem with Bregman divergence.

[BibT_eX]

[DOI]

Speech Commun., 2011

Polyphonic Pitch Estimation and Instrument Identification by Joint Modeling of Sustained and Attack Sounds.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2011

Introduction to the Special Issue on Music Signal Processing.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2011

Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Polyhymnia: An Automatic Piano Performance System with Statistical Modeling of Polyphonic Expression and Musical Symbol Interpretation.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on New Interfaces for Musical Expression, 2011

Using Spectral Fluctuation of Speech in Multi-Feature HMM-Based Voice Activity Detection.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Concurrent Optimization of Context Clustering and GMM for Offline Handwritten Word Recognition Using HMM.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Multipitch estimation by joint modeling of harmonic and transient sounds.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Infinite-state spectrum model for music signal analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Automatic video annotation via Hierarchical Topic Trajectory Model considering cross-modal correlations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Multichannel harmonic and percussive component separation by joint modeling of spatial and spectral continuity.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Templatic features for modeling phoneme acquisition.

[BibT_eX]

[DOI]

Emmanuel Dupoux

Guillaume Beraud-Sudreau

Proceedings of the 33th Annual Meeting of the Cognitive Science Society, 2011

Musical Instrument Identification Based on New Boosting Algorithm with Probabilistic Decisions.

[BibT_eX]

[DOI]

Proceedings of the Speech, Sound and Music Processing: Embracing Research in India, 2011

2010

Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Music Information Retrieval, 2010

Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

SEMANTIC INDEXING AND KNOWN ITEM SEARCH BASED ON A UNIFIED MODEL WITH TOPIC TRANSITION REPRESENTATION.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Analysis on speech characteristics for robust voice activity detection.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Flexible Harmonic Temporal Structure for Modeling Musical Instrument.

[BibT_eX]

[DOI]

Proceedings of the Entertainment Computing - ICEC 2010, 9th International Conference, 2010

A Roadmap Towards Versatile MIR.

[BibT_eX]

[DOI]

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Autoregressive MFCC Models for Genre Classification Improved by Harmonic-percussion Separation.

[BibT_eX]

[DOI]

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Multiple Pitch Transcription using DBN-based Musicological Models.

[BibT_eX]

[DOI]

Frédéric Bimbot

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Monophonic Instrument Sound Segregation by Clustering NMF Components Based on Basis Similarity and Gain Disjointness.

[BibT_eX]

[DOI]

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Musical instrument identification based on harmonic temporal timbre features.

[BibT_eX]

[DOI]

Yu Kitano

Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2010

HMM-based approach for automatic chord detection using refined acoustic features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Music mood classification by rhythm and bass-line unit pattern analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Melody line estimation in homophonic music audio signals based on temporal-variability of melodic source.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

R-means localization: A simple iterative algorithm for range-difference-based source localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

A sparse component model of source signals and its application to blind source separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Designing the Wiener post-filter for diffuse noise suppression using imaginary parts of inter-channel cross-spectra.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Consistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2010

Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2010

Crystal-MUSIC: Accurate Localization of Multiple Sources in Diffuse Noise Environments Using Crystal-Shaped Microphone Arrays.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2010

Blind Estimation of Locations and Time Offsets for Distributed Recording Devices.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2010

2009

Note detection with dynamic bayesian networks as a postanalysis step for NMF-based multiple pitch estimation techniques.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Blind alignment of asynchronously recorded signals for distributed microphone array.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Orpheus: Automatic Composition System Considering Prosody of Japanese Lyrics.

[BibT_eX]

[DOI]

Proceedings of the Entertainment Computing, 2009

Musical Bass-Line Pattern Clustering and Its Application to Audio Genre Classification.

[BibT_eX]

[DOI]

Emiru Tsunoo

Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Minimum Classification Error Training to Improve Isolated Chord Recognition.

[BibT_eX]

[DOI]

Jeremy Reed

Yushi Ueda

Sabato Marco Siniscalchi

Yuuki Uchiyama

Chin-Hui Lee

Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Stereo-input speech recognition using sparseness-based time-frequency masking in a reverberant environment.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Audio genre classification using percussive pattern clustering combined with timbral features.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Rhythm map: Extraction of unit rhythmic patterns and analysis of rhythmic structure from music acoustic signals.

[BibT_eX]

[DOI]

Emiru Tsunoo

Proceedings of the IEEE International Conference on Acoustics, 2009

Complex NMF: A new sparse representation for acoustic signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Extending Nonnegative Matrix Factorization - A discussion in the context of multiple frequency estimation of musical signals.

[BibT_eX]

[DOI]

Proceedings of the 17th European Signal Processing Conference, 2009

2008

Specmurt Analysis of Polyphonic Music Signals.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Sound Source Localization with Front-Back Judgement by Two Microphones Asymmetrically Mounted on a Sphere.

[BibT_eX]

[DOI]

Souichiro Fukamachi

J. Multim., 2008

A Real-time Equalizer of Harmonic and Percussive Components in Music Signals.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction.

[BibT_eX]

[DOI]

Jonathan Le Roux

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Computational auditory induction by missing-data non-negative matrix factorization.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

On-line handwritten Kanji string recognition based on grammar description of character structures.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Modulation analysis of speech through orthogonal FIR filterbank optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Harmonic-Temporal-Timbral Clustering (HTTC) for the analysis of multi-instrument polyphonic music signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Auxiliary function approach to parameter estimation of constrained sinusoidal model for monaural speech separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

A blind noise decorrelation approach with crystal arrays on designing post-filters for diffuse noise suppression.

[BibT_eX]

[DOI]

Nobutaka Ito

Proceedings of the IEEE International Conference on Acoustics, 2008

Separation of a monaural audio signal into harmonic/percussive components by complementary diffusion on spectrogram.

[BibT_eX]

[DOI]

Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007

Single and Multiple F0 Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Sound Source Localization by Asymmetrically Arrayed 2ch Microphones on a Sphere.

[BibT_eX]

[DOI]

Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

Multipitch Analysis with Harmonic Nonnegative Matrix Approximation.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Automatic Decision of Piano Fingering Based on a Hidden Markov Models.

[BibT_eX]

[DOI]

Yuichiro Yonebayashi

Proceedings of the IJCAI 2007, 2007

Online Handwritten Kanji Recognition Based on Inter-stroke Grammar.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

Rhythm and Tempo Analysis Toward Automatic Music Transcription.

[BibT_eX]

[DOI]

Haruto Takeda

Proceedings of the IEEE International Conference on Acoustics, 2007

Harmonic-Temporal Clustering of Speech for Single and Multiple F0 Contour Estimation in Noisy Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Probabilistic Approach to Automatic Music Transcription from Audio Signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Speech analyzer using a joint estimation model of spectral envelope and fine structure.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Model Adaptation for Long Convolutional Distortion by Maximum Likelihood Based State Filtering Approach.

[BibT_eX]

[DOI]

Chandra Kant Raut

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Effect of Learning on Listening to Ultra-Fast Synthesized Speech.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference of the IEEE Engineering in Medicine and Biology Society, 2006

2005

Specmurt Analysis of Multi-Pitch Music Signals with Adaptive Estimation of Common Harmonic Structure .

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2005, 2005

Harmonic-Temporal Clustering via Deterministic Annealing EM Algorithm for Audio Feature Extraction.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2005, 2005

Model adaptation by state splitting of HMM for long reverberation.

[BibT_eX]

[DOI]

Chandra Kant Raut

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Audio stream segregation of multi-pitch music signal based on time-space clustering using Gaussian kernel 2-dimensional model.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents.

[BibT_eX]

Proceedings of the Life-like characters - tools, affective functions, and applications., 2004

Rhythm and Tempo Recognition of Music Performance from a Probabilistic Approach.

[BibT_eX]

[DOI]

Haruto Takeda

Proceedings of the ISMIR 2004, 2004

Complex spectrum circle centroid for microphone-array-based noisy speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Specmurt anasylis: a piano-roll-visualization of polyphonic music signal by deconvolution of log-frequency spectrum.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

Model composition by lagrange polynomial approximation for robust speech recognition in noisy environment.

[BibT_eX]

[DOI]

Chandra Kant Raut

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Multi-pitch trajectory estimation of concurrent speech based on harmonic GMM and nonlinear kalman filtering.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Automatic rhythm transcription from multiphonic MIDI signals.

[BibT_eX]

[DOI]

Haruto Takeda

Proceedings of the ISMIR 2003, 2003

On-line Overlaid-Handwriting Recognition Based on Substroke HMMs.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR 2003), 2003

Generation of Hierarchical Dictionary for Stroke-order Free Kanji Handwriting Recognition Based on Substroke HMM.

[BibT_eX]

[DOI]

Mitsuru Nakai

Hiroshi Shimodaira

Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR 2003), 2003

2002

Pen Pressure Features for Writer-Independent On-Line Handwriting Recognition Based on Substroke HMM.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Pattern Recognition, 2002

Context-dependent substroke model for HMM-based on-line handwriting recognition.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition, 2002

Jacobian joint adaptation to noise, channel and vocal tract length.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

Hidden Markov model for automatic transcription of MIDI signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002

2001

Dynamic Time-Alignment Kernel in Support Vector Machine.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Support vector machine with dynamic time-alignment kernel for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Substroke Approach to HMM-Based On-line Kanji Handwriting Recognition.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR 2001), 2001

Multiple-regression hidden Markov model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Speaker adaptation of acoustic models using correlations of training transfer vectors.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2000

IPA Japanese Dictation Free Software Project.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Jacobian adaptation of HMM with initial model selection for noisy speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Feature-dependent allophone clustering.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Free software toolkit for Japanese large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Asynchronous-transition HMM.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

An address data entry system with a multimodal interface including speech recognition.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1999

1998

Two-step generation of variable-word-length language model integrating local and global constraints.

[BibT_eX]

[DOI]

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997

ASR and TTS telecommunications applications in Japan.

[BibT_eX]

[DOI]

Speech Commun., 1997

Speech recognition and synthesis technology development at NTT for telecommunications services.

[BibT_eX]

[DOI]

Kazuo Hakoda

Mikio Kitai

Int. J. Speech Technol., 1997

Vector-field-smoothed Bayesian learning for fast and incremental speaker/telephone-channel adaptation.

[BibT_eX]

[DOI]

Comput. Speech Lang., 1997

Fast adaptation of acoustic models to environmental noise using jacobian adaptation algorithm.

[BibT_eX]

[DOI]

Yoshikazu Yamaguchi

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Variable-length language modeling integrating global constraints.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Discrete mixture HMM.

[BibT_eX]

[DOI]

Kiyoaki Aikawa

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Jacobian approach to fast acoustic model adaptation.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Improved estimation of supervision in unsupervised speaker adaptation.

[BibT_eX]

[DOI]

Shigeru Homma

Kiyoaki Aikawa

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996

A speaker-adaptation technique for context-dependent models represented by hidden markov networks.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1996

Speaker-independent speech recognition based on tree-structured speaker clustering.

[BibT_eX]

[DOI]

Comput. Speech Lang., 1996

LR-parser-driven viterbi search with hypotheses merging mechanism using context-dependent phone models.

[BibT_eX]

[DOI]

Tomokazu Yamada

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Iterative unsupervised speaker adaptation for batch dictation.

[BibT_eX]

[DOI]

Shigeru Homma

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Minimum classification error training for a small amount of data enhanced by vector-field-smoothed Bayesian learning.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Tied-structure HMM based on parameter correlation for efficient model training.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

Interactive voice technology development for telecommunications applications.

[BibT_eX]

[DOI]

Speech Commun., 1995

Unsupervised Speaker Adaptation Using All-Phoneme Ergodic Hidden Markov Network.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 1995

Automatic Determination of the Number of Mixture Components for Continuous HMMs Based a Uniform Variance Criterion.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 1995

Speech Recognition Using Function-Word N-Grams and Content-Word N-Grams.

[BibT_eX]

[DOI]

Ryosuke Isotani

IEICE Trans. Inf. Syst., 1995

Fast and accurate beam search using forward heuristic functions in HMM-LR speech recognition.

[BibT_eX]

[DOI]

Yoshiaki Noda

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Syllabic duration control for vocabulary-free speech recognition.

[BibT_eX]

[DOI]

Takatoshi Jitsuhiro

Tomokazu Yamada

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Vector-field-smoothed Bayesian learning for incremental speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the 1995 International Conference on Acoustics, 1995

Four-level tied-structure for efficient representation of acoustic modeling.

[BibT_eX]

[DOI]

Proceedings of the 1995 International Conference on Acoustics, 1995

On the use of scalar quantization for fast HMM computation.

[BibT_eX]

[DOI]

Proceedings of the 1995 International Conference on Acoustics, 1995

1994

Speaker-consistent parsing for speaker-independent continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Telephone line characteristic adaptation using vector field smoothing technique.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Tree-structured speaker clustering for speaker-independent continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

All-phoneme ergodic hidden Markov network for unsupervised speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Tree-structured speaker clustering for fast speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993

Suprasegmental duration control with matrix parsing in continuous speech recognition.

[BibT_eX]

[DOI]

Harald Singer

Speech Commun., 1993

Feature extraction using a matrix coefficient filter for speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 1993

A neural fuzzy training approach for improving speech recognition.

[BibT_eX]

[DOI]

Yasuhiro Komori

Alexander H. Waibel

Syst. Comput. Jpn., 1993

ATREUS: a speech recognition front-end for a speech translation system.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

The possibility for acquisition of statistical network grammar using ergodic HMM.

[BibT_eX]

[DOI]

Jin'ichi Murakami

Hiroki Yamatomo

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

ATR's speech translation system: ASURA.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A dynamic approach to speaker adaptation of hidden Markov networks for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Speech recognition using particle n-grams and content-word n-grams.

[BibT_eX]

[DOI]

Ryosuke Isotani

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Spoken Language Translation System.

[BibT_eX]

Proceedings of the 13th International Joint Conference on Artificial Intelligence. Chambéry, France, August 28, 1993

Matrix parser and its application to HMM-based speech recognition.

[BibT_eX]

[DOI]

Harald Singer

Proceedings of the IEEE International Conference on Acoustics, 1993

ATREUS: a comparative study of continuous speech recognition systems at ATR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1993

Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1993

1992

Continuous mixture HMM-LR using the a* algorithm for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Appropriate error criterion selection for continuous speech HMM minimum error training.

[BibT_eX]

[DOI]

David Rainion

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs.

[BibT_eX]

[DOI]

Kazumi Ohkura

Masahide Sugiyama

Proceedings of the Second International Conference on Spoken Language Processing, 1992

The SSS-LR continuous speech recognition system: integrating SSS-derived allophone models and a phoneme-context-dependent LR parser.

[BibT_eX]

[DOI]

Akito Nagai

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Hardware implementation of realtime 1000-word HMM-LR continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Enhancement of ATR's spoken language translation system: SL-TRANS2.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Continuously spoken sentence recognition by HMM-LR.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Vector field smoothing principle for speaker adaptation.

[BibT_eX]

[DOI]

Hiroaki Hattori

Proceedings of the Second International Conference on Spoken Language Processing, 1992

A successive state splitting algorithm for efficient allophone modeling.

[BibT_eX]

[DOI]

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Pitch dependent phone modelling for HMM based speech recognition.

[BibT_eX]

[DOI]

Harald Singer

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991

A matrix representation of HMM-based speech recognition algorithms.

[BibT_eX]

[DOI]

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Phoneme-context-dependent LR parsing algorithms for HMM-based continuous speech recognition.

[BibT_eX]

[DOI]

Akito Nagai

Kenji Kita

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

A pairwise discriminant approach to robust phoneme recognition by time-delay neural networks.

[BibT_eX]

[DOI]

Proceedings of the 1991 International Conference on Acoustics, 1991

Phoneme recognition by phoneme filter neural networks.

[BibT_eX]

[DOI]

Masami Nakamura

Shinichi Tamura

Proceedings of the 1991 International Conference on Acoustics, 1991

1990

Phoneme recognition by pairwise discriminant TDNNs.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

Isolated word recognition using pitch pattern information.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

Estimation of unknown context using a phoneme environment clustering algorithm.

[BibT_eX]

[DOI]

Shigeru Honrna

Proceedings of the First International Conference on Spoken Language Processing, 1990

Sentence speech recognition using semantic dependency analysis.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

Speaker weighted training of HMM using multiple reference speakers.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

Line spectrum pair frequency - based distance measures for speech recognition.

[BibT_eX]

[DOI]

Fikret S. Gürgen

Sadaoki Furui

Proceedings of the First International Conference on Spoken Language Processing, 1990

Statistical study on voice individuality conversion across different languages.

[BibT_eX]

[DOI]

Masanobu Abe

Proceedings of the First International Conference on Spoken Language Processing, 1990

A continuous speech recognition system based on a two-level grammar approach.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

1989

Phoneme environment clustering for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1989

1986

Duality theory of composite sinusoidal modeling and linear prediction.

[BibT_eX]

[DOI]