Mark Hasegawa-Johnson

Koeng-Mo Sung

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Hmm-Based and Svm-Based Recognition of the Speech of Talkers With Spastic Dysarthria.

[BibT_eX]

[DOI]

Jon R. Gunderson

Adrienne Perlman

Thomas S. Huang

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus.

[BibT_eX]

[DOI]

Speech Commun., 2005

Distinctive feature based SVM discriminant features for improvements to phone recognition on telephone band speech.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Landmark-Based Speech Recognition: Report of the 2004 Johns Hopkins Summer Workshop.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Prosodic parallelism as a cue to repetition and error correction disfluency.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop (ITRW) on Disfluency in Spontaneous Speech, 2005

2004

Model enforcement: a unified feature transformation framework for classification and recognition.

[BibT_eX]

[DOI]

IEEE Trans. Signal Process., 2004

Automatic recognition of pitch movements using multilayer perceptron and time-Delay Recursive neural network.

[BibT_eX]

[DOI]

Sung-Suk Kim

IEEE Signal Process. Lett., 2004

Semantic analysis for a speech user interface in an intelligent tutoring system.

[BibT_eX]

[DOI]

Yuexi Ren

Proceedings of the 9th International Conference on Intelligent User Interfaces, 2004

Stop consonant classification by dynamic formant trajectory.

[BibT_eX]

[DOI]

Yanli Zheng

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Intertranscriber reliability of prosodic labeling on telephone conversation using toBI.

[BibT_eX]

[DOI]

Taejin Yoon

Sandra Chavarria

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

AVICAR: audio-visual speech corpus in a car environment.

[BibT_eX]

[DOI]

Bowon Lee

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Children's emotion recognition in an intelligent tutoring scenario.

[BibT_eX]

[DOI]

Tong Zhang

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Automatic detection of contrast for speech understanding.

[BibT_eX]

[DOI]

Tong Zhang

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

A factorial HMM aproach to robust isolated digit recognition in background music.

[BibT_eX]

[DOI]

Ameya N. Deoras

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Source separation using particle filters.

[BibT_eX]

[DOI]

Mital Gandhi

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Modeling pronunciation variation using artificial neural networks for English spontaneous speech.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Modeling and recognition of phonetic and prosodic factors for improvements to acoustic speech recognition models.

[BibT_eX]

[DOI]

Aaron Cohen

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Formant tracking by mixture state particle filter.

[BibT_eX]

[DOI]

Yanli Zheng

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel.

[BibT_eX]

[DOI]

Ameya N. Deoras

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model.

[BibT_eX]

[DOI]

Aaron Cohen

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Approximately independent factors of speech using nonlinear symplectic transformation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2003

Non-linear maximum likelihood feature transformation for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Maximum conditional mutual information projection for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Prosody dependent speech recognition with explicit duration modelling at intonational phrase boundaries.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Acoustic segmentation using switching state Kalman filter.

[BibT_eX]

[DOI]

Yanli Zheng

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

An evaluation of using mutual information for selection of acoustic-features representation of phonemes for speech recognition.

[BibT_eX]

[DOI]

Yigal Brandman

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Maximum mutual information based acoustic-features representation of phonological features for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

Auditory-modeling inspired methods of feature extraction for robust automatic speech recognition.

[BibT_eX]

[DOI]

Zhinian Jing

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

PLP coefficients can be quantized at 400 bps.

[BibT_eX]

[DOI]

Wira Gunawan

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Signal approximation in Hilbert space and its application on articulatory speech synthesis.

[BibT_eX]

[DOI]

Jun Huang

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Time-frequency distribution of partial phonetic information measured using mutual information.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Multivariate-state hidden Markov models for simultaneous transcription of phones and formants.

[BibT_eX]

[DOI]