Douglas D. O'Shaughnessy

Orcid: 0000-0002-0110-2346

Affiliations:
  • INRS-EMT, Montreal, Canada


According to our database1, Douglas D. O'Shaughnessy authored at least 256 papers between 1976 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 2006, "For contributions to education in speech processing and communication".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Speech Enhancement - A Review of Modern Methods.
IEEE Trans. Hum. Mach. Syst., February, 2024

Trends and developments in automatic speech recognition research.
Comput. Speech Lang., January, 2024

2023
Review of methods for coding of speech signals.
EURASIP J. Audio Speech Music. Process., December, 2023

Review of analysis methods for speech applications.
Speech Commun., June, 2023

2022
The Effects of Model Capacity in Modelling Variability between Training and Testing Environments for Automatic Speech Recognition.
Proceedings of the 5th IEEE International Conference on Artificial Intelligence and Knowledge Engineering, 2022

2021
Feature Pooling of Modulation Spectrum Features for Improved Speech Emotion Recognition in the Wild.
IEEE Trans. Affect. Comput., 2021

Automatic speaker verification from affective speech using Gaussian mixture model based estimation of neutral speech characteristics.
Speech Commun., 2021

On the use of blind channel response estimation and a residual neural network to detect physical access attacks to speaker verification systems.
Comput. Speech Lang., 2021

2020
Non-intrusive speech quality prediction based on the blind estimation of clean speech and the i-vector framework.
Qual. User Exp., 2020

On the use of the i-vector speech representation for instrumental quality measurement.
Qual. User Exp., 2020

Introduction to the Issue on Automatic Assessment of Health Disorders Based on Voice, Speech, and Language Processing.
IEEE J. Sel. Top. Signal Process., 2020

2019
Recognition and Processing of Speech Signals Using Neural Networks.
Circuits Syst. Signal Process., 2019

Intrusive Quality Measurement of Noisy and Enhanced Speech based on i-Vector Similarity.
Proceedings of the 11th International Conference on Quality of Multimedia Experience QoMEX 2019, 2019

Blind Channel Response Estimation for Replay Attack Detection.
Proceedings of the Interspeech 2019, 2019

Speech-Based Stress Classification based on Modulation Spectral Features and Convolutional Neural Networks.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
Investigating Speech Enhancement and Perceptual Quality for Speech Emotion Recognition.
Proceedings of the Interspeech 2018, 2018

2017
Speech emotion recognition on mobile devices based on modulation spectral feature pooling and deep neural networks.
Proceedings of the 2017 IEEE International Symposium on Signal Processing and Information Technology, 2017

2016
Feature mapping, score-, and feature-level fusion for improved normal and whispered speech speaker verification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Evaluation of graph metrics for optimizing bin-based ontologically smoothed language models.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
Regularized minimum variance distortionless response-based cepstral features for robust continuous speech recognition.
Speech Commun., 2015

Unsupervised language model adaptation using LDA-based mixture models and latent semantic marginals.
Comput. Speech Lang., 2015

Boosting speaker identification performance using a frame level based algorithm.
Proceedings of the International Conference on Communications, 2015

Document-specific context plsa language model for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
On the relevance of using rhythmic metrics and SVM to assess dysarthric severity.
Int. J. Biom., 2014

Robust feature extraction based on an asymmetric level-dependent auditory filterbank and a subband spectrum enhancement technique.
Digit. Signal Process., 2014

Document-based Dirichlet class language model for speech recognition using document-based n-gram events.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Keynote speakers: The challenges of pattern recognition for speech signals.
Proceedings of the IEEE 12th International New Circuits and Systems Conference, 2014

Automatic Emotion Recognition from Cochlear Implant-Like Spectrally Reduced Speech.
Proceedings of the Ambient Assisted Living and Daily Activities, 2014

Improving the performance of far-field speaker verification using multi-condition training: the case of GMM-UBM and i-vector systems.
Proceedings of the INTERSPEECH 2014, 2014

Noise spectrum estimation using Gaussian mixture model-based speech presence probability for robust speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

Abin-based ontological framework for low-resourcen-gram smoothing in language modelling.
Proceedings of the IEEE International Conference on Acoustics, 2014

Novel topic n-gram count LM incorporating document-based topic distributions and n-gram counts.
Proceedings of the 22nd European Signal Processing Conference, 2014

Robust speech recognition using warped DFT-based cepstral features in clean and multistyle training.
Proceedings of the 22nd European Signal Processing Conference, 2014

Robust feature extractors for continuous speech recognition.
Proceedings of the 22nd European Signal Processing Conference, 2014

Interpolated Dirichlet Class Language Model for Speech Recognition Incorporating Long-distance N-grams.
Proceedings of the COLING 2014, 2014

2013
Multitaper MFCC and PLP features for speaker verification using i-vectors.
Speech Commun., 2013

Speech Information Processing: Theory and Applications [Scanning the Issue].
Proc. IEEE, 2013

Acoustic Analysis for Automatic Speech Recognition.
Proc. IEEE, 2013

Assessment of dysarthric speech through rhythm metrics.
J. King Saud Univ. Comput. Inf. Sci., 2013

Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems.
Cogn. Comput., 2013

Smoothed Nonlinear Energy Operator-Based Amplitude Modulation Features for Robust Speech Recognition.
Proceedings of the Advances in Nonlinear Speech Processing - 6th International Conference, 2013

Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations.
Proceedings of the INTERSPEECH 2013, 2013

Fitting long-range information using interpolated distanced n-grams and cache models into a latent dirichlet language model for speech recognition.
Proceedings of the INTERSPEECH 2013, 2013

Regularized MVDR spectrum estimation-based robust feature extractors for speech recognition.
Proceedings of the INTERSPEECH 2013, 2013

Amplitude modulation features for emotion recognition from speech.
Proceedings of the INTERSPEECH 2013, 2013

Whispered speaker verification and gender detection using weighted instantaneous frequencies.
Proceedings of the IEEE International Conference on Acoustics, 2013

Comparison of a bigram PLSA and a novel context-based PLSA language model for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Multiple windowed spectral features for emotion recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Cognitive, affective, and experience correlates of speech quality perception in complex listening conditions.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speech recognition using regularized minimum variance distortionless response spectrum estimation-based cepstral features.
Proceedings of the IEEE International Conference on Acoustics, 2013

PLSA enhanced with a long-distance bigram language model for speech recognition.
Proceedings of the 21st European Signal Processing Conference, 2013

A new approach to short-time harmonic analysis of tonal audio signals using harmonic sinusoidals.
Proceedings of the 26th IEEE Canadian Conference on Electrical and Computer Engineering CCECE 2013, 2013

2012
Fine granularity scalable speech coding using embedded tree-structured vector quantization.
Speech Commun., 2012

Bayesian on-line spectral change point detection: a soft computing approach for on-line ASR.
Int. J. Speech Technol., 2012

A segmental non-parametric-based phoneme recognition approach at the acoustical level.
Comput. Speech Lang., 2012

Codage échelonnable à granularité fine de la parole : Application au codeur G.729 (Fine granularity scalable speech coding: Application to the G.729 coder) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Topic n-gram count language model adaptation for speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

On the use of asymmetric-shaped tapers for speaker verification using i-vectors.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

A soft computing approach to improve the robustness of on-line ASR in previously unseen highly non-stationary acoustic environments.
Proceedings of the 11th International Conference on Information Science, 2012

Effects of discriminative training on the RACAD corpus of the French language spoken in the Canadian province of New-Brunswick.
Proceedings of the 11th International Conference on Information Science, 2012

Robust Feature Extraction for Speech Recognition by Enhancing Auditory Spectrum.
Proceedings of the INTERSPEECH 2012, 2012

A highly non-stationary noise tracking and compensation algorithm, with applications to speech enhancement and on-line ASR.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

LDA-based LM adaptation using latent semantic marginals and minimum discriminant information.
Proceedings of the 20th European Signal Processing Conference, 2012

Robust speech recognition under noisy environments using asymmetric tapers.
Proceedings of the 20th European Signal Processing Conference, 2012

Ontology-based pattern generator and root semantic analyser for spoken dialogue systems.
Proceedings of the 25th IEEE Canadian Conference on Electrical and Computer Engineering, 2012

2011
Perceptual improvement of Wiener filtering employing a post-filter.
Digit. Signal Process., 2011

Real-Time Bayesian Inference: A Soft Computing Approach to Environmental Learning for On-Line Robust Automatic Speech Recognition.
Proceedings of the Soft Computing Models in Industrial and Environmental Applications, 2011

Comparative Evaluation of Feature Normalization Techniques for Speaker Verification.
Proceedings of the Advances in Nonlinear Speech Processing, 2011

A Study of Low-variance Multi-taper Features for Distributed Speech Recognition.
Proceedings of the Advances in Nonlinear Speech Processing, 2011

A Rapid Adaptation Algorithm for Tracking Highly Non-Stationary Noises based on Bayesian Inference for On-Line Spectral Change Point Detection.
Proceedings of the INTERSPEECH 2011, 2011

Blind Speech Separation in Multiple Environments Using a Frequency Oriented PCA Method for Convolutive Mixtures.
Proceedings of the INTERSPEECH 2011, 2011

Real-life speech-enabled system to enhance interaction with rfid networks in noisy environments.
Proceedings of the IEEE International Conference on Acoustics, 2011

Unsupervised language model adaptation using latent Dirichlet allocation and dynamic marginals.
Proceedings of the 19th European Signal Processing Conference, 2011

Unsupervised language model adaptation using n-gram weighting.
Proceedings of the 24th Canadian Conference on Electrical and Computer Engineering, 2011

Multi-taper MFCC features for speaker verification using I-vectors.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Text-independent distributed speaker identification and verification using GMM-UBM speaker models for mobile communications.
Proceedings of the 10th International Conference on Information Sciences, 2010

Novel weighting scheme for unsupervised language model adaptation using latent dirichlet allocation.
Proceedings of the INTERSPEECH 2010, 2010

A segment-based non-parametric approach for monophone recognition.
Proceedings of the INTERSPEECH 2010, 2010

Phoneme classification and lattice rescoring based on a k-NN approach.
Proceedings of the INTERSPEECH 2010, 2010

Oriented PCA method for blind speech separation of convolutive mixtures.
Proceedings of the INTERSPEECH 2010, 2010

An efficient tree-structured codebook design for embedded vector quantization.
Proceedings of the IEEE International Conference on Acoustics, 2010

Blind speech separation for convolutive mixtures using an oriented principal components analysis method.
Proceedings of the 18th European Signal Processing Conference, 2010

Frame recursive dynamic mean bias removal technique for robust environment-aware speech recognition in real world applications.
Proceedings of the 23rd Canadian Conference on Electrical and Computer Engineering, 2010

2009
Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education].
IEEE Signal Process. Mag., 2009

Developments and directions in speech recognition and understanding, Part 1 [DSP Education].
IEEE Signal Process. Mag., 2009

Alternative Speech Communication System for Persons with Severe Speech Disorders.
EURASIP J. Adv. Signal Process., 2009

A novel method for epoch extraction from speech signals.
Proceedings of the INTERSPEECH 2009, 2009

Fine-granular scalable MELP coder based on embedded vector quantization.
Proceedings of the INTERSPEECH 2009, 2009

STFT-based speech enhancement by reconstructing the harmonics.
Proceedings of the INTERSPEECH 2009, 2009

Context-independent phoneme recognition using a K-Nearest Neighbour classification approach.
Proceedings of the IEEE International Conference on Acoustics, 2009

Robust Speech Enhancement Using Two-Stage Filtered Minima Controlled Recursive Averaging.
Proceedings of the Signal Processing, Image Processing and Pattern Recognition, 2009

A Comparative Study of Blind Speech Separation Using Subspace Methods and Higher Order Statistics.
Proceedings of the Signal Processing, Image Processing and Pattern Recognition, 2009

A method utilizing window function frequency characteristics for noise-robust spectral pitch estimation.
Proceedings of the 17th European Signal Processing Conference, 2009

Low-complexity encoding of speech lsf parameters using multistage tree-structured vector quantization: Application to the MELP coder.
Proceedings of the 22nd Canadian Conference on Electrical and Computer Engineering, 2009

A study on bias-based speech signal conditioning techniques for improving the robustness of automatic speech recognition.
Proceedings of the 22nd Canadian Conference on Electrical and Computer Engineering, 2009

Distributed automatic text-independent speaker identification using GMM-UBM speaker models.
Proceedings of the 22nd Canadian Conference on Electrical and Computer Engineering, 2009

Blind speech separation using high order statistics.
Proceedings of the 22nd Canadian Conference on Electrical and Computer Engineering, 2009

Robust distributed speech recognition using two-stage Filtered Minima Controlled Recursive Averaging.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

An improved perceptual speech enhancement technique employing a psychoacoustically motivated weighting factor.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Invited paper: Automatic speech recognition: History, methods and challenges.
Pattern Recognit., 2008

Speech-Enabled Tools for Augmented Interaction in E-Learning Applications.
Int. J. Distance Educ. Technol., 2008

Experiments on Automatic Recognition of Nonnative Arabic Speech.
EURASIP J. Audio Speech Music. Process., 2008

Seed models combination and state level mappings of cross-lingual transfer for rapid HMM development: from English to Mandarin.
Proceedings of the INTERSPEECH 2008, 2008

Voice activity detection using modified Wigner-ville distribution.
Proceedings of the INTERSPEECH 2008, 2008

An intuitive class discriminability measure for feature selection in a speech recognition system.
Proceedings of the INTERSPEECH 2008, 2008

Speech enhancement using a wiener denoising technique and musical noise reduction.
Proceedings of the INTERSPEECH 2008, 2008

Speech enhancement based on novel two-step a priori SNR estimators.
Proceedings of the INTERSPEECH 2008, 2008

Likelihood-based non-uniform allocation of Gaussian kernels in scalar dimension for HMM compression.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Segmentation of a speech spectrogram using mathematical morphology.
Proceedings of the IEEE International Conference on Acoustics, 2008

Speech enhancement based on a hybrid a priori signal-to-noise ratio (SNR) estimator and a self-adaptive Lagrange multiplier.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Subspace-based speech enhancement by updating noise characteristics in the presence of speech.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
Acoustic Analysis and Detection of Hypernasality Using a Group Delay Function.
IEEE Trans. Biomed. Eng., 2007

Environmental Independent ASR Model Adaptation/Compensation by Bayesian Parametric Representation.
IEEE Trans. Speech Audio Process., 2007

Theoretical Complex Cepstrum of DCT and Warped DCT Filters.
IEEE Signal Process. Lett., 2007

A Hybrid Genetic-Neural Front-End Extension for Robust Speech Recognition over Telephone Lines.
Proceedings of the Advances in Nonlinear Speech Processing, 2007

An evaluation of cross-language adaptation and native speech training for rapid HMM construction based on very limited training data.
Proceedings of the INTERSPEECH 2007, 2007

Clustering-based two-dimensional linear discriminant analysis for speech recognition.
Proceedings of the INTERSPEECH 2007, 2007

Frame margin probability discriminative training algorithm for noisy speech recognition.
Proceedings of the INTERSPEECH 2007, 2007

A new approach for phoneme segmentation of speech signals.
Proceedings of the INTERSPEECH 2007, 2007

Effect of incomplete glottal closures on estimates of glottal waves via inverse filtering of vowel sounds.
Proceedings of the INTERSPEECH 2007, 2007

Speech enhancement using PCA and variance of the reconstruction error model identification.
Proceedings of the INTERSPEECH 2007, 2007

Voiced-Unvoiced-Silence Speech Sound Classification Based on Unsupervised Learning.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Performance Analysis of Features from Complex Cepstra of Warped DST, DCT and DHT Filters for Phoneme Recognition.
Proceedings of the 15th International Conference on Digital Signal Processing, 2007

Bias Estimation and Correction in a Classifier using Product of Likelihood-Gaussians.
Proceedings of the IEEE International Conference on Acoustics, 2007

Interpolative variable frame rate transmission of speech features for distributed speech recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Speech enhancement using PCA and variance of the reconstruction error in distributed speech recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Assessment of articulatory sub-systems of dysarthric speech using an isolated-style phoneme recognition system.
Proceedings of the INTERSPEECH 2006, 2006

Speaker adaptation using evolutionary-based linear transform.
Proceedings of the INTERSPEECH 2006, 2006

Combining multiple-sized sub-word units in a speech recognition system using baseform selection.
Proceedings of the INTERSPEECH 2006, 2006

Discriminative MLE training using a product of Gaussian likelihoods.
Proceedings of the INTERSPEECH 2006, 2006

State-level variable modeling for phoneme classification.
Proceedings of the INTERSPEECH 2006, 2006

Noise-robust speech recognition of conversational telephone speech.
Proceedings of the INTERSPEECH 2006, 2006

Obtaining LIP and Glottal Reflection Coefficients from Vowel Sounds.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Environmental compensation using ASR model adaptation by a Bayesian parametric representation method.
Proceedings of the INTERSPEECH 2005, 2005

Robust automatic speech recognition using a perceptually-based optimal spectral amplitude estimator speech enhancement algorithm in various low-SNR environments.
Proceedings of the INTERSPEECH 2005, 2005

Explicit segmentation of speech based on frequency-domain AR modeling.
Proceedings of the INTERSPEECH 2005, 2005

Statistical properties of the warped discrete cosine transform cepstrum compared with MFCC.
Proceedings of the INTERSPEECH 2005, 2005

Relevant information extraction for discriminative training applied to speaker identification.
Proceedings of the INTERSPEECH 2005, 2005

A performance investigation of noisy voice recognition over IP telephony networks.
Proceedings of the INTERSPEECH 2005, 2005

Experiments on speaker profile portability.
Proceedings of the INTERSPEECH 2005, 2005

Log-Energy Dynamic Range Normalizaton for Robust Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Subspace-based Speaker-independent Vowel Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Warped discrete cosine transform cepstrum: A new feature for speech processing.
Proceedings of the 13th European Signal Processing Conference, 2005

2004
ICASSP 2004 in Montreal.
IEEE Signal Process. Mag., 2004

Noise adaptation for robust AURORA 2 noisy digit recognition using statistical data mapping.
Proceedings of the INTERSPEECH 2004, 2004

Robust ASR model adaptation by feature-based statistical data mapping.
Proceedings of the INTERSPEECH 2004, 2004

The use of typical sequences for robust speaker identification.
Proceedings of the INTERSPEECH 2004, 2004

Robust automatic speech recognition using an optimal spectral amplitude estimator algorithm in low-SNR car environments.
Proceedings of the INTERSPEECH 2004, 2004

Robustness of speech recognition using genetic algorithms and a Mel-cepstral subspace approach.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Automatic recognition of Bluetooth speech in 802.11 interference and the effectiveness of insertion-based compensation techniques.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Loss recovery through spectral interpolation for robust speech recognition over packet voice communications.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
Interacting with computers by voice: automatic speech recognition and synthesis.
Proc. IEEE, 2003

On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions.
EURASIP J. Adv. Signal Process., 2003

Auditory-based Acoustic Distinctive Features and Spectral Cues for Robust Automatic Speech Recognition in Low-SNR Car Environments.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Improving the efficiency of automatic speech recognition by feature transformation and dimensionality reduction.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for robust automatic speech recognition in low-SNR car environments.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for automatic speech recognition using a multi-stream paradigm.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Noise-robust speech recognition in car environments using genetic algorithms and a mel-cepstral subspace approach.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

On improving the performance of analysis-by-synthesis coding using a multi-magnitude algebraic code-book excitation signal.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Auditory-based acoustic distinctive features and spectral cues for automatic speech recognition using a multi-stream paradigm.
Proceedings of the IEEE International Conference on Acoustics, 2002

A hybrid HMM/autoregressive Time-Delay Neural Network Automatic Speech Recognition system.
Proceedings of the 11th European Signal Processing Conference, 2002

2001
Combining pitch and MFCC for speaker identification systems.
Proceedings of the 2001: A Speaker Odyssey, 2001

Hybrid architectures for complex phonetic features classification: a unified approach.
Proceedings of the Sixth International Symposium on Signal Processing and its Applications, 2001

Robust automatic speech recognition in low-SNR car environments by the application of a connectionist subspace-based approach to the melbased cepstral coefficients.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Towards combining pitch and MFCC for speaker recognition systems.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Microphone array within a handset or face mask for speech enhancement.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Recognition of digit strings in noisy speech with limited resources.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Speech recognition using error spotting.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Practical language modeling: an interpolating method.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Detection of filled pauses in spontaneous conversational speech.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Towards a large-vocabulary French vocal dictation based on a size-independent language-model search using the INRS recognizer.
Proceedings of the IEEE International Conference on Acoustics, 2000

Speech signal recovery in white noise using an adaptive Kalman filter.
Proceedings of the 10th European Signal Processing Conference, 2000

Speech communications - human and machine, 2nd Edition.
IEEE, ISBN: 978-0-7803-3449-6, 2000

1999
Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition.
IEEE Trans. Speech Audio Process., 1999

Combating nonlinear telephone channel-noise using the multiband AM-FM model.
Proceedings of the IEEE-EURASIP Workshop on Nonlinear Signal and Image Processing (NSIP'99), 1999

Toward parametric representation of speech for speaker recognition systems.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Towards recognizing "non-lexical" words in spontaneous conversational speech.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Error spotting using syllabic fillers in spontaneous conversational speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

On the use of some divergence measures in speaker recognition.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Objective evaluation of grapheme to phoneme conversion for text-to-speech synthesis in French.
Comput. Speech Lang., 1998

Evaluation of grapheme-to phoneme conversion for text-to-speech synthesis in French.
Proceedings of the First International Conference on Language Resources and Evaluation, 1998

Robust automatic continuous-speech recognition based on a voiced-unvoiced decision.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

On the application of the AM-FM model for the recovery of missing frequency bands of telephone speech.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Comparative experiments to evaluate a voiced-unvoiced-based pre-processing approach to robust automatic speech recognition in low-SNR environments.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Powerful syllabic fillers for general-task keyword-spotting and unlimited-vocabulary continuous-speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A new method to achieve fast acoustic matching for speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Robust automatic speech recognition by the application of a temporal-correlation-based recurrent multilayer neural network to the mel-based cepstral coefficients.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Automatic speech recognition based on cepstral coefficients and a mel-based discrete energy operator.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Specific language modelling for new-word detection in continuous-speech recognition.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
A double Gaussian mixture modeling approach to speaker recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Speech enhancement via energy separation.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Hybrid networks based on RBFN and GMM for speaker recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Clustering beyond phoneme contexts for speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Time domain technique for pitch modification and robust voice transformation.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Accurate keyword spotting using strictly lexical fillers.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Robust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male/female classification.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

New efficient fillers for unlimited word recognition and keyword spotting.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

New developments in the INRS continuous speech recognition system.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Compensated mel frequency cepstrum coefficients.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Clustering words for statistical language models based on contextual word similarity.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Using a transcription graph for large vocabulary continuous speech recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Lexical fillers for task-independent-training based keyword spotting and detection of new words.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Hybrid hidden Markov models in speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

A shared-distribution approach in a hidden Markov model-based continuous speech recognition system.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Segmental duration and HMM modeling.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Timing patterns in fluent and disfluent spontaneous speech.
Proceedings of the 1995 International Conference on Acoustics, 1995

Searching with a transcription graph.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Statistical recovery of wideband speech from narrowband speech.
IEEE Trans. Speech Audio Process., 1994

The masking of narrowband noise by broadband harmonic complex sounds and implications for the processing of speech sounds.
Speech Commun., 1994

Experiments in continuous speech recognition using books on tape.
Speech Commun., 1994

Books on tape as training data for continuous speech recognition.
Speech Commun., 1994

Correcting complex false starts in spontaneous speech.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

New graph search techniques for speech recognition.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
A*-admissible heuristics for rapid lexical access.
IEEE Trans. Speech Audio Process., 1993

On 450-600 b/s natural sounding speech coding.
IEEE Trans. Speech Audio Process., 1993

Frequency domain adaptive postfiltering for enhancement of noisy speech.
Speech Commun., 1993

Development of the INRS ATIS system.
Proceedings of the 1st International Workshop on Intelligent User Interfaces, 1993

Issues in large scale statistical language modeling.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

The inks ATIS system and its n-best interface.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Locating disfluencies in spontaneous speech: an acoustical analysis.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A psychophysical study of fourier phase and amplitude coding of speech.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A very fast method for scoring phonetic transcriptions.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Prosody and continuous speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Analysis and automatic recognition of false starts in spontaneous speech.
Proceedings of the IEEE International Conference on Acoustics, 1993

A new fast match for very large vocabulary continuous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
An A* algorithm for very large vocabulary continuous speech recognition.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Analysis of false starts in spontaneous speech.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Experiments in continuous speech recognition with a 60, 000 word vocabulary.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Speech enhancement using a statistically derived filter mapping.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

HMM training on unconstrained speech for large vocabulary, continuous speech recognition.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Recognition of hesitations in spontaneous speech.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Hybrid segmental-LVQ/HMM for large vocabulary speech recognition.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Speech enhancement based conceptually on auditory evidence.
IEEE Trans. Signal Process., 1991

Short-term temporal decomposition and its properties for speech compression.
IEEE Trans. Signal Process., 1991

A Textual processor to handle ATIS queries.
Proceedings of the Speech and Natural Language, 1991

Energy, duration and Markov models.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Using phoneme duration and energy contour information to improve large vocabulary isolated-word recognition.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Spectral transitions in rule-based and diphone synthesis.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990

An 86, 000-Word Recognizer Based on Phonemic Models.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

A 450 b.p.s. vocoder with natural-sounding speech.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
Automatic and reliable estimation of glottal closure instant and period.
IEEE Trans. Acoust. Speech Signal Process., 1989

Lexical stress detection in isolated English words.
Speech Commun., 1989

Specifying Accent Marks in French Text for Teletext and Speech Synthesis.
Int. J. Man Mach. Stud., 1989

Parsing with a Small Dictionary for Applications such as Text to Speech.
Comput. Linguistics, 1989

Enhancing speech degraded by additive noise or interfering speakers.
IEEE Commun. Mag., 1989

Using syntactic information to improve large-vocabulary word recognition.
Proceedings of the IEEE International Conference on Acoustics, 1989

Parameter sensitivity and robust estimation in an ARX model with glottal excitation.
Proceedings of the IEEE International Conference on Acoustics, 1989

1988
Diphone speech synthesis.
Speech Commun., 1988

Speech enhancement using vector quantization and a formant distance measure.
Proceedings of the IEEE International Conference on Acoustics, 1988

1987
Specifying intonation in a text-to-speech system using only a small dictionary.
Proceedings of the IEEE International Conference on Acoustics, 1987

1986
The effects of speaking rate on formant transitions in French synthesis-by-rule.
Proceedings of the IEEE International Conference on Acoustics, 1986

1984
Design of a real-time French text-to-speech system.
Speech Commun., 1984

1983
Automatic speech synthesis.
IEEE Commun. Mag., 1983

1976
Modelling fundamental frequency, and its relationship to syntax, semantics, and phonetics.
PhD thesis, 1976

A comprehensive model for fundamental frequency generation.
Proceedings of the IEEE International Conference on Acoustics, 1976


  Loading...