Philip C. Woodland

According to our database1, Philip C. Woodland authored at least 229 papers between 1990 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 2013, "For contributions to large vocabulary speech recognition".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring.
Speech Commun., February, 2023

Minimising Biasing Word Errors for Contextual ASR With the Tree-Constrained Pointer Generator.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Estimating the Uncertainty in Emotion Class Labels With Utterance-Specific Dirichlet Priors.
IEEE Trans. Affect. Comput., 2023

FastInject: Injecting Unpaired Text Data into CTC-based ASR training.
CoRR, 2023

Speech-based Slot Filling using Large Language Models.
CoRR, 2023

Conditional Diffusion Model for Target Speaker Extraction.
CoRR, 2023

It HAS to be Subjective: Human Annotator Simulation via Zero-shot Density Estimation.
CoRR, 2023

Decoupled Structure for Improved Adaptability of End-to-End Models.
CoRR, 2023

Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations.
CoRR, 2023

Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data.
CoRR, 2023

Can Contextual Biasing Remain Effective with Whisper and GPT-2?
CoRR, 2023

Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator.
CoRR, 2023

Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition.
CoRR, 2023

Self-Supervised Representations in Speech-Based Depression Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

End-to-End Spoken Language Understanding with Tree-Constrained Pointer Generator.
Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Supervised Learning-Based Source Separation for Meeting Data.
Proceedings of the IEEE International Conference on Acoustics, 2023

Spectral Clustering-Aware Learning of Embeddings for Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Adaptable End-to-End ASR Models Using Replaceable Internal LMs and Residual Softmax.
Proceedings of the IEEE International Conference on Acoustics, 2023

Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
On the similarities of representations in artificial and brain neural networks for speech recognition.
Frontiers Comput. Neurosci., 2022

Biased Self-supervised learning for ASR.
CoRR, 2022

Distribution-Based Emotion Recognition in Conversation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription.
Proceedings of the Interspeech 2022, 2022

Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Knowledge Distillation for Neural Transducers from Large Self-Supervised Pre-Trained Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Combination of deep speaker embeddings for diarisation.
Neural Networks, 2021

A distributed optimisation framework combining natural gradient with Hessian-free for discriminative sequence training.
Neural Networks, 2021

Discriminative Neural Clustering for Speaker Diarisation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Residual Energy-Based Models for End-to-End Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Emotion Recognition by Fusing Time Synchronous and Time Asynchronous Representations.
Proceedings of the IEEE International Conference on Acoustics, 2021

Content-Aware Speaker Embeddings for Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Transformer Language Models with LSTM-Based Cross-Utterance Information Representation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Tree-Constrained Pointer Generator for End-to-End Contextual Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Cross-Utterance Language Models with Acoustic Error Sampling.
CoRR, 2020

Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings.
Proceedings of the Interspeech 2020, 2020

Improved Large-Margin Softmax Loss for Speaker Diarisation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Multi-Span Acoustic Modelling Using Raw Waveform Signals.
Proceedings of the Interspeech 2019, 2019

PyHTK: Python Library and ASR Pipelines for HTK.
Proceedings of the IEEE International Conference on Acoustics, 2019

Speaker Diarisation Using 2D Self-attentive Combination of Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2019

Integrating Source-Channel and Attention-Based Sequence-to-Sequence Models for Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Semi-tied Units for Efficient Gating in LSTM and Highway Networks.
Proceedings of the Interspeech 2018, 2018

Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems.
Proceedings of the Interspeech 2018, 2018

Combining Natural Gradient with Hessian Free Methods for Sequence Training.
Proceedings of the Interspeech 2018, 2018

High Order Recurrent Neural Networks for Acoustic Modelling.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improved Tdnns Using Deep Kernels and Frequency Dependent Grid-RNNS.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem.
PLoS Comput. Biol., 2017

Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Sequence training of DNN acoustic models with natural gradient.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Very deep convolutional neural networks for robust speech recognition.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems.
Proceedings of the Interspeech 2016, 2016

DNN speaker adaptation using parameterised sigmoid and ReLU hidden activation functions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

System combination with log-linear models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improved DNN-based segmentation for multi-genre broadcast audio.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

CUED-RNNLM - An open-source toolkit for efficient training and evaluation of recurrent neural network language models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
A general artificial neural network extension for HTK.
Proceedings of the INTERSPEECH 2015, 2015

Parameterised sigmoid and reLU hidden activation functions for DNN acoustic modelling.
Proceedings of the INTERSPEECH 2015, 2015

Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages.
Proceedings of the INTERSPEECH 2015, 2015

The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation.
Proceedings of the INTERSPEECH 2015, 2015

I-vector estimation using informative priors for adaptation of deep neural networks.
Proceedings of the INTERSPEECH 2015, 2015

Recurrent neural network language model adaptation for multi-genre broadcast speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

Paraphrastic recurrent neural network language models.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Recurrent neural network language model training with noise contrastive estimation for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improving the training and evaluation efficiency of recurrent neural network language models.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Cambridge university transcription systems for the multi-genre broadcast challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The development of the cambridge university alignment systems for the multi-genre broadcast challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Speaker diarisation and longitudinal linking in multi-genre broadcast data.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Multilingual representations for low resource speech recognition and keyword search.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Investigation of back-off based interpolation between recurrent neural network and n-gram language models.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The MGB challenge: Evaluating multi-genre broadcast media recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Paraphrastic language models.
Comput. Speech Lang., 2014

Adaptation of deep neural network acoustic models using factorised i-vectors.
Proceedings of the INTERSPEECH 2014, 2014

Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch.
Proceedings of the INTERSPEECH 2014, 2014

Standalone training of context-dependent deep neural network acoustic models.
Proceedings of the IEEE International Conference on Acoustics, 2014

Direct sub-word confidence estimation with hidden-state conditional random fields.
Proceedings of the IEEE International Conference on Acoustics, 2014

Detecting deletions in ASR output.
Proceedings of the IEEE International Conference on Acoustics, 2014

Efficient lattice rescoring using recurrent neural network language models.
Proceedings of the IEEE International Conference on Acoustics, 2014

Paraphrastic neural network language models.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Language model cross adaptation for LVCSR system combination.
Comput. Speech Lang., 2013

Use of contexts in language model interpolation and adaptation.
Comput. Speech Lang., 2013

Improving lightly supervised training for broadcast transcription.
Proceedings of the INTERSPEECH 2013, 2013

Cross-domain paraphrasing for improving language modelling using out-of-domain data.
Proceedings of the INTERSPEECH 2013, 2013

Automatic Transcription of Multi-genre Media Archives.
Proceedings of the First Workshop on Speech, 2013

A confidence-based approach for improving keyword hypothesis scores.
Proceedings of the IEEE International Conference on Acoustics, 2013

System combination and score normalization for spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2013

Paraphrastic language models and combination with neural network language models.
Proceedings of the IEEE International Conference on Acoustics, 2013

A high-performance Cantonese keyword search system.
Proceedings of the IEEE International Conference on Acoustics, 2013

Investigation of multilingual deep neural networks for spoken term detection.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Morphological decomposition in Arabic ASR systems.
Comput. Speech Lang., 2012

Transcription of multi-genre media archives using out-of-domain data.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Using Sub-word-level Information for Confidence Estimation with Conditional Random Field Models.
Proceedings of the INTERSPEECH 2012, 2012

Complementary Phone Error Training.
Proceedings of the INTERSPEECH 2012, 2012

2011
The efficient incorporation of MLP features into automatic speech recognition systems.
Comput. Speech Lang., 2011

Combining Information Sources for Confidence Estimation with CRF Models.
Proceedings of the INTERSPEECH 2011, 2011

Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation.
Proceedings of the INTERSPEECH 2011, 2011

Graphone Model Interpolation and Arabic Pronunciation Generation.
Proceedings of the INTERSPEECH 2011, 2011

Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems.
Proceedings of the INTERSPEECH 2011, 2011

Investigation of acoustic units for LVCSR systems.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Unsupervised training and directed manual transcription for LVCSR.
Speech Commun., 2010

Improved neural network based language modelling and adaptation.
Proceedings of the INTERSPEECH 2010, 2010

Recent improvements to the Cambridge Arabic Speech-to-Text systems.
Proceedings of the IEEE International Conference on Acoustics, 2010

Language model combination and adaptation usingweighted finite state transducers.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Unsupervised Adaptation With Discriminative Mapping Transforms.
IEEE Trans. Speech Audio Process., 2009

Efficient generation and use of MLP features for Arabic speech recognition.
Proceedings of the INTERSPEECH 2009, 2009

Exploiting Chinese character models to improve speech recognition performance.
Proceedings of the INTERSPEECH 2009, 2009

Morphological analysis and decomposition for Arabic speech-to-text systems.
Proceedings of the INTERSPEECH 2009, 2009

Training and adapting MLP features for Arabic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
MPE-based discriminative linear transforms for speaker adaptation.
Comput. Speech Lang., 2008

Context dependent language model adaptation.
Proceedings of the INTERSPEECH 2008, 2008

Unsupervised discriminative adaptation using discriminative mapping transforms.
Proceedings of the IEEE International Conference on Acoustics, 2008

Phonetic pronunciations for arabic speech-to-text systems.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio.
Proceedings of the INTERSPEECH 2007, 2007

Unsupervised Training for Mandarin Broadcast News and Conversation Transcription.
Proceedings of the IEEE International Conference on Acoustics, 2007

Improving Speech Transcription for Mandarin-English Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Consensus Network Decoding for Statistical Machine Translation System Combination.
Proceedings of the IEEE International Conference on Acoustics, 2007

Speech Recognition System Combination for Machine Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Discriminative language model adaptation for Mandarin broadcast speech transcription and translation.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Development of a phonetic system for large vocabulary Arabic speech recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Corrections to "Automatic Transcription of Conversational Telephone Speech".
IEEE Trans. Speech Audio Process., 2006

Progress in the CU-HTK broadcast news transcription system.
IEEE Trans. Speech Audio Process., 2006

Unsupervised language model adaptation for Mandarin broadcast conversation transcription.
Proceedings of the INTERSPEECH 2006, 2006

Discriminatively Trained Gaussian Mixture Models for Sentence Boundary Detection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

The Cu-Htk Mandarin Broadcast News Transcription System.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Automatic transcription of conversational telephone speech.
IEEE Trans. Speech Audio Process., 2005

The Cambridge University March 2005 speaker diarisation system.
Proceedings of the INTERSPEECH 2005, 2005

Structural metadata research in the EARS program.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Development of the CU-HTK 2004 Broadcast News Transcription Systems.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Training LVCSR Systems on Thousands of Hours of Data.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Automatic capitalisation generation for speech input.
Comput. Speech Lang., 2004

A PLSA-based language model for conversational telephone speech.
Proceedings of the INTERSPEECH 2004, 2004

Using VTLN for broadcast news transcription.
Proceedings of the INTERSPEECH 2004, 2004

MPE-based discriminative linear transform for speaker adaptation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Development of the 2003 CU-HTK conversational telephone speech transcription system.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Improving broadcast news transcription by lightly supervised discriminative training.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
A combined punctuation generation and speech recognition system and its performance enhancement using prosody.
Speech Commun., 2003

Erratum: Language modelling for Russian and English using words and classes [Computer Speech and Language 17 (2003) 87-104].
Comput. Speech Lang., 2003

Language modelling for Russian and English using words and classes.
Comput. Speech Lang., 2003

MMI-MAP and MPE-MAP for acoustic model adaptation.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Discriminative map for acoustic model adaptation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Automatic complexity control for HLDA systems.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Porting: SwitchBoard to the VoiceMail task.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
The development of the HTK Broadcast News transcription system: An overview.
Speech Commun., 2002

Large scale discriminative training of hidden Markov models for speech recognition.
Comput. Speech Lang., 2002

Cluster identification for speaker-environment tracking.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Maximum mutual information training of hidden Markov models with vector linear predictors.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Minimum Phone Error and I-smoothing for improved discriminative training.
Proceedings of the IEEE International Conference on Acoustics, 2002

Implementation of automatic capitalisation generation systems for speech input.
Proceedings of the IEEE International Conference on Acoustics, 2002

Improved cross-task recognition using MMIE training.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
The Cambridge University Multimedia Document Retrieval Demo System.
Int. J. Speech Technol., 2001

Information Retrieval from Unsegmented Broadcast News Audio.
Int. J. Speech Technol., 2001

The use of prosody in a combined system for punctuation generation and speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Efficient class-based language modelling for very large vocabularies.
Proceedings of the IEEE International Conference on Acoustics, 2001

Improvements in linear transform based speaker adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2001

Improved discriminative training techniques for large vocabulary continuous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001

New features in the CU-HTK system for transcription of conversational telephone speech.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Spoken document representations for probabilistic retrieval.
Speech Commun., 2000

Spoken Document Retrieval for TREC-9 at Cambridge University.
Proceedings of The Ninth Text REtrieval Conference, 2000

Effects of out of vocabulary words in spoken document retrieval.
Proceedings of the SIGIR 2000: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2000

Audio Indexing and Retrieval of Complete Broadcoast News Shows.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Particle-based language modelling.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A rule-based named entity recognition system for speech input.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Modelling sub-phone insertions and deletions in continuous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A method for direct audio search with applications to indexing and retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2000

Large vocabulary decoding and confidence estimation using word posterior probabilities.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Variable-length categoryn-gram language models.
Comput. Speech Lang., 1999

A hidden Markov-model-based trainable speech synthesizer.
Comput. Speech Lang., 1999

Spoken Document Retrieval for TREC-8 at Cambridge University.
Proceedings of The Eighth Text REtrieval Conference, 1999

Improving Retrieval on Imperfect Speech Transcriptions (poster abstract).
Proceedings of the SIGIR '99: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1999

Improvements in accuracy and speed in the HTK broadcast news transcription system.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

An investigation into vocal tract length normalisation.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Dynamic HMM selection for continuous speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Frame discrimination training for HMMs for large vocabulary speech recognition.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

The Cambridge University spoken document retrieval system.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

The 1998 HTK system for transcription of conversational telephone speech.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Spoken Document Retrieval For TREC-7 At Cambridge University.
Proceedings of The Seventh Text REtrieval Conference, 1998

Comparison of language modelling techniques for Russian and English.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Speaker clustering using direct maximisation of the MLLR-adapted likelihood.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Segmentation and classification of broadcast news audio.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Experiments in broadcast news transcription.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Comparison of part-of-speech and automatically derived category-based language models for speech recognition.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

The use of accent-specific pronunciation dictionaries in acoustic model training.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
MMIE training of large vocabulary recognition systems.
Speech Commun., 1997

Multilingual large vocabulary speech recognition: the European SQALE project.
Comput. Speech Lang., 1997

Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models.
Comput. Speech Lang., 1997

Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Broadcast news transcription using HTK.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Experiments in speaker normalisation and adaptation for large vocabulary speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Modelling word-pair relations in a category-based language model.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Mean and variance adaptation within the MLLR framework.
Comput. Speech Lang., 1996

Iterative unsupervised adaptation using maximum likelihood linear regression.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Discriminative optimisation of large vocabulary recognition systems.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Combination of word-based and category-based language models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Using accent-specific pronunciation modelling for robust speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Improving environmental robustness in large vocabulary speech recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Lattice-based discriminative training for large vocabulary speech recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

A variable-length category-based n-gram language model.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models.
Comput. Speech Lang., 1995

Large vocabulary multilingual speech recognition using HTK.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Flexible speaker adaptation for large vocabulary speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Improvements in an HMM-based speech synthesiser.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

The 1994 HTK large vocabulary speech recognition system.
Proceedings of the 1995 International Conference on Acoustics, 1995

Automatic speech synthesiser parameter estimation using HMMs.
Proceedings of the 1995 International Conference on Acoustics, 1995

Rapid speaker adaptation using model prediction.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Spontaneous speech recognition for the credit card corpus using the HTK toolkit.
IEEE Trans. Speech Audio Process., 1994

State clustering in hidden Markov model-based continuous speech recognition.
Comput. Speech Lang., 1994

Tree-Based State Tying for High Accuracy Modelling.
Proceedings of the Human Language Technology, 1994

A One Pass Decoder Design For Large Vocabulary Recognition.
Proceedings of the Human Language Technology, 1994

Recognition ********* a dynamic network decoder design for large vocabulary speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Speaker adaptation of continuous density HMMs using multivariate linear regression.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Modelling syllable characteristics to improve a large vocabulary continuous speech recogniser.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Large vocabulary continuous speech recognition using HTK.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
The use of state tying in continuous speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

The HTK tied-state continuous speech recogniser.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Hidden Markov models using shared vector linear predictors.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Using relative duration in large vocabulary speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Exploiting variable-width features in large vocabulary speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1993

A wave digital filter model of the entire auditory periphery.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
Hidden Markov models using vector linear prediction and discriminative output distributions.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Optimising hidden Markov models using discriminative output distributions.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
An experimental comparison of connectionist and conventional classification systems on natural data.
Speech Commun., 1990


  Loading...