Stephen J. Cox

John S. Phillips

Paul W. Stackhouse Junior

IEEE J. Biomed. Health Informatics, 2021

A predictor analysis framework for surface radiation budget reprocessing using satellite data.

[BibT_eX]

[DOI]

Patricia A. Quigley

Resit Unal

Int. J. Crit. Infrastructures, 2021

Detecting positional vertigo using an ensemble of 2D convolutional neural networks.

[BibT_eX]

[DOI]

John S. Phillips

Biomed. Signal Process. Control., 2021

2019

Automatic nystagmus detection and quantification in long-term continuous eye-movement data.

[BibT_eX]

[DOI]

Comput. Biol. Medicine, 2019

2016

Visual units and confusion modelling for automatic lip-reading.

[BibT_eX]

[DOI]

Dominic Howell

Barry-John Theobald

Image Vis. Comput., 2016

Improved speaker independent lip reading using speaker adaptive training and deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Tennis Ball Tracking Using a Two-Layered Data Association Approach.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Improving lip-reading performance for robust audiovisual speech recognition using DNNs.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2015

Speaker-independent machine lip-reading with speaker-dependent viseme classifiers.

[BibT_eX]

[DOI]

Helen L. Bear

Richard W. Harvey

Proceedings of the Auditory-Visual Speech Processing, 2015

Detection of anomalous events in a tennis game using multimodal information.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

Multimodal joint information processing in human machine interaction: recent advances.

[BibT_eX]

[DOI]

Lei Xie

Zhigang Deng

Multim. Tools Appl., 2014

Automatic annotation of tennis games: An integration of audio, vision, and learning.

[BibT_eX]

[DOI]

Image Vis. Comput., 2014

Unsupervised model selection for recognition of regional accented speech.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

Native accent classification via i-vectors and speaker compensation fusion.

[BibT_eX]

[DOI]

Andrea DeMarco

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A two layered data association approach for ball tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Confusion modelling for automated lip-reading usingweighted finite-state transducers.

[BibT_eX]

[DOI]

Dominic Howell

Barry-John Theobald

Proceedings of the Auditory-Visual Speech Processing, 2013

2012

Language Identification Using Visual Features.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Iterative classification of regional British accents in i-vector space.

[BibT_eX]

[DOI]

Andrea DeMarco

Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Improved audio event detection by use of contextual noise.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Detection of ball hits in a tennis game using audio and visual information.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Inferring the Structure of a Tennis Game Using Audio Information.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Learning Score Structure from Spoken Language for a Tennis Game.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Iterative Improvement of Speaker Segmentation in a Noisy Environment Using High-Level Knowledge.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An Accurate and Robust Gender Identification Algorithm.

[BibT_eX]

[DOI]

Andrea DeMarco

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Improved detection of ball hit events in a tennis game using multimodal information.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2011

2010

Using high-level information to detect key audio events in a tennis game.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speaker independent visual-only language identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Hierarchical language modeling for audio events detection in a sports game.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Limitations of visual speech recognition.

[BibT_eX]

[DOI]

Barry-John Theobald

Proceedings of the Auditory-Visual Speech Processing, 2010

2009

Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers.

[BibT_eX]

[DOI]

Christopher James Watkins

EURASIP J. Adv. Signal Process., 2009

Example-based speech recognition using formulaic phrases.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

On the estimation and the use of confusion-matrices for improving ASR accuracy.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Automatic visual-only language identification: A preliminary study.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Application of weighted finite-state transducers to improve recognition accuracy for dysarthric speech.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

The challenge of multispeaker lip-reading.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007

Modelling confusion matrices to improve speech recognition accuracy, with an application to dysarthric speech.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Analysis of User Interaction with Service Oriented Chatbot Systems.

[BibT_eX]

[DOI]

Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

2006

Task-independent call-routing.

[BibT_eX]

[DOI]

Speech Commun., 2006

2004

Mixture language models for call routing.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Improving phoneme recognition of telephone quality speech.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Integrated pitch and MFCC extraction for speech reconstruction and speech recognition applications.

[BibT_eX]

[DOI]

Xu Shao

Ben P. Milner

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Unit selection in concatenative TTS synthesis systems based on mel filter bank amplitudes and phonetic context.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic call-routing without transcriptions.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

The use of confidence measures in vector based call-routing.

[BibT_eX]

[DOI]

Gavin C. Cawley

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Discriminative techniques in call routing.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Extraction of Visual Features for Lipreading.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2002

1998

Towards speech recognizer assessment using a human reference standard.

[BibT_eX]

[DOI]

Comput. Speech Lang., 1998

Nonlinear scale decomposition based features for visual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th European Signal Processing Conference, 1998

A Comparison of Active Shape Model and Scale Decomposition Based Features for Visual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 1998

Lipreading Using Shape, Shading and Scale.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 1998

1997

Evaluating feature set performance using the f-ratio and j-measures.

[BibT_eX]

[DOI]

Simon Nicholson

Ben P. Milner

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Lip reading from scale-space measurements.

[BibT_eX]

[DOI]

Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

Combining noise compensation with visual information in speech recognition.

[BibT_eX]

[DOI]

Iain A. Matthews

J. Andrew Bangham

Proceedings of the ESCA Workshop on Audio-Visual Speech Processing, 1997

1996

Audiovisual speech recognition using multiscale nonlinear image decomposition.

[BibT_eX]

[DOI]

Iain A. Matthews

J. Andrew Bangham

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Confidence measures for the SWITCHBOARD database.

[BibT_eX]

[DOI]

Richard C. Rose

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1990

RecNorm: Simultaneous Normalisation and Classification Applied to Speech Recognition.

[BibT_eX]

[DOI]

John S. Bridle

Proceedings of the Advances in Neural Information Processing Systems 3, 1990

1989

Some statistical issues in the comparison of speech recognition algorithms.

[BibT_eX]

[DOI]

L. Gillick

Proceedings of the IEEE International Conference on Acoustics, 1989

Unsupervised speaker adaptation by probabilistic spectrum fitting.

[BibT_eX]

[DOI]