Jon Barker

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Modelling Turn-taking in Multispeaker Parties for Realistic Data Simulation.

[BibT_eX]

[DOI]

Jack Deadman

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

The 1st Clarity Prediction Challenge: A machine learning challenge for hearing aid intelligibility prediction.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multi-Modal Acoustic-Articulatory Feature Fusion For Dysarthric Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Auditory-Based Data Augmentation for end-to-end Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Improved Simulation of Realistically-Spatialised Simultaneous Speech Using Multi-Camera Analysis in The Chime-5 Dataset.

[BibT_eX]

[DOI]

Jack Deadman

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Teacher-Student MixIT for Unsupervised and Semi-Supervised Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Parental Spoken Scaffolding and Narrative Skills in Crowd-Sourced Storytelling Samples of Young Children.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Optimising Hearing Aid Fittings for Speech in Noise with a Differentiable Hearing Loss Model.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Clarity-2021 Challenges: Machine Learning Challenges for Advancing Hearing Aid Processing.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

DHASP: Differentiable Hearing Aid Speech Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

The use of Voice Source Features for Sung Speech Recognition.

[BibT_eX]

[DOI]

Gerardo Roa Dabike

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings.

[BibT_eX]

[DOI]

CoRR, 2020

Autoencoder Bottleneck Features with Multi-Task Optimisation for Improved Continuous Dysarthric Speech Recognition.

[BibT_eX]

[DOI]

Zhengjun Yue

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Simulating Realistically-Spatialised Simultaneous Speech Using Video-Driven Speaker Detection and the CHiME-5 Dataset.

[BibT_eX]

[DOI]

Jack Deadman

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Exploring Appropriate Acoustic and Language Modelling Choices for Continuous Dysarthric Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Source Domain Data Selection for Improved Transfer Learning Targeting Dysarthric Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Automatic Lyric Transcription from Karaoke Vocal Tracks: Resources and a Baseline System.

[BibT_eX]

[DOI]

Gerardo Roa Dabike

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Phonetic Analysis of Dysarthric Speech Tempo and Applications to Robust Personalised Dysarthric Speech Recognition.

[BibT_eX]

[DOI]

Feifei Xiong

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

The impact of the Lombard effect on audio and visual speech recognition systems.

[BibT_eX]

[DOI]

Speech Commun., 2018

SDCNet: Video Prediction Using Spatially-Displaced Convolution.

[BibT_eX]

[DOI]

CoRR, 2018

On the Usefulness of the Speech Phase Spectrum for Pitch Extraction.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

DNN Driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Exploring the Use of Group Delay for Generalised VTS Based Noise Compensation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

SDC-Net: Video Prediction Using Spatially-Displaced Convolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Malware Detection by Eating a Whole EXE.

[BibT_eX]

[DOI]

Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Deep Learning of Articulatory-Based Representations and Applications for Improving Dysarthric Speech Recognition.

[BibT_eX]

[DOI]

Feifei Xiong

Angel Manuel Gomez Garcia

Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017

The impact of automatic exaggeration of the visual articulatory features of a talker on the intelligibility of spectrally distorted speech.

[BibT_eX]

[DOI]

Speech Commun., 2017

Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition.

[BibT_eX]

[DOI]

José A. González

Antonio M. Peinado

Circuits Syst. Signal Process., 2017

An analysis of environment, microphone and data simulation mismatches in robust speech recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

The third 'CHiME' speech separation and recognition challenge: Analysis and outcomes.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

Multi-microphone speech recognition in everyday environments.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

Binary Mask Estimation Strategies for Constrained Imputation-Based Speech Enhancement.

[BibT_eX]

[DOI]

Ricard Marxer

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Robust Source-Filter Separation of Speech Signal in the Phase Domain.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Statistical normalisation of phase-based feature representation for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multichannel Spatial Clustering Using Model-Based Source Separation.

[BibT_eX]

[DOI]

Michael I. Mandel

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

The CHiME Challenges: Robust Speech Recognition in Everyday Environments.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

Misperceptions Arising from Speech-in-Babble Interactions.

[BibT_eX]

[DOI]

Máté Attila Tóth

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Multichannel Spatial Clustering for Robust Far-Field Automatic Speech Recognition in Mismatched Conditions.

[BibT_eX]

[DOI]

Michael I. Mandel

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition.

[BibT_eX]

[DOI]

María Luisa García Lecumberri

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Language Effects in Noise-Induced Word Misperceptions.

[BibT_eX]

[DOI]

Ricard Marxer

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A Data Driven Approach to Audiovisual Speech Mapping.

[BibT_eX]

[DOI]

Proceedings of the Advances in Brain Inspired Cognitive Systems, 2016

2015

Chime-home: A dataset for sound source recognition in a domestic environment.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2015

A framework for the evaluation of microscopic intelligibility models.

[BibT_eX]

[DOI]

Ricard Marxer

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Source-filter separation of speech signal in the phase domain.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

The effect of cochlear implant processing on speaker intelligibility: a perceptual study and computer model.

[BibT_eX]

[DOI]

Lin Lin

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

On the role of discriminative intelligibility model for speech intelligibility enhancement.

[BibT_eX]

[DOI]

Maryam Al Dabel

Proceedings of the 18th International Congress of Phonetic Sciences, 2015

A comparison of audiovisual and auditory-only training on the perception of spectrally-distorted speech.

[BibT_eX]

[DOI]

Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Investigating the impact of artificial enhancement of lip visibility on the intelligibility of spectrally-distorted speech.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2015

Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The third 'CHiME' speech separation and recognition challenge: Dataset, task and baselines.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Speech pre-enhancement using a discriminative microscopic intelligibility model.

[BibT_eX]

[DOI]

Maryam Al Dabel

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

MMSE-Based Missing-Feature Reconstruction With Temporal Modeling for Robust Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

Speech Spectral Envelope Enhancement by HMM-Based Analysis/Resynthesis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2013

A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

The PASCAL CHiME speech separation and recognition challenge.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Special issue on speech separation and recognition in multisource environments.

[BibT_eX]

[DOI]

Emmanuel Vincent

Comput. Speech Lang., 2013

The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Combining Speech Fragment Decoding and Adaptive Noise Floor Modeling.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Indication of slowly moving ground targets in non-Gaussian clutter using multi-channel synthetic aperture radar.

[BibT_eX]

[DOI]

Brian Barber

IET Signal Process., 2012

Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition.

[BibT_eX]

[DOI]

José Andrés González López

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Combining missing-data reconstruction and uncertainty decoding for robust speech recognition.

[BibT_eX]

[DOI]

Antonio Miguel Peinado Herreros

Angel Manuel Gomez Garcia

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Missing-Data Techniques: Recognition with Incomplete Spectrograms.

[BibT_eX]

[DOI]

Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011

Binaural Cues for Fragment-Based Speech Recognition in Reverberant Multisource Environments.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Crowdsourcing for Word Recognition in Noise.

[BibT_eX]

[DOI]

María Luisa García Lecumberri

Krzysztof Wasilewski

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A pitch based noise estimation technique for robust speech recognition with Missing Data.

[BibT_eX]

[DOI]

Juan Andres Morales-Cordovilla

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Speech fragment decoding techniques for simultaneous speaker identification and speech recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2010

Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding.

[BibT_eX]

[DOI]

Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2010

The CHiME corpus: a resource and a challenge for computational hearing in multisource environments.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speaker turn tracking with mobile microphones: Combining location and pitch information.

[BibT_eX]

[DOI]

Proceedings of the 18th European Signal Processing Conference, 2010

2009

Energetic and Informational Masking Effects in an Audiovisual Speech Recognition System.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Using location cues to track speaker changes from mobile, binaural microphones.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A speech fragment approach to localising multiple speakers in reverberant environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Stream weight estimation for multistream audio-visual speech recognition in a multispeaker environment.

[BibT_eX]

[DOI]

Speech Commun., 2008

The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Multimodal Interfaces, 2008

2007

Exploiting correlogram structure for robust speech recognition with multiple speech sources.

[BibT_eX]

[DOI]

Speech Commun., 2007

An automatic speech recognition system based on the scene analysis account of auditory perception.

[BibT_eX]

[DOI]

Speech Commun., 2007

Modelling speaker intelligibility in noise.

[BibT_eX]

[DOI]

Speech Commun., 2007

Applying word duration constraints by using unrolled HMMs.

[BibT_eX]

[DOI]

Phil D. Green

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Integrating pitch and localisation cues at a speech fragment level.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Audio-visual speech fragment decoding.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing 2007, 2007

2006

Mask estimation for missing data speech recognition based on statistics of binaural interaction.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2006

Audio-visual speech recognition in the presence of a competing speaker.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

A multipitch tracker for monaural speech segmentation.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Recent advances in speech fragment decoding techniques.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Recognition of Reverberant Speech using Full Cepstral Features and Spectral Missing Data.

[BibT_eX]

[DOI]

Kalle J. Palomäki

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Speech Separation Based on The Statistics of Binaural Auditory Features.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Decoding speech in the presence of other sources.

[BibT_eX]

[DOI]

Martin P. Cooke

Daniel P. W. Ellis

Speech Commun., 2005

Binaural feature selection for missing data speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Soft harmonic masks for recognising speech in the presence of a competing speaker.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Mask Estimation Based on Sound Localisation for Missing Data Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Recognising Speech in the Presence of a Competing Speaker using a 'Speech Fragment Decoder'.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Tracking Facial Markers with an Adaptive Marker Collocation Model.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Techniques for handling convolutional distortion with 'missing data' automatic speech recognition.

[BibT_eX]

[DOI]

Kalle J. Palomäki

Speech Commun., 2004

2002

Missing data speech recognition in reverberant conditions.

[BibT_eX]

[DOI]

Kalle J. Palomäki

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise.

[BibT_eX]

[DOI]

Phil D. Green

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Handling Missing and Unreliable Information in Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Workshop on Artificial Intelligence and Statistics, 2001

2000

Soft decisions in missing data techniques for robust automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Decoding speech in the presence of other sound sources.

[BibT_eX]

[DOI]

Daniel P. W. Ellis

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999

Is the sine-wave speech cocktail party worth attending?

[BibT_eX]

[DOI]

Speech Commun., 1999

Estimation of speech acoustics from visual speech features: A comparison of linear and non-linear models.

[BibT_eX]

[DOI]

Frédéric Berthommier

Proceedings of the Auditory-Visual Speech Processing, 1999

1998

Acoustic confidence measures for segmenting broadcast news.

[BibT_eX]

[DOI]

Gethin Williams

Steve Renals

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Is Primitive AV Coherence An Aid To Segment The Scene?

[BibT_eX]

[DOI]

Frédéric Berthommier

Jean-Luc Schwartz

Proceedings of the Auditory-Visual Speech Processing, 1998

1997

Modelling the recognition of spectrally reduced speech.

[BibT_eX]

[DOI]