Jon Barker

Orcid: 0000-0002-1684-5660

According to our database1, Jon Barker authored at least 124 papers between 1997 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge.
CoRR, 2024

Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models.
CoRR, 2024

Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Intelligibility prediction with a pretrained noise-robust automatic speech recognition model.
CoRR, 2023

The Cadenza ICASSP 2024 Grand Challenge.
CoRR, 2023

Overview of the 2023 ICASSP SP Clarity Challenge: Speech Enhancement for Hearing Aids.
Proceedings of the IEEE International Conference on Acoustics, 2023

The 2nd Clarity Enhancement Challenge for Hearing Aid Speech Intelligibility Enhancement: Overview and Outcomes.
Proceedings of the IEEE International Conference on Acoustics, 2023

The First Cadenza Signal Processing Challenge: Improving Music for Those With a Hearing Loss.
Proceedings of the 2nd Workshop on Human-Centric Music Information Retrieval 2023 co-located with the 24th International Society for Music Information Retrieval Conference (ISMIR 2023), 2023

2022
Acoustic Modelling From Raw Source and Filter Components for Dysarthric Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction.
CoRR, 2022

SNuC: The Sheffield Numbers Spoken Language Corpus.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training.
Proceedings of the Interspeech 2022, 2022

Dysarthric Speech Recognition From Raw Waveform with Parametric CNNs.
Proceedings of the Interspeech 2022, 2022

Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction.
Proceedings of the Interspeech 2022, 2022

Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners.
Proceedings of the Interspeech 2022, 2022

Modelling Turn-taking in Multispeaker Parties for Realistic Data Simulation.
Proceedings of the Interspeech 2022, 2022

The 1st Clarity Prediction Challenge: A machine learning challenge for hearing aid intelligibility prediction.
Proceedings of the Interspeech 2022, 2022

Multi-Modal Acoustic-Articulatory Feature Fusion For Dysarthric Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Auditory-Based Data Augmentation for end-to-end Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improved Simulation of Realistically-Spatialised Simultaneous Speech Using Multi-Camera Analysis in The Chime-5 Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Teacher-Student MixIT for Unsupervised and Semi-Supervised Speech Separation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Parental Spoken Scaffolding and Narrative Skills in Crowd-Sourced Storytelling Samples of Young Children.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Optimising Hearing Aid Fittings for Speech in Noise with a Differentiable Hearing Loss Model.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Clarity-2021 Challenges: Machine Learning Challenges for Advancing Hearing Aid Processing.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2021

DHASP: Differentiable Hearing Aid Speech Processing.
Proceedings of the IEEE International Conference on Acoustics, 2021

The use of Voice Source Features for Sung Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings.
CoRR, 2020

Autoencoder Bottleneck Features with Multi-Task Optimisation for Improved Continuous Dysarthric Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Simulating Realistically-Spatialised Simultaneous Speech Using Video-Driven Speaker Detection and the CHiME-5 Dataset.
Proceedings of the Interspeech 2020, 2020

On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Exploring Appropriate Acoustic and Language Modelling Choices for Continuous Dysarthric Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Source Domain Data Selection for Improved Transfer Learning Targeting Dysarthric Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Automatic Lyric Transcription from Karaoke Vocal Tracks: Resources and a Baseline System.
Proceedings of the Interspeech 2019, 2019

Phonetic Analysis of Dysarthric Speech Tempo and Applications to Robust Personalised Dysarthric Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
The impact of the Lombard effect on audio and visual speech recognition systems.
Speech Commun., 2018

SDCNet: Video Prediction Using Spatially-Displaced Convolution.
CoRR, 2018

On the Usefulness of the Speech Phase Spectrum for Pitch Extraction.
Proceedings of the Interspeech 2018, 2018

DNN Driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation.
Proceedings of the Interspeech 2018, 2018

The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines.
Proceedings of the Interspeech 2018, 2018

Exploring the Use of Group Delay for Generalised VTS Based Noise Compensation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

SDC-Net: Video Prediction Using Spatially-Displaced Convolution.
Proceedings of the Computer Vision - ECCV 2018, 2018

Malware Detection by Eating a Whole EXE.
Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Deep Learning of Articulatory-Based Representations and Applications for Improving Dysarthric Speech Recognition.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
The impact of automatic exaggeration of the visual articulatory features of a talker on the intelligibility of spectrally distorted speech.
Speech Commun., 2017

Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition.
Circuits Syst. Signal Process., 2017

An analysis of environment, microphone and data simulation mismatches in robust speech recognition.
Comput. Speech Lang., 2017

The third 'CHiME' speech separation and recognition challenge: Analysis and outcomes.
Comput. Speech Lang., 2017

Multi-microphone speech recognition in everyday environments.
Comput. Speech Lang., 2017

Binary Mask Estimation Strategies for Constrained Imputation-Based Speech Enhancement.
Proceedings of the Interspeech 2017, 2017

Robust Source-Filter Separation of Speech Signal in the Phase Domain.
Proceedings of the Interspeech 2017, 2017

Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR.
Proceedings of the Interspeech 2017, 2017

Statistical normalisation of phase-based feature representation for robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multichannel Spatial Clustering Using Model-Based Source Separation.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

The CHiME Challenges: Robust Speech Recognition in Everyday Environments.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Misperceptions Arising from Speech-in-Babble Interactions.
Proceedings of the Interspeech 2016, 2016

Multichannel Spatial Clustering for Robust Far-Field Automatic Speech Recognition in Mismatched Conditions.
Proceedings of the Interspeech 2016, 2016

Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Language Effects in Noise-Induced Word Misperceptions.
Proceedings of the Interspeech 2016, 2016

A Data Driven Approach to Audiovisual Speech Mapping.
Proceedings of the Advances in Brain Inspired Cognitive Systems, 2016

2015
Chime-home: A dataset for sound source recognition in a domestic environment.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition.
Proceedings of the Statistical Language and Speech Processing, 2015

A framework for the evaluation of microscopic intelligibility models.
Proceedings of the INTERSPEECH 2015, 2015

Source-filter separation of speech signal in the phase domain.
Proceedings of the INTERSPEECH 2015, 2015

The effect of cochlear implant processing on speaker intelligibility: a perceptual study and computer model.
Proceedings of the INTERSPEECH 2015, 2015

On the role of discriminative intelligibility model for speech intelligibility enhancement.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

A comparison of audiovisual and auditory-only training on the perception of spectrally-distorted speech.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Investigating the impact of artificial enhancement of lip visibility on the intelligibility of spectrally-distorted speech.
Proceedings of the Auditory-Visual Speech Processing, 2015

Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The third 'CHiME' speech separation and recognition challenge: Dataset, task and baselines.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Speech pre-enhancement using a discriminative microscopic intelligibility model.
Proceedings of the INTERSPEECH 2014, 2014

2013
MMSE-Based Missing-Feature Reconstruction With Temporal Modeling for Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2013

Speech Spectral Envelope Enhancement by HMM-Based Analysis/Resynthesis.
IEEE Signal Process. Lett., 2013

A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources.
Comput. Speech Lang., 2013

The PASCAL CHiME speech separation and recognition challenge.
Comput. Speech Lang., 2013

Special issue on speech separation and recognition in multisource environments.
Comput. Speech Lang., 2013

The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines.
Proceedings of the IEEE International Conference on Acoustics, 2013

The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Combining Speech Fragment Decoding and Adaptive Noise Floor Modeling.
IEEE Trans. Speech Audio Process., 2012

Indication of slowly moving ground targets in non-Gaussian clutter using multi-channel synthetic aperture radar.
IET Signal Process., 2012

Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition.
Proceedings of the INTERSPEECH 2012, 2012

Combining missing-data reconstruction and uncertainty decoding for robust speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Missing-Data Techniques: Recognition with Incomplete Spectrograms.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011
Binaural Cues for Fragment-Based Speech Recognition in Reverberant Multisource Environments.
Proceedings of the INTERSPEECH 2011, 2011

Crowdsourcing for Word Recognition in Noise.
Proceedings of the INTERSPEECH 2011, 2011

A pitch based noise estimation technique for robust speech recognition with Missing Data.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Speech fragment decoding techniques for simultaneous speaker identification and speech recognition.
Comput. Speech Lang., 2010

Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2010

The CHiME corpus: a resource and a challenge for computational hearing in multisource environments.
Proceedings of the INTERSPEECH 2010, 2010

Speaker turn tracking with mobile microphones: Combining location and pitch information.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
Energetic and Informational Masking Effects in an Audiovisual Speech Recognition System.
IEEE Trans. Speech Audio Process., 2009

Using location cues to track speaker changes from mobile, binaural microphones.
Proceedings of the INTERSPEECH 2009, 2009

A speech fragment approach to localising multiple speakers in reverberant environments.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Stream weight estimation for multistream audio-visual speech recognition in a multispeaker environment.
Speech Commun., 2008

The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements.
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008

2007
Exploiting correlogram structure for robust speech recognition with multiple speech sources.
Speech Commun., 2007

An automatic speech recognition system based on the scene analysis account of auditory perception.
Speech Commun., 2007

Modelling speaker intelligibility in noise.
Speech Commun., 2007

Applying word duration constraints by using unrolled HMMs.
Proceedings of the INTERSPEECH 2007, 2007

Integrating pitch and localisation cues at a speech fragment level.
Proceedings of the INTERSPEECH 2007, 2007

Audio-visual speech fragment decoding.
Proceedings of the Auditory-Visual Speech Processing 2007, 2007

2006
Mask estimation for missing data speech recognition based on statistics of binaural interaction.
IEEE Trans. Speech Audio Process., 2006

Audio-visual speech recognition in the presence of a competing speaker.
Proceedings of the INTERSPEECH 2006, 2006

A multipitch tracker for monaural speech segmentation.
Proceedings of the INTERSPEECH 2006, 2006

Recent advances in speech fragment decoding techniques.
Proceedings of the INTERSPEECH 2006, 2006

Recognition of Reverberant Speech using Full Cepstral Features and Spectral Missing Data.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Speech Separation Based on The Statistics of Binaural Auditory Features.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Decoding speech in the presence of other sources.
Speech Commun., 2005

Binaural feature selection for missing data speech recognition.
Proceedings of the INTERSPEECH 2005, 2005

Soft harmonic masks for recognising speech in the presence of a competing speaker.
Proceedings of the INTERSPEECH 2005, 2005

Mask Estimation Based on Sound Localisation for Missing Data Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Recognising Speech in the Presence of a Competing Speaker using a 'Speech Fragment Decoder'.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Tracking Facial Markers with an Adaptive Marker Collocation Model.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Techniques for handling convolutional distortion with 'missing data' automatic speech recognition.
Speech Commun., 2004

2002
Missing data speech recognition in reverberant conditions.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Handling Missing and Unreliable Information in Speech Recognition.
Proceedings of the Eighth International Workshop on Artificial Intelligence and Statistics, 2001

2000
Soft decisions in missing data techniques for robust automatic speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Decoding speech in the presence of other sound sources.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Is the sine-wave speech cocktail party worth attending?
Speech Commun., 1999

Estimation of speech acoustics from visual speech features: A comparison of linear and non-linear models.
Proceedings of the Auditory-Visual Speech Processing, 1999

1998
Acoustic confidence measures for segmenting broadcast news.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Is Primitive AV Coherence An Aid To Segment The Scene?
Proceedings of the Auditory-Visual Speech Processing, 1998

1997
Modelling the recognition of spectrally reduced speech.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997


  Loading...