Barry-John Theobald

According to our database1, Barry-John Theobald authored at least 64 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
CoRR, 2024

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.
CoRR, 2024

Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It's Complicated.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation.
CoRR, 2023

Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning.
CoRR, 2023

Naturalistic Head Motion Generation from Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

On the Role of LIP Articulation in Visual Speech Perception.
Proceedings of the IEEE International Conference on Acoustics, 2023

Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards.
Proceedings of the Conference on Robot Learning, 2023

2022
Understanding the Robustness of Multi-Exit Models under Common Corruptions.
CoRR, 2022

Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning.
CoRR, 2022

Contrastive Self-Supervised Learning for Skeleton Representations.
CoRR, 2022

Towards a Perceptual Model for Estimating the Quality of Visual Speech.
CoRR, 2022

FedEmbed: Personalized Private Federated Learning.
CoRR, 2022

2021
Multimodal Punctuation Prediction with Contextual Dropout.
Proceedings of the IEEE International Conference on Acoustics, 2021

On The Role of Visual Cues in Audiovisual Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021

MorphGAN: One-Shot Face Synthesis GAN for Detecting Recognition Bias.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement.
CoRR, 2020

Modality Dropout for Improved Performance-driven Talking Faces.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

2019
Mirroring to Build Trust in Digital Assistants.
Proceedings of the Interspeech 2019, 2019

Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models.
Proceedings of the International Conference on Multimodal Interaction, 2019

2018
Learning Sharing Behaviors with Arbitrary Numbers of Agents.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

2017
Some observations on computer lip-reading: moving from the dream to the reality.
CoRR, 2017

2016
Visual units and confusion modelling for automatic lip-reading.
Image Vis. Comput., 2016

Expressive Modulation of Neutral Visual Speech.
IEEE Multim., 2016

2015
A mouth full of words: Visually consistent acoustic redubbing.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

HMM-based visual speech synthesis using dynamic visemes.
Proceedings of the Auditory-Visual Speech Processing, 2015

Improving lip-reading performance for robust audiovisual speech recognition using DNNs.
Proceedings of the Auditory-Visual Speech Processing, 2015

2014
Which Phoneme-to-Viseme Maps Best Improve Visual-Only Computer Lip-Reading?
Proceedings of the Advances in Visual Computing - 10th International Symposium, 2014

Resolution limits on visual speech recognition.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

The effect of speaking rate on audio and visual speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Transforming neutral visual speech into expressive visual speech.
Proceedings of the Auditory-Visual Speech Processing, 2013

Confusion modelling for automated lip-reading usingweighted finite-state transducers.
Proceedings of the Auditory-Visual Speech Processing, 2013

2012
Relating Objective and Subjective Performance Measures for AAM-Based Visual Speech Synthesis.
IEEE Trans. Speech Audio Process., 2012

On the Segmentation and Classification of Hand Radiographs.
Int. J. Neural Syst., 2012

Dynamic Units of Visual Speech.
Proceedings of the 2012 Eurographics/ACM SIGGRAPH Symposium on Computer Animation, 2012

Automated Bone Age Assessment Using Feature Extraction.
Proceedings of the Intelligent Data Engineering and Automated Learning - IDEAL 2012, 2012

View Independent Computer Lip-Reading.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Insights into machine lip reading.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
On the Extraction and Classification of Hand Outlines.
Proceedings of the Intelligent Data Engineering and Automated Learning - IDEAL 2011, 2011

2010
Limitations of visual speech recognition.
Proceedings of the Auditory-Visual Speech Processing, 2010

Improving visual features for lip-reading.
Proceedings of the Auditory-Visual Speech Processing, 2010

In pursuit of visemes.
Proceedings of the Auditory-Visual Speech Processing, 2010

2009
Animating Virtual Speakers or Singers from Audio: Lip-Synching Facial Animation.
EURASIP J. Audio Speech Music. Process., 2009

High-presence, low-bandwidth, apparent 3D video-conferencing with a single camera.
Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

Robust facial feature tracking using selected multi-resolution linear predictors.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Comparing visual features for lipreading.
Proceedings of the Auditory-Visual Speech Processing, 2009

Comparison of human and machine-based lip-reading.
Proceedings of the Auditory-Visual Speech Processing, 2009

2008
A probabilistic trajectory synthesis system for synthesising visual speech.
Proceedings of the INTERSPEECH 2008, 2008

LIPS2008: visual speech synthesis challenge.
Proceedings of the INTERSPEECH 2008, 2008

Comparing text-driven and speech-driven visual speech synthesisers.
Proceedings of the INTERSPEECH 2008, 2008

On evaluating synthesised visual speech.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

The challenge of multispeaker lip-reading.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007
Real-time expression cloning using appearance models.
Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

The painful face: pain expression recognition using active appearance models.
Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

A real-time speech-driven talking head using active appearance models.
Proceedings of the Auditory-Visual Speech Processing 2007, 2007

2006
Evaluating Error Functions for Robust Active Appearance Models.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

2004
Near-videorealistic synthetic talking faces: implementation and evaluation.
Speech Commun., 2004

2003
Visual speech synthesis using shape and appearance models.
PhD thesis, 2003

Towards a low bandwidth talking face using appearance models.
Image Vis. Comput., 2003

Near-videorealistic synthetic visual speech using non-rigid appearance models.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2.5D Visual Speech Synthesis Using Appearance Models.
Proceedings of the British Machine Vision Conference, 2003

Evaluation of a talking head based on appearance models.
Proceedings of the AVSP 2003, 2003

2002
Towards video realistic synthetic visual speech.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Visual speech synthesis using statistical models of shape and appearance.
Proceedings of the Auditory-Visual Speech Processing, 2001


  Loading...