Barry-John Theobald

Luca Zappella

CoRR, December, 2025

Investigating Intersectional Bias in Large Language Models using Confidence Disparities in Coreference Resolution.

[BibT_eX]

[DOI]

CoRR, August, 2025

Fairness Dynamics During Training.

[BibT_eX]

[DOI]

CoRR, June, 2025

Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs.

[BibT_eX]

[DOI]

CoRR, May, 2025

Analyze the Neurons, not the Embeddings: Understanding When and Where LLM Representations Align with Humans.

[BibT_eX]

[DOI]

CoRR, February, 2025

Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Aligning LLMs by Predicting Preferences from User Writing Samples.

[BibT_eX]

[DOI]

Stéphane Aroca-Ouellette

Natalie Mackraz

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models.

[BibT_eX]

[DOI]

Li-Wei Chen

Takuya Higuchi

He Bai

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Towards Automatic Assessment of Self-Supervised Speech Models using Rank.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels.

[BibT_eX]

[DOI]

Tatiana Likhomanenko

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Bias after Prompting: Persistent Discrimination in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Evaluating Gender Bias Transfer between Pre-trained and Prompt-Adapted Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories.

[BibT_eX]

[DOI]

Stephane Aroca-Ouellette

Natalie Mackraz

CoRR, 2024

Learning Spatially-Aware Language and Audio Embedding.

[BibT_eX]

[DOI]

CoRR, 2024

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.

[BibT_eX]

[DOI]

CoRR, 2024

REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Learning Spatially-Aware Language and Audio Embeddings.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It's Complicated.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Naturalistic Head Motion Generation from Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

On the Role of LIP Articulation in Visual Speech Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

2022

Understanding the Robustness of Multi-Exit Models under Common Corruptions.

[BibT_eX]

[DOI]

CoRR, 2022

Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning.

[BibT_eX]

[DOI]

Miguel Sarabia

CoRR, 2022

Contrastive Self-Supervised Learning for Skeleton Representations.

[BibT_eX]

[DOI]

CoRR, 2022

Towards a Perceptual Model for Estimating the Quality of Visual Speech.

[BibT_eX]

[DOI]

CoRR, 2022

FedEmbed: Personalized Private Federated Learning.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Multimodal Punctuation Prediction with Contextual Dropout.

[BibT_eX]

[DOI]

Andrew Silva

Proceedings of the IEEE International Conference on Acoustics, 2021

On The Role of Visual Cues in Audiovisual Speech Enhancement.

[BibT_eX]

[DOI]

Zakaria Aldeneh

Anushree Prasanna Kumar

Proceedings of the IEEE International Conference on Acoustics, 2021

MorphGAN: One-Shot Face Synthesis GAN for Detecting Recognition Bias.

[BibT_eX]

[DOI]

Nataniel Ruiz

Anurag Ranjan

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement.

[BibT_eX]

[DOI]

Zakaria Aldeneh

Anushree Prasanna Kumar

CoRR, 2020

Modality Dropout for Improved Performance-driven Talking Faces.

[BibT_eX]

[DOI]

Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

2019

Mirroring to Build Trust in Digital Assistants.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimodal Interaction, 2019

2018

Learning Sharing Behaviors with Arbitrary Numbers of Agents.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

2017

Some observations on computer lip-reading: moving from the dream to the reality.

[BibT_eX]

[DOI]

CoRR, 2017

2016

Visual units and confusion modelling for automatic lip-reading.

[BibT_eX]

[DOI]

Dominic Howell

Stephen J. Cox

Image Vis. Comput., 2016

Expressive Modulation of Neutral Visual Speech.

[BibT_eX]

[DOI]

Felix Shaw

IEEE Multim., 2016

2015

A mouth full of words: Visually consistent acoustic redubbing.

[BibT_eX]

[DOI]

Sarah L. Taylor

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

HMM-based visual speech synthesis using dynamic visemes.

[BibT_eX]

[DOI]

Ausdang Thangthai

Proceedings of the Auditory-Visual Speech Processing, 2015

Improving lip-reading performance for robust audiovisual speech recognition using DNNs.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2015

2014

Which Phoneme-to-Viseme Maps Best Improve Visual-Only Computer Lip-Reading?

[BibT_eX]

[DOI]

Proceedings of the Advances in Visual Computing - 10th International Symposium, 2014

Resolution limits on visual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

The effect of speaking rate on audio and visual speech.

[BibT_eX]

[DOI]

Sarah L. Taylor

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Transforming neutral visual speech into expressive visual speech.

[BibT_eX]

[DOI]

Felix Shaw

Proceedings of the Auditory-Visual Speech Processing, 2013

Confusion modelling for automated lip-reading usingweighted finite-state transducers.

[BibT_eX]

[DOI]

Dominic Howell

Stephen J. Cox

Proceedings of the Auditory-Visual Speech Processing, 2013

2012

Relating Objective and Subjective Performance Measures for AAM-Based Visual Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

On the Segmentation and Classification of Hand Radiographs.

[BibT_eX]

[DOI]

Int. J. Neural Syst., 2012

Dynamic Units of Visual Speech.

[BibT_eX]

[DOI]

Proceedings of the 2012 Eurographics/ACM SIGGRAPH Symposium on Computer Animation, 2012

Automated Bone Age Assessment Using Feature Extraction.

[BibT_eX]

[DOI]

Luke M. Davis

Anthony J. Bagnall

Proceedings of the Intelligent Data Engineering and Automated Learning - IDEAL 2012, 2012

View Independent Computer Lip-Reading.

[BibT_eX]

[DOI]

Yuxuan Lan

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Insights into machine lip reading.

[BibT_eX]

[DOI]

Yuxuan Lan

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

On the Extraction and Classification of Hand Outlines.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Data Engineering and Automated Learning - IDEAL 2011, 2011

2010

Limitations of visual speech recognition.

[BibT_eX]

[DOI]

Jacob L. Newman

Stephen J. Cox

Proceedings of the Auditory-Visual Speech Processing, 2010

Improving visual features for lip-reading.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2010

In pursuit of visemes.

[BibT_eX]

[DOI]

Sarah Hilder

Proceedings of the Auditory-Visual Speech Processing, 2010

2009

Animating Virtual Speakers or Singers from Audio: Lip-Synching Facial Animation.

[BibT_eX]

[DOI]

Sascha Fagel

Gérard Bailly

EURASIP J. Audio Speech Music. Process., 2009

High-presence, low-bandwidth, apparent 3D video-conferencing with a single camera.

[BibT_eX]

[DOI]

Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

Robust facial feature tracking using selected multi-resolution linear predictors.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Comparing visual features for lipreading.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2009

Comparison of human and machine-based lip-reading.

[BibT_eX]

[DOI]

Sarah Hilder

Proceedings of the Auditory-Visual Speech Processing, 2009

2008

A probabilistic trajectory synthesis system for synthesising visual speech.

[BibT_eX]

[DOI]

Nicholas Wilkinson

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

LIPS2008: visual speech synthesis challenge.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Comparing text-driven and speech-driven visual speech synthesisers.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

On evaluating synthesised visual speech.

[BibT_eX]

[DOI]

Nicholas Wilkinson

Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

The challenge of multispeaker lip-reading.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007

Real-time expression cloning using appearance models.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

The painful face: pain expression recognition using active appearance models.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

A real-time speech-driven talking head using active appearance models.

[BibT_eX]

[DOI]

Nicholas Wilkinson

Proceedings of the Auditory-Visual Speech Processing 2007, 2007

2006

Evaluating Error Functions for Robust Active Appearance Models.

[BibT_eX]

[DOI]

Simon Baker

Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

2004

Near-videorealistic synthetic talking faces: implementation and evaluation.

[BibT_eX]

[DOI]

Speech Commun., 2004

2003

Visual speech synthesis using shape and appearance models.

[BibT_eX]

[DOI]