Peter Bell

Shree Harsha Bokkahalli Satish

Hao Tang

CoRR, January, 2026

"Walk a Mile in My Voice": Voice Conversion Shapes Trust, Attribution, and Empathy in Human-AI Speech Interactions.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the 31st International Conference on Intelligent User Interfaces, 2026

Analysing the role of lexical and temporal information in turn-taking through predictability.

[BibT_eX]

[DOI]

Sean Leishman

Shree Harsha Bokkahalli Satish

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

From Seeing it to Experiencing it: Interactive Evaluation of Intersectional Voice Bias in Human-AI Speech Interaction.

[BibT_eX]

[DOI]

Proceedings of the Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems, 2026

2025

TTSDS2: Resources and Benchmark for Evaluating Human-Quality Text to Speech Systems.

[BibT_eX]

[DOI]

CoRR, June, 2025

The role of audio-visual integration in the time course of phonetic encoding in self-supervised speech models.

[BibT_eX]

[DOI]

Yi Wang

Oli Danyi Liu

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Prosodic Structure Beyond Lexical Content: A Study of Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Scaling Laws for Synthetic Speech for Model Training.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

A Practitioner's Guide to Building ASR Models for Low-Resource Languages: A Case Study on Scottish Gaelic.

[BibT_eX]

[DOI]

William Lamb

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Regarding the Existence of the Internal Language Model in CTC-Based E2E ASR.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Spoken Document Retrieval for an Unwritten Language: A Case Study on Gormati.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Synthesising a Corpus of Gaelic Traditional Narrative with Cross-Lingual Text Expansion.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Can self-supervised speech models predict the perceived acceptability of prosodic variation?

[BibT_eX]

[DOI]

Adaeze Adigwe

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024

Comparison and analysis of new curriculum criteria for end-to-end ASR.

[BibT_eX]

[DOI]

Speech Commun., 2024

Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR.

[BibT_eX]

[DOI]

CoRR, 2024

Phonetic Error Analysis of Raw Waveform Acoustic Models with Parametric and Non-Parametric CNNs.

[BibT_eX]

[DOI]

CoRR, 2024

Explainable Attribute-Based Speaker Verification.

[BibT_eX]

[DOI]

CoRR, 2024

Advancing CTC Models for Better Speech Alignment: A Topological Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Large Language Model Based Generative Error Correction: A Challenge and Baselines For Speech Recognition, Speaker Tagging, and Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Language Bias in Self-Supervised Learning For Automatic Speech Recognition.

[BibT_eX]

[DOI]

Edward Storey

Naomi Harte

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

TTSDS - Text-to-Speech Distribution Score.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Crossmodal ASR Error Correction With Discrete Speech Units.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Speech Emotion Recognition With ASR Transcripts: a Comprehensive Study on Word Error Rate and Fusion Techniques.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Characterizing code-switching: Applying Linguistic Principles for Metric Assessment and Development.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Regarding Topology and Adaptability in Differentiable WFST-Based E2E ASR.

[BibT_eX]

[DOI]

Pinzhen Chen

Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Dominant Paths in CTC-Like ASR Models: Unraveling the Effectiveness of Viterbi Decoding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition.

[BibT_eX]

[DOI]

Xiaoliang Wu

Ajitha Rajan

Proceedings of the IEEE International Conference on Acoustics, 2024

Bootstrap Predictive Coding: Investigating a Non-Contrastive Self-Supervised Learning Approach.

[BibT_eX]

[DOI]

Yumnah Mohamied

Proceedings of the IEEE International Conference on Acoustics, 2024

Analyzing the Role of Part-of-Speech in Code-Switching: A Corpus-Based Study.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

UnMute Toolkit: Speech Interactions Designed With Minoritised Language Speakers.

[BibT_eX]

[DOI]

Thomas Reitmaier

Proceedings of the ACM Conversational User Interfaces 2024, 2024

Cultivating Spoken Language Technologies for Unwritten Languages.

[BibT_eX]

[DOI]

Thomas Reitmaier

Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

2023

Multi-Stream Acoustic Modelling Using Raw Real and Imaginary Parts of the Fourier Transform.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Phonetic Error Analysis Beyond Phone Error Rate.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Regarding Topology and Variant Frame Rates for Differentiable WFST-based End-to-End ASR.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Quantifying the perceptual value of lexical and non-lexical channels in speech.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Evaluating and reducing the distance between synthetic and real speech distributions.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Transfer Learning for Personality Perception via Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Unsupervised Code-switched Text Generation from Parallel Text.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Capturing Formality in Speech Across Domains and Languages.

[BibT_eX]

[DOI]

Debasmita Bhattacharya

Julia Hirschberg

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Explanations for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Xiaoliang Wu

Cassia Valentini-Botinhao

Ajitha Rajan

Proceedings of the IEEE International Conference on Acoustics, 2023

Efficient Intelligibility Evaluation Using Keyword Spotting: A Study on Audio-Visual Speech Enhancement.

[BibT_eX]

[DOI]

Andrea Lorena Aldana Blanco

Proceedings of the IEEE International Conference on Acoustics, 2023

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Multimodal Dyadic Impression Recognition via Listener Adaptive Cross-Domain Fusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Do dialogue representations align with perception? An empirical study.

[BibT_eX]

[DOI]

Andrea Lorena Aldana Blanco

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers.

[BibT_eX]

[DOI]

Léa-Marie Lam-Yee-Mui

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022

Exploration of a Self-Supervised Speech Model: A Study on Emotional Corpora.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

AVSE Challenge: Audio-Visual Speech Enhancement Challenge.

[BibT_eX]

[DOI]

Cassia Valentini-Botinhao

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Investigating perception of spoken dialogue acceptability through surprisal.

[BibT_eX]

[DOI]

Sarenne Carrol Wallbridge

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Investigating the contribution of speaker attributes to speaker separability using disentangled speaker representations.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Investigating Sequence-Level Normalisation For CTC-Like End-to-End ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Fusing ASR Outputs in Joint Training for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Code-switched ASR with Linguistic Information.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Opportunities and Challenges of Automatic Speech Recognition Systems for Low-Resource Language Speakers.

[BibT_eX]

[DOI]

Thomas Reitmaier

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021

Mask-combine Decoding and Classification Approach for Punctuation Prediction with real-time Inference Constraints.

[BibT_eX]

[DOI]

CoRR, 2021

On The Usefulness of Self-Attention for Automatic Speech Recognition with Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Stochastic Attention Head Removal: A Simple and Effective Method for Improving Transformer Based ASR Models.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

On the Learning Dynamics of Semi-Supervised Training for ASR.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

It's Not What You Said, it's How You Said it: Discriminative Perception of Speech as a Multichannel Communication System.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Leveraging Speaker Attribute Information Using Multi Task Learning for Speaker Verification and Diarization.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech Acoustic Modelling Using Raw Source and Filter Components.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

The CSTR System for Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Speech Acoustic Modelling from Raw Phase Spectrum.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Segmenting Subtitles for Correcting ASR Segmentation Errors.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Leveraging Linguistic Knowledge for Accent Robustness of End-to-End Models.

[BibT_eX]

[DOI]

Andrea Carmantini

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers.

[BibT_eX]

[DOI]

CoRR, 2020

Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines.

[BibT_eX]

[DOI]

CoRR, 2020

Adaptation Algorithms for Speech Recognition: An Overview.

[BibT_eX]

[DOI]

CoRR, 2020

When Can Self-Attention Be Replaced by Feed Forward Layers?

[BibT_eX]

[DOI]

CoRR, 2020

DropClass and DropAdapt: Dropping classes for deep speaker representation learning.

[BibT_eX]

[DOI]

CoRR, 2020

Dropping Classes for Deep Speaker Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines.

[BibT_eX]

[DOI]

Proceedings of the workshop on Cross-Language Search and Summarization of Text and Speech, 2020

A Deep 2D Convolutional Network for Waveform-Based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Raw Sign and Magnitude Spectra for Multi-Head Acoustic Modelling.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

On the Robustness and Training Dynamics of Raw Waveform Models.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Scattering Power Spectrum Features for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Neural Network Driven Binaural Audio Visual Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Multi-Scale Octave Convolutions for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Channel Adversarial Training for Speaker Verification and Diarization.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Cross Lingual Transfer Learning for Zero-Resource Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Lattice-Based Unsupervised Test-Time Adaptation of Neural Network Acoustic Models.

[BibT_eX]

[DOI]

CoRR, 2019

Trainable Dynamic Subsampling for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

On Learning Interpretable CNNs with Parametric Modulated Kernel-Based Filters.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Lattice-Based Lightly-Supervised Acoustic Model Training.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Untranscribed Web Audio for Low Resource Speech Recognition.

[BibT_eX]

[DOI]

Andrea Carmantini

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Windowed Attention Mechanisms for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

On the Usefulness of Statistical Normalisation of Bottleneck Features for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Embeddings for DNN Speaker Adaptive Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Speaker Adaptive Training Using Model Agnostic Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Acoustic Model Adaptation from Raw Waveforms with Sincnet.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Few-shot learning with attention-based sequence-to-sequence models.

[BibT_eX]

[DOI]

Bertrand Higy

CoRR, 2018

Analyzing Deep CNN-Based Utterance Embeddings for Acoustic Model Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Learning to Adapt: A Meta-learning Approach for Speaker Adaptation.

[BibT_eX]

[DOI]

Joachim Fainberg

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Multitask Learning of Context-Dependent Targets in Deep Neural Network Acoustic Models.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Hierarchical Recurrent Neural Network for Story Segmentation.

[BibT_eX]

[DOI]

Emiru Tsunoo

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Factorised Representations for Neural Network Adaptation to Diverse Acoustic Environments.

[BibT_eX]

[DOI]

Joachim Fainberg

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A System for Real Time Collaborative Transcription Correction.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

The SUMMA Platform Prototype.

[BibT_eX]

[DOI]

Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Simplifying very deep convolutional neural network architectures for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

WERD: Using social text spelling variants for evaluating dialectal speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

ALISA: An automatic lightly supervised speech segmentation and alignment tool.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2016

Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

The MGB-2 challenge: Arabic multi-dialect broadcast media recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Unsupervised Adaptation of Recurrent Neural Network Language Models.

[BibT_eX]

[DOI]

Siva Reddy Gangireddy

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Improving Children's Speech Recognition Through Out-of-Domain Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Dialect Detection in Arabic Broadcast Speech.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Automatic Dialect Detection in Arabic Broadcast Speech.

[BibT_eX]

[DOI]

Ahmed M. Ali

CoRR, 2015

Structured output layer with auxiliary targets for context-dependent acoustic modelling.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Towards automatic detection of reported speech in dialogue using prosodic cues.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Complementary tasks for context-dependent deep neural network acoustic models.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A system for automatic broadcast news summarisation, geolocation and translation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Regularization of context-dependent deep neural networks with context-independent multi-task training.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The MGB challenge: Evaluating multi-genre broadcast media recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Multi-reference WER for evaluating ASR for languages with no orthographic rules.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

A system for automatic alignment of broadcast media captions using weighted finite-state transducers.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

The UEDIN ASR systems for the IWSLT 2014 evaluation.

[BibT_eX]

[DOI]

Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

A semi-Markov model for speech segmentation with an utterance-break prior.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Cross-lingual adaptation with multi-task adaptive networks.

[BibT_eX]

[DOI]

Joris Driesen

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

Using adaptation to improve speech transcription alignment in noisy and reverberant environments.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Description of the UEDIN system for German ASR.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

The UEDIN English ASR system for the IWSLT 2013 evaluation.

[BibT_eX]

[DOI]

Fergus McInnes

Siva Reddy Gangireddy

Mark Sinclair

Alexandra Birch

Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Lightly supervised discriminative training of grapheme models for improved sentence-level alignment of speech and text data.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic Transcription of Multi-genre Media Archives.

[BibT_eX]

[DOI]

Matthew Stephen Seigel

Philip C. Woodland

Proceedings of the First Workshop on Speech, 2013

Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project.

[BibT_eX]

[DOI]

Proceedings of the First Workshop on Speech, 2013

A lecture transcription system combining neural network acoustic and language models.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Grapheme and multilingual posterior features for under-resourced speech recognition: A study on Scottish Gaelic.

[BibT_eX]

[DOI]

Ramya Rasipuram

Mathew Magimai-Doss

Proceedings of the IEEE International Conference on Acoustics, 2013

Multi-level adaptive networks in tandem and hybrid ASR systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

A grapheme-based method for automatic alignment of speech and text data.

[BibT_eX]

[DOI]

Adriana Stan

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Transcription of multi-genre media archives using out-of-domain data.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

The UEDIN systems for the IWSLT 2012 evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

A tutorial dialogue system with unrestricted spoken input.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Designing a spoken language interface for a tutorial dialogue system.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Evaluating language understanding accuracy with respect to objective outcomes in a dialogue system.

[BibT_eX]

[DOI]

Johanna D. Moore

Proceedings of the EACL 2012, 2012

2011

Beetle II: an adaptable tutorial dialogue system.

[BibT_eX]

[DOI]

Johanna D. Moore

Natalie B. Steinhauser

Gwendolyn E. Campbell

Proceedings of the SIGDIAL 2011 Conference, 2011

Adaptive Intelligent Tutorial Dialogue in the BEETLE II System.

[BibT_eX]

[DOI]

Johanna D. Moore

Natalie B. Steinhauser

Gwendolyn E. Campbell

Leanne S. Taylor

Simon Caine

Charlie Scott

Proceedings of the Artificial Intelligence in Education - 15th International Conference, 2011

2010

Stochastic pronunciation modelling and soft match for out-of-vocabulary spoken term detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Term-dependent confidence for out-of-vocabulary term detection.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Diagonal priors for full covariance speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008

Covariance updates for discriminative training by constrained line search.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A shrinkage estimator for speech recognition with full covariance HMMs.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Sparse Gaussian graphical models for speech recognition.

[BibT_eX]

[DOI]