Ahmed Hussen Abdelaziz

Orcid: 0000-0001-8027-4666

According to our database1, Ahmed Hussen Abdelaziz authored at least 37 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
CoRR, 2024

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.
CoRR, 2024

2023
Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features.
CoRR, 2023

Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models.
Proceedings of the Interspeech 2022, 2022

2021
Audiovisual Speech Synthesis using Tacotron2.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

On The Role of Visual Cues in Audiovisual Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021

MorphGAN: One-Shot Face Synthesis GAN for Detecting Recognition Bias.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Audiovisual Speech Synthesis using Tacotron2.
CoRR, 2020

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement.
CoRR, 2020

Modality Dropout for Improved Performance-driven Talking Faces.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

2019
On Neural Phone Recognition of Mixed-Source ECoG Signals.
CoRR, 2019

Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models.
Proceedings of the International Conference on Multimodal Interaction, 2019

2018
Comparing Fusion Models for DNN-Based Audiovisual Continuous Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

2017
NTCD-TIMIT: A New Database and Baseline for Noise-Robust Audio-Visual Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Turbo Decoders for Audio-Visual Continuous Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Improving acoustic modeling using audio-visual speech.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Noise-robust HMM-based pattern recognition using multimodal features and observation uncertainties.
PhD thesis, 2016

General hybrid framework for uncertainty-decoding-based automatic speech recognition systems.
Speech Commun., 2016

Introducing the Turbo-Twin-HMM for Audio-Visual Speech Enhancement.
Proceedings of the Interspeech 2016, 2016

Blind Non-Intrusive Speech Intelligibility Prediction Using Twin-HMMs.
Proceedings of the Interspeech 2016, 2016

Dynamic Stream Weighting for Turbo-Decoding-Based Audiovisual ASR.
Proceedings of the Interspeech 2016, 2016

Twin-HMM-based non-intrusive speech intelligibility prediction.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

New Insights into Turbo-Decoding-Based AVSR with Dynamic StreamWeights.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
Learning Dynamic Stream Weights For Coupled-HMM-Based Audio-Visual Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Robust speech processing using observation uncertainty and uncertainty propagation: session and paper overview.
Proceedings of the INTERSPEECH 2015, 2015

Uncertainty propagation through deep neural networks.
Proceedings of the INTERSPEECH 2015, 2015

2014
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Dynamic stream weight estimation in coupled-HMM-based audio-visual speech recognition using multilayer perceptrons.
Proceedings of the INTERSPEECH 2014, 2014

A newem estimationof dynamic stream weights for coupled-HMM-based audio-visual ASR.
Proceedings of the IEEE International Conference on Acoustics, 2014

Human-robot collaborative tutoring using multiparty multimodal spoken dialogue.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2014

2013
Using twin-HMM-based audio-visual speech enhancement as a front-end for robust audio-visual speech recognition.
Proceedings of the INTERSPEECH 2013, 2013

Tutoring Robots - Multiparty Multimodal Social Dialogue with an Embodied Tutor.
Proceedings of the Innovative and Creative Developments in Multimodal Interaction Systems, 2013

GMM-based significance decoding.
Proceedings of the IEEE International Conference on Acoustics, 2013

Twin-HMM-based audio-visual speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Decoding of Uncertain Features Using the Posterior Distribution of the Clean Data for Robust Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Audio-Visual Speech Recognition for Uncertain Acoustical Observations.
Proceedings of the 10th ITG Conference on Speech Communication, 2012


  Loading...