Pingchuan Ma

IEEE Trans. Cybern., June, 2023

Omnidirectional Image Quality Assessment With Knowledge Distillation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2023

SparseVSR: Lightweight and Noise Robust Visual Speech Recognition.

[BibT_eX]

[DOI]

Adriana Fernandez-Lopez

CoRR, 2023

Is dataset condensation a silver bullet for healthcare data sharing?

[BibT_eX]

[DOI]

CoRR, 2023

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision.

[BibT_eX]

[DOI]

Xubo Liu

Egor Lakomkin

CoRR, 2023

Jointly Learning Visual and Auditory Speech Representations from Raw Data.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Cross-Lingual Visual Speech Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels.

[BibT_eX]

[DOI]

Alexandros Haliassos

Adriana Fernandez-Lopez

Honglie Chen

Proceedings of the IEEE International Conference on Acoustics, 2023

SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision.

[BibT_eX]

[DOI]

Xubo Liu

Egor Lakomkin

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Visual speech recognition for multiple languages in the wild.

[BibT_eX]

[DOI]

Nat. Mac. Intell., November, 2022

Streaming Audio-Visual Speech Recognition with Alignment Regularization.

[BibT_eX]

[DOI]

CoRR, 2022

Training Strategies for Improved Lip-Reading.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Lip-reading with Densely Connected Temporal Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

LiRA: Learning Visual Speech Representations from Audio Through Self-Supervision.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

End-To-End Audio-Visual Speech Recognition with Conformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Detecting Adversarial Attacks on Audiovisual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Towards Practical Lipreading with Distilled and Efficient Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

End-to-end visual speech recognition for small-scale datasets.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2020

Visually Guided Self Supervised Learning of Speech Representations.

[BibT_eX]

[DOI]

Abhinav Shukla

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Lipreading Using Temporal Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Towards Pose-Invariant Lip-Reading.

[BibT_eX]

[DOI]

Shiyang Cheng

Georgios Tzimiropoulos

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Detecting Adversarial Attacks On Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

Video-Driven Speech Reconstruction Using Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

2018

Audio-Visual Speech Recognition with a Hybrid CTC/Attention Architecture.

[BibT_eX]

[DOI]

Themos Stafylakis

Georgios Tzimiropoulos

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

End-to-End Audiovisual Speech Recognition.

[BibT_eX]

[DOI]

Georgios Tzimiropoulos