Honglie Chen

According to our database¹, Honglie Chen authored at least 17 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation.

[BibT_eX]

[DOI]

CoRR, October, 2025

MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

CoRR, October, 2025

Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment.

[BibT_eX]

[DOI]

CoRR, January, 2025

Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Large Language Models are Strong Audio-Visual Speech Recognition Learners.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization.

[BibT_eX]

[DOI]

Adriana Fernandez-Lopez

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision.

[BibT_eX]

[DOI]

Xubo Liu

Egor Lakomkin

Konstantinos Vougioukas

CoRR, 2023

SparseVSR: Lightweight and Noise Robust Visual Speech Recognition.

[BibT_eX]

[DOI]

Adriana Fernandez-Lopez

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels.

[BibT_eX]

[DOI]

Pingchuan Ma

Alexandros Haliassos

Adriana Fernandez-Lopez

Honglie Chen

Stavros Petridis

Maja Pantic

Proceedings of the IEEE International Conference on Acoustics, 2023

SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision.

[BibT_eX]

[DOI]

Xubo Liu

Egor Lakomkin

Konstantinos Vougioukas

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021

Localizing Visual Sounds the Hard Way.

[BibT_eX]

[DOI]

Honglie Chen

Weidi Xie

Triantafyllos Afouras

Arsha Nagrani

Andrea Vedaldi

Andrew Zisserman

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Audio-Visual Synchronisation in the wild.

[BibT_eX]

[DOI]

Triantafyllos Afouras

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Vggsound: A Large-Scale Audio-Visual Dataset.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations.

[BibT_eX]

[DOI]

Proceedings of the 30th British Machine Vision Conference 2019, 2019

Honglie Chen

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...