Honglie Chen

According to our database1, Honglie Chen authored at least 15 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis.
CoRR, May, 2025

Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment.
CoRR, January, 2025

Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Large Language Models are Strong Audio-Visual Speech Recognition Learners.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision.
CoRR, 2023

SparseVSR: Lightweight and Noise Robust Visual Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels.
Proceedings of the IEEE International Conference on Acoustics, 2023

SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021
Localizing Visual Sounds the Hard Way.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Audio-Visual Synchronisation in the wild.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Vggsound: A Large-Scale Audio-Visual Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations.
Proceedings of the 30th British Machine Vision Conference 2019, 2019


  Loading...