Joanna Hong

Orcid: 0000-0003-4182-1000

According to our database¹, Joanna Hong authored at least 18 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

2024

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion.

[BibT_eX]

[DOI]

CoRR, 2023

DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding.

[BibT_eX]

[DOI]

Jeongsoo Choi

Joanna Hong

Yong Man Ro

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Lip-to-Speech Synthesis in the Wild with Multi-Task Learning.

[BibT_eX]

[DOI]

Minsu Kim

Joanna Hong

Yong Man Ro

Proceedings of the IEEE International Conference on Acoustics, 2023

Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model.

[BibT_eX]

[DOI]

Joanna Hong

Se Jin Park

Yong Man Ro

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection.

[BibT_eX]

[DOI]

Joanna Hong

Minsu Kim

Yong Man Ro

Proceedings of the Computer Vision - ECCV 2022, 2022

SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Speech Reconstruction With Reminiscent Sound Via Visual Voice Memory.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Lip to Speech Synthesis with Visual Context Attentional GAN.

[BibT_eX]

[DOI]

Minsu Kim

Joanna Hong

Yong Man Ro

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence Through Facial Action Units.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Unsupervised Disentangling of Viewpoint and Residues Variations by Substituting Representations for Robust Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Learning Style Correlation for Elaborate Few-Shot Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2020

Comprehensive Facial Expression Synthesis Using Human-Interpretable Language.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2020

Joanna Hong

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...