Joanna Hong

Orcid: 0000-0003-4182-1000

According to our database1, Joanna Hong authored at least 16 papers between 2020 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion.
CoRR, 2023

DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Lip-to-Speech Synthesis in the Wild with Multi-Task Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition.
IEEE Trans. Multim., 2022

Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition.
Proceedings of the Interspeech 2022, 2022

VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection.
Proceedings of the Computer Vision - ECCV 2022, 2022

SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Speech Reconstruction With Reminiscent Sound Via Visual Voice Memory.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Lip to Speech Synthesis with Visual Context Attentional GAN.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence Through Facial Action Units.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Unsupervised Disentangling of Viewpoint and Residues Variations by Substituting Representations for Robust Face Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Learning Style Correlation for Elaborate Few-Shot Classification.
Proceedings of the IEEE International Conference on Image Processing, 2020

Comprehensive Facial Expression Synthesis Using Human-Interpretable Language.
Proceedings of the IEEE International Conference on Image Processing, 2020


  Loading...