Hyeongseop Rha
Orcid: 0009-0004-9301-2760
According to our database1,
Hyeongseop Rha
authored at least 7 papers
between 2022 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages.
CoRR, 2024
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2022
Digestive Organ Recognition in Video Capsule Endoscopy Based on Temporal Segmentation Network.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022