Hyeongseop Rha
Orcid: 0009-0004-9301-2760
According to our database1,
Hyeongseop Rha authored at least 12 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Decoding Strategies for Diffusion-Based ASR: A Systematic Evaluation of Confidence-Based Thresholding.
CoRR, May, 2026
TMT: Tri-Modal Translation Between Speech, Image, and Text by Processing Different Modalities as Different Languages.
IEEE Trans. Multim., 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
Learning What to Attend First: Modality-Importance-Guided Reasoning for Reliable Multimodal Emotion Understanding.
CoRR, December, 2025
Towards Inclusive Communication: A Unified LLM-Based Framework for Sign Language, Lip Movements, and Audio Understanding.
CoRR, August, 2025
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2022
Digestive Organ Recognition in Video Capsule Endoscopy Based on Temporal Segmentation Network.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022