Hyeongseop Rha

Orcid: 0009-0004-9301-2760

According to our database1, Hyeongseop Rha authored at least 12 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Decoding Strategies for Diffusion-Based ASR: A Systematic Evaluation of Confidence-Based Thresholding.
CoRR, May, 2026

Diffusion Large Language Models for Visual Speech Recognition.
CoRR, May, 2026

TMT: Tri-Modal Translation Between Speech, Image, and Text by Processing Different Modalities as Different Languages.
IEEE Trans. Multim., 2026

Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Learning What to Attend First: Modality-Importance-Guided Reasoning for Reliable Multimodal Emotion Understanding.
CoRR, December, 2025

Towards Inclusive Communication: A Unified LLM-Based Framework for Sign Language, Lip Movements, and Audio Understanding.
CoRR, August, 2025

MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
AV-EmoDialog: Chat with Audio-Visual Users Leveraging Emotional Cues.
CoRR, 2024

Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2022
Digestive Organ Recognition in Video Capsule Endoscopy Based on Temporal Segmentation Network.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022


  Loading...