Hyeongseop Rha

Orcid: 0009-0004-9301-2760

According to our database¹, Hyeongseop Rha authored at least 12 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of five.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Decoding Strategies for Diffusion-Based ASR: A Systematic Evaluation of Confidence-Based Thresholding.

[BibT_eX]

[DOI]

CoRR, May, 2026

Diffusion Large Language Models for Visual Speech Recognition.

[BibT_eX]

[DOI]

CoRR, May, 2026

TMT: Tri-Modal Translation Between Speech, Image, and Text by Processing Different Modalities as Different Languages.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Learning What to Attend First: Modality-Importance-Guided Reasoning for Reliable Multimodal Emotion Understanding.

[BibT_eX]

[DOI]

CoRR, December, 2025

Towards Inclusive Communication: A Unified LLM-Based Framework for Sign Language, Lip Movements, and Audio Understanding.

[BibT_eX]

[DOI]

CoRR, August, 2025

MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

AV-EmoDialog: Chat with Audio-Visual Users Leveraging Emotional Cues.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2022

Digestive Organ Recognition in Video Capsule Endoscopy Based on Temporal Segmentation Network.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Hyeongseop Rha

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...