Yerin Hwang

Orcid: 0009-0003-2445-7734

According to our database1, Yerin Hwang authored at least 15 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
When Wording Steers the Evaluation: Framing Bias in LLM judges.
CoRR, January, 2026

Judging Against the Reference: Uncovering Knowledge-Driven Failures in LLM-Judges on QA Evaluation.
CoRR, January, 2026

Don't Judge Code by Its Cover: Exploring Biases in LLM Judges for Code Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

2025
Benchmarking LLM Causal Reasoning with Scientifically Validated Relationships.
CoRR, October, 2025

Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Fooling the LVLM Judges: Visual Biases in LVLM-Based Evaluation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Can You Trick the Grader? Adversarial Persuasion of LLM Judges.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

LLMs can be easily Confused by Instructional Distractions.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Flowlogue: A Novel Framework for Synthetic Dialogue Generation With Structured Flow From Text Passages.
IEEE Access, 2024

MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Kosmic: Korean Text Similarity Metric Reflecting Honorific Distinctions.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Dialogizer: Context-aware Conversational-QA Dataset Generation from Textual Sources.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Injecting Comparison Skills in Task-Oriented Dialogue Systems for Database Search Results Disambiguation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023


  Loading...