We stand with Ukraine

We stand with Ukraine

Guijin Son

According to our database¹, Guijin Son authored at least 25 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Revisiting the UID Hypothesis in LLM Reasoning Traces.

[BibT_eX]

[DOI]

,

,

CoRR, October, 2025

Revisiting the Uniform Information Density Hypothesis in LLM Reasoning Traces.

[BibT_eX]

[DOI]

,

,

CoRR, October, 2025

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought.

[BibT_eX]

[DOI]

,

,

Hitesh Laxmichand Patel

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

KAIO: A Collection of More Challenging Korean Questions.

[BibT_eX]

[DOI]

,

,

,

CoRR, September, 2025

Ko-PIQA: A Korean Physical Commonsense Reasoning Dataset with Cultural Context.

[BibT_eX]

[DOI]

,

,

CoRR, September, 2025

From KMMLU-Redux to KMMLU-Pro: A Professional Korean Benchmark Suite for LLM Evaluation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, July, 2025

BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation.

[BibT_eX]

[DOI]

,

,

,

Hitesh Laxmichand Patel

,

,

CoRR, June, 2025

Controlling Language Confusion in Multilingual LLMs.

[BibT_eX]

[DOI]

,

,

,

CoRR, May, 2025

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Stella Biderman

CoRR, May, 2025

On the Robustness of Reward Models for Language Model Alignment.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2025

HRET: A Self-Evolving LLM Evaluation Toolkit for Korean.

[BibT_eX]

[DOI]

,

,

,

,

Seunghyeok Hong

,

,

,

,

CoRR, March, 2025

Won: Establishing Best Practices for Korean Financial NLP.

[BibT_eX]

[DOI]

,

,

,

CoRR, March, 2025

Multi-Step Reasoning in Korean and the Emergent Mirage.

[BibT_eX]

[DOI]

,

,

CoRR, January, 2025

Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap.

[BibT_eX]

[DOI]

,

,

CoRR, January, 2025

KMMLU: Measuring Massive Multitask Language Understanding in Korean.

[BibT_eX]

[DOI]

,

,

,

,

Niklas Muennighoff

,

,

,

,

Stella Biderman

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Sheikh Shafayat

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Bill Yuchen Lin

,

,

,

,

,

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Improving Fine-grained Visual Understanding in VLMs through Text-Only Training.

[BibT_eX]

[DOI]

,

,

,

,

Seunghyeok Hong

CoRR, 2024

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models.

[BibT_eX]

[DOI]

,

,

,

Javier Aula-Blasco

,

,

,

Shayekh Bin Islam

,

Jaume Prats-Cristià

,

Lucía Tormo-Bañuelos

,

CoRR, 2024

LLM-as-a-Judge & Reward Model: What They Can and Cannot Do.

[BibT_eX]

[DOI]

,

,

,

,

Seunghyeok Hong

CoRR, 2024

ESG Classification by Implicit Rule Learning via GPT-4.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Beyond Classification: Financial Reasoning in State-of-the-Art Language Models.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Removing Non-Stationary Knowledge From Pre-Trained Language Models for Entity-Level Sentiment Classification in Finance.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Loading...