Guijin Son
According to our database1,
Guijin Son authored at least 36 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training.
CoRR, May, 2026
CoRR, May, 2026
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs.
CoRR, May, 2026
CoRR, April, 2026
KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context.
CoRR, April, 2026
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math.
CoRR, February, 2026
CoRR, January, 2026
2025
CoRR, October, 2025
CoRR, October, 2025
CoRR, September, 2025
From KMMLU-Redux to KMMLU-Pro: A Professional Korean Benchmark Suite for LLM Evaluation.
CoRR, July, 2025
CoRR, June, 2025
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research.
CoRR, May, 2025
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap.
CoRR, January, 2025
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2025
2024
CoRR, 2024
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models.
CoRR, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
CoRR, 2023
Removing Non-Stationary Knowledge From Pre-Trained Language Models for Entity-Level Sentiment Classification in Finance.
CoRR, 2023