Jaekyeom Kim

Orcid: 0000-0003-4538-8398

According to our database1, Jaekyeom Kim authored at least 20 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation.
CoRR, February, 2026

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation.
CoRR, January, 2026

Process Reward Models That Think.
Trans. Mach. Learn. Res., 2026

Beyond Blind Following: Evaluating Robustness of LLM Agents under Imperfect Guidance.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Towards Minimal Fine-Tuning of VLMs.
CoRR, December, 2025

Do Not Trust Licenses You See: Dataset Compliance Requires Massive-Scale AI-Powered Lifecycle Tracing.
CoRR, March, 2025

MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Interactive and Expressive Code-Augmented Planning with Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Small Language Models Need Strong Verifiers to Self-Correct Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2022
Constrained GPI for Zero-Shot Transfer in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Lipschitz-constrained Unsupervised Skill Discovery.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Unsupervised Skill Discovery with Bottleneck Option Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Drop-Bottleneck: Learning Discrete Compressed Representation for Noise-Robust Exploration.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Model-Agnostic Boundary-Adversarial Sampling for Test-Time Generalization in Few-Shot Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
EMI: Exploration with Mutual Information.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
EMI: Exploration with Mutual Information Maximizing State and Action Embeddings.
CoRR, 2018

2015
Image quality evaluation of LCDs based on novel RGBW sub-pixel structure.
Proceedings of the Image Quality and System Performance XII, 2015


  Loading...