Hyungjoo Chae

According to our database1, Hyungjoo Chae authored at least 20 papers between 2022 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL.
CoRR, June, 2025

ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions.
CoRR, May, 2025

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents.
CoRR, May, 2025

Towards Lifelong Dialogue Agents via Timeline-based Memory Management.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Can You Share Your Story? Modeling Clients' Metacognition and Openness for LLM Therapist Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Evaluating Robustness of Reward Models for Mathematical Reasoning.
CoRR, 2024

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models.
CoRR, 2024

Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback.
CoRR, 2023

Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023, 2023

TUTORING: Instruction-Grounded Conversational Agent for Language Learners.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization.
Proceedings of the 29th International Conference on Computational Linguistics, 2022


  Loading...