Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics.

[BibT_eX]

[DOI]

Seungbeen Lee

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Can You Share Your Story? Modeling Clients' Metacognition and Openness for LLM Therapist Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Evaluating Robustness of Reward Models for Mathematical Reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023, 2023

TUTORING: Instruction-Grounded Conversational Agent for Language Learners.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Hyungjoo Chae

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...