We stand with Ukraine

We stand with Ukraine

Youngsoo Jang

Orcid: 0000-0002-8372-5343

According to our database¹, Youngsoo Jang authored at least 22 papers between 2016 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

IRPO: Implicit Policy Regularized Preference Optimization.

[DOI]

,

,

Geon-Hyeong Kim

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

2025

SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety.

[DOI]

Geon-Hyeong Kim

,

,

,

,

,

,

CoRR, May, 2025

Online Pre-Training for Offline-to-Online Reinforcement Learning.

[DOI]

,

,

,

,

,

,

Geon-Hyeong Kim

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection.

[DOI]

,

,

,

,

CoRR, 2024

Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration.

[DOI]

,

Geon-Hyeong Kim

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking.

[DOI]

,

,

Lajanugen Logeswaran

,

Geon-Hyeong Kim

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Show, Think, and Tell: Thought-Augmented Fine-Tuning of Large Language Models for Video Captioning.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments.

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations.

[DOI]

,

Geon-Hyeong Kim

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Information-Theoretic State Space Model for Multi-View Reinforcement Learning.

[DOI]

HyeongJoo Hwang

,

,

,

,

Geon-Hyeong Kim

,

,

Proceedings of the International Conference on Machine Learning, 2023

2022

LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation.

[DOI]

Geon-Hyeong Kim

,

,

,

,

CoRR, 2022

LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation.

[DOI]

Geon-Hyeong Kim

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems.

[DOI]

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Monte-Carlo Planning and Learning with Language Action Value Estimates.

[DOI]

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Variational Inference for Sequential Data with Future Likelihood Estimates.

[DOI]

Geon-Hyeong Kim

,

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2.

[DOI]

,

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues.

[DOI]

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules.

[DOI]

,

,

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Trust Region Sequential Variational Inference.

[DOI]

Geon-hyeong Kim

,

,

,

,

,

Proceedings of The 11th Asian Conference on Machine Learning, 2019

2018

Cross-Language Neural Dialog State Tracker for Large Ontologies Using Hierarchical Attention.

[DOI]

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2018

2017

Constrained Bayesian Reinforcement Learning via Approximate Linear Programming.

[DOI]

,

,

,

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

2016

Neural dialog state tracker for large ontologies by attention mechanism.

[DOI]

,

,

,

,

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Loading...