Youngsoo Jang

Orcid: 0000-0002-8372-5343

According to our database1, Youngsoo Jang authored at least 15 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection.
CoRR, 2024

2023
SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Information-Theoretic State Space Model for Multi-View Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

2022
LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation.
CoRR, 2022

LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Monte-Carlo Planning and Learning with Language Action Value Estimates.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Variational Inference for Sequential Data with Future Likelihood Estimates.
Proceedings of the 37th International Conference on Machine Learning, 2020

End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Trust Region Sequential Variational Inference.
Proceedings of The 11th Asian Conference on Machine Learning, 2019

2018
Cross-Language Neural Dialog State Tracker for Large Ontologies Using Hierarchical Attention.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

2017
Constrained Bayesian Reinforcement Learning via Approximate Linear Programming.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

2016
Neural dialog state tracker for large ontologies by attention mechanism.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016


  Loading...