Lior Shani

According to our database¹, Lior Shani authored at least 19 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Latent Reasoning with Supervised Thinking States.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces.

[BibT_eX]

[DOI]

CoRR, September, 2025

Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

2024

Embedding-Aligned Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Offline Regularised Reinforcement Learning for Large Language Models Alignment.

[BibT_eX]

[DOI]

Pierre Harvey Richemond

Yunhao Tang

Daniel Guo

Daniele Calandriello

Mohammad Gheshlaghi Azar

CoRR, 2024

Multi-turn Reinforcement Learning from Preference Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

Embedding-Aligned Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Multi-turn Reinforcement Learning with Preference Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Demystifying Embedding Spaces using Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Reinforcement Learning with History Dependent Dynamic Contexts.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Reinforcement Learning with a Terminator.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mirror Descent Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Online Apprenticeship Learning.

[BibT_eX]

[DOI]

Lior Shani

Tom Zahavy

Shie Mannor

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2020

Optimistic Policy Optimization with Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs.

[BibT_eX]

[DOI]

Lior Shani

Yonathan Efroni

Shie Mannor

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Exploration Conscious Reinforcement Learning Revisited.

[BibT_eX]

[DOI]

Lior Shani

Yonathan Efroni

Shie Mannor

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

Multi Instance Learning For Unbalanced Data.

[BibT_eX]

[DOI]

CoRR, 2018

Revisiting Exploration-Conscious Reinforcement Learning.

[BibT_eX]

[DOI]

Lior Shani

Yonathan Efroni

Shie Mannor

CoRR, 2018

Lior Shani

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...