Yonathan Efroni

According to our database1, Yonathan Efroni authored at least 40 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The Bias of Harmful Label Associations in Vision-Language Models.
CoRR, 2024

2023
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models.
Trans. Mach. Learn. Res., 2023

Pearl: A Production-ready Reinforcement Learning Agent.
CoRR, 2023

PcLast: Discovering Plannable Continuous Latent States.
CoRR, 2023

Prospective Side Information for Latent MDPs.
CoRR, 2023

Reward-Mixing MDPs with Few Latent Contexts are Learnable.
Proceedings of the International Conference on Machine Learning, 2023

Principled Offline RL in the Presence of Rich Exogenous Information.
Proceedings of the International Conference on Machine Learning, 2023

2022
Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information.
CoRR, 2022

Guaranteed Discovery of Controllable Latent States with Multi-Step Inverse Models.
CoRR, 2022

Tractable Optimality in Episodic Latent MABs.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms.
Proceedings of the International Conference on Machine Learning, 2022

Sparsity in Partially Controllable Linear Systems.
Proceedings of the International Conference on Machine Learning, 2022

Provable Reinforcement Learning with a Short-Term Memory.
Proceedings of the International Conference on Machine Learning, 2022

Mirror Descent Policy Optimization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Provably Filtering Exogenous Distractors using Multistep Inverse Dynamics.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information.
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

2021
Provable RL with Exogenous Distractors via Multistep Inverse Dynamics.
CoRR, 2021

Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits.
CoRR, 2021

Bandits with partially observable confounded data.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

RL for Latent MDPs: Regret Guarantees and a Lower Bound.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Reinforcement Learning in Reward-Mixing MDPs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Minimax Regret for Stochastic Shortest Path.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Confidence-Budget Matching for Sequential Budgeted Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Reinforcement Learning with Trajectory Feedback.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Bandits with Partially Observable Offline Data.
CoRR, 2020

Exploration-Exploitation in Constrained MDPs.
CoRR, 2020

Online Planning with Lookahead Policies.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Multi-step Greedy Reinforcement Learning Algorithms.
Proceedings of the 37th International Conference on Machine Learning, 2020

Optimistic Policy Optimization with Bandit Feedback.
Proceedings of the 37th International Conference on Machine Learning, 2020

Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning.
CoRR, 2019

Multi-Step Greedy and Approximate Real Time Dynamic Programming.
CoRR, 2019

Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Action Robust Reinforcement Learning and Applications in Continuous Control.
Proceedings of the 36th International Conference on Machine Learning, 2019

Exploration Conscious Reinforcement Learning Revisited.
Proceedings of the 36th International Conference on Machine Learning, 2019

How to Combine Tree-Search Methods in Reinforcement Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Revisiting Exploration-Conscious Reinforcement Learning.
CoRR, 2018

Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning.
CoRR, 2018

Multiple-Step Greedy Policies in Approximate and Online Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Beyond the One-Step Greedy Approach in Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018


  Loading...