Nadav Merlis

Orcid: 0000-0002-9906-0577

According to our database¹, Nadav Merlis authored at least 25 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Adaptive Bandit Algorithms for Contextual Matching Markets.

[BibT_eX]

[DOI]

CoRR, May, 2026

Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive Batching.

[BibT_eX]

[DOI]

Nadav Merlis

CoRR, January, 2026

Online Linear Regression with Paid Stochastic Features.

[BibT_eX]

[DOI]

Nadav Merlis

Kyoungseok Jang

Nicolò Cesa-Bianchi

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

On the hardness of RL with Lookahead.

[BibT_eX]

[DOI]

CoRR, October, 2025

Stable Matching with Ties: Approximation Ratios and Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

On Bits and Bandits: Quantifying the Regret-Information Trade-off.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Improved Algorithms for Contextual Dynamic Pricing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

The Value of Reward Lookahead in Reinforcement Learning.

[BibT_eX]

[DOI]

Nadav Merlis

Dorian Baudry

Vianney Perchet

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Reinforcement Learning with Lookahead Information.

[BibT_eX]

[DOI]

Nadav Merlis

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Multi-armed bandits with guaranteed revenue per arm.

[BibT_eX]

[DOI]

Dorian Baudry

Nadav Merlis

Mathieu Benjamin Molina

Hugo Richard

Vianney Perchet

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics.

[BibT_eX]

[DOI]

CoRR, 2023

Reinforcement Learning with History Dependent Dynamic Contexts.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

On Preemption and Learning in Stochastic Scheduling.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022

Reinforcement Learning with a Terminator.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits.

[BibT_eX]

[DOI]

Nadav Merlis

Yonathan Efroni

Shie Mannor

CoRR, 2021

Ensemble Bootstrapping for Q-Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Confidence-Budget Matching for Sequential Budgeted Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Lenient Regret for Multi-Armed Bandits.

[BibT_eX]

[DOI]

Nadav Merlis

Shie Mannor

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Reinforcement Learning with Trajectory Feedback.

[BibT_eX]

[DOI]

Yonathan Efroni

Nadav Merlis

Shie Mannor

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Tight Lower Bounds for Combinatorial Multi-Armed Bandits.

[BibT_eX]

[DOI]

Nadav Merlis

Shie Mannor

Proceedings of the Conference on Learning Theory, 2020

2019

Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients.

[BibT_eX]

[DOI]

Chen Tessler

Nadav Merlis

Shie Mannor

CoRR, 2019

Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit Problem.

[BibT_eX]

[DOI]

Nadav Merlis

Shie Mannor

Proceedings of the Conference on Learning Theory, 2019

2018

Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Nadav Merlis

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...