Assaf Hallak

According to our database¹, Assaf Hallak authored at least 23 papers between 2012 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

More Test-Time Compute Can Hurt: Overestimation Bias in LLM Beam Search.

[BibT_eX]

[DOI]

CoRR, March, 2026

2025

Who Said Neural Networks Aren't Linear?

[BibT_eX]

[DOI]

Nimrod Berman

Assaf Hallak

Assaf Shocher

CoRR, October, 2025

Towards Large Language Models with Self-Consistent Natural Language Explanations.

[BibT_eX]

[DOI]

CoRR, June, 2025

"You just can't go around killing people" Explaining Agent Behavior to a Human Terminator.

[BibT_eX]

[DOI]

Uri Menkes

Assaf Hallak

Ofra Amir

CoRR, April, 2025

Policy Gradient with Tree Expansion.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

PlaMo: Plan and Move in Rich 3D Physical Environments.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

2023

On the Products of Stochastic and Diagonal Matrices.

[BibT_eX]

[DOI]

Assaf Hallak

Gal Dalal

CoRR, 2023

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search.

[BibT_eX]

[DOI]

CoRR, 2023

Planning and Learning with Adaptive Lookahead.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

SoftTreeMax: Policy Gradient with Tree Search.

[BibT_eX]

[DOI]

CoRR, 2022

Reinforcement Learning with a Terminator.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2017

Automatic Representation for Lifetime Value Recommender Systems.

[BibT_eX]

[DOI]

Assaf Hallak

Yishay Mansour

Elad Yom-Tov

CoRR, 2017

Consistent On-Line Off-Policy Evaluation.

[BibT_eX]

[DOI]

Assaf Hallak

Shie Mannor

Proceedings of the 34th International Conference on Machine Learning, 2017

2016

Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Emphatic TD Bellman Operator is a Contraction.

[BibT_eX]

[DOI]

Assaf Hallak

Aviv Tamar

Shie Mannor

CoRR, 2015

Off-policy evaluation for MDPs with unknown structure.

[BibT_eX]

[DOI]

CoRR, 2015

Contextual Markov Decision Processes.

[BibT_eX]

[DOI]

Assaf Hallak

Dotan Di Castro

Shie Mannor

CoRR, 2015

Off-policy Model-based Learning under Unknown Factored Dynamics.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

2013

Model selection in markovian processes.

[BibT_eX]

[DOI]

Assaf Hallak

Dotan Di Castro

Shie Mannor

Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

2012

How to sample if you must: on optimal functional sampling

[BibT_eX]

[DOI]

Assaf Hallak

Shie Mannor

CoRR, 2012

Assaf Hallak

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...