Assaf Hallak

According to our database1, Assaf Hallak authored at least 16 papers between 2012 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
On the Products of Stochastic and Diagonal Matrices.
CoRR, 2023

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search.
CoRR, 2023

Planning and Learning with Adaptive Lookahead.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
SoftTreeMax: Policy Gradient with Tree Search.
CoRR, 2022

Reinforcement Learning with a Terminator.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2017
Automatic Representation for Lifetime Value Recommender Systems.
CoRR, 2017

Consistent On-Line Off-Policy Evaluation.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Emphatic TD Bellman Operator is a Contraction.
CoRR, 2015

Off-policy evaluation for MDPs with unknown structure.
CoRR, 2015

Contextual Markov Decision Processes.
CoRR, 2015

Off-policy Model-based Learning under Unknown Factored Dynamics.
Proceedings of the 32nd International Conference on Machine Learning, 2015

2013
Model selection in markovian processes.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

2012
How to sample if you must: on optimal functional sampling
CoRR, 2012


  Loading...