We stand with Ukraine

We stand with Ukraine

Marek Petrik

According to our database¹, Marek Petrik authored at least 75 papers between 2006 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Bayesian Regret Minimization in Offline Bandits.

[BibT_eX]

[DOI]

,

Guy Tennenholtz

,

Mohammad Ghavamzadeh

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Beyond discounted returns: Robust Markov decision processes with average and Blackwell optimality.

[BibT_eX]

[DOI]

Julien Grand-Clément

,

,

Nicolas Vieille

CoRR, 2023

A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits.

[BibT_eX]

[DOI]

Mohammad Ghavamzadeh

,

,

Guy Tennenholtz

CoRR, 2023

On Dynamic Program Decompositions of Static Risk Measures.

[BibT_eX]

[DOI]

,

,

Mohammad Ghavamzadeh

,

CoRR, 2023

Solving multi-model MDPs by coordinate ascent and dynamic programming.

[BibT_eX]

[DOI]

,

Proceedings of the Uncertainty in Artificial Intelligence, 2023

On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes.

[BibT_eX]

[DOI]

,

,

Mohammad Ghavamzadeh

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor.

[BibT_eX]

[DOI]

Julien Grand-Clément

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Percentile Criterion Optimization in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Policy Gradient in Robust MDPs with Global Convergence Guarantee.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2023

Entropic Risk Optimization in Discounted MDPs.

[BibT_eX]

[DOI]

,

,

Mohammad Ghavamzadeh

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

On the Convergence of Policy Gradient in Robust MDPs.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

On the convex formulations of robust Markov decision processes.

[BibT_eX]

[DOI]

Julien Grand-Clément

,

CoRR, 2022

RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk.

[BibT_eX]

[DOI]

,

,

Mohammad Ghavamzadeh

,

Reazul Hasan Russel

CoRR, 2022

Data poisoning attacks on off-policy policy evaluation methods.

[BibT_eX]

[DOI]

,

Harvineet Singh

,

,

,

Himabindu Lakkaraju

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Robust $\phi$-Divergence MDPs.

[BibT_eX]

[DOI]

,

,

Wolfram Wiesemann

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Partial Policy Iteration for L1-Robust Markov Decision Processes.

[BibT_eX]

[DOI]

,

,

Wolfram Wiesemann

J. Mach. Learn. Res., 2021

Robust Maximum Entropy Behavior Cloning.

[BibT_eX]

[DOI]

Mostafa Hussein

,

,

,

CoRR, 2021

Fast Algorithms for $L_\infty$-constrained S-rectangular Robust MDPs.

[BibT_eX]

[DOI]

Bahram Behzadian

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Robust Behavior Cloning with Adversarial Demonstration Detection.

[BibT_eX]

[DOI]

Mostafa Hussein

,

,

Madison Clark-Turner

,

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Policy Gradient Bayesian Robust Optimization for Imitation Learning.

[BibT_eX]

[DOI]

,

Daniel S. Brown

,

,

,

Ashwin Balakrishna

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Optimizing Percentile Criterion using Robust MDPs.

[BibT_eX]

[DOI]

Bahram Behzadian

,

Reazul Hasan Russel

,

,

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020

Soft-Robust Algorithms for Handling Model Misspecification.

[BibT_eX]

[DOI]

,

Mohammad Ghavamzadeh

,

CoRR, 2020

Finite-Sample Analysis of GTD Algorithms.

[BibT_eX]

[DOI]

,

,

Mohammad Ghavamzadeh

,

Sridhar Mahadevan

,

CoRR, 2020

Entropic Risk Constrained Soft-Robust Policy Optimization.

[BibT_eX]

[DOI]

Reazul Hasan Russel

,

Bahram Behzadian

,

CoRR, 2020

Bayesian Robust Optimization for Imitation Learning.

[BibT_eX]

[DOI]

Daniel S. Brown

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Beliefs We Can Believe in: Replacing Assumptions with Data in Real-Time Search.

[BibT_eX]

[DOI]

Maximilian Fickert

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Optimizing Norm-Bounded Weighted Ambiguity Sets for Robust MDPs.

[BibT_eX]

[DOI]

Reazul Hasan Russel

,

Bahram Behzadian

,

CoRR, 2019

High-Confidence Policy Optimization: Reshaping Ambiguity Sets in Robust MDPs.

[BibT_eX]

[DOI]

Bahram Behzadian

,

Reazul Hasan Russel

,

CoRR, 2019

Robust Exploration with Tight Bayesian Plausibility Sets.

[BibT_eX]

[DOI]

Reazul Hasan Russel

,

,

CoRR, 2019

Beyond Confidence Regions: Tight Bayesian Ambiguity Sets for Robust MDPs.

[BibT_eX]

[DOI]

,

Reazul Hasan Russel

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Inverse Reinforcement Learning of Interaction Dynamics from Demonstrations.

[BibT_eX]

[DOI]

Mostafa Hussein

,

,

Proceedings of the International Conference on Robotics and Automation, 2019

Fast Feature Selection for Linear Value Function Approximation.

[BibT_eX]

[DOI]

Bahram Behzadian

,

Soheil Gharatappeh

,

Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, 2019

Real-Time Planning as Decision-Making under Uncertainty.

[BibT_eX]

[DOI]

Andrew Mitchell

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity.

[BibT_eX]

[DOI]

,

,

Mohammad Ghavamzadeh

,

,

Sridhar Mahadevan

,

J. Artif. Intell. Res., 2018

Tight Bayesian Ambiguity Sets for Robust MDPs.

[BibT_eX]

[DOI]

Reazul Hasan Russel

,

CoRR, 2018

Interpretable Reinforcement Learning with Ensemble Methods.

[BibT_eX]

[DOI]

Alexander Brown

,

CoRR, 2018

Policy-Conditioned Uncertainty Sets for Robust Markov Decision Processes.

[BibT_eX]

[DOI]

Andrea Tirinzoni

,

,

,

Brian D. Ziebart

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Low-rank Feature Selection for Reinforcement Learning.

[BibT_eX]

[DOI]

Bahram Behzadian

,

Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2018

Fast Bellman Updates for Robust MDPs.

[BibT_eX]

[DOI]

,

,

Wolfram Wiesemann

Proceedings of the 35th International Conference on Machine Learning, 2018

2017

A Practical Method for Solving Contextual Bandit Problems Using Decision Trees.

[BibT_eX]

[DOI]

Adam N. Elmachtoub

,

,

,

Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Value Directed Exploration in Multi-Armed Bandits with Structured Priors.

[BibT_eX]

[DOI]

,

,

Reazul Hasan Russel

,

Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Robust Partially-Compressed Least-Squares.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Building an Interpretable Recommender via Loss-Preserving Transformation.

[BibT_eX]

[DOI]

Amit Dhurandhar

,

,

CoRR, 2016

Interpretable Policies for Dynamic Product Recommendations.

[BibT_eX]

[DOI]

,

Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Safe Policy Improvement by Minimizing Robust Baseline Regret.

[BibT_eX]

[DOI]

Mohammad Ghavamzadeh

,

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Proximal Gradient Temporal Difference Learning Algorithms.

[BibT_eX]

[DOI]

,

,

Mohammad Ghavamzadeh

,

Sridhar Mahadevan

,

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015

Tight Approximations of Dynamic Risk Measures.

[BibT_eX]

[DOI]

Dan Andrei Iancu

,

,

Dharmashankar Subramanian

Math. Oper. Res., 2015

Robust Partially-Compressed Least-Squares.

[BibT_eX]

[DOI]

,

,

,

Karthikeyan Natesan Ramamurthy

CoRR, 2015

Optimal Threshold Control for Energy Arbitrage with Degradable Battery Storage.

[BibT_eX]

[DOI]

,

Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Finite-Sample Analysis of Proximal Gradient TD Algorithms.

[BibT_eX]

[DOI]

,

,

Mohammad Ghavamzadeh

,

Sridhar Mahadevan

,

Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

2014

Efficient and accurate methods for updating generalized linear models with multiple feature additions.

[BibT_eX]

[DOI]

Amit Dhurandhar

,

J. Mach. Learn. Res., 2014

Social media and customer behavior analytics for personalized customer engagements.

[BibT_eX]

[DOI]

Stephen J. Buckley

,

,

,

,

,

Rajesh Kumar Ravi

,

Chitra Venkatramani

IBM J. Res. Dev., 2014

RAAM: The Benefits of Robustness in Approximating Aggregated MDPs in Reinforcement Learning.

[BibT_eX]

[DOI]

,

Dharmashankar Subramanian

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013

Agile logistics simulation and optimization for managing disaster responses.

[BibT_eX]

[DOI]

Francisco Barahona

,

,

,

Peter M. Rimshnick

Proceedings of the Winter Simulations Conference: Simulation Making Decisions in a Complex World, 2013

Solution Methods for Constrained Markov Decision Process with Continuous Probability Modulation.

[BibT_eX]

[DOI]

,

Dharmashankar Subramanian

,

Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

2012

An Approximate Solution Method for Large Risk-Averse Markov Decision Processes.

[BibT_eX]

[DOI]

,

Dharmashankar Subramanian

Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Approximate Dynamic Programming By Minimizing Distributionally Robust Bounds.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Machine Learning, 2012

Learning Feature-Based Heuristic Functions.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Proceedings of the Autonomous Search, 2012

2011

Robust Approximate Bilinear Programming for Value Function Approximation.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

J. Mach. Learn. Res., 2011

Linear Dynamic Programs for Resource Management.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010

Global Optimization for Value Function Approximation

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

CoRR, 2010

Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision Processes.

[BibT_eX]

[DOI]

,

,

,

Shlomo Zilberstein

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

2009

Hybrid least-squares algorithms for approximate policy evaluation.

[BibT_eX]

[DOI]

,

,

Sridhar Mahadevan

Mach. Learn., 2009

A Bilinear Programming Approach for Multiagent Planning.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

J. Artif. Intell. Res., 2009

Robust Value Function Approximation Using Bilinear Programming.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Constraint relaxation in approximate linear programs.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2008

Biasing Approximate Dynamic Programming with a Lower Discount Factor.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

A Successive Approximation Algorithm for Coordination Problems.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008

Learning Heuristic Functions through Approximate Linear Programming.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Proceedings of the Eighteenth International Conference on Automated Planning and Scheduling, 2008

Interaction Structure and Dimensionality Reduction in Decentralized MDPs.

[BibT_eX]

[DOI]

,

,

Shlomo Zilberstein

Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007

Average-Reward Decentralized Markov Decision Processes.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Proceedings of the IJCAI 2007, 2007

An Analysis of Laplacian Methods for Value Function Approximation in MDPs.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2007, 2007

Anytime Coordination Using Separable Bilinear Programs.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006

Learning parallel portfolios of algorithms.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Ann. Math. Artif. Intell., 2006

Learning Static Parallel Portfolios of Algorithms.

[BibT_eX]

[DOI]

,

Shlomo Zilberstein

Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2006

Loading...