We stand with Ukraine

We stand with Ukraine

Miljan Martic

According to our database¹, Miljan Martic authored at least 10 papers between 2017 and 2021.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2021

Causal Analysis of Agent Behavior for AI Safety.

[DOI]

Grégoire Delétang

,

Jordi Grau-Moya

,

,

,

,

Vladimir Mikulik

,

,

,

Pedro A. Ortega

CoRR, 2021

2020

Algorithms for Causal Reasoning in Probability Trees.

[DOI]

,

,

Grégoire Delétang

,

Vladimir Mikulik

,

,

,

Pedro A. Ortega

CoRR, 2020

Meta-trained agents implement Bayes-optimal agents.

[DOI]

Vladimir Mikulik

,

Grégoire Delétang

,

,

,

,

,

Pedro A. Ortega

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Avoiding Side Effects By Considering Future Tasks.

[DOI]

Victoria Krakovna

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019

Penalizing Side Effects using Stepwise Relative Reachability.

[DOI]

Victoria Krakovna

,

,

,

Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the 28th International Joint Conference on Artificial Intelligence, 2019

2018

Scaling shared model governance via model splitting.

[DOI]

,

,

,

,

,

CoRR, 2018

Scalable agent alignment via reward modeling: a research direction.

[DOI]

,

,

,

,

,

CoRR, 2018

Measuring and avoiding side effects using relative reachability.

[DOI]

Victoria Krakovna

,

,

,

CoRR, 2018

2017

AI Safety Gridworlds.

[DOI]

,

,

Victoria Krakovna

,

Pedro A. Ortega

,

,

Andrew Lefrancq

,

,

CoRR, 2017

Deep Reinforcement Learning from Human Preferences.

[DOI]

Paul F. Christiano

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Loading...