Stephanie Milani

Orcid: 0000-0003-1150-4418

According to our database1, Stephanie Milani authored at least 26 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023
Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning.
CoRR, 2023

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition.
CoRR, 2023

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
UniMASK: Unified Inference in Sequential Decision Problems.
CoRR, 2022

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers.
CoRR, 2022

Retrospective on the 2021 BASALT Competition on Learning from Human Feedback.
CoRR, 2022

A Survey of Explainable Reinforcement Learning.
CoRR, 2022

MAVIPER: Learning Decision Tree Policies for Interpretable Multi-agent Reinforcement Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Uni[MASK]: Unified Inference in Sequential Decision Problems.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

How Humans Perceive Human-like Behavior in Video Game Navigation.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021
The MineRL BASALT Competition on Learning from Human Feedback.
CoRR, 2021

Towards robust and domain agnostic reinforcement learning competitions.
CoRR, 2021

The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors.
CoRR, 2021

Retrospective on the 2021 MineRL BASALT Competition on Learning from Human Feedback.
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021



Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Guaranteeing Reproducibility in Deep Learning Competitions.
CoRR, 2020


Harnessing the Power of Deception in Attack Graph-Based Security Games.
Proceedings of the Decision and Game Theory for Security - 11th International Conference, 2020

Planning with Abstract Learned Models While Learning Transferable Subtasks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
The MineRL Competition on Sample Efficient Reinforcement Learning using Human Priors.
CoRR, 2019

Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning.
Proceedings of the NeurIPS 2019 Competition and Demonstration Track, 2019

Perceptions of Domestic Robots' Normative Behavior Across Cultures.
Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 2019


  Loading...