Stephanie Milani

Orcid: 0000-0003-1150-4418

According to our database¹, Stephanie Milani authored at least 38 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Selecting Decision-Relevant Concepts in Reinforcement Learning.

[BibT_eX]

[DOI]

Naveen Raman

Stephanie Milani

Fei Fang

CoRR, April, 2026

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale.

[BibT_eX]

[DOI]

CoRR, March, 2026

Content Creation with Generative AI: How Do Content Creators Responsibly Use Generative AI Tools? CSCW009.

[BibT_eX]

[DOI]

Proc. ACM Hum. Comput. Interact., 2026

2025

Making Teams and Influencing Agents: Efficiently Coordinating Decision Trees for Interpretable Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

LICORICE: Label-Efficient Concept-Based Interpretable Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Explainable Reinforcement Learning: A Survey and Comparative Review.

[BibT_eX]

[DOI]

ACM Comput. Surv., July, 2024

Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels.

[BibT_eX]

[DOI]

CoRR, 2024

Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent.

[BibT_eX]

[DOI]

Mohammad Reza Samsami

CoRR, 2024

Unifying Interpretability and Explainability for Alzheimer's Disease Progression Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals.

[BibT_eX]

[DOI]

CoRR, 2024

When is Transfer Learning Possible?

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

PATIENT-ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023

Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition.

[BibT_eX]

[DOI]

CoRR, 2023

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games.

[BibT_eX]

[DOI]

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022

UniMASK: Unified Inference in Sequential Decision Problems.

[BibT_eX]

[DOI]

Matthew J. Hausknecht

Anca D. Dragan

Sam Devlin

CoRR, 2022

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers.

[BibT_eX]

[DOI]

Matthew J. Hausknecht

Anca D. Dragan

Sam Devlin

CoRR, 2022

Retrospective on the 2021 BASALT Competition on Learning from Human Feedback.

[BibT_eX]

[DOI]

Nicholas R. Waytowich

CoRR, 2022

A Survey of Explainable Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

MAVIPER: Learning Decision Tree Policies for Interpretable Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

Evangelos E. Papalexakis

Fei Fang

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Uni[MASK]: Unified Inference in Sequential Decision Problems.

[BibT_eX]

[DOI]

Matthew J. Hausknecht

Anca D. Dragan

Sam Devlin

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

How Humans Perceive Human-like Behavior in Video Game Navigation.

[BibT_eX]

[DOI]

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021

The MineRL BASALT Competition on Learning from Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2021

Towards robust and domain agnostic reinforcement learning competitions.

[BibT_eX]

[DOI]

CoRR, 2021

The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors.

[BibT_eX]

[DOI]

William H. Guss

Mario Ynocente Castro

CoRR, 2021

Retrospective on the 2021 MineRL BASALT Competition on Learning from Human Feedback.

[BibT_eX]

[DOI]

Nicholas R. Waytowich

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2022 Competition Track, 2021

MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Guaranteeing Reproducibility in Deep Learning Competitions.

[BibT_eX]

[DOI]

CoRR, 2020

Towards robust and domain agnostic reinforcement learning competitions: MineRL 2020.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

Harnessing the Power of Deception in Attack Graph-Based Security Games.

[BibT_eX]

[DOI]

Proceedings of the Decision and Game Theory for Security - 11th International Conference, 2020

Planning with Abstract Learned Models While Learning Transferable Subtasks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

The MineRL Competition on Sample Efficient Reinforcement Learning using Human Priors.

[BibT_eX]

[DOI]

CoRR, 2019

Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2019 Competition and Demonstration Track, 2019

Perceptions of Domestic Robots' Normative Behavior Across Cultures.

[BibT_eX]

[DOI]

Huao Li

Stephanie Milani

Vigneshram Krishnamoorthy

Michael Lewis

Katia P. Sycara

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 2019

Stephanie Milani

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...