Roberta Raileanu

CoRR, June, 2025

Sparks of Science: Hypothesis Generation Using Structured Paper Data.

[BibT_eX]

[DOI]

CoRR, April, 2025

MLGym: A New Framework and Benchmark for Advancing AI Research Agents.

[BibT_eX]

[DOI]

Ricardo Silveira Cabral

CoRR, February, 2025

Combining Code Generating Large Language Models and Self-Play to Iteratively Refine Strategies in Games.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

MaestroMotif: Skill Design from Artificial Intelligence Feedback.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources.

[BibT_eX]

[DOI]

CoRR, 2024

Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games.

[BibT_eX]

[DOI]

CoRR, 2024

Teaching Large Language Models to Reason with Reinforcement Learning.

[BibT_eX]

[DOI]

Alex Havrilla

Yuqing Du

CoRR, 2024

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts.

[BibT_eX]

[DOI]

Mikayel Samvelyan

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Generalization to New Sequential Decision Making Tasks with In-Context Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements.

[BibT_eX]

[DOI]

Alexander Havrilla

Proceedings of the Forty-first International Conference on Machine Learning, 2024

The Generalization Gap in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Motif: Intrinsic Motivation from Artificial Intelligence Feedback.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Understanding the Effects of RLHF on LLM Generalisation and Diversity.

[BibT_eX]

[DOI]

Robert Kirk

Ishita Mediratta

Proceedings of the Twelfth International Conference on Learning Representations, 2024

DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on the Foundations of Digital Games, 2024

TOOLVERIFIER: Generalization to New Tools via Self-Verification.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Chain-of-Verification Reduces Hallucination in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Augmented Language Models: a Survey.

[BibT_eX]

[DOI]

Grégoire Mialon

Roberto Dessì

Maria Lomeli

Trans. Mach. Learn. Res., 2023

Challenges and Applications of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Toolformer: Language Models Can Teach Themselves to Use Tools.

[BibT_eX]

[DOI]

CoRR, 2023

Toolformer: Language Models Can Teach Themselves to Use Tools.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Importance of Exploration for Generalization in Reinforcement Learning.

[BibT_eX]

[DOI]

Yiding Jiang

J. Zico Kolter

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Language Plasticity via Pretraining with Active Forgetting.

[BibT_eX]

[DOI]

Yihong Chen

Kelly Marchisio

Pontus Lars Erik Saito Stenetorp

David Ifeoluwa Adelani

Sebastian Riedel

Mikel Artetxe

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs.

[BibT_eX]

[DOI]

Mikael Henaff

Minqi Jiang

Proceedings of the International Conference on Machine Learning, 2023

Hyperparameters in Reinforcement Learning and How To Tune Them.

[BibT_eX]

[DOI]

Theresa Eimer

Marius Lindauer

Proceedings of the International Conference on Machine Learning, 2023

MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Jakob Nicolaus Foerster

Tim Rocktäschel

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Building a Subspace of Policies for Scalable Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Improving Intrinsic Exploration with Language Abstractions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploration via Elliptical Episodic Bonuses.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Dungeons and Data: A Large-Scale NetHack Dataset.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Towards More General and Adaptive Deep Reinforcement Learning Agents.

[BibT_eX]

[DOI]

Wojciech Marian Czarnecki

PhD thesis, 2021

Open-Ended Learning Leads to Generally Capable Agents.

[BibT_eX]

[DOI]

Open Ended Learning Team

Nathalie Bradley-Schmieg

CoRR, 2021

Automatic Data Augmentation for Generalization in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Insights From the NeurIPS 2021 NetHack Challenge.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

Decoupling Value and Policy for Generalization in Reinforcement Learning.

[BibT_eX]

[DOI]

Rob Fergus

Proceedings of the 38th International Conference on Machine Learning, 2021

Learning with AMIGo: Adversarially Motivated Intrinsic Goals.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Fast Adaptation via Policy-Dynamics Value Functions.

[BibT_eX]

[DOI]

CoRR, 2020

Automatic Data Augmentation for Generalization in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

The NetHack Learning Environment.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Fast Adaptation to New Environments via Policy-Dynamics Value Functions.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments.

[BibT_eX]

[DOI]