Sam Toyer

Orcid: 0000-0002-6665-6593

According to our database¹, Sam Toyer authored at least 18 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

IH-Challenge: A Training Dataset to Improve Instruction Hierarchy on Frontier LLMs.

[BibT_eX]

[DOI]

Chuan Guo

Juan Felipe Ceron Uribe

Sicheng Zhu

Christopher A. Choquette-Choo

CoRR, March, 2026

2025

Trading Inference-Time Compute for Adversarial Robustness.

[BibT_eX]

[DOI]

Wojciech Zaremba

Evgenia Nitishinskaya

CoRR, January, 2025

2024

Deliberative Alignment: Reasoning Enables Safer Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Human Action Anticipation: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring and Addressing Reward Confusion in Offline Preference Learning.

[BibT_eX]

[DOI]

Xin Chen

Sam Toyer

Florian Shkurti

CoRR, 2024

A StrongREJECT for Empty Jailbreaks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2022

imitation: Clean Imitation Learning Implementations.

[BibT_eX]

[DOI]

CoRR, 2022

An Empirical Investigation of Representation Learning for Imitation.

[BibT_eX]

[DOI]

CoRR, 2022

A Primer on Maximum Causal Entropy Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

Adam Gleave

Sam Toyer

CoRR, 2022

2021

An Empirical Investigation of Representation Learning for Imitation.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

2020

ASNets: Deep Learning for Generalised Planning.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2020

DERAIL: Diagnostic Environments for Reward And Imitation Learning.

[BibT_eX]

[DOI]

CoRR, 2020

The MAGICAL Benchmark for Robust Imitation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019

Guiding Search with Generalized Policies for Probabilistic Planning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Symposium on Combinatorial Search, 2019

Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Action Schema Networks: Generalised Policies With Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Human Pose Forecasting via Deep Markov Models.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications, 2017

Sam Toyer

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...