Sam Toyer

Orcid: 0000-0002-6665-6593

According to our database1, Sam Toyer authored at least 18 papers between 2017 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
IH-Challenge: A Training Dataset to Improve Instruction Hierarchy on Frontier LLMs.
CoRR, March, 2026

2025
Trading Inference-Time Compute for Adversarial Robustness.
CoRR, January, 2025

2024
Deliberative Alignment: Reasoning Enables Safer Language Models.
CoRR, 2024

Human Action Anticipation: A Survey.
CoRR, 2024

Exploring and Addressing Reward Confusion in Offline Preference Learning.
CoRR, 2024

A StrongREJECT for Empty Jailbreaks.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2022
imitation: Clean Imitation Learning Implementations.
CoRR, 2022

An Empirical Investigation of Representation Learning for Imitation.
CoRR, 2022

A Primer on Maximum Causal Entropy Inverse Reinforcement Learning.
CoRR, 2022

2021
An Empirical Investigation of Representation Learning for Imitation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

2020
ASNets: Deep Learning for Generalised Planning.
J. Artif. Intell. Res., 2020

DERAIL: Diagnostic Environments for Reward And Imitation Learning.
CoRR, 2020

The MAGICAL Benchmark for Robust Imitation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Guiding Search with Generalized Policies for Probabilistic Planning.
Proceedings of the Twelfth International Symposium on Combinatorial Search, 2019

Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Action Schema Networks: Generalised Policies With Deep Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Human Pose Forecasting via Deep Markov Models.
Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications, 2017


  Loading...