Alexander Robey

Orcid: 0009-0003-5693-2819

According to our database¹, Alexander Robey authored at least 41 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Toward Understanding the Transferability of Adversarial Suffixes in Large Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Preventing Robotic Jailbreaking via Multimodal Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, September, 2025

Algorithms for Adversarially Robust Deep Learning.

[BibT_eX]

[DOI]

Alexander Robey

CoRR, September, 2025

Embodied AI: Emerging Risks and Opportunities for Policy Action.

[BibT_eX]

[DOI]

CoRR, September, 2025

Evaluating Language Model Reasoning about Confidential Information.

[BibT_eX]

[DOI]

CoRR, August, 2025

Command-V: Pasting LLM Behaviors via Activation Profiles.

[BibT_eX]

[DOI]

CoRR, June, 2025

Benchmarking Misuse Mitigation Against Covert Adversaries.

[BibT_eX]

[DOI]

CoRR, June, 2025

Adversarial Attacks on Robotic Vision Language Action Models.

[BibT_eX]

[DOI]

Eliot Krzysztof Jones

CoRR, June, 2025

Existing Large Language Model Unlearning Evaluations Are Inconclusive.

[BibT_eX]

[DOI]

CoRR, June, 2025

Transferable Adversarial Attacks on Black-Box Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

Safety Pretraining: Toward the Next Generation of Safe AI.

[BibT_eX]

[DOI]

CoRR, April, 2025

Antidistillation Sampling.

[BibT_eX]

[DOI]

CoRR, April, 2025

Safety Guardrails for LLM-Enabled Robots.

[BibT_eX]

[DOI]

CoRR, March, 2025

Steering Dialogue Dynamics for Robustness against Multi-turn Jailbreaking Attacks.

[BibT_eX]

[DOI]

Hanjiang Hu

Alexander Robey

Changliu Liu

CoRR, March, 2025

SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation.

[BibT_eX]

[DOI]

Joshua Nathaniel Williams

Trans. Mach. Learn. Res., 2025

Jailbreaking Black Box Large Language Models in Twenty Queries.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Secure and Trustworthy Machine Learning, 2025

Jailbreaking LLM-Controlled Robots.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

2024

A Safe Harbor for AI Evaluation and Red Teaming.

[BibT_eX]

[DOI]

CoRR, 2024

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing.

[BibT_eX]

[DOI]

CoRR, 2024

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models.

[BibT_eX]

[DOI]

Patrick Chao

Edoardo Debenedetti

Alexander Robey

Maksym Andriushchenko

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Position: A Safe Harbor for AI Evaluation and Red Teaming.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Adversarial Training Should Be Cast as a Non-Zero-Sum Game.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Provable Tradeoffs in Adversarially Robust Classification.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, December, 2023

Data-Driven Modeling and Verification of Perception-Based Autonomous Systems.

[BibT_eX]

[DOI]

CoRR, 2023

Toward Certified Robustness Against Real-World Distribution Shifts.

[BibT_eX]

[DOI]

Proceedings of the 2023 IEEE Conference on Secure and Trustworthy Machine Learning, 2023

2022

Probable Domain Generalization via Quantile Risk Minimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On the Sample Complexity of Stability Constrained Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the Learning for Dynamics and Control Conference, 2022

Probabilistically Robust Learning: Balancing Average and Worst-case Performance.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Do deep networks transfer invariances across classes?

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Chordal Sparsity for Lipschitz Constant Estimation of Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 61st IEEE Conference on Decision and Control, 2022

2021

Learning Robust Output Control Barrier Functions from Safe Expert Demonstrations.

[BibT_eX]

[DOI]

CoRR, 2021

Closing the Closed-Loop Distribution Shift in Safe Imitation Learning.

[BibT_eX]

[DOI]

Stephen Tu

Alexander Robey

Nikolai Matni

CoRR, 2021

Model-Based Domain Generalization.

[BibT_eX]

[DOI]

Alexander Robey

George J. Pappas

Hamed Hassani

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Adversarial Robustness with Semi-Infinite Constrained Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Optimal Algorithms for Submodular Maximization with Distributed Constraints.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Learning for Dynamics and Control, 2021

Learning Robust Hybrid Control Barrier Functions for Uncertain Systems.

[BibT_eX]

[DOI]

Proceedings of the 7th IFAC Conference on Analysis and Design of Hybrid Systems, 2021

2020

Model-Based Robust Deep Learning.

[BibT_eX]

[DOI]

Alexander Robey

Hamed Hassani

George J. Pappas

CoRR, 2020

Learning Hybrid Control Barrier Functions from Data.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

Learning Control Barrier Functions from Expert Demonstrations.

[BibT_eX]

[DOI]

Proceedings of the 59th IEEE Conference on Decision and Control, 2020

2019

Efficient and Accurate Estimation of Lipschitz Constants for Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Alexander Robey

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...