Alexander Robey

Orcid: 0009-0003-5693-2819

According to our database1, Alexander Robey authored at least 36 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Command-V: Pasting LLM Behaviors via Activation Profiles.
CoRR, June, 2025

Benchmarking Misuse Mitigation Against Covert Adversaries.
CoRR, June, 2025

Adversarial Attacks on Robotic Vision Language Action Models.
CoRR, June, 2025

Existing Large Language Model Unlearning Evaluations Are Inconclusive.
CoRR, June, 2025

Transferable Adversarial Attacks on Black-Box Vision-Language Models.
CoRR, May, 2025

Safety Pretraining: Toward the Next Generation of Safe AI.
CoRR, April, 2025

Antidistillation Sampling.
CoRR, April, 2025

Safety Guardrails for LLM-Enabled Robots.
CoRR, March, 2025

Steering Dialogue Dynamics for Robustness against Multi-turn Jailbreaking Attacks.
CoRR, March, 2025

SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks.
Trans. Mach. Learn. Res., 2025

Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation.
Trans. Mach. Learn. Res., 2025

Jailbreaking Black Box Large Language Models in Twenty Queries.
Proceedings of the IEEE Conference on Secure and Trustworthy Machine Learning, 2025

2024
Jailbreaking LLM-Controlled Robots.
CoRR, 2024

A Safe Harbor for AI Evaluation and Red Teaming.
CoRR, 2024

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing.
CoRR, 2024

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024


Adversarial Training Should Be Cast as a Non-Zero-Sum Game.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Provable Tradeoffs in Adversarially Robust Classification.
IEEE Trans. Inf. Theory, December, 2023

Data-Driven Modeling and Verification of Perception-Based Autonomous Systems.
CoRR, 2023

Toward Certified Robustness Against Real-World Distribution Shifts.
Proceedings of the 2023 IEEE Conference on Secure and Trustworthy Machine Learning, 2023

2022
Probable Domain Generalization via Quantile Risk Minimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On the Sample Complexity of Stability Constrained Imitation Learning.
Proceedings of the Learning for Dynamics and Control Conference, 2022

Probabilistically Robust Learning: Balancing Average and Worst-case Performance.
Proceedings of the International Conference on Machine Learning, 2022

Do deep networks transfer invariances across classes?
Proceedings of the Tenth International Conference on Learning Representations, 2022

Chordal Sparsity for Lipschitz Constant Estimation of Deep Neural Networks.
Proceedings of the 61st IEEE Conference on Decision and Control, 2022

2021
Learning Robust Output Control Barrier Functions from Safe Expert Demonstrations.
CoRR, 2021

Closing the Closed-Loop Distribution Shift in Safe Imitation Learning.
CoRR, 2021

Model-Based Domain Generalization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Adversarial Robustness with Semi-Infinite Constrained Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Optimal Algorithms for Submodular Maximization with Distributed Constraints.
Proceedings of the 3rd Annual Conference on Learning for Dynamics and Control, 2021

Learning Robust Hybrid Control Barrier Functions for Uncertain Systems.
Proceedings of the 7th IFAC Conference on Analysis and Design of Hybrid Systems, 2021

2020
Model-Based Robust Deep Learning.
CoRR, 2020

Learning Hybrid Control Barrier Functions from Data.
Proceedings of the 4th Conference on Robot Learning, 2020

Learning Control Barrier Functions from Expert Demonstrations.
Proceedings of the 59th IEEE Conference on Decision and Control, 2020

2019
Efficient and Accurate Estimation of Lipschitz Constants for Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019


  Loading...