Xander Davies
According to our database1,
Xander Davies authored at least 22 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition.
CoRR, March, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, October, 2025
CoRR, October, 2025
CoRR, October, 2025
Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs.
CoRR, August, 2025
Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition.
CoRR, July, 2025
CoRR, June, 2025
Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
2023
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback.
Trans. Mach. Learn. Res., 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023