Derek Duenas

According to our database1, Derek Duenas authored at least 4 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents.
CoRR, 2024

Improving Alignment and Robustness with Circuit Breakers.
CoRR, 2024

Improving Alignment and Robustness with Circuit Breakers.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024


  Loading...