Canfer Akbulut

According to our database1, Canfer Akbulut authored at least 9 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Evaluating Language Models for Harmful Manipulation.
CoRR, March, 2026

2025
Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models.
CoRR, February, 2025

Century: A Framework and Dataset for Evaluating Historical Contextualisation of Sensitive Images.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Towards Responsible Development of Generative AI for Education: An Evaluation-Driven Approach.
CoRR, 2024

The Ethics of Advanced AI Assistants.
CoRR, 2024

A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI.
CoRR, 2024

STAR: SocioTechnical Approach to Red Teaming Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Gaps in the Safety Evaluation of Generative AI.
Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024

All Too Human? Mapping and Mitigating the Risk from Anthropomorphic AI.
Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024


  Loading...