Gabriel Mukobi

Orcid: 0009-0004-7715-0717

According to our database¹, Gabriel Mukobi authored at least 12 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Open Problems in Technical AI Governance.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

AI Consciousness and Public Perceptions: Four Futures.

[BibT_eX]

[DOI]

CoRR, 2024

Reasons to Doubt the Impact of AI Risk Evaluations.

[BibT_eX]

[DOI]

Gabriel Mukobi

CoRR, 2024

Open Problems in Technical AI Governance.

[BibT_eX]

[DOI]

CoRR, 2024

Societal Adaptation to Advanced AI.

[BibT_eX]

[DOI]

CoRR, 2024

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning.

[BibT_eX]

[DOI]

Ann-Kathrin Dombrowski

Justin Tienken-Harder

Kallol Krishna Karmakar

Steven Basart

Stephen Fitz

Mindy Levine

Ponnurangam Kumaraguru

CoRR, 2024

Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

The WMDP Benchmark: Measuring and Reducing Malicious Use with Unlearning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Escalation Risks from Language Models in Military and Diplomatic Decision-Making.

[BibT_eX]

[DOI]

Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

2023

SuperHF: Supervised Iterative Learning from Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Welfare Diplomacy: Benchmarking Language Model Cooperation.

[BibT_eX]

[DOI]

CoRR, 2023

Gabriel Mukobi

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...