Gabriel Mukobi

Orcid: 0009-0004-7715-0717

According to our database1, Gabriel Mukobi authored at least 11 papers between 2023 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
AI Consciousness and Public Perceptions: Four Futures.
CoRR, 2024

Reasons to Doubt the Impact of AI Risk Evaluations.
CoRR, 2024

Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
CoRR, 2024

Open Problems in Technical AI Governance.
CoRR, 2024

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
CoRR, 2024

Societal Adaptation to Advanced AI.
CoRR, 2024

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning.
CoRR, 2024


Escalation Risks from Language Models in Military and Diplomatic Decision-Making.
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

2023
SuperHF: Supervised Iterative Learning from Human Feedback.
CoRR, 2023

Welfare Diplomacy: Benchmarking Language Model Cooperation.
CoRR, 2023


  Loading...