Toby Shevlane

According to our database1, Toby Shevlane authored at least 5 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI.
CoRR, 2024

Evaluating Frontier Models for Dangerous Capabilities.
CoRR, 2024

2023
Model evaluation for extreme risks.
CoRR, 2023

2022
Structured access to AI capabilities: an emerging paradigm for safe AI deployment.
CoRR, 2022

2020
The Offense-Defense Balance of Scientific Knowledge: Does Publishing AI Research Reduce Misuse?
Proceedings of the AIES '20: AAAI/ACM Conference on AI, 2020


  Loading...