×
2023
Frontier AI Regulation: Managing Emerging Risks to Public Safety.
[DOI]
Markus Anderljung
,
Joslyn Barnhart
,
Anton Korinek
,
Jade Leung
,
Cullen O'Keefe
,
Jess Whittlestone
,
Shahar Avin
,
Miles Brundage
,
Justin Bullock
,
Duncan Cass-Beggs
,
Ben Chang
,
Tantum Collins
,
Timothy Fist
,
Gillian K. Hadfield
,
Alan Hayes
,
Lewis Ho
,
Sara Hooker
,
Eric Horvitz
,
Noam Kolt
,
Jonas Schuett
,
Yonadav Shavit
,
Divya Siddarth
,
Robert Trager
,
Kevin Wolf
CoRR, 2023
Model evaluation for extreme risks.
[DOI]
Toby Shevlane
,
Sebastian Farquhar
,
Ben Garfinkel
,
Mary Phuong
,
Jess Whittlestone
,
Jade Leung
,
Daniel Kokotajlo
,
Nahema Marchal
,
Markus Anderljung
,
Noam Kolt
,
Lewis Ho
,
Divya Siddarth
,
Shahar Avin
,
Will Hawkins
,
Been Kim
,
Iason Gabriel
,
Vijay Bolina
,
Jack Clark
,
Yoshua Bengio
,
Paul F. Christiano
,
Allan Dafoe
CoRR, 2023
2020
Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims.
[DOI]
Miles Brundage
,
Shahar Avin
,
Jasmine Wang
,
Haydn Belfield
,
Gretchen Krueger
,
Gillian K. Hadfield
,
Heidy Khlaaf
,
Jingying Yang
,
Helen Toner
,
Ruth Fong
,
Tegan Maharaj
,
Pang Wei Koh
,
Sara Hooker
,
Jade Leung
,
Andrew Trask
,
Emma Bluemke
,
Jonathan Lebensold
,
Cullen O'Keefe
,
Mark Koren
,
Théo Ryffel
,
J. B. Rubinovitz
,
Tamay Besiroglu
,
Federica Carugati
,
Jack Clark
,
Peter Eckersley
,
Sarah de Haas
,
Maritza Johnson
,
Ben Laurie
,
Alex Ingerman
,
Igor Krawczuk
,
Amanda Askell
,
Rosario Cammarota
,
Andrew Lohn
,
David Krueger
,
Charlotte Stix
,
Peter Henderson
,
Logan Graham
,
Carina Prunkl
,
Bianca Martin
,
Elizabeth Seger
,
Noa Zilberman
,
Seán Ó hÉigeartaigh
,
Frens Kroeger
,
Girish Sastry
,
Rebecca Kagan
,
Adrian Weller
,
Brian Tse
,
Elizabeth Barnes
,
Allan Dafoe
,
Paul Scharre
,
Ariel Herbert-Voss
,
Martijn Rasser
,
Shagun Sodhani
,
Carrick Flynn
,
Thomas Krendl Gilbert
,
Lisa Dyer
,
Saif M. Khan
,
Yoshua Bengio
,
Markus Anderljung
CoRR, 2020
The Windfall Clause: Distributing the Benefits of AI for the Common Good.
[DOI]
Cullen O'Keefe
,
Peter Cihon
,
Ben Garfinkel
,
Carrick Flynn
,
Jade Leung
,
Allan Dafoe
Proceedings of the AIES '20: AAAI/ACM Conference on AI, 2020