Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

Miles Brundage 1 Shahar Avin 2 Jasmine Wang 3 Haydn Belfield 2 Gretchen Krueger 1 Gillian Hadfield 1 Heidy Khlaaf 4 Jingying Yang 5 Helen Toner 6 Ruth Fong 7 Tegan Maharaj 8 Wei Koh 9 Sara Hooker 10 Jade Leung 11 Andrew Trask 7 Emma Bluemke 7 Jonathan Lebensold 12 Cullen O'Keefe 1 Mark Koren 9 Théo Ryffel 13, 14 J Rubinovitz 15 Tamay Besiroglu 16 Federica Carugati 17 Jack Clark 1 Peter Eckersley 5 Sarah de Haas 18 Maritza Johnson 18 Ben Laurie 18 Alex Ingerman 18 Igor Krawczuk 19 Amanda Askell 1 Rosario Cammarota 20 Andrew Lohn 21 David Krueger 22 Charlotte Stix 23 Peter Henderson 9 Logan Graham 7 Carina Prunkl 11 Bianca Martin 1 Elizabeth Seger 16 Noa Zilberman 7 Seán Héigeartaigh 2 Frens Kroeger 24 Girish Sastry 1 Rebecca Kagan 6 Adrian Weller 16 Brian Tse 11, 5 Elizabeth Barnes 1 Allan Dafoe 7 Paul Scharre 25 Ariel Herbert-Voss 1 Martijn Rasser 25 Shagun Sodhani 22 Carrick Flynn 6 Thomas Gilbert 26 Lisa Dyer 5 Saif Khan 6 Yoshua Bengio 22 Markus Anderljung 11
Document type :
Preprints, Working Papers, ...
Complete list of metadata

https://hal.inria.fr/hal-03065927
Contributor : Théo Ryffel Connect in order to contact the contributor
Submitted on : Tuesday, December 15, 2020 - 8:40:48 AM
Last modification on : Friday, January 21, 2022 - 3:23:00 AM
Long-term archiving on: : Tuesday, March 16, 2021 - 6:32:20 PM

File

2004.07213.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03065927, version 1
  • ARXIV : 2004.07213

Collections

Citation

Miles Brundage, Shahar Avin, Jasmine Wang, Haydn Belfield, Gretchen Krueger, et al.. Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims. 2020. ⟨hal-03065927⟩

Share

Metrics

Les métriques sont temporairement indisponibles