AI safety field consensus

From Issawiki

Revision as of 04:24, 25 February 2020 by Issa (talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to: navigation, search

People in AI safety tend to disagree about many things. However, there is also wide agreement about some other things (which people outside the field often disagree about).

Orthogonality thesis
Instrumental convergence
Edge instantiation / patch resistance
Goodhart problems i.e. awareness that Goodhart's law is a thing, and general attention/wariness of it
AGI possible in principle (as in, it is virtually certain that humans can create AGI)

see also "Background AI safety intuitions" section in [1]

one operationalization might be something like: what are the things relevant to AI safety that all of Eliezer Yudkowsky, Paul Christiano, Robin Hanson, Rohin Shah, Dario Amodei, and Wei Dai agree on?

Retrieved from "https://wiki.issarice.com/index.php?title=AI_safety_field_consensus&oldid=279"