Difference between revisions of "AI safety field consensus"

From Issawiki
Jump to: navigation, search
Line 3: Line 3:
 
* [[Orthogonality thesis]]
 
* [[Orthogonality thesis]]
 
* [[Instrumental convergence]]
 
* [[Instrumental convergence]]
* Edge instantiation / patch resistance
+
* [[Edge instantiation]]
 +
* [[Patch resistance]]
 
* Goodhart problems i.e. awareness that [[Goodhart's law]] is a thing, and general attention/wariness of it
 
* Goodhart problems i.e. awareness that [[Goodhart's law]] is a thing, and general attention/wariness of it
 
* AGI possible in principle (as in, it is virtually certain that humans can create AGI)
 
* AGI possible in principle (as in, it is virtually certain that humans can create AGI)

Revision as of 04:24, 25 February 2020

People in AI safety tend to disagree about many things. However, there is also wide agreement about some other things (which people outside the field often disagree about).

see also "Background AI safety intuitions" section in [1]

one operationalization might be something like: what are the things relevant to AI safety that all of Eliezer Yudkowsky, Paul Christiano, Robin Hanson, Rohin Shah, Dario Amodei, and Wei Dai agree on?