Difference between revisions of "AI safety field consensus"
Line 5: | Line 5: | ||
* Edge instantiation / patch resistance | * Edge instantiation / patch resistance | ||
* Goodhart problems | * Goodhart problems | ||
− | * AGI possible in principle | + | * AGI possible in principle (as in, it is virtually certain that humans can create AGI) |
one operationalization might be something like: what are the things relevant to AI safety that all of [[Eliezer Yudkowsky]], [[Paul Christiano]], [[Robin Hanson]], [[Rohin Shah]], [[Dario Amodei]], and [[Wei Dai]] agree on? | one operationalization might be something like: what are the things relevant to AI safety that all of [[Eliezer Yudkowsky]], [[Paul Christiano]], [[Robin Hanson]], [[Rohin Shah]], [[Dario Amodei]], and [[Wei Dai]] agree on? |
Revision as of 03:35, 25 February 2020
People in AI safety tend to disagree about many things. However, there is also wide agreement about some other things (which people outside the field often disagree about).
- Orthogonality thesis
- Instrumental convergence
- Edge instantiation / patch resistance
- Goodhart problems
- AGI possible in principle (as in, it is virtually certain that humans can create AGI)
one operationalization might be something like: what are the things relevant to AI safety that all of Eliezer Yudkowsky, Paul Christiano, Robin Hanson, Rohin Shah, Dario Amodei, and Wei Dai agree on?