Difference between revisions of "Corrigibility"

From Issawiki
Jump to: navigation, search
(Created page with "'''Corrigibility''' is a term used in AI safety with multiple/unclear meanings. I think the term was originally used by MIRI to mean something like an AI that allowed hum...")
 
Line 4: Line 4:
  
 
Then the idea was generalized by [[Paul Christiano]] to mean something like an AI assistant that is trying to be helpful to humans.
 
Then the idea was generalized by [[Paul Christiano]] to mean something like an AI assistant that is trying to be helpful to humans.
 +
 +
==See also==
 +
 +
* [[Corrigibility may be undesirable]]
  
 
==External links==
 
==External links==

Revision as of 22:18, 25 March 2021

Corrigibility is a term used in AI safety with multiple/unclear meanings.

I think the term was originally used by MIRI to mean something like an AI that allowed human programmers to shut it off.

Then the idea was generalized by Paul Christiano to mean something like an AI assistant that is trying to be helpful to humans.

See also

External links