Difference between revisions of "Iterated amplification"

Latest revision as of 03:58, 26 April 2020

Iterated amplification (also called iterated distillation and amplification, and abbreviated IDA) is the technical alignment agenda that Paul Christiano works on.

Terminology (not necessarily about IDA, but these are some terms frequently used by Paul):

@@ Line 6: / Line 6: @@
 * [[adequate oversight]]
 * [[overseer]]
-* [[bandwidth of the overseer]]
+* [[bandwidth of the overseer]], [[high bandwidth oversight]], [[low bandwidth oversight]]
 * [[reward engineering]]
 * [[HCH]], [[Strong HCH]], [[Weak HCH]], [[Humans consulting HCH]]
@@ Line 30: / Line 30: @@
 * [[universality]]
 * [[narrow value learning]] vs [[ambitious value learning]]
+* [[learning with catastrophes]], [[optimizing worst-case performance]]
 ==See also==
@@ Line 36: / Line 37: @@
 [[Category:Iterated amplification]]
+[[Category:AI safety]]

Difference between revisions of "Iterated amplification"

Latest revision as of 03:58, 26 April 2020

See also

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools