Difference between revisions of "List of technical AI alignment agendas"
(Created page with "This is a '''list of technical AI alignment agendas'''. https://aiwatch.issarice.com/#agendas") |
|||
Line 1: | Line 1: | ||
This is a '''list of technical AI alignment agendas'''. | This is a '''list of technical AI alignment agendas'''. | ||
+ | |||
+ | * [[Iterated amplification]] | ||
+ | * [[Embedded agency]] | ||
+ | * [[Comprehensive AI services]] | ||
+ | * [[Ambitious value learning]] | ||
+ | * [[Factored cognition]] | ||
+ | * [[Recursive reward modeling]] | ||
+ | * [[Debate]] | ||
+ | * [[Interpretability]] | ||
+ | * [[Inverse reinforcement learning]] | ||
+ | * [[Preference learning]] | ||
+ | * [[Cooperative inverse reinforcement learning]] | ||
+ | * [[Imitation learning]] | ||
+ | * [[Alignment for advanced machine learning systems]] | ||
+ | * [[Learning-theoretic AI alignment]] | ||
+ | * [[Counterfactual reasoning]] | ||
https://aiwatch.issarice.com/#agendas | https://aiwatch.issarice.com/#agendas | ||
+ | |||
+ | [[Category:AI safety]] |
Latest revision as of 21:29, 2 April 2021
This is a list of technical AI alignment agendas.
- Iterated amplification
- Embedded agency
- Comprehensive AI services
- Ambitious value learning
- Factored cognition
- Recursive reward modeling
- Debate
- Interpretability
- Inverse reinforcement learning
- Preference learning
- Cooperative inverse reinforcement learning
- Imitation learning
- Alignment for advanced machine learning systems
- Learning-theoretic AI alignment
- Counterfactual reasoning