Difference between revisions of "List of success criteria for HRAD work"
Line 8: | Line 8: | ||
* helps AGI programmers postdict/explain problems in early advanced AI systems | * helps AGI programmers postdict/explain problems in early advanced AI systems | ||
* ideas from HRAD will be a "useful source of inspiration" for ML/AGI work [https://eaforum.issarice.com/posts/SEL9PW8jozrvLnkb4/my-current-thoughts-on-miri-s-highly-reliable-agent-design] | * ideas from HRAD will be a "useful source of inspiration" for ML/AGI work [https://eaforum.issarice.com/posts/SEL9PW8jozrvLnkb4/my-current-thoughts-on-miri-s-highly-reliable-agent-design] | ||
+ | * when applying HRAD to actual systems, there will be "theoretically satisfying approximation methods" that make this application possible [https://eaforum.issarice.com/posts/SEL9PW8jozrvLnkb4/my-current-thoughts-on-miri-s-highly-reliable-agent-design] | ||
+ | * when applying HRAD to actual systems, the approximation methods used will preserve the important desirable properties of HRAD work [https://eaforum.issarice.com/posts/SEL9PW8jozrvLnkb4/my-current-thoughts-on-miri-s-highly-reliable-agent-design] | ||
==See also== | ==See also== |
Revision as of 02:12, 2 June 2020
This page is a list of success criteria that have been proposed for HRAD work.
- resembles the work of Turing, Shannon, Bayes, etc
- helps AGI programmers avoid mistakes analogous to the use of null-terminated strings in C
- early advanced AI systems will be understandable in terms of HRAD's formalisms [1] (need to clarify what it means to be understandable in terms of a formalism)
- helps AGI programmers fix problems in early advanced AI systems
- helps AGI programmers predict problems in early advanced AI systems
- helps AGI programmers postdict/explain problems in early advanced AI systems
- ideas from HRAD will be a "useful source of inspiration" for ML/AGI work [2]
- when applying HRAD to actual systems, there will be "theoretically satisfying approximation methods" that make this application possible [3]
- when applying HRAD to actual systems, the approximation methods used will preserve the important desirable properties of HRAD work [4]