AlphaGo

From Issawiki
Revision as of 21:54, 13 June 2020 by Issa (talk | contribs) (Eliezer's commentary)
Jump to: navigation, search

AlphaGo and its successor AlphaGo Zero are used to make various points in AI safety.

Eliezer's commentary

Date Initial segment + link Description
2016-02-08 I have one bet on at 2:3 against AlphaGo winning against Sedol in March - they get my $667 if AlphaGo wins, I get their $1000 if AlphaGo loses
2016-02-29 This suggests that AlphaGo beating Sedol in March might not be nearly as out-of-character fast progress as I thought
2016-03-09 Second match ongoing. #AlphaGo just made a move that everyone is saying nobody else would have played. #Sedol walked out of the room with his clock running, presumably to think about it.
2016-03-10 It's possible that, contrary to hopeful commentators, #AlphaGo is not actually enriching the Go game for humans
2016-03-11 (Long.) As I post this, AlphaGo seems almost sure to win the third game and the match.
2016-03-13 And then AlphaGo got confused in a way no human would and lost its 4th game
2016-03-13 Okay, look, to everyone going "Aha but of course superhuman cognition will always be bugged for deep reason blah": Please remember that machine chess *is* out of the phase where a human can analyze it psychologically without computer assistance
2017-10-19 AlphaGo Zero uses 4 TPUs, is built entirely out of neural nets with no handcrafted features, doesn't pretrain against expert games or anything else human, reaches a superhuman level after 3 days of self-play, and is the strongest version of AlphaGo yet