gary
22-10-2017, 04:23 PM
In a 18 Oct 2017 paper in Nature, (https://www.nature.com/articles/nature24270.epdf?author_access_toke n=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9 jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ)Silver et. al. from Google's DeepMind Technologies Limited describe
AlphaGo Zero, their latest evolution of AlphaGo (https://en.wikipedia.org/wiki/AlphaGo), which was the
first program ever to defeat a world champion at the ancient Chinese
game of Go (https://en.wikipedia.org/wiki/Go_(game)).
Both programs use neural networks. However, whereas AlphaGo was
trained by learning from human expert moves which was then reinforced
by learning through self-play, in the case of the AlphaGo Zero, beyond
being taught the rules of the game, it taught itself to become an expert
player entirely through self-play.
In fact, it went from a beginner to a grand master, without any human help,
in three days.
Nature paper "Mastering the game of Go without human knowledge" here :-
https://www.nature.com/articles/nature24270.epdf?author_access_toke n=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9 jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ
Article and video at DeepMind here :-
https://deepmind.com/blog/alphago-zero-learning-scratch/
AlphaGo Zero, their latest evolution of AlphaGo (https://en.wikipedia.org/wiki/AlphaGo), which was the
first program ever to defeat a world champion at the ancient Chinese
game of Go (https://en.wikipedia.org/wiki/Go_(game)).
Both programs use neural networks. However, whereas AlphaGo was
trained by learning from human expert moves which was then reinforced
by learning through self-play, in the case of the AlphaGo Zero, beyond
being taught the rules of the game, it taught itself to become an expert
player entirely through self-play.
In fact, it went from a beginner to a grand master, without any human help,
in three days.
Nature paper "Mastering the game of Go without human knowledge" here :-
https://www.nature.com/articles/nature24270.epdf?author_access_toke n=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9 jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ
Article and video at DeepMind here :-
https://deepmind.com/blog/alphago-zero-learning-scratch/