MuZero Handles Go, Chess, Shogi and Atari
Major news about MuZero from DeepMind:
This approach comes with another major benefit: MuZero can repeatedly use its learned model to improve its planning, rather than collecting new data from the environment. For example, in tests on the Atari suite, this variant – known as MuZero Reanalyze – used the learned model 90% of the time to re-plan what should have been done in past episodes.
Read more from DeepMind and Ars Technica

Post a Comment