MuZero Handles Go, Chess, Shogi and Atari

Unknown Reply 7:50 AM

Major news about MuZero from DeepMind:

This approach comes with another major benefit: MuZero can repeatedly use its learned model to improve its planning, rather than collecting new data from the environment. For example, in tests on the Atari suite, this variant – known as MuZero Reanalyze – used the learned model 90% of the time to re-plan what should have been done in past episodes.

Read more from DeepMind and Ars Technica

Post a Comment