AlphaStar: Grandmaster level in StarCraft II using multi-agent reinforcement learning

Posted: Thu Oct 31, 2019 12:19 pm
by RĂ©mi Coulom ... t-learning

Direct link to the paper: ... matted.pdf

The most interesting contribution seems to be their league system to reach the Nash equilibrium. Their method for off-policy learning is very interesting, too. This gives me some inspiration for my mahjong reinforcement-learning experiments.