How does the sampled alphazero work specifically?
everyone.
As you see above. AlphaZero is a general robot to play games such as Go by self-playing. But when we design a game which can be played and trained by AlphaZero framework, meanwhile its action space is greatly enormous such as AlphaTensor with almost 5^12 actions, We have to make the sampled alphazero.