💭 current experiment is using about 100x more training data for alphazero to see if that makes it improve over mcts. If not it's clearly broken