💭 At this point my plan for handling arbitrary board games is

  • represent the game state in a heterogenous graph. A graph that has different node and edge types..
  • Use stochastic muzero with a graph transformer model with type aware attention
  • Make the set of legal moves part of the game state rather than being a logical mask applied onto a fixed size policy vector. You just can't put complex board games' choice spaces into a fixed size vector.