💭 a programmed function can generate in out observations. An ml model can approximate the observations. Since we then have program ast tokens to ml weights, we can generate training data to learn the inverse, to write code describing the behavior of an ml model?