💭 or really the bottleneck is the ability for the llm to robustly test out,...

💭 or really the bottleneck is the ability for the llm to robustly test out, debug, and iterate on its work. 'Edit, run, error message, loop' is awfully limited at least by my debugging standards. And there's no way for a gamedev llm to really "try out the game". Similarly image generation and understanding are not nuanced enough for human like specification and iteration