You start with some description of the situation, ask it to list some number of actions, and then tell it to describe the situation after those actions have been performed.
Given this process (basically, "the model", described in the language of MDP), there are lots of techniques (including the subject of my dissertation) for evaluating and finding plans.
Comments