Generating Optimal Policies for High-level Plans with Conditional Branches and Loops
نویسندگان
چکیده
We are concerned with generating optimal policies for Markov decision processes that are represented as high-level plans with conditional branches and loops. Often complex planning processes can be broken down into elementary plan steps with associated restricted sets of actions. These plan steps can be combined to form high-level plans using a simple programming language specifying conditionals, loops, and sequences involving the plan steps as primitive statements. It is infeasible to directly generate and solve the underlying Markov decision process, since the size of the state space is exponential in the size of a high-level plan. We address the problem of eeciently computing an optimal policy by taking advantage of locality structure in the high-level plan. The main result is the speciication and analysis of an algorithm that takes as input a high-level plan and provides as output an optimal policy for the underlying Markov decision process.
منابع مشابه
Finding Plans with Branches, Loops, and Preconditions
We present a new approach for finding conditional plans with loops and branches for planning in situations with uncertainty in state properties as well as in object quantities. We use a state abstraction technique from static analysis of programs to builds such plans incrementally using generalizations of input example plans generated by classical planners. Preconditions of the resulting plans ...
متن کاملUni ed Information and Control Flow in Hierarchical Task Networks
Much recent planning research has focused on two related issues. First, there has been a strong interest in information-gathering (or \sensing", or \knowledgeproducing") actions. Second, has been an investigation of plans with sophisticated control structures, such as conditional branches and loops. But the combination of these two lines of research poses a representational problem: plans with ...
متن کاملPlanner Scheduler Executor
Much recent planning research has focused on two related issues. First, there has been a strong interest in information-gathering (or \sensing", or \knowledge-producing") actions. Second, has been an investigation of plans with sophisticated control structures, such as conditional branches and loops. But the combination of these two lines of research poses a representational problem: plans with...
متن کاملUnified Information and Control Flow in Hierarchical Task Networks
Much recent planning research has focused on two related issues. First, there has been a strong interest in information-gathering (or "sensing", or "knowledgeproducing") actions. Second, has been an investigation of plans with sophisticated control structures, such as conditional branches and loops. But the combination of these two lines of research poses a representational problem: plans with ...
متن کاملUniied Information and Control Flow in Hierarchical Task Networks
Much recent planning research has focused on two related issues. First, there has been a strong interest in information-gathering (or \sensing", or \knowledge-producing") actions. Second, has been an investigation of plans with sophisticated control structures, such as conditional branches and loops. But the combination of these two lines of research poses a represen-tational problem: plans wit...
متن کامل