11 Automated Planning
11 Automated Planning
Planning
Classical Planning
Hierarchical Planning
PRECOND:
EFFECT:
DEL() ADD()
HLA
Refinement
HLA HLA HLA HLA HLA …
𝑎𝑎 𝑎𝑎𝑎 … “Implementation” with only primitive actions
• Two refinements for the HLA to go from home to the SFO airport:
• The top HLA is often just “Act” and the agent needs to find an implementation that
achieves the goal.
• Classical Planning
• For each primitive action, provide a refinement of with steps .
• This can recursively build any sequence of actions.
• To stop the recursion, define:
• Search for primitive solutions has to refine all HLAs all the way to primitive
actions to determine if a plan is workable.
• Reachable set: the set of states reachable with a sequence of HLAs in state .
• A sequence of HLAs achieves the goal if its reachable set intersects the goal set.
• Typical implementation:
1. Use a simplified (optimistic) version of precondition-effect descriptions to find a high-
level plan that works.
2. Check if a refinement of that plan that works really exists. If not, go back to 1.
Monitoring and
Replanning
Planning and Acting in Partially Observable, Nondeterministic, and
Unknown Environments
Determinism & Observability -
Belief States
• For nondeterministic or partially observable environments we need belief
states.
• A belief state is a set of possible physical states the agent might be in given
its current knowledge.
Technical note: If we manage to keep the belief state in 1-CNF (1-conjunctive normal form,
i.e., fluents are combined with ANDs), then the complexity is reduced from being
exponential in the number of fluents to linear!
Observability -
Percept Schema
• For partially observable environments we need to be able to define what
percepts the agent can get when.
• The agent uses a percept schema to reason about percepts that it can obtains
during executing a plan.
• Example: Whenever the agent sees an object, then it will perceive its color.
PRECOND:
The agent can now reason that it needs to get an object inView to see the
color.
• Update:
represents the physical transition model which adds positive and negative literals to the
state description. The state description becomes more and more complete.
Determinism & Observability -
Contingency Planning
• We can create a conditional plan for partially observable planning
problems and non-deterministic problems.
• We already have introduced conditional plans in Chapter 4 and just need
to augment it by:
• Action schemata instead of a transition function.
• Percept schemata to reason about how to get needed percepts.
• The state has a factored representation as facts in 1-CNF.
• Issues:
• Contingency plans become very complicated with non-deterministic effects like
failures in actions or percepts. E.g., moving north fails 1 out of 100 times.
• Plan fails with incorrect model of the world. E.g., actions with missing
preconditions or missing effects, missing fluents, exogenous effects.
→ Online Planning
Execution Monitoring and
Replanning
• Online planning = replan during execution when necessary.
• Requires execution monitoring to determine the need for
replanning. The agent can perform:
• Action monitoring: Only execute action if the preconditions are met.
• Plan monitoring: Verify that the remaining plan will still succeed.
• Goal monitoring: Check if a better set of goals has become available.
Actual
path taken