PO-PDDL: Learning Symbolic POMDPs from Visual Demonstrations for Robot Planning Under Uncertainty
Summary
PO-PDDL introduces a symbolic formulation for Partially Observable Markov Decision Processes (POMDPs) designed for real-world robot task planning under stochastic action execution and partial observability. This new language preserves the relational structure and LLM-friendly syntax of PDDL, while explicitly modeling beliefs, stochasticity, and partial observability. The method includes a demonstration-driven pipeline that reconstructs latent symbolic state trajectories from real-robot execution videos. It identifies partial observability through inconsistencies between inferred states and visual observations, then learns stochastic transition and observation models. Experiments on long-horizon manipulation tasks show PO-PDDL consistently outperforms existing PDDL and POMDP model-learning approaches, enabling robust task planning with significantly lower planning cost.
Key takeaway
For robotics engineers and AI scientists building autonomous systems that operate under uncertainty, PO-PDDL offers a robust approach to constructing POMDP models. This method allows you to learn complex planning domains directly from visual demonstrations, significantly reducing the labor involved in model creation. Consider adopting PO-PDDL to enhance your robot's ability to perform long-horizon tasks reliably, even with perception and execution uncertainties, while also lowering planning costs.
Key insights
PO-PDDL learns symbolic POMDPs from visual demonstrations for robust robot planning under uncertainty.
Principles
- Symbolic POMDPs can preserve PDDL's relational structure.
- Partial observability is identifiable via state-observation inconsistencies.
- Demonstration-driven learning improves POMDP model construction.
Method
Reconstruct latent symbolic state trajectories from robot videos, identify partial observability via state-observation inconsistencies, then learn stochastic transition and observation models.
In practice
- Use PO-PDDL for robot planning under uncertainty.
- Apply visual demonstrations to learn POMDP models.
- Leverage PO-PDDL for reusable task domains.
Topics
- Robotics
- POMDPs
- PDDL
- Symbolic AI
- Machine Learning
- Visual Demonstrations
- Uncertainty Planning
Best for: Research Scientist, Robotics Engineer, AI Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.