Where Should Action Generation Begin? A Learnable Source Prior for Generative Robot Policies
Summary
LeaP, a Learnable source Prior, significantly enhances generative robot policies by replacing the conventional observation-independent standard Gaussian distribution with a proprioception-conditioned diagonal Gaussian over action chunks. Parameterized by a lightweight MLP, LeaP jointly predicts the mean and state-adaptive variance of this source distribution, enabling the downstream generator to focus on precise action refinement. On 15 RoboTwin manipulation tasks, LeaP achieved an 81.6% average success rate, outperforming four baselines by 6.5 to 25.5 percentage points. This prior consistently improves both flow-matching and diffusion-bridge generators, uses fewer parameters, converges faster, and demonstrates superior performance in real-world deployment.
Key takeaway
For Robotics Engineers developing generative robot policies, if you are seeking to improve action generation success rates and training efficiency, consider integrating a learnable source prior like LeaP. This approach, which conditions the initial action generation on proprioception, has demonstrated an 81.6% average success rate on RoboTwin tasks and faster convergence, allowing your generators to focus on refinement rather than sample transport.
Key insights
A learnable, proprioception-conditioned prior significantly improves generative robot policy performance and efficiency.
Principles
- Source distribution is a reusable design axis
- Observation-informed initialization enhances generative models
- Prior design is complementary to generative dynamics
Method
LeaP employs a lightweight MLP to predict the mean and state-adaptive variance of a proprioception-conditioned diagonal Gaussian over action chunks.
In practice
- Replace standard Gaussian with a conditioned prior
- Parameterize prior with a lightweight MLP
- Apply to flow-matching or diffusion-bridge generators
Topics
- Generative Robot Policies
- Action Generation
- Learnable Prior
- Proprioception
- Diffusion Models
- Robotics Manipulation
Best for: Research Scientist, AI Scientist, Robotics Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.