OnDeFog: Online Decision Transformer under Frame Dropping
Summary
OnDeFog, an online decision transformer, is proposed to mitigate performance degradation caused by frame dropping in real-world reinforcement learning applications. Frame dropping, often due to communication delays or sensor failures, prevents agents from receiving states and rewards. While the offline Decision Transformer under Random Frame Dropping (DeFog) previously addressed this, it struggled with generalizing to novel states not in its training data. OnDeFog integrates DeFog's mechanisms with the online decision transformer (ODT), allowing it to learn policies through direct environmental interaction. Experimental evaluation, published on 2026-06-18, demonstrates OnDeFog's superior performance over ODT in environments with high dropping frame rates and its outperformance of DeFog on datasets containing significant low-reward data.
Key takeaway
For Machine Learning Engineers designing robust reinforcement learning systems for real-world deployment, you should consider OnDeFog. This online decision transformer effectively addresses performance degradation from frame dropping, outperforming ODT in high dropping rate scenarios and DeFog on low-reward datasets. Its ability to generalize to novel states through direct environmental interaction makes it a strong candidate for applications facing communication delays or sensor failures.
Key insights
OnDeFog combines offline frame-dropping mitigation with online learning for robust decision-making in real-world, state-missing environments.
Principles
- Online learning improves generalization to novel states.
- Frame dropping degrades RL agent performance.
- Integrating offline mechanisms into online methods can enhance robustness.
Method
OnDeFog integrates DeFog's frame-dropping mitigation mechanisms into the Online Decision Transformer (ODT) architecture, enabling policy learning via direct environmental interaction.
In practice
- Apply OnDeFog in high frame-dropping rate environments.
- Use OnDeFog for datasets with low-reward data.
- Enhance real-world RL systems facing sensor failures.
Topics
- Online Decision Transformers
- Frame Dropping
- Reinforcement Learning
- Policy Generalization
- Robust AI Systems
Best for: Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.