Learning-Based Decision Making for Combustion Phasing Control in Multi-Fuel CI Engines with Latent Fuel Reactivity Estimation
Summary
A new GRU-guided Reinforcement Learning framework addresses combustion phasing control in multi-fuel compression-ignition (CI) engines, which face challenges from uncertain, time-varying fuel reactivity (cetane number, CN). This framework formulates CA50 regulation under latent CN variation as a partially observable sequential decision problem. It learns a compact GRU-based representation of fuel reactivity from combustion history, conditioning both the actor and critic on this estimated signal. Evaluation using a Gaussian-process surrogate, trained on experimental multi-fuel engine data, demonstrates that the proposed method outperforms myopic/fixed-history bandit methods and observation-only DDPG, especially when CN evolves rapidly. The policy achieves stable CA50 regulation with a mean absolute tracking error below 0.25° CA at the training setpoint, producing smooth, physically consistent SOI and glow-plug-power actuation across unseen CN trajectories.
Key takeaway
For Machine Learning Engineers designing control systems for multi-fuel engines or similar dynamic systems with latent, time-varying parameters, you should integrate state estimation directly into your Reinforcement Learning policy learning. This approach, which conditions the policy on the same imperfect state information available at deployment, significantly improves control robustness and avoids train-deploy inconsistency, leading to more stable and physically consistent actuation.
Key insights
Aligning fuel-reactivity inference with control policy learning avoids train-deploy inconsistency in dynamic systems.
Principles
- Latent, continuously evolving fuel dynamics necessitate more than standalone estimation or generic recurrence.
- Training policies on imperfect, deployable state estimates improves robustness and consistency.
Method
The proposed GRU-guided RL framework learns a compact GRU-based representation of fuel reactivity from combustion history, then conditions both actor and critic on this estimated signal.
In practice
- Integrate GRU-based networks for latent state estimation directly into RL policy architectures.
- Evaluate control policies against unseen dynamic trajectories of latent variables.
Topics
- Multi-fuel Engines
- Combustion Control
- Reinforcement Learning
- GRU Networks
- Cetane Number
- Partially Observable MDPs
Best for: AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.