Evolutionary Two-Stage Hyperparameter Optimization Strategies for Physics-Informed Neural Networks

· Source: cs.NE updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Mathematics & Computational Sciences · Depth: Expert, extended

Summary

A two-stage hyperparameter optimization framework significantly improves the accuracy and robustness of Physics-Informed Neural Networks (PINNs) for solving Partial Differential Equations (PDEs). This method addresses PINNs' sensitivity to hyperparameters and unstable convergence by combining exploration and exploitation phases. The first stage uses evolutionary algorithms like JADE, LSHADE, Grey Wolf, and WOA for low-fidelity training with truncated epochs to rapidly screen candidate configurations. The second stage fully trains only the most promising candidates using standard gradient-based optimizers. Evaluated on Advection, Klein–Gordon, and Helmholtz equations, the approach consistently outperforms standard training, achieving 28% to 77% error reduction and approximately 40% average improvement over baseline error within fixed computational budgets, with JADE showing superior stability. An exploration budget of about 10% of full training epochs proved optimal.

Key takeaway

For Machine Learning Engineers or Research Scientists developing Physics-Informed Neural Networks, you should adopt a two-stage evolutionary hyperparameter optimization strategy. This approach systematically identifies superior configurations, reducing reliance on manual tuning. Specifically, allocate approximately 10% of total training epochs for an exploration phase using algorithms like JADE to rapidly identify promising configurations, then fully train only the best. This can reduce error by 28-77% and significantly improve robustness compared to traditional methods.

Key insights

Evolutionary two-stage hyperparameter optimization significantly enhances PINN accuracy and robustness under fixed computational budgets.

Principles

Method

A two-stage process: low-fidelity training with truncated epochs for rapid screening (exploration), followed by full training of promising candidates with gradient-based optimizers (exploitation).

In practice

Topics

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.NE updates on arXiv.org.