Lightweight Geometric Adaptation for Training Physics-Informed Neural Networks

2026-04-20 · Source: cs.AI updates on arXiv.org · Field: Science & Research — Artificial Intelligence & Machine Learning, Mathematics & Computational Sciences · Depth: Expert, extended

Summary

A new lightweight curvature-aware optimization framework is proposed to enhance Physics-Informed Neural Networks (PINNs) training, which often suffers from slow convergence and instability due to anisotropic loss landscapes. This framework augments existing first-order optimizers by incorporating an adaptive predictive correction based on secant information, using consecutive gradient differences as a proxy for local geometric change. A step-normalized secant curvature indicator controls the correction strength. The method is plug-and-play, computationally efficient, and compatible with optimizers like AdamW, SOAP, and Muon, without requiring explicit second-order matrix formation. Experiments on diverse PDE benchmarks, including the 10D heat equation, Gray–Scott system, Belousov–Zhabotinsky system, and 2D Kuramoto–Sivashinsky system, demonstrate consistent improvements in convergence speed, training stability, and solution accuracy.

Key takeaway

For AI Scientists and Machine Learning Engineers developing or deploying PINNs, integrating this curvature-aware optimization framework can substantially improve model training. Your PINNs will achieve faster convergence, enhanced stability, and higher solution accuracy, especially for challenging, stiff partial differential equations. Consider implementing the CA framework with your preferred first-order optimizer to mitigate common training pathologies and achieve more robust physical simulations.

Key insights

Curvature-aware optimization using secant information significantly improves PINN training stability and accuracy.

Principles

PINN training difficulties stem from anisotropic loss landscapes.
Consecutive gradient differences indicate local geometric change.
Adaptive correction strength is crucial for varying local stiffness.

Method

The framework computes a secant curvature indicator from consecutive gradient differences and parameter displacements, then uses it to dynamically modulate a predictive correction applied to the base optimizer's gradient input.

In practice

Augment AdamW, SOAP, or Muon with curvature-aware gating.
Use EMA for gradient differences to improve robustness.
Apply time-marching for stiff PDE systems like Gray-Scott.

Topics

Physics-Informed Neural Networks
Curvature-Aware Optimization
Loss Landscape Geometry
Secant-based Gating
Gradient Differences

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.