Towards Data-Efficient Cross-Device Generalization of Grad-Shafranov Equilibria via Transfer Learning Neural Operator

· Source: Machine Learning · Field: Science & Research — Artificial Intelligence & Machine Learning, Physical Sciences & Chemistry · Depth: Expert, quick

Summary

A new domain-specific neural operator framework addresses the challenge of real-time magnetohydrodynamic equilibrium reconstruction in magnetic confinement fusion. Current Grad-Shafranov calculations are device-specific and iterative, limiting their use in latency-constrained control. This framework recasts equilibrium reconstruction as a cross-device operator learning problem, mapping geometry and profile parameters directly to the poloidal flux field for amortized inference. Using the analytically tractable Solov'ev family as a testbed, researchers generated equilibria across eight distinct tokamak-like configurations. They benchmarked five neural operator architectures under four transfer-learning strategies. Multi-geometry pretraining proved crucial for data-efficient adaptation, outperforming single-geometry pretraining. The Wavelet Neural Operator achieved the strongest cross-geometry performance, reaching mean relative L2 errors below 4% with 100 labelled target equilibria and below 2% with full fine-tuning. Four architectures achieved millisecond or sub-millisecond inference, satisfying the divergence-free constraint to numerical precision. This work identifies neural operator pretraining as a path toward reusable, real-time equilibrium inference across fusion devices.

Key takeaway

For AI Scientists developing real-time control systems for magnetic confinement fusion, this research indicates that neural operator pretraining can overcome device-specific equilibrium calculation limitations. You should prioritize multi-geometry pretraining strategies to achieve data-efficient adaptation across different tokamak geometries. Specifically, consider implementing Wavelet Neural Operators, which demonstrated superior cross-geometry performance and millisecond inference speeds, enabling reusable models for plasma shaping and stability assessment.

Key insights

Neural operator pretraining enables data-efficient, real-time, cross-device generalization of Grad-Shafranov equilibria for magnetic confinement fusion.

Principles

Method

Generate equilibria across diverse geometries using an analytical testbed. Benchmark neural operator architectures with transfer learning strategies to map geometry/profile to poloidal flux.

In practice

Topics

Best for: AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.