Sobolev Approximation by Fixed-Size Neural Networks with Arbitrary Accuracy

· Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Mathematics & Computational Sciences · Depth: Expert, extended

Summary

This work introduces novel activation functions enabling arbitrary-accuracy Sobolev approximation by fixed-size neural networks, addressing a gap in controlling weak derivatives. The Elementary Universal Activation Function ("EUAF") allows approximation of W^2,inf((a,b)^d) functions in the W^1,inf-norm with a fixed-size network of width 4^d(5d^2+8d+3) and depth 2d+5. To extend this to higher-order Sobolev spaces, the authors present the Differentiable Universal Activation Function ("DUAF") family, including the C^inf "DUAF"_inf. These achieve W^s-1,inf-norm approximation for W^s,inf((a,b)^d) functions with fixed width N_s,d=O(4^d(s+d)C(s+d-1,d)) and depth L_s,d=O(s+d). Sigmoidal variants, "DUAF"_n, also provide fixed-size W^s-1,inf approximation for 1<=s<=n, with width S_n N_s,d and depth (3n^2+5)L_s,d. For structured Sobolev targets, "DUAF"_inf networks achieve reduced complexity, with width O(dQ s^2) and depth O(s).

Key takeaway

For research scientists designing neural networks for scientific computing or PDE-based models, this work fundamentally changes the approach to Sobolev approximation. You can now achieve arbitrary accuracy in Sobolev norms with fixed-size networks by employing specialized activation functions like "EUAF" or "DUAF"_inf. This eliminates the need for ever-growing network sizes to improve derivative control. Consider implementing these activations, especially for structured problems where network width scales linearly with channels and dimension.

Key insights

Fixed-size neural networks can achieve arbitrary-accuracy Sobolev approximation using specially designed activation functions.

Principles

Method

The method involves local Taylor approximation, encoding coefficients via staircase and point-fitting subnets, and gluing local approximants using smooth, fixed-size gates.

In practice

Topics

Best for: AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.