Neural Legendre-Fenchel transform with Hessian Preconditioning
Summary
A new method, Neural Legendre-Fenchel transform with Hessian Preconditioning, addresses challenges in approximating convex conjugates of ill-conditioned functions. Building on the Legendre-Fenchel (LF) transform's reformulation as projective polarity, this work introduces a Hessian-based preconditioning strategy. It leverages affine invariance by applying an affine deformation around a minimizer, making the second-order Taylor approximation coincide with a canonical paraboloid. A residual network then learns this simplified mapping, with the original conjugation recovered via inverse deformation. This approach incurs modest overhead: a single eigendecomposition during initialization and two matrix-vector multiplications per query. Experiments show improved convergence rates and enhanced numerical accuracy, especially for high-dimensional and ill-conditioned problems.
Key takeaway
For research scientists developing numerical methods for convex analysis, this Hessian-preconditioning technique offers a robust solution for ill-conditioned functions. You can achieve significantly improved convergence and numerical accuracy in approximating convex conjugates. Consider integrating this affine invariance-based preconditioning, noting its modest computational overhead of one eigendecomposition and two matrix-vector multiplications per query.
Key insights
Hessian-based preconditioning improves neural Legendre-Fenchel transform accuracy for ill-conditioned functions.
Principles
- Leverage affine invariance for function simplification
- Initialize residual networks near identity mapping
Method
Apply affine deformation to align a function's second-order Taylor approximation with a canonical paraboloid, then use a residual network, and inverse deform.
In practice
- Enhance conjugation accuracy for ill-conditioned problems
- Improve convergence rates in high-dimensional benchmarks
Topics
- Legendre-Fenchel Transform
- Convex Analysis
- Neural Networks
- Hessian Preconditioning
- Affine Invariance
- Numerical Accuracy
Best for: AI Scientist, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.