Convex training of Lipschitz-regularized shallow neural networks

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

A new training procedure for shallow neural networks enhances robustness against adversarial attacks. This method addresses a non-convex Lipschitz-regularized training program by introducing a convex restriction, which can be efficiently solved to global optimality. The approach can function as a post-processing step, taking a pre-trained network as an initial solution and guaranteeing that the resulting optimal network is no worse than the original. Experiments on real-world datasets for regression tasks in adversarial settings demonstrate its effectiveness. Numerically, the proposed convex program yields networks with lower objective values on the Lipschitz-regularized program compared to existing methods. Furthermore, on specific datasets, networks trained with this convex procedure exhibit improved accuracy and increased robustness against adversarial attacks.

Key takeaway

For Machine Learning Engineers developing shallow neural networks for regression tasks in adversarial environments, you should consider integrating this convex training procedure. It offers a guaranteed improvement over initial pre-trained models by efficiently solving a Lipschitz-regularized program to global optimality. Implementing this as a post-processing step can significantly enhance your network's accuracy and robustness against adversarial attacks, providing a more reliable solution.

Key insights

A convex restriction efficiently solves non-convex Lipschitz-regularized shallow neural network training, enhancing adversarial robustness.

Principles

Method

Solve a non-convex Lipschitz-regularized training program via a convex restriction, efficiently achieving global optimality. Can be a post-processing step for pre-trained networks.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Security Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.