GRAPE: Guided Parameter-Space Evolution for Compact Adversarial Robustness
Summary
GRAPE, or Guided Parameter-Space Evolution, is a novel training framework designed to enhance compact adversarial robustness in neural networks. This method challenges traditional Adversarial Training (AT) by proposing that the order in which parameters become optimizable significantly impacts the final robust solution, even with controlled architecture and computation budgets. GRAPE integrates parameter-space stabilization with progressive hidden expansion, gradually releasing new optimizable dimensions and employing an adversarial spectral utilization score to direct capacity towards high-pressure modules. On CIFAR-10 under the standard ε∞ threat model, GRAPE improved PGD-20 robust accuracy from 51.70% to 56.94% using a ResNet-18 architecture, while maintaining a FLOPs ratio of 1.009x and reducing parameter count by approximately 21.4%. A sequential grow variant achieved 56.52% PGD-20 robust accuracy, underscoring the impact of the parameter-space exposure path.
Key takeaway
For Machine Learning Engineers designing adversarially robust neural networks, consider evolving your model's parameter space rather than training a fixed architecture. This approach, exemplified by GRAPE, demonstrates that guided parameter exposure can significantly improve PGD-20 robust accuracy (e.g., to 56.94% on CIFAR-10) while simultaneously reducing parameter count by over 21%. You should explore dynamic parameter-space evolution to achieve superior robustness and model compactness without increasing computational budget.
Key insights
Guided evolution of a neural network's parameter space during adversarial training can yield more compact and robust models.
Principles
- Parameter exposure order impacts final robust solutions.
- Progressive hidden expansion stabilizes robust optimization.
- Spectral utilization guides capacity to high-pressure modules.
Method
GRAPE stabilizes robust optimization in exposed parameter space, progressively releases new optimizable dimensions, and uses an adversarial spectral utilization score to guide capacity to high-pressure modules.
In practice
- Improve PGD-20 robust accuracy on CIFAR-10.
- Reduce ResNet-18 parameter count by 21.4%.
- Achieve 56.94% robust accuracy with matched FLOPs.
Topics
- Adversarial Robustness
- Parameter-Space Evolution
- Neural Network Training
- Model Compression
- ResNet-18
- Adversarial Training
Best for: Computer Vision Engineer, Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.