STARFISH: faST Accuracy Recovery in pruned networks From Internal State Healing

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision & Pattern Recognition · Depth: Expert, quick

Summary

STARFISH is a novel healing method designed to efficiently recover accuracy in pruned neural networks. Pruning reduces network weights to speed up inference, often at the cost of accuracy, necessitating a subsequent healing process. STARFISH addresses this by optimizing the pruned network to align with the original network's internal state representations, utilizing a small calibration set of unlabeled examples. For networks with 50% of weights removed, STARFISH improves recovered accuracy by up to 22% compared to existing methods on ViT-based architectures. Its benefits are particularly evident under aggressive pruning; for instance, after eliminating 75% of weights in a DeiT-B network for ImageNet, STARFISH recovers 82% of the original dense accuracy using only 0.4% of the training images, significantly outperforming competing techniques that achieve only 40%.

Key takeaway

For Machine Learning Engineers optimizing model deployment, if you are struggling to maintain accuracy after network pruning, STARFISH offers a significant advancement. You should consider integrating this method into your pruning workflows, especially for ViT-based networks or aggressive pruning scenarios. STARFISH can recover substantially more accuracy with minimal unlabeled data, potentially allowing for much smaller, faster models without unacceptable performance degradation.

Key insights

STARFISH recovers pruned network accuracy by aligning its internal state representations with the original model using a small unlabeled calibration set.

Principles

Method

STARFISH optimizes a pruned network to match the original network's internal state representations using a tiny calibration set of unlabeled examples.

In practice

Topics

Best for: Research Scientist, AI Engineer, AI Scientist, Machine Learning Engineer, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.