Structured Neuron Pruning in Deep Neural Networks Using Multi-Armed Bandits
Summary
A new structured pruning framework utilizes multi-armed bandit (MAB) algorithms to remove complete neurons from deep neural networks, addressing the challenge of exploiting unstructured sparsity in dense implementations. This method treats each candidate neuron as an "arm," temporarily masking it to measure loss change on a mini-batch, restoring it, and updating a "safe-removal reward" estimate. The framework supports various stochastic policies, including Epsilon-Greedy, Softmax, UCB1, and Thompson Sampling, alongside multiplicative-weight policies like Hedge-style multiplicative weights and EXP3. Evaluations across tabular classification, tabular regression, and deep neural-network benchmarks (image, text, reasoning tasks) demonstrated its efficacy. Statistical analysis, using the Friedman test and Nemenyi post-hoc test, revealed UCB1 as the top performer in tabular tasks, improving on unpruned networks and competing with standard regression models by R^2. For deep-learning tasks, UCB1 and Thompson Sampling achieved the strongest ranks, significantly outperforming unpruned models and other pruning techniques.
Key takeaway
For Machine Learning Engineers optimizing deep neural network deployment, consider integrating multi-armed bandit (MAB) based neuron pruning. This method, particularly using UCB1 or Thompson Sampling, offers a computationally practical way to achieve structured model reduction, outperforming traditional pruning techniques and unpruned models across various tasks. You can significantly reduce model size and improve efficiency without sacrificing performance, especially for image, text, and reasoning applications.
Key insights
Multi-armed bandits effectively prune entire neurons in deep neural networks, improving model efficiency and performance.
Principles
- Redundant hidden units exist in deep neural networks.
- Structured pruning is more exploitable than unstructured sparsity.
- MAB algorithms can estimate neuron removal rewards efficiently.
Method
Treat each neuron as an MAB arm; pull to mask, measure loss change on a mini-batch, restore, and update a safe-removal reward estimate using policies like UCB1 or Thompson Sampling.
In practice
- Apply UCB1 for tabular classification and regression tasks.
- Use UCB1 or Thompson Sampling for deep learning model pruning.
- Reduce model size for image, text, and reasoning applications.
Topics
- Deep Neural Networks
- Neuron Pruning
- Multi-Armed Bandits
- Model Reduction
- UCB1 Algorithm
- Thompson Sampling
Best for: Research Scientist, NLP Engineer, Computer Vision Engineer, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.