Structured Neuron Pruning in Deep Neural Networks Using Multi-Armed Bandits

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

A new structured pruning framework utilizes multi-armed bandit (MAB) algorithms to remove complete neurons from deep neural networks, addressing the challenge of exploiting unstructured sparsity in dense implementations. This method treats each candidate neuron as an "arm," temporarily masking it to measure loss change on a mini-batch, restoring it, and updating a "safe-removal reward" estimate. The framework supports various stochastic policies, including Epsilon-Greedy, Softmax, UCB1, and Thompson Sampling, alongside multiplicative-weight policies like Hedge-style multiplicative weights and EXP3. Evaluations across tabular classification, tabular regression, and deep neural-network benchmarks (image, text, reasoning tasks) demonstrated its efficacy. Statistical analysis, using the Friedman test and Nemenyi post-hoc test, revealed UCB1 as the top performer in tabular tasks, improving on unpruned networks and competing with standard regression models by R^2. For deep-learning tasks, UCB1 and Thompson Sampling achieved the strongest ranks, significantly outperforming unpruned models and other pruning techniques.

Key takeaway

For Machine Learning Engineers optimizing deep neural network deployment, consider integrating multi-armed bandit (MAB) based neuron pruning. This method, particularly using UCB1 or Thompson Sampling, offers a computationally practical way to achieve structured model reduction, outperforming traditional pruning techniques and unpruned models across various tasks. You can significantly reduce model size and improve efficiency without sacrificing performance, especially for image, text, and reasoning applications.

Key insights

Multi-armed bandits effectively prune entire neurons in deep neural networks, improving model efficiency and performance.

Principles

Method

Treat each neuron as an MAB arm; pull to mask, measure loss change on a mini-batch, restore, and update a safe-removal reward estimate using policies like UCB1 or Thompson Sampling.

In practice

Topics

Best for: Research Scientist, NLP Engineer, Computer Vision Engineer, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.