KVNN: Learnable Multi-Kernel Volterra Neural Networks

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision · Depth: Expert, quick

Summary

A novel kernelized Volterra Neural Network (kVNN) is proposed, designed to enhance higher-order learning by exploiting compositional features more efficiently than conventional deep learning models. The kVNN achieves this through a learnable multi-kernel representation, where distinct polynomial-kernel components model different interaction orders with compact, learnable centers, resulting in an order-adaptive parameterization. Its architecture allows kVNN filters to directly replace standard convolutional kernels in existing deep learning frameworks. Experimental validation on video action recognition and image denoising tasks demonstrates that kVNN consistently reduces model parameters and computational complexity (GFLOPs) while maintaining competitive or improved performance. This efficiency is observed even when training from scratch, without requiring extensive pretraining.

Key takeaway

For AI Engineers and Research Scientists developing computer vision models, kVNN offers a compelling alternative to standard convolutional layers. Its ability to reduce model parameters and GFLOPs while maintaining or improving performance, even without large-scale pretraining, suggests a practical path to more efficient and expressive deep networks. Consider integrating kVNN filters into your existing architectures to optimize for both computational cost and performance.

Key insights

kVNNs use learnable multi-kernel representations for efficient higher-order feature learning in deep networks.

Principles

Method

kVNN layers consist of parallel branches of different polynomial orders, allowing direct replacement of standard convolutional kernels for order-adaptive parameterization.

In practice

Topics

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.