Two-Valued Symmetric Circulant Matrices: Applications in Deep Learning

· Source: cs.LG updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Internet of Things (IoT) & Connected Devices, Health & Medical Research · Depth: Expert, extended

Summary

A new deep learning architecture, the Two-Valued Symmetric Circulant Matrix (TVSCM), is proposed to address the high storage and computational demands of deep neural networks, particularly for resource-limited edge and Internet of Medical Things (IoMT) platforms. This architecture replaces traditional dense layers with a highly sparse structure that uses only two weights per layer, arranged in a circulant and symmetric configuration. Experimental results on the MNIST dataset show an approximately 79x reduction in parameters (from 623,290 to 7,852) and an 80.4x reduction in memory access, while maintaining comparable accuracy (97.67% to 93.54%). On the MIT-BIH arrhythmia dataset, TVSCM achieved a 26x parameter reduction (from 24,709 to 942) and a 26.2x memory access reduction, with accuracy shifting from 97.60% to 93.10%. The TVSCM model also demonstrated significant improvements in energy efficiency (50x on MNIST, 112x on MIT-BIH) and inference latency (117x on MNIST, 91.5x on MIT-BIH), making it suitable for battery-powered and real-time healthcare applications.

Key takeaway

For AI Engineers and Research Scientists developing models for edge or IoMT platforms, consider integrating Two-Valued Symmetric Circulant Matrices (TVSCM) into your network architecture. This approach can yield significant reductions in model parameters (up to 80x), memory footprint, energy consumption, and inference latency, making your models viable for resource-constrained environments without substantial accuracy loss. Prioritize TVSCM for applications requiring real-time processing and extended battery life, such as wearable health monitors or embedded diagnostic tools.

Key insights

Two-Valued Symmetric Circulant Matrices drastically reduce deep learning model parameters and computational overhead for edge devices.

Principles

Method

Replace dense neural network layers with Two-Valued Symmetric Circulant Matrices (TVSCM) that learn an entire weight matrix from just two scalar values, leveraging cyclic shifts and symmetry for compression.

In practice

Topics

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Hardware Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.LG updates on arXiv.org.