EdgeZSAD: Practical Zero-Shot Anomaly Detection on Edge Devices

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision & Pattern Recognition, Internet of Things (IoT) & Connected Devices · Depth: Expert, quick

Summary

EdgeZSAD is a compact reference system designed for practical zero-shot anomaly detection (ZSAD) on edge devices, addressing the limitations of larger ViT-L foundation backbones (~300M parameters) in embedded hardware. This system integrates a TinyViT-21M-512 backbone, an asymmetric global-local readout (EdgeGLR), and a reproducible source-side training recipe called Real-IAD-DR. Using a single checkpoint trained with a source-trained, target-unseen protocol, EdgeZSAD was evaluated across six industrial benchmarks. It achieved an average image AUROC of 91.6 on MVTec-AD and 88.2 on VisA across three independent runs. The model is directly deployable on platforms like Jetson Orin Nano Super (TensorRT FP16) and RB5 Gen2 (QNN GPU FP16), demonstrating image-AUROC drift below 0.2 points on device-rescored benchmarks, preserving host-side ranking behavior.

Key takeaway

For Computer Vision Engineers deploying zero-shot anomaly detection on edge devices, EdgeZSAD demonstrates that high performance is achievable without large backbones. You should consider compact architectures like TinyViT-21M-512 combined with specialized readouts for memory-constrained hardware. This approach allows for direct deployment on platforms such as Jetson Orin Nano, maintaining host-side ranking behavior with minimal AUROC drift, thereby optimizing resource utilization while preserving detection accuracy.

Key insights

EdgeZSAD enables high-performance zero-shot anomaly detection on edge devices using a compact TinyViT backbone and specialized readout.

Principles

Method

EdgeZSAD combines a TinyViT-21M-512 backbone with EdgeGLR and Real-IAD-DR training. It uses a single checkpoint in a source-trained, target-unseen protocol for evaluation.

In practice

Topics

Best for: AI Scientist, Research Scientist, Machine Learning Engineer, Computer Vision Engineer, AI Hardware Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.