Attention-Guided Autoencoder Fusion for Insulator Defect Detection Using UAV Transmission-Line Imaging
Summary
AE-YOLO, an Attention-Guided AutoEncoder-Enhanced YOLO framework, addresses challenges in automated defect detection for high-voltage transmission-line insulators using Unmanned Aerial Vehicle (UAV) imagery, specifically severe class imbalance, large scale variation, and small defect instances. The architecture integrates lightweight bottleneck autoencoders within a Feature Pyramid Network-Path Aggregation Network (FPN-PAN) neck to preserve anomaly-sensitive information during multi-scale feature fusion. Convolutional Block Attention Modules (CBAM) enhance feature discrimination, while a variance-maximizing autoencoder regularization strategy encourages diverse, defect-discriminative latent representations. The network trains with a unified objective combining focal loss, Complete IoU (CIoU) loss, and autoencoder regularization. During inference, Weighted Boxes Fusion (WBF) combines predictions from YOLOv8, YOLOv10, and YOLO11, augmented by an autoencoder-guided confidence boosting mechanism for rare defect categories. Experiments on the Insulator-Defect Detection dataset show AE-YOLO with an EfficientNetV2 backbone achieves 95.10% mAP at 0.5, 96.40% precision, and 93.80% recall, surpassing the strongest YOLO-family baseline by 5.0 points in mAP at 0.5 and 6.7 points in recall.
Key takeaway
For Computer Vision Engineers developing robust defect detection systems for critical infrastructure like transmission lines, AE-YOLO presents a highly effective framework. You should consider integrating lightweight bottleneck autoencoders within your FPN-PAN neck and Convolutional Block Attention Modules throughout your backbone to improve anomaly sensitivity and feature discrimination. Adopting multi-model fusion with confidence boosting, as demonstrated by AE-YOLO's 95.10% mAP at 0.5, can significantly enhance detection accuracy and recall for rare defect categories in UAV imagery.
Key insights
AE-YOLO integrates autoencoders and attention mechanisms into YOLO for robust UAV-based insulator defect detection, improving mAP and recall.
Principles
- Autoencoders preserve anomaly-sensitive information.
- Attention modules enhance feature discrimination.
- Multi-model fusion improves detection robustness.
Method
AE-YOLO integrates bottleneck autoencoders into FPN-PAN and CBAMs into the backbone. It uses variance-maximizing regularization and a unified loss (focal, CIoU, autoencoder regularization). Inference combines YOLOv8, YOLOv10, YOLO11 via WBF with autoencoder-guided confidence boosting.
In practice
- Inspect transmission lines via UAVs.
- Monitor insulator defects effectively.
- Enhance rare defect category detection.
Topics
- Insulator Defect Detection
- UAV Imaging
- Attention Mechanisms
- Autoencoders
- YOLO Frameworks
- Multi-model Fusion
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.