Intracranial Aneurysm Classification and Segmentation via Tri-Axial ROI and Multi-Task Learning

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision & Pattern Recognition, Medical Imaging Analysis · Depth: Expert, quick

Summary

A new multi-task framework addresses the limitations of existing automated methods for intracranial aneurysm detection, which typically offer only binary detection. This framework simultaneously performs multi-label classification, multi-class aneurysm segmentation, and multi-class vessel segmentation across 13 anatomical locations and four imaging modalities: CTA, MRA, T2, and T1-post. The two-stage approach integrates a fast 2D tri-axial Region of Interest (ROI) extraction method with a 3D multi-task nnU-Net backbone. Key architectural features include a dual-decoder design to mitigate extreme volume imbalance between aneurysm and vessel classes, alongside cross-attention pooling and modality-specific auxiliary heads for enhanced feature learning across heterogeneous inputs. A two-fold ensemble of this system secured 2nd place in the RSNA 2025 Intracranial Aneurysm Detection challenge. Code, model weights, and a 3D Slicer plugin are publicly available.

Key takeaway

For Computer Vision Engineers developing automated medical image analysis systems for intracranial aneurysms, this multi-task framework offers a robust blueprint. You should consider adopting a two-stage approach with 2D ROI extraction and a 3D multi-task nnU-Net, especially when dealing with diverse imaging modalities and severe class imbalance. Implementing a dual-decoder and cross-attention pooling can significantly improve the accuracy of both classification and fine-grained multi-class segmentation, moving beyond simple binary detection.

Key insights

A multi-task framework enhances intracranial aneurysm analysis by integrating classification and multi-class segmentation across diverse modalities.

Principles

Method

A two-stage process involving fast 2D tri-axial ROI extraction, followed by a 3D multi-task nnU-Net backbone with a dual-decoder, cross-attention pooling, and modality-specific auxiliary heads for classification and segmentation.

In practice

Topics

Best for: AI Scientist, Computer Vision Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.