Learning Under Extreme Data Scarcity: Subject-Level Evaluation of Lightweight CNNs for fMRI-Based Prodromal Parkinsons Detection
Summary
A study on prodromal Parkinson's disease detection using resting-state fMRI data from 40 subjects (20 prodromal cases, 20 controls) investigates deep learning under extreme data scarcity. It fine-tunes ImageNet-pretrained convolutional neural networks, including VGG19, Inception V3, Inception ResNet V2, and MobileNet V1. The research highlights that common image-level data splits lead to information leakage and inflated accuracy, whereas a strict subject-level split reveals test accuracies between 60% and 81%. MobileNet V1, a lightweight architecture, consistently outperforms deeper models like VGG19 under subject-level evaluation, suggesting that evaluation strategy and model capacity are more critical than architectural depth in low-data scenarios.
Key takeaway
For AI Scientists developing deep learning models with scarce neuroimaging data, you must enforce strict subject-level data splits to prevent information leakage and obtain realistic performance metrics. Your focus should be on model capacity and robust evaluation strategies rather than simply increasing architectural depth, as lightweight models like MobileNet V1 can offer superior generalization in these challenging low-data regimes.
Key insights
Subject-level data splitting and model capacity are crucial for reliable deep learning evaluation in extreme data scarcity.
Principles
- Avoid image-level data splits in neuroimaging.
- Model capacity impacts generalization in low-data.
- Evaluation strategy is critical for performance.
Method
Fine-tuning ImageNet-pretrained CNNs on fMRI data, comparing performance under image-level vs. strict subject-level data partitioning, and evaluating models with varying capacities.
In practice
- Use subject-level data splits for fMRI analysis.
- Consider lightweight CNNs like MobileNet V1.
- Prioritize evaluation strategy over model depth.
Topics
- Prodromal Parkinson's Detection
- fMRI Analysis
- Data Scarcity Learning
- Lightweight CNNs
- Subject-Level Evaluation
Best for: AI Scientist, Research Scientist, AI Researcher, Machine Learning Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CV updates on arXiv.org.