Seeing the imagined: a latent functional alignment in visual imagery decoding from fMRI data
Summary
Researchers Fabrizio Spera, Tommaso Boccato, and colleagues investigated adapting a state-of-the-art visual perception decoder, DynaDiff, to reconstruct imagined content from fMRI data using the Imagery-NSD benchmark. They introduced a latent functional alignment approach that maps imagery-evoked brain activity into the pretrained model's conditioning space, keeping other components frozen. To address limited matched imagery-perception supervision, a retrieval-based augmentation strategy was developed, selecting semantically related Natural Scenes Dataset (NSD) perception trials. Across four subjects, this latent functional alignment consistently improved high-level semantic reconstruction metrics compared to a frozen pretrained baseline and a voxel-space ridge alignment baseline. The method enabled above-chance decoding from multiple cortical regions, suggesting that semantic structures learned from perception can stabilize and enhance visual imagery decoding under out-of-distribution conditions.
Key takeaway
For AI Scientists and Machine Learning Engineers developing brain-computer interfaces or fMRI decoding models, consider implementing latent functional alignment and retrieval-based data augmentation. This approach significantly enhances semantic reconstruction of imagined content, particularly when working with limited imagery datasets and adapting models initially trained on visual perception. Your focus should be on high-level semantic metrics to accurately assess performance, as pixel-level fidelity may be less critical for internally generated mental images.
Key insights
Latent functional alignment significantly improves fMRI-based visual imagery decoding by leveraging perception-learned semantic structures.
Principles
- Semantic structure from perception aids imagery decoding.
- Imagery decoding benefits from latent functional alignment.
- Distributed cortical networks contribute to visual imagery.
Method
A latent functional alignment approach maps imagery-evoked fMRI activity into a pretrained model's conditioning space. A retrieval-based augmentation strategy selects semantically related perception trials to mitigate data scarcity.
In practice
- Adapt perception decoders for imagery tasks.
- Augment imagery datasets with semantically similar perception data.
- Focus on high-level semantic metrics for imagery reconstruction.
Topics
- fMRI Decoding
- Visual Imagery Reconstruction
- Latent Functional Alignment
- DynaDiff Model
- Natural Scenes Dataset
Best for: AI Scientist, Research Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.