A foundation model for electrodermal activity data

· Source: cs.LG updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Internet of Things (IoT) & Connected Devices · Depth: Advanced, extended

Summary

Researchers have developed UME, the first dedicated open-source foundation model for electrodermal activity (EDA) data, addressing a critical gap in large-scale, curated, and openly accessible physiological datasets. To train UME, they compiled EDAMAME, a collection of EDA traces from 24 public datasets, totaling over 25,000 hours from 634 users, all collected using Empatica E4 devices. UME, an EfficientNet-based model trained using contrastive learning on approximately 275 million 60-second EDA windows, demonstrates strong performance. In 8 out of 10 downstream classification tasks, UME outperforms traditional handcrafted feature baselines and matches the performance of generalist time-series foundation models like Mantis, while requiring 20x fewer computational resources. The project releases all datasets, model weights, and code to foster further research in wearable EDA modeling.

Key takeaway

For AI Scientists and Research Scientists working with physiological signals, UME and the EDAMAME dataset offer a significant advancement in electrodermal activity (EDA) modeling. You should explore integrating UME's pre-trained weights or EDAMAME into your projects to improve performance on EDA-related tasks, especially given its computational efficiency compared to generalist models. This work provides a robust foundation for developing more accurate and resource-friendly wearable health applications.

Key insights

UME is the first open-source foundation model for EDA, outperforming baselines with 20x less computation.

Principles

Method

The method involves compiling EDAMAME from 24 public datasets, pre-processing EDA signals (filtering, phasic/tonic decomposition, 60-second windowing), and training an EfficientNet architecture with contrastive learning using InfoNCE loss and EDA-specific data augmentations.

In practice

Topics

Best for: AI Scientist, Research Scientist, AI Researcher, Machine Learning Engineer, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.LG updates on arXiv.org.