Beyond Classification: A Cough Regression Benchmark for Respiratory Acoustic Foundation Models
Summary
A new multi-model, multi-target cough regression benchmark evaluates five respiratory acoustic foundation models (FMs) across six health targets on three datasets. It explores their ability to predict continuous quantities like age, BMI, and disease probability from cough audio, moving beyond classification. Findings indicate MLP-small regression heads outperform baselines in 23 of 30 cases, while full MLP overfits on small data but recovers on larger sets, revealing a dataset size x head-capacity trade-off. HeAR leads within-dataset age regression on Coswara (9.12 yr MAE). HeAR and M2D+Resp achieve near-full performance at N = 50 samples, significantly more data-efficient than OPERA models requiring N = 400. Cross-dataset transfer is strongly asymmetric, with large diverse data generalizing better to small clinical populations.
Key takeaway
For machine learning engineers developing respiratory acoustic models, prioritize MLP-small regression heads for robust performance on continuous health quantity prediction. Be aware that HeAR and M2D+Resp offer high data efficiency, reaching near-full performance with N=50 samples. When considering transfer learning, focus on pretraining with large, diverse datasets to ensure generalizability to smaller clinical populations.
Key insights
Respiratory acoustic foundation models can predict continuous health quantities from cough audio using regression, moving beyond classification.
Principles
- Optimal regression head capacity depends on dataset size.
- Large, diverse datasets enable asymmetric cross-dataset transfer.
Method
The benchmark evaluates five FMs on six targets across three datasets using linear, MLP-small, and full MLP regression heads under subject-disjoint protocols.
In practice
- HeAR achieves 9.12 yr MAE for age regression on Coswara.
- HeAR and M2D+Resp reach near-full performance at N=50 samples.
Topics
- Cough Regression
- Respiratory Acoustics
- Foundation Models
- Machine Learning Benchmarks
- Transfer Learning
- Health Monitoring
Best for: AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.