M$^2$FedAQI: Multimodal Federated Learning for Air Quality Prediction on Heterogeneous Edge Devices
Summary
M^2FedAQI is a novel multimodal federated learning framework designed for decentralized Air Quality Index (AQI) prediction on heterogeneous edge devices. Submitted on May 10, 2026, this lightweight framework addresses the scalability, privacy, and communication overhead challenges of centralized and unimodal federated learning approaches in IoT environments. It integrates visual and tabular data modalities using a feature modulation-based fusion mechanism to enable efficient cross-modal interaction with low computational overhead. Evaluated on PM25Vision and TRAQID datasets for both classification and regression, M^2FedAQI significantly outperforms existing baselines, achieving up to 11.0% higher Accuracy, 3.53% higher AUC, 12.2% higher F1-score, and 18.0% higher R^2, while reducing MAE by up to 25.4% and RMSE by up to 20.4%. The framework also incorporates TLS-based authentication for secure client participation and efficient resource utilization on edge devices.
Key takeaway
For research scientists developing environmental monitoring systems, M^2FedAQI demonstrates a robust approach to air quality prediction. You should consider adopting multimodal federated learning architectures to improve prediction accuracy and ensure data privacy on distributed edge networks, especially when dealing with diverse data sources and heterogeneous devices.
Key insights
M^2FedAQI offers a multimodal federated learning solution for accurate, private, and efficient air quality prediction on edge devices.
Principles
- Multimodal data fusion enhances environmental pattern capture.
- Federated learning improves privacy and scalability.
- Edge deployment requires efficient resource utilization.
Method
M^2FedAQI integrates visual and tabular modalities via a feature modulation fusion mechanism, enabling decentralized AQI prediction on heterogeneous edge devices with TLS-based authentication.
In practice
- Deploy multimodal models on IoT edge devices.
- Use TLS for secure federated learning communication.
- Evaluate models with PM25Vision and TRAQID datasets.
Topics
- M$^2$FedAQI
- Multimodal Federated Learning
- Air Quality Prediction
- Heterogeneous Edge Devices
- Feature Modulation
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Hardware Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.LG updates on arXiv.org.