Physics-Guided Spatiotemporal Learning for Coastal Wave Peak Period Estimation from Video

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Coastal Oceanography · Depth: Advanced, extended

Summary

A novel Physics-Guided Deep Spatiotemporal Learning Framework is proposed for direct estimation of nearshore wave peak periods ($T_{p}$) from passive coastal video streams. This framework addresses limitations of traditional monitoring systems and existing deep learning methods by integrating automated temporal-variance-based region-of-interest (ROI) detection, multi-stage Sim-to-Real transfer learning, and physics-informed regularization. Evaluations of various spatiotemporal architectures, including transformer-based (LtViViT) and recurrent-convolutional (TinyWaveNet), revealed that LtViViT achieved the lowest instantaneous prediction RMSE of 0.7692 s. Conversely, the lightweight PtAttnCNN (TinyWaveNet) with \u03bb=5.0 demonstrated superior operational oceanographic skill (WS: 0.9811, SI: 0.0892), low parameters (0.28 M), and fast inference (0.53 ms). Physics-guided regularization significantly enhanced trend-following consistency and eliminated physically implausible predictions, while automated ROI detection improved accuracy by 27%. The framework shows promise for cost-efficient, long-term coastal wave monitoring.

Key takeaway

For ML Engineers developing coastal monitoring systems or coastal managers seeking cost-effective solutions, this framework offers a robust approach to estimate wave peak periods from video. You should consider the PtAttnCNN (TinyWaveNet) with \u03bb=5.0 for operational deployments, prioritizing its high oceanographic skill (WS: 0.9811), low parameter count (0.28 M), and sub-millisecond inference (0.53 ms). This balances physical consistency with real-time feasibility, providing a reliable alternative to traditional sensors.

Key insights

A physics-guided deep spatiotemporal learning framework accurately estimates nearshore wave peak periods from video, leveraging multi-stage transfer learning and physical constraints.

Principles

Method

The framework employs temporal-variance ROI detection, synthetic pre-training (Airy wave theory), noisy "silver" data pre-training, and expert "gold" data fine-tuning with a physics-guided loss function.

In practice

Topics

Best for: Computer Vision Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.