Can AI models reliably forecast extreme weather events?

· Source: Machine learning : nature.com subject feeds · Field: Science & Research — Environmental Science & Earth Systems, Mathematics & Computational Sciences, Research Methodology & Innovation · Depth: Intermediate, quick

Summary

Artificial intelligence (AI) models are emerging as a faster alternative to traditional physics-based numerical weather prediction systems, capable of producing a 14-day global forecast two hours earlier. Unlike conventional models that solve complex physical equations, AI models map current conditions to future states using algorithms trained on historical weather data, shifting heavy computing to the training phase. However, concerns exist regarding their reliability in forecasting rare, extreme weather events, as AI systems trained on past data may falter with unprecedented events. While some agencies, like the European Centre for Medium-Range Weather Forecasts, are integrating AI, scientists emphasize the urgent need for rigorous, standardized testing protocols to objectively evaluate AI models' performance on out-of-sample extreme events before widespread adoption by public forecasting agencies.

Key takeaway

For meteorologists and climate scientists evaluating AI integration into operational forecasting, you should prioritize the development and adoption of standardized benchmarking protocols. Your focus must be on rigorously testing AI models against a consensus-driven set of "iconic" extreme weather events that were not part of their training data. This ensures that the speed advantage of AI does not compromise accuracy and reliability when forecasting critical, rare phenomena.

Key insights

AI weather models offer speed but require rigorous, standardized testing for extreme event reliability.

Principles

Method

The AI Retraining Without Iconic Events (AIRWIE) protocol proposes training AI systems while withholding a designated set of "iconic" extreme events solely for testing, ensuring evaluation against out-of-sample extremes.

In practice

Topics

Best for: AI Scientist, Research Scientist, AI Researcher, MLOps Engineer, Policy Maker

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.