图/Pexels提供

使用深度学习的雷暴临近预报：多源灾害数据融合模型

雷暴通过各种极端天气现象对人们的生命、财产安全都构成巨大的威胁。一些部门需要对雷暴相关的危害进行预测，包括应急救援部门、基础设施管理部门和航空气象部门。为了满足这一需求，我们提出了一个深度学习模型，可以适应不同的危险类型。该模型可以利用多种数据源；我们使用来自天气雷达、闪电探测仪、卫星可见/红外图像、数值天气预报和数字高程模型的数据。我们展示了该模型在1公里分辨率网格上对闪电、冰雹和强降水进行概率预测的能力，时间分辨率为5分钟，提前预报时效最高可达60分钟，在特定时间和地点发生危害的概率。Shapley值量化了不同数据源的重要性，表明天气雷达产品是所有三种灾害类型的最重要预测因素。

1关键点

我们提出了一个用于预报雷暴灾害的深度学习模型，并对雷电、冰雹和强降水预报能力进行了演示
该模型可以在二维网格上提供这些灾害的概率预警
我们使用可解释的人工智能方法分析了模型中使用的不同数据源的重要性

2预训练模型和结果

数据大小2.4GB

https://zenodo.org/record/7157986#.ZFGxJ6BByhA

3论文代码

Code for paper "Thunderstorm nowcasting with deep learning: a multi-hazard data fusion model"

https://github.com/MeteoSwiss/c4dl-multi

4文章引用

Leinonen, J., Hamann, U., Sideris, I. V., & Germann, U. (2023). Thunderstorm Nowcasting With Deep Learning: A Multi‐Hazard Data Fusion Model. Geophysical Research Letters, 50(8), e2022GL101626.https://doi.org/10.1029/2022GL101626

5数据源

Weather radar observations were collected from the Swiss operational network (Germann et al., 2016, 2022). These data include radar-measured information about the precipitation rate and the vertical structure of the radar reflectivity, such as echo top heights and the vertically integrated liquid water content, at 1 km horizontal resolution.
Geostationary satellite imagery was obtained from the Spinning Enhanced Visible and InfraRed Imager (SEVIRI; Schmid, 2000) on the MeteoSat Second Generation 3 (MSG-3) satellite. We used the radiances and brightness temperatures from the visible and infrared (IR) bands; the native resolution of these in the study area is approximately 1 km × 2 km for the high-resolution visible (HRV) band and 3 km × 5 km for the others. The bands that consist mostly of reflected solar radiation were normalized with the function f(x) = x/cos θ, where θ is the solar zenith angle; these bands are unavailable at night. Furthermore, we used the Nowcasting Satellite Application Facility (NWCSAF) cloud top height, cloud top temperature, cloud optical thickness and cloud top phase products (Derrien & Le Gléau, 2005; Hamann et al., 2014; Le Gléau, 2016).
Lightning detection measurements were collected by the European Cooperation for Lightning Detection (EUCLID) network of lightning antennas (Poelman et al., 2016; Schulz et al., 2016) and delivered by Météorage. The original data consist of locations and various properties of lightning strikes. We aggregated these into maps of lightning density and current density, as well as binary occurrence maps used in lightning prediction.
NWP forecasts originated from the Consortium for Small Scale Modeling (COSMO) model (Baldauf et al., 2011) used operationally at MeteoSwiss. We selected various COSMO outputs relevant to thunderstorms, such as the convective available potential energy (CAPE).
Digital elevation model (DEM) data were from the Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) global DEM (Abrams et al., 2020) used to model topography in COSMO.

6Neural Network

LHG2022 described a recurrent-convolutional DL model for predicting lightning occurrence (see Figure S2 in Supporting Information S1), based on the model (Leinonen, 2021a, 2021b) used in the Weather4cast 2021 competition (Herruzo et al., 2021) where it outperformed competing architectures such as U-Nets and transformer architectures. We adopt this architecture for each hazard, inheriting the best-performing hyperparameters from LHG2022. The model is built slightly differently for each combination of input data sources, such that only the parts of the model necessary for those inputs are included.

The main architectural change to the DL model in this study is that the prediction of heavy precipitation only uses the last time step of the final layer, which is trained to predict the entire 1-hr accumulation. Furthermore, in contrast to LHG2022 we did not perform model ensembling (Ganaie et al., 2022) due to the required computational cost.

For lightning, we utilize focal loss (Lin et al., 2017) with focusing parameter γ = 2 as the training loss function, so that our results are comparable with LHG2022 where this loss was also adopted. The hail and precipitation targets are defined probabilistically and it is not clear how the focal loss generalizes to such cases. Thus, we use cross entropy (CE) loss, which also performed well in LHG2022 and can be straightforwardly defined for probabilistic targets as:

where is the predicted probability, is the target probability and the sum is over the possible classes . In the case of hail, there are two classes (hail or no hail), while with precipitation, there are four classes as defined by Equation 1.

Training all 96 combinations of targets and data sources takes approximately one month on eight Nvidia V100 GPUs.（配置要求不低啊！） For each target and data source combination, we used the same model architecture and hyperparameters. Ideally, these should be tuned separately for each case to optimize performance, but this would require training each model many times, which would be infeasible with the available resources.