Py学习  »  机器学习算法

使用深度学习的雷暴临近预报:多源灾害数据融合模型(附代码+模型数据)

气象学家 • 11 月前 • 1055 次点击  

图/Pexels提供

使用深度学习的雷暴临近预报:多源灾害数据融合模型

雷暴通过各种极端天气现象对人们的生命、财产安全都构成巨大的威胁。一些部门需要对雷暴相关的危害进行预测,包括应急救援部门、基础设施管理部门和航空气象部门。为了满足这一需求,我们提出了一个深度学习模型,可以适应不同的危险类型。该模型可以利用多种数据源;我们使用来自天气雷达、闪电探测仪、卫星可见/红外图像、数值天气预报和数字高程模型的数据。我们展示了该模型在1公里分辨率网格上对闪电、冰雹和强降水进行概率预测的能力,时间分辨率为5分钟,提前预报时效最高可达60分钟,在特定时间和地点发生危害的概率。Shapley值量化了不同数据源的重要性,表明天气雷达产品是所有三种灾害类型的最重要预测因素。

1关键点

  • 我们提出了一个用于预报雷暴灾害的深度学习模型,并对雷电、冰雹和强降水预报能力进行了演示

  • 该模型可以在二维网格上提供这些灾害的概率预警

  • 我们使用可解释的人工智能方法分析了模型中使用的不同数据源的重要性

2预训练模型和结果

数据大小2.4GB

https://zenodo.org/record/7157986#.ZFGxJ6BByhA

3论文代码

Code for paper "Thunderstorm nowcasting with deep learning: a multi-hazard data fusion model" 

https://github.com/MeteoSwiss/c4dl-multi

4文章引用

Leinonen, J., Hamann, U., Sideris, I. V., & Germann, U. (2023). Thunderstorm Nowcasting With Deep Learning: A Multi‐Hazard Data Fusion Model. Geophysical Research Letters, 50(8), e2022GL101626.https://doi.org/10.1029/2022GL101626

5数据源

  • Weather radar observations were collected from the Swiss operational network (Germann et al., 2016, 2022). These data include radar-measured information about the precipitation rate and the vertical structure of the radar reflectivity, such as echo top heights and the vertically integrated liquid water content, at 1 km horizontal resolution.

  • Geostationary satellite imagery was obtained from the Spinning Enhanced Visible and InfraRed Imager (SEVIRI; Schmid, 2000) on the MeteoSat Second Generation 3 (MSG-3) satellite. We used the radiances and brightness temperatures from the visible and infrared (IR) bands; the native resolution of these in the study area is approximately 1 km × 2 km for the high-resolution visible (HRV) band and 3 km × 5 km for the others. The bands that consist mostly of reflected solar radiation were normalized with the function f(x) = x/cos θ, where θ is the solar zenith angle; these bands are unavailable at night. Furthermore, we used the Nowcasting Satellite Application Facility (NWCSAF) cloud top height, cloud top temperature, cloud optical thickness and cloud top phase products (Derrien & Le Gléau, 2005; Hamann et al., 2014; Le Gléau, 2016).

  • Lightning detection measurements were collected by the European Cooperation for Lightning Detection (EUCLID) network of lightning antennas (Poelman et al., 2016; Schulz et al., 2016) and delivered by Météorage. The original data consist of locations and various properties of lightning strikes. We aggregated these into maps of lightning density and current density, as well as binary occurrence maps used in lightning prediction.

  • NWP forecasts originated from the Consortium for Small Scale Modeling (COSMO) model (Baldauf et al., 2011) used operationally at MeteoSwiss. We selected various COSMO outputs relevant to thunderstorms, such as the convective available potential energy (CAPE).

  • Digital elevation model (DEM) data were from the Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) global DEM (Abrams et al., 2020) used to model topography in COSMO.

6Neural Network

LHG2022 described a recurrent-convolutional DL model for predicting lightning occurrence (see Figure S2 in Supporting Information S1), based on the model (Leinonen, 2021a, 2021b) used in the Weather4cast 2021 competition (Herruzo et al., 2021) where it outperformed competing architectures such as U-Nets and transformer architectures. We adopt this architecture for each hazard, inheriting the best-performing hyperparameters from LHG2022. The model is built slightly differently for each combination of input data sources, such that only the parts of the model necessary for those inputs are included.

The main architectural change to the DL model in this study is that the prediction of heavy precipitation only uses the last time step of the final layer, which is trained to predict the entire 1-hr accumulation. Furthermore, in contrast to LHG2022 we did not perform model ensembling (Ganaie et al., 2022) due to the required computational cost.

For lightning, we utilize focal loss (Lin et al., 2017) with focusing parameter γ = 2 as the training loss function, so that our results are comparable with LHG2022 where this loss was also adopted. The hail and precipitation targets are defined probabilistically and it is not clear how the focal loss generalizes to such cases. Thus, we use cross entropy (CE) loss, which also performed well in LHG2022 and can be straightforwardly defined for probabilistic targets as:

where is the predicted probability, is the target probability and the sum is over the possible classes . In the case of hail, there are two classes (hail or no hail), while with precipitation, there are four classes as defined by Equation 1.

Training all 96 combinations of targets and data sources takes approximately one month on eight Nvidia V100 GPUs.(配置要求不低啊!) For each target and data source combination, we used the same model architecture and hyperparameters. Ideally, these should be tuned separately for each case to optimize performance, but this would require training each model many times, which would be infeasible with the available resources.

7结果与讨论

Figure 1 (a) The results of lightning prediction. On the left, three input variables (rain rate, lightning, and the satellite HRV channel) are shown at two time steps. On the right, future time steps are shown, with the first row giving the CAPE, the second showing the observed lightning, and the third showing the model prediction of lightning occurrence probability. (b) As (a), but showing the observed and predicted POH instead of lightning. The HRV is not shown as the case occurs at night. (c) As (a) and (b), but showing the probability of accumulated precipitation in the next 60 min exceeding 10 mm. Consequently, only one future time step is shown.
Figure 2 The average loss in the test data set for the prediction of (a) lightning, (b) hail and (c) heavy precipitation using different data sources. In every panel, each square shows the loss for the data sources indicated by the combination of the labels for the corresponding row and column (abbreviated to “Rad” for radar, “Sat” for satellite and “Lig” for lightning). For example, the top left corner of each panel shows the model trained with all five data sources, the bottom left shows the model using only radar and satellite data, and the bottom right shows the “ignorant” model that did not receive any inputs. All loss scores are scaled such that the loss of the ignorant model is set to 1.
Figure 3 The Shapley values (normalized to a sum of 1) of the different data sources, evaluated with the test data set, for the prediction of (a) lightning, (b) hail and (c) heavy precipitation. For lightning and hail, the Shapley values as a function of lead time are also shown; for heavy precipitation, we only predict the 60-min accumulated precipitation so this cannot be displayed.



-----广告赞助商-----







声明:欢迎转载、转发本号原创内容,可留言区留言或者后台联系小编(微信:gavin7675)进行授权。气象学家公众号转载信息旨在传播交流,其内容由作者负责,不代表本号观点。文中部分图片来源于网络,如涉及作品内容、版权和其他问题,请后台联系小编处理。


往期推荐

 获取ERA5-Land陆面高分辨率再分析数据(32TB)

 1942-2022年中国地面气象站观测数据免费分享

 获取全球GPM降水数据,半小时/逐日(4TB)

 获取1998-2019 TRMM 3B42逐日降水数据

★  获取最新版本CMIP6降尺度数据集30TB

★ 获取ERA5常用变量再分析数据26TB

 EC数据商店推出Python在线处理工具箱

★ EC打造实用气象Python工具Metview

★ 机器学习简介及在短临天气预警中的应用

★ Nature-地球系统科学领域的深度学习及理解

★ 采用神经网络与深度学习来预报降水、温度

★ 灵魂拷问:ChatGPT对气象人的饭碗是福是祸?

★ 气象局是做啥的?气象局的薪水多少?

★ 一位气象学家尝试ChatGPT复现Nature子刊的研究,他真的会面临失业吗?!


   欢迎加入气象学家交流群   

请备注:姓名/昵称-单位/学校-研究方向



Python社区是高质量的Python/Django开发社区
本文地址:http://www.python88.com/topic/154410
 
1055 次点击