GRAPES全球切线性和伴随模式的调优

引用本文

刘永柱, 张林, 金之雁. GRAPES全球切线性和伴随模式的调优[J]. 应用气象学报, 2017, 28(1): 62-71. 复制到剪切板

Liu Yongzhu, Zhang Lin, Jin Zhiyan. The Optimization of GRAPES Global Tangent Linear Model and Adjoint Model[J]. Journal of Applied Meteorological Science, 2017, 28(1): 62-71 复制到剪切板

GRAPES全球切线性和伴随模式的调优

刘永柱, 张林, 金之雁

中国气象局数值预报中心, 北京 100081

2016-03-22 收到, 2016-10-12 收到修改稿.

资助项目: 公益性行业（气象）科研专项（GYHY201506003）；“十二五”国家科技支撑计划（2012BAC22B02）；中国气象局数值预报GRAPES发展专项（GRAPES-FZZX-2016-13）

通讯作者: 刘永柱, email:liuyzh@cma.gov.cn.

摘要: 伴随技术是四维变分同化（4DVar）系统中计算代价函数梯度的最佳办法，切线性和伴随模式的效果和效率直接影响着4DVar系统的发展。基于GRAPES（Global and Regional Assimilation PrEdiction System）全球切线性和伴随模式1.0版本，利用GRAPES全球模式2.0版本在并行框架和性能等方面的改善，重新优化和设计了GRAPES全球切线性伴随模式2.0版本，提高了GRAPES全球切线性和伴随模式的效果和效率，优化了切线性模式程序结构，使其计算时间最优可控制在非线性模式的1.2倍以内；采用在切线性模式中保存基态的方法，重构了伴随模式的程序结构，使其计算时间最优控制在非线性模式的1.5倍以内；在GRAPES全球切线性物理过程的设计中，将线性物理过程的轨迹基态计算和切线性扰动计算解耦，提高了GRAPES全球切线性和伴随模式的计算效果和效率。

关键词: 切线性模式伴随模式四维变分同化 GRAPES模式

The Optimization of GRAPES Global Tangent Linear Model and Adjoint Model

Liu Yongzhu, Zhang Lin, Jin Zhiyan

Numerical Weather Prediction Center of CMA, Beijing 100081

Abstract: Adjoint models are widely applied in numerical weather prediction. For instance, in four-dimensional variational data assimilation (4DVar), they are the best method to efficiently determine optimal initial conditions. The minimization of the 4DVar cost function is solved with an iterative algorithm and is computationally demanding. Though the minimization is usually performed with a much lower resolution than in forecast model, obtaining the optimal model state requires dozens of iterations, and the model parallel efficiency must be fast enough. However, the parallel efficiency of GRAPES global tangent linear model and adjoint model version 1.0 based on GRAPES global non-linear model 1.0 is so low that it seriously impacts the development of GRAPES_4DVar. In order to reduce the computational cost, a new tangent linear model and adjoint model version 2.0 are re-designed and re-developed based on GRAPES global model version 2.0. By optimizing the program structure of tangent linear model, the calculating time of GRAPES tangent linear model can be best controlled within 1.2 times of GRAPES non-linear model's consumption with only dynamic framework. And by methods transferring the model base state and trajectory to the adjoint model, the calculating time of GRAPES adjoint model can be best controlled within 1.5 times of GRAPES non-linear model's consumption. Therefore, the new GRAPES tangent linear model and adjoint model version 2.0 are very successful in terms of computational efficiency to speed up the development of GRAPES_4DVar. In practical applications, the tangent linear model and adjoint model is run at a lower resolution than the non-linear model, since the dynamics is already simplified through the reduction in horizontal resolution, the linearized physics doesn't necessarily need to be exactly tangent to the full physics. In principle, physical parameterizations can already behave differently between non-linear and tangent-linear models due to the change in resolution. In order to reduce computational cost, it is often necessary to select different set of simplified linearized parameterizations with the full physical processes of GRAPES forecast model. By decoupling base states calculation in GRAPES and the perturbation calculation in the tangent linear and adjoint model, the computational cost of GRAPES tangent and adjoint model with simplified physical parameterizations increases only a little than no physical parameterizations versions, and the computational efficiency is higher than GRAPES forecast model with full physical parameterizations.

Key words: tangent linear model adjoint model 4DVar GRAPES

[1]	Errico R M, VuKicevic T. Sensitivity analysis using an adjoint of the PSU-NCAR mesoscale model. Mon Wea Rev, 1992, 120: 1644–1660. DOI:10.1175/1520-0493(1992)120<1644:SAUAAO>2.0.CO;2
[2]	Errico R M. What is an adjoint model. Bull Amer Meteor Soc, 1997, 78: 2577–2591. DOI:10.1175/1520-0477(1997)078<2577:WIAAM>2.0.CO;2
[3]	Rabier F, Jarvinen H. The ECMWF operational implementation of four-dimensional variational assimilation.Ⅰ:Experimental results with simplified physics. Q J R Met Soc, 2000, 126, (564): 11–43.
[4]	Molteni F R B, Palmer T N, Petroloagis T. The ECMWF ensemble prediction system:Methodlogy and validation. Q J R Met Soc, 1996, 119: 269–298.
[5]	Cardinal C. Forecast Sensitivity to Observation (FSO) as a Diagnostic Tool. ECMWF Tech Memo, 2009: 26.
[6]	陈德辉, 沈学顺. 新一代数值预报系统GRAPES研究进展. 应用气象学报, 2006, 17, (6): 773–777.
[7]	薛纪善, 陈德辉. 数值预报系统GRAPES的科学设计与应用. 北京: 科学出版社, 2008.
[8]	任迪生, 沈学顺, 薛纪善, 等. GRAPES伴随模式底层数据栈优化. 应用气象学报, 2011, 22, (3): 362–366.
[9]	蒋沁谷, 金之雁. GRAPES全球模式MPI与OpenMP混合并行方案. 应用气象学报, 2014, 25, (5): 581–591.
[10]	刘艳, 薛纪善, 张林, 等. GRAPES全球三维变分同化系统的检验与诊断. 应用气象学报, 2016, 27, (1): 1–15.
[11]	刘永柱, 沈学顺, 李晓莉. 基于总能量模的GRAPES全球模式奇异向量扰动研究. 气象学报, 2013, 71, (3): 517–526.
[12]	Janiskova M, Lopez P. Linearized Physics for Data Assimilation at ECMWF. ECMWF Tech Memo, 2012: 26.
[13]	Giering R, Kaminski T. Recipes for adjoint code construction. ACM Trans Math Software, 1998, 24, (4): 437–474. DOI:10.1145/293686.293695
[14]	Laurent H.TAPENADE, Automatic Dierentiation by Program Transformation.http://www-sop.inria.fr/tropics.2007.
[15]	宋君强, 伍湘君. GRAPES模式中Helmhothz方程两种求解方法的对比研究. 计算机工程与科学, 2011, 33, (11): 65–70.
[16]	徐国强, 陈德辉. GRAPES物理过程的优化试验及程序结构设计. 科学通报, 2008, (20): 2428–2434.
[17]	张林, 朱宗申. GRAPES模式切线性垂直扩散方案的误差分析和改进. 应用气象学报, 2008, 19, (2): 194–200.
[18]	刘奇俊, 胡志晋, 周秀骥. HLAFS显式云降水方案及其对暴雨和云的模拟(I)云降水显式方案. 应用气象学报, 2003, 14, (增刊): 60–67.
[19]	Zou X. Tangent linear and adjoint of "on-off" processes and their feasibility for use in 4-dimensional variational data assimilation. Tells, 1997, 49: 3–31.


图1 GRAPES全球4DVar中的轨迹和基态设计 Fig.1 The design of trajectory and base state in GRAPES Global 4DVar


图2 切线性物理过程的设计方案 Fig.2 The design of tangent linear physics


图4 位温扰动的平均绝对偏差(a)无物理过程的TLM，(b)简化切线性物理过程的TLM Fig.4 The mean absolute error of the potential temperature perturbation (a) TLM with no tangent physical process, (b) TLM with simple tangent physical process


图5 增加内存的计算效率提高率 Fig.5 Parallel efficiency rates by increasing memory


图6 GRAPES全球NLM, TLM和ADM的加速效率 Fig.6 The speedup efficiency of GRAPES Global NLM, TLM and ADM