大型基因组亲缘矩阵求逆算法的优化研究

引用本文

周洁, 曾维俊, 杨天瑞, 程郁斐, 龙贤达, 经佩齐, 曾仰双, 徐旭, 唐国庆. 大型基因组亲缘矩阵求逆算法的优化研究[J]. 畜牧兽医学报, 2020, 51(8): 1804-1810.

ZHOU Jie, ZENG Weijun, YANG Tianrui, CHENG Yufei, LONG Xianda, JING Peiqi, ZENG Yangshuang, XU Xu, TANG Guoqing. Optimization Study of Inverse Algorithm of Large Genomic Relationship Matrix[J]. Acta Veterinaria et Zootechnica Sinica, 2020, 51(8): 1804-1810.

大型基因组亲缘矩阵求逆算法的优化研究

周洁¹, 曾维俊¹, 杨天瑞¹, 程郁斐¹, 龙贤达¹, 经佩齐¹, 曾仰双², 徐旭², 唐国庆¹

1. 四川农业大学, 成都 611130;
2. 四川省畜牧总站, 成都 610041

收稿日期：2020-01-20

基金项目：四川省科技计划项目（20ZDYF1241）；四川生猪创新团队（SCSZTD-3-002）；国家生猪产业技术体系项目（CARS-36-01A）

作者简介：周洁(1995-), 男, 重庆丰都人, 硕士生, 主要从事猪的遗传育种研究, E-mail:1048185949@qq.com.

通信作者：唐国庆, 主要从事猪分子数量遗传学研究, E-mail:tyq003@163.com.

摘要：基因组选择常用的评估方法GBLUP和ssGBLUP都涉及到基因组亲缘矩阵的求逆，而大规模矩阵求逆运算非常耗时。本研究以提高大型基因组亲缘矩阵求逆运算的效率为目的。本研究通过真实数据和模拟数据构建基因组亲缘矩阵，引入Intel MKL矩阵函数，以减少迭代次数（方法1）和重复分块（方法2）两种方式改良分块迭代求逆算法，编程实现算法并在台式电脑和服务器上测试计算时间。结果表明，利用方法1计算4 000×4 000的基因组亲缘矩阵逆矩阵时，与MKL库函数的加速比为0.898。而16 000×16 000矩阵的计算速度为MKL库函数的1.006倍。利用方法2计算4 000×4 000矩阵的运算速度是MKL库函数的1.084倍；而在更大型的128 000×128 000基因组亲缘矩阵求逆运算时，该方法与MKL直接求逆函数的加速比为1.805倍。相比于MKL直接求逆函数，改进后的两种方法在效率上有一定程度的提升。

关键词：基因组选择矩阵求逆分块迭代求逆

Optimization Study of Inverse Algorithm of Large Genomic Relationship Matrix

ZHOU Jie¹, ZENG Weijun¹, YANG Tianrui¹, CHENG Yufei¹, LONG Xianda¹, JING Peiqi¹, ZENG Yangshuang², XU Xu², TANG Guoqing¹

1. Sichuan Agricultural University, Chengdu 611130, China;
2. Sichuan Animal Husbandry Station, Chengdu 610041, China

Corresponding author: TANG Guoqing, E-mail:tyq003@163.com.

[1]	张闻, 何永蜀, 陈元晓. 新一代DNA测序技术[J]. 国际遗传学杂志, 2009, 32(5): 341–344. ZHANG W, HE Y S, CHEN Y X. New-generation DNA sequencing technology[J]. International Journal of Genetics, 2009, 32(5): 341–344. (in Chinese)
[2]	王晨, 秦珂, 薛明, 等. 全基因组选择在猪育种中的应用[J]. 畜牧兽医学报, 2016, 47(1): 1–9. WANG C, QIN K, XUE M, et al. Application of genomic selection in swine breeding[J]. Acta Veterinaria et Zootechnica Sinica, 2016, 47(1): 1–9. (in Chinese)
[3]	赵晓铎, 张鹏, 吕昕哲, 等. 全基因组选择技术在牧场的应用[J]. 中国奶牛, 2019(7): 21–24. ZHAO X D, ZHANG P, LV X Z, et al. New-generation dna sequencing technology[J]. China Dairy Cattle, 2019(7): 21–24. (in Chinese)
[4]	MEUWISSEN T H, HAYES B J, GODDARD M E. Prediction of total genetic value using genome-wide dense marker maps[J]. Genetics, 2001, 157(4): 1819–1829.
[5]	WIGGANS G R, COLE J B, HUBBARD S M, et al. Genomic selection in dairy cattle:the USDA experience[J]. Annu Rev Anim Biosci, 2017, 5: 309–327.
[6]	ELSEN J M. Genomic selection-the final step or another step in an endless race?[J]. J Anim Breed Genet, 2018, 135(2): 95–96.
[7]	李宏伟, 王瑞军, 王志英, 等. 家畜基因组选择研究进展[J]. 遗传, 2017, 39(5): 377–387. LI H W, WANG R J, WANG Z Y, et al. The research progress of genomic selection in livestock[J]. Hereditas (Beijing), 2017, 39(5): 377–387. (in Chinese)
[8]	MISZTAL I, LEGARRA A. Invited review:efficient computation strategies in genomic selection[J]. Animal, 2017, 11(5): 731–736.
[9]	WELLER J I, EZRA E, RON M. Invited review:a perspective on the future of genomic selection in dairy cattle[J]. J Dairy Sci, 2017, 100(11): 8633–8644.
[10]	FRAGOMENI B D, LOURENCO D, TSURUTA S, et al. 0352 Genetics of heat stress in purebred and crossbred pigs from different states using BLUP or ssGBLUP[J]. J Anim Sci, 2016, 94(5): 169–170.
[11]	KOIVULA M, STRANDÉN I, PÖSÖ J, et al. Single-step genomic evaluation using multitrait random regression model and test-day data[J]. J Dairy Sci, 2015, 98(4): 2775–2784.
[12]	KOIVULA M, STRANDÉN I, AAMAND G P, et al.Comparison of ssGBLUP and ssGTBLUP using Nordic Holstein TD data[C]//Proceedings of the World Congress on Genetics Applied to Livestock Production.2018.
[13]	HOWARD J T, RATHJE T A, BRUNS C E, et al. The impact of truncating data on the predictive ability of selection candidate EBV in swine using ssgblup[J]. J Anim Sci, 2018, 96(S2): 18–19.
[14]	SOLLERO B P, HOWARD J T, SPANGLER M L. The impact of reducing the frequency of animals geno-typed at higher density on imputation and prediction accuracies using ssGBLUP[J]. J Anim Sci, 2019, 97(7): 2780–2792.
[15]	BALOCHE G, LEGARRA A, SALLÉ G, et al. Assessment of accuracy of genomic prediction for French Lacaune dairy sheep[J]. J Dairy Sci, 2014, 97(2): 1107–1116.
[16]	MASUDA Y, MISZTAL I, TSURUTA S, et al. Single-step genomic evaluations with 570K genotyped animals in US Holsteins[J]. Interbull Bulletin, 2015(49): 85–89.
[17]	潘荣杨, 张哲, 高宁, 等. 基因组选择一步法理论及应用研究进展[J]. 广东农业科学, 2016, 43(9): 124–131. PAN R Y, ZHANG Z, GAO N, et al. Research progress in Single step procedure theory and application in genomic selection[J]. Guangdong Agricultural Sciences, 2016, 43(9): 124–131. (in Chinese)
[18]	MISZTAL I, LEGARRA A, AGUILAR I. Computing procedures for genetic evaluation including phenotypic, full pedigree, and genomic information[J]. J Dairy Sci, 2009, 92(9): 4648–4655.
[19]	LEGARRA A, AGUILAR I, MISZTAL I. A relationship matrix including full pedigree and genomic information[J]. J Dairy Sci, 2009, 92(9): 4656–4663.
[20]	ZHANG X Y, LOURENCO D, AGUILAR I, et al. Weighting strategies for single-step genomic BLUP:an iterative approach for accurate calculation of GEBV and GWAS[J]. Front Genet, 2016, 7: 151.
[21]	彭潇, 尹立林, 梅全顺, 等. 猪主要经济性状的基因组选择研究[J]. 畜牧兽医学报, 2019, 50(2): 439–445. PENG X, YIN L L, MEI Q S, et al. A study of genome selection based on the porcine major economic traits[J]. Acta Veterinaria et Zootechnica Sinica, 2019, 50(2): 439–445. (in Chinese)
[22]	KINCAID D R, RESPESS J R, YOUNG D M.Itpack 2C: a FORTRAN package for solving large sparse linear systems by adaptative accelerated iterative methods[R]. Austin: University of Texas, 1997. https://dl.acm.org/doi/10.1145/356004.356009
[23]	MADSEN P, JENSEN J, LABOURIAU R, et al.DMU-a package for analyzing multivariate mixed models in quantitative genetics and genomics[C]//Proceedings of the 10th World Congress of Genetics Applied to Livestock Production.2014: 18-22.
[24]	周学勇. 计算机在养猪生产管理和育种中的应用[J]. 现代农业科技, 2011(12): 34–35. ZHOU X Y. Application of computer in pig production management and breeding[J]. Modern Agricultural Science and Technology, 2011(12): 34–35. (in Chinese)
[25]	AGUILAR I, MISZTAL I, LEGARRA A, et al. Efficient computation of the genomic relationship matrix and other matrices used in single-step evaluation[J]. J Anim Breed Genet, 2011, 128(6): 422–428.
[26]	FORNI S, AGUILAR I, MISZTAL I. Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic information[J]. Genet Sel Evol, 2011, 43(1): 1.
[27]	MISZTAL I, TSURUTA S, AGUILAR I, et al. Methods to approximate reliabilities in single-step genomic evaluation[J]. J Dairy Sci, 2013, 96(1): 647–654.
[28]	MEYER K, TIER B, GRASER H U. Technical note:updating the inverse of the genomic relationship matrix[J]. J Anim Sci, 2013, 91(6): 2583–2586.
[29]	MISZTAL I. Inexpensive computation of the inverse of the genomic relationship matrix in populations with small effective population size[J]. Genetics, 2016, 202(2): 401–409.
[30]	FRAGOMENI B O, LOURENCO D A L, TSURUTA S, et al. Use of genomic recursions and algorithm for proven and young animals for single-step genomic BLUP analyses-a simulation study[J]. J Anim Breed Genet, 2015, 132(5): 340–345.
[31]	FAUX P, GENGLER N, MISZTAL I. A recursive algorithm for decomposition and creation of the inverse of the genomic relationship matrix[J]. J Dairy Sci, 2012, 95(10): 6093–6102.
[32]	毛汉清. 可逆矩阵的分块求逆方法研究[J]. 上海铁道学院学报, 1994, 15(3): 110–117. MAO H Q. Research of methods to obtain an inverse matrix by partitioning an inversible matrix[J]. Journal of Shanghai Institute of Railway Technology, 1994, 15(3): 110–117. (in Chinese)
[33]	张国亮, 沈慧, 石峰, 等. 大型实对称矩阵分块迭代求逆算法[J]. 无线互联科技, 2015(6): 127–129. ZHANG G L, SHEN H, SHI F, et al. Block iterative inverse algorithm for a iarge-scale real matrix[J]. Wireless Internet Technology, 2015(6): 127–129. (in Chinese)
[34]	VANRADEN P M. Efficient methods to compute genomic predictions[J]. J Dairy Sci, 2008, 91(11): 4414–4423.
[35]	AGUILAR I, MISZTAL I, JOHNSON D L, et al. Hot topic:a unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score[J]. J Dairy Sci, 2010, 93(2): 743–752.
[36]	SARGOLZAEI M, SCHENKEL F S. QMSim:a large-scale genome simulator for livestock[J]. Bioinformatics, 2009, 25(5): 680–681.


畜牧兽医学报 2020, Vol. 51 Issue (8): 1804-1810. DOI: 10.11843/j.issn.0366-6964.2020.08.004	PDF