基于单倍型肉牛屠宰性状全基因组关联分析研究

引用本文

李宏伟, 徐凌洋, 王泽昭, 蔡文涛, 朱波, 陈燕, 高雪, 张路培, 高会江, 李俊雅. 基于单倍型肉牛屠宰性状全基因组关联分析研究[J]. 畜牧兽医学报, 2022, 53(12): 4232-4243.

LI Hongwei, XU Lingyang, WANG Zezhao, CAI Wentao, ZHU Bo, CHEN Yan, GAO Xue, ZHANG Lupei, GAO Huijiang, LI Junya. Genome-wide Association Study of Slaughter Traits Based on Haplotype in Beef Cattle[J]. Acta Veterinaria et Zootechnica Sinica, 2022, 53(12): 4232-4243.

李宏伟, 徐凌洋, 王泽昭, 蔡文涛, 朱波, 陈燕, 高雪, 张路培, 高会江, 李俊雅

中国农业科学院北京畜牧兽医研究所牛遗传育种创新团队, 北京 100193

收稿日期：2022-06-02

基金项目：中国农业科学院重大科研任务“优质高效肉牛新品种培育”(CAAS-ZDXT2018006)

作者简介：李宏伟(1992-)，男，内蒙古包头人，博士生，主要从事群体遗传学和统计基因组学研究，E-mail: lihongweicaas@163.com.

通信作者：徐凌洋，主要从事群体遗传学和统计基因组学研究，E-mail: xulingyang@163.com; 李俊雅，主要从事肉牛遗传育种研究，E-mail: lijunya@caas.cn.

摘要：单倍型标记与数量性状基因座(quantitative trait loci，QTL)之间具有较强的连锁不平衡(linkage disequilibrium，LD)关系，在基因定位和因果突变鉴定方面具有较高的应用价值。为了评估单倍型标记在基因组研究中的作用，本研究在华西牛资源群体中，选取该群体于2008—2021年间屠宰的共计1 478头平均月龄为24个月的个体进行研究，其中公牛1 333头，母牛145头。利用770K高密度芯片数据，基于LD阈值(r²>0.3)及固定单核苷酸多态(single nucleotide polymorphism, SNP)个数(5个连续SNP)两种方法进行单倍型构建，分别采用单位点SNP标记和两种单倍型标记共3种标记，基于GCTA的混合线性模型(mixed linear model, MLM)，开展宰前活重(LW)和屠宰率(DP)等屠宰性状的全基因组关联分析(genome-wide association study，GWAS)，定位影响屠宰性状的显著(P < 0.05) SNPs、单倍型块和候选基因，同时比较3种标记的GWAS结果，评估3种标记的优劣。结果显示，3种标记在全基因组范围内共找到16个的显著SNPs及单倍型区域，主要分布于1、5、6、14、16、17和28号染色体上，同时鉴定到FAM184B、PPM1K、LCORL、RIMS2等10个与屠宰性状相关的候选基因，其中，基于SNP标记方法鉴定到的3个候选基因，在利用基于单倍型标记的方法中也鉴定到，且单倍型鉴定到的显著性位点或区域大多位于基因内部。在两种单倍型构建方法中，与基于固定SNP个数构建单倍型进行GWAS相比，基于LD阈值的构建方法鉴定到了更多候选基因。本研究结果表明，以单倍型开展GWAS可以综合考虑SNP位点间连锁关系，能较好地揭示复杂性状的遗传结构。

关键词：肉牛单倍型单核苷酸多态性全基因组关联分析连锁不平衡

Genome-wide Association Study of Slaughter Traits Based on Haplotype in Beef Cattle

LI Hongwei, XU Lingyang, WANG Zezhao, CAI Wentao, ZHU Bo, CHEN Yan, GAO Xue, ZHANG Lupei, GAO Huijiang, LI Junya

Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China

Corresponding author: XU Lingyang, E-mail: xulingyang@163.com; LI Junya, E-mail: lijunya@caas.cn.

标记
Marker

染色体
Chromosome

单倍型ID/SNP_ID
Haplotype ID/SNP_ID

物理位置/bp
Genome location

距离/bp^*
Distance

P_adj

候选基因
Candidate gene

SNP

BovineHD0600010474

37 854 698

28 260

8.08×10^-7

PPM1K

BovineHD0600010475

37 857 369

25 589

7.42×10^-7

PPM1K

Hapmap26261-BTC-034133

37 868 743

14 215

4.70×10^-7

PPM1K

BovineHD0600010716

38 704 872

32 566

2.73×10^-7

FAM184B

5_SNP_HAP

Block2020

38 653 201~38 672 761

within

1.00×10^-6

FAM184B

Block2022

38 698 886~38 716 298

26 082

4.02×10^-7

FAM184B

Block2053

39 364 589~39 371 150

372 477

6.35×10^-7

LCORL

Block2383

46 751 517~46 763 708

7 430 801

1.31×10^-6

PMP2

Block3121

62 740 639~62 782 571

within

3.56×10^-7

RIMS2

0.3LD

Block992

37 853 910~37 857 369

20 782

6.17×10^-7

PPM1K

Block1085

43 861 858~43 925 641

within

1.30×10^-11

PEX14

BovineHD0100023864

83 229 746

within

1.13×10^-8

EPHB3

BovineHD0500036083

75 823 709

within

4.66×10^-8

MPST

Hapmap26261-BTC-034133

37 868 743

14 215

9.37×10^-7

PPM1K

BovineHD0600010716

38 704 872

32 566

7.01×10^-7

FAM184B

BovineHD1400017455

62 769 117

within

2.24×10^-12

RIMS2

BovineHD1700005100

17 660 413

21 284

3.88×10^-7

SCOC

^*表示SNP与邻近基因的物理距离，within表示落入基因区域内。下同
^* represents the physical distance between SNP and adjacent genes, within means that it falls into the gene region. The same as below

[1]	RISCH N, MERIKANGAS K. The future of genetic studies of complex human diseases[J]. Science, 1996, 273(5281): 1516-1517. DOI:10.1126/science.273.5281.1516
[2]	COCKRAM J, WHITE J, ZULUAGA D L, et al. Genome-wide association mapping to candidate poly-morphism resolution in the unsequenced barley genome[J]. Proc Natl Acad Sci U S A, 2010, 107(50): 21611-21616. DOI:10.1073/pnas.1010179107
[3]	LIU N, XUE Y D, GUO Z Y, et al. Genome-wide association study identifies candidate genes for starch content regulation in maize kernels[J]. Front Plant Sci, 2016, 7: 1046.
[4]	REMINGTON D L, THORNSBERRY J M, MATSUOKA Y, et al. Structure of linkage disequilibrium and phenotypic associations in the maize genome[J]. Proc Natl Acad Sci U S A, 2001, 98(20): 11479-11484. DOI:10.1073/pnas.201394398
[5]	THORNSBERRY J M, GOODMAN M M, DOEBLEY J, et al. Dwarf8 polymorphisms associate with variation in flowering time[J]. Nat Genet, 2001, 28(3): 286-289. DOI:10.1038/90135
[6]	常天鹏, 夏江威, 宝金山, 等. 利用两种统计模型对中国肉用西门塔尔牛屠宰性状的全基因组关联分析[J]. 畜牧兽医学报, 2018, 49(4): 833-840. CHANG T P, XIA J W, BAO J S, et al. Genome-wide association study for carcass traits using two statistic models in Chinese simmental beef cattle[J]. Acta Veterinaria et Zootechnica Sinica, 2018, 49(4): 833-840. (in Chinese)
[7]	段星海, 安炳星, 杜丽丽, 等. 中国肉用西门塔尔牛生长曲线参数的全基因组关联分析[J]. 畜牧兽医学报, 2021, 52(5): 1267-1277. DUAN X H, AN B X, DU L L, et al. Genome-wide association study of growth curve parameters in Chinese simmental beef cattle[J]. Acta Veterinaria et Zootechnica Sinica, 2021, 52(5): 1267-1277. (in Chinese)
[8]	刘晓静, 刘璐, 王杰, 等. 鸡血糖性状的全基因组关联分析[J]. 畜牧兽医学报, 2020, 51(6): 1187-1195. LIU X J, LIU L, WANG J, et al. Genome-wide association study of chicken blood glucose traits using whole genome resequencing[J]. Acta Veterinaria et Zootechnica Sinica, 2020, 51(6): 1187-1195. (in Chinese)
[9]	米布农, 张立果, 乌日汉, 等. 戴瑞奶绵羊产奶性状的全基因组关联分析[J]. 畜牧兽医学报, 2021, 52(11): 3294-3303. MI B N, ZHANG L G, BAI U, et al. Genome-wide association study of milk production traits in dairy meade sheep[J]. Acta Veterinaria et Zootechnica Sinica, 2021, 52(11): 3294-3303. DOI:10.11843/j.issn.0366-6964.2021.011.030 (in Chinese)
[10]	XIANG R D, MACLEOD I M, DAETWYLER H D, et al. Genome-wide fine-mapping identifies pleiotropic and functional variants that predict many traits across global cattle populations[J]. Nat Commun, 2021, 12(1): 860. DOI:10.1038/s41467-021-21001-0
[11]	YU J M, PRESSOIR G, BRIGGS W H, et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness[J]. Nat Genet, 2006, 38(2): 203-208. DOI:10.1038/ng1702
[12]	LIPKA A E, KANDIANIS C B, HUDSON M E, et al. From association to prediction: statistical methods for the dissection and selection of complex traits in plants[J]. Curr Opin Plant Biol, 2015, 24: 110-118. DOI:10.1016/j.pbi.2015.02.010
[13]	ZHANG X Y, YAZAKI J, SUNDARESAN A, et al. Genome-wide high-resolution mapping and functional analysis of DNA methylation in arabidopsis[J]. Cell, 2006, 126(6): 1189-1201. DOI:10.1016/j.cell.2006.08.003
[14]	PRYCE J E, BOLORMAA S, CHAMBERLAIN A J, et al. A validated genome-wide association study in 2 dairy cattle breeds for milk production and fertility traits using variable length haplotypes[J]. J Dairy Sci, 2010, 93(7): 3331-3345. DOI:10.3168/jds.2009-2893
[15]	TRÉGOUËT D A, KÖNIG I R, ERDMANN J, et al. Genome-wide haplotype association study identifies the SLC22A3-LPAL2-LPA gene cluster as a risk locus for coronary artery disease[J]. Nat Genet, 2009, 41(3): 283-285. DOI:10.1038/ng.314
[16]	BARDEL C, DANJEAN V, HUGOT J P, et al. On the use of haplotype phylogeny to detect disease susceptibility loci[J]. BMC Genet, 2005, 6(1): 24. DOI:10.1186/1471-2156-6-24
[17]	JIANG Y, SCHMIDT R H, REIF J C. Haplotype-based genome-wide prediction models exploit local epistatic interactions among markers[J]. G3 (Bethesda), 2018, 8(5): 1687-1699. DOI:10.1534/g3.117.300548
[18]	N'DIAYE A, HAILE J K, CORY A T, et al. Single marker and haplotype-based association analysis of semolina and pasta colour in elite durum wheat breeding lines using a high-density consensus map[J]. PLoS One, 2017, 12(10): e0170941.
[19]	MEUWISSEN T H E, GODDARD M E. Fine mapping of quantitative trait loci using linkage disequilibria with closely linked marker loci[J]. Genetics, 2000, 155(1): 421-430. DOI:10.1093/genetics/155.1.421
[20]	TEMPLETON A R. A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping or DNA sequencing.V.Analysis of case/control sampling designs: Alzheimer's disease and the apoprotein E locus[J]. Genetics, 1995, 140(1): 403-409. DOI:10.1093/genetics/140.1.403
[21]	CHEN H L, HAO Z Y, ZHAO Y F, et al. A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize[J]. BMC Genomics, 2020, 21(1): 151. DOI:10.1186/s12864-020-6552-x
[22]	CHEN S L, LIU F, WU W X, et al. A SNP-based GWAS and functional haplotype-based GWAS of flag leaf-related traits and their influence on the yield of bread wheat (Triticum aestivum L.)[J]. Theor Appl Genet, 2021, 134(12): 3895-3909. DOI:10.1007/s00122-021-03935-7
[23]	ZHANG H, SHEN L Y, XU Z C, et al. Haplotype-based genome-wide association studies for carcass and growth traits in chicken[J]. Poult Sci, 2020, 99(5): 2349-2361. DOI:10.1016/j.psj.2020.01.009
[24]	ARAUJO A C, CARNEIRO P L S, ALVARENGA A B, et al. Haplotype-based single-step GWAS for yearling temperament in American Angus cattle[J]. Genes, 2022, 13(1): 17.
[25]	WU P X, WANG K, ZHOU J, et al. A combined GWAS approach reveals key loci for socially-affected traits in Yorkshire pigs[J]. Commun Biol, 2021, 4(1): 891. DOI:10.1038/s42003-021-02416-3
[26]	GILMOUR A R, THOMPSON R, CULLIS B R. Average information REML: an efficient algorithm for variance parameter estimation in linear mixed models[J]. Biometrics, 1995, 51(4): 1440-1450. DOI:10.2307/2533274
[27]	PURCELL S, NEALE B, TODD-BROWN K, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses[J]. Am J Hum Genet, 2007, 81(3): 559-575. DOI:10.1086/519795
[28]	BROWNING B L, ZHOU Y, BROWNING S R. A one-penny imputed genome from next-generation reference panels[J]. Am J Hum Genet, 2018, 103(3): 338-348. DOI:10.1016/j.ajhg.2018.07.015
[29]	HILL W G, ROBERTSON A. Linkage disequilibrium in finite populations[J]. Theor Appl Genet, 1968, 38(6): 226-231. DOI:10.1007/BF01245622
[30]	HILL W G. Estimation of effective population size from data on linkage disequilibrium[J]. Genet Res, 1981, 38(3): 209-216. DOI:10.1017/S0016672300020553
[31]	MEUWISSEN T H E, ODEGARD J, ANDERSEN-RANBERG I, et al. On the distance of genetic relationships and the accuracy of genomic prediction in pig breeding[J]. Genet Sel Evol, 2014, 46(1): 49. DOI:10.1186/1297-9686-46-49
[32]	YANG J, LEE S H, GODDARD M E, et al. GCTA: a tool for genome-wide complex trait analysis[J]. Am J Hum Genet, 2011, 88(1): 76-82. DOI:10.1016/j.ajhg.2010.11.011
[33]	HAYES B. Overview of statistical methods for genome-wide association studies (GWAS)[J]. Methods Mol Biol, 2013, 1019: 149-169.
[34]	GILMOUR A, GOGEL B, CULLIS B, et al. ASReml user guide release 3.0. VSN international Ltd[J]. 2009: https://www.scienceopen.com/document?vid=eb4bebf5-3221-45b0-93ee-e90ea245a4ba.
[35]	DUNCAN L E, RATANATHARATHORN A, AIELLO A E, et al. Largest GWAS of PTSD (N=20 070) yields genetic overlap with schizophrenia and sex differences in heritability[J]. Mol Psychiatry, 2018, 23(3): 666-673. DOI:10.1038/mp.2017.77
[36]	BOUAZIZ M, AMBROISE C, GUEDJ M. Accounting for population stratification in practice: a comparison of the main strategies dedicated to genome-wide association studies[J]. PLoS One, 2011, 6(12): e28845. DOI:10.1371/journal.pone.0028845
[37]	ARMSTRONG D L, ZIDOVETZKI R, ALARCÓN-RIQUELME M E, et al. GWAS identifies novel SLE susceptibility genes and explains the association of the HLA region[J]. Genes Immun, 2014, 15(6): 347-354. DOI:10.1038/gene.2014.23
[38]	BANI-FATEMI A, GRAFF A, ZAI C, et al. GWAS analysis of suicide attempt in schizophrenia: main genetic effect and interaction with early life trauma[J]. Neurosci Lett, 2016, 622: 102-106. DOI:10.1016/j.neulet.2016.04.043
[39]	LI S P, QIAN J, YANG Y, et al. GWAS identifies novel susceptibility loci on 6p21.32 and 21q21.3 for hepatocellular carcinoma in chronic hepatitis B virus carriers[J]. PLoS Genet, 2012, 8(7): e1002791. DOI:10.1371/journal.pgen.1002791
[40]	LIU Z X, BAI C Y, SHI L L, et al. Detection of selection signatures in South African Mutton Merino sheep using whole-genome sequencing data[J]. Anim Genet, 2022, 53(2): 224-229. DOI:10.1111/age.13173
[41]	XIA J W, FAN H Z, CHANG T P, et al. Searching for new loci and candidate genes for economically important traits through gene-based association analysis of Simmental cattle[J]. Sci Rep, 2017, 7(1): 42048. DOI:10.1038/srep42048
[42]	ABO-ISMAIL M K, LANSINK N, AKANNO E, et al. Development and validation of a small SNP panel for feed efficiency in beef cattle[J]. J Anim Sci, 2018, 96(2): 375-397. DOI:10.1093/jas/sky020
[43]	ZHANG W G, XU L Y, GAO H J, et al. Detection of candidate genes for growth and carcass traits using genome-wide association strategy in Chinese Simmental beef cattle[J]. Anim Prod Sci, 2016, 58(2): 224-233.
[44]	MIAO J, WANG X, BAO J, et al. Multimarker and rare variants genomewide association studies for bone weight in Simmental cattle[J]. J Anim Breed Genet, 2018, 135(3): 159-169. DOI:10.1111/jbg.12326
[45]	WANG X Q, MIAO J, XIA J W, et al. Identifying novel genes for carcass traits by testing G×E interaction through genome-wide meta-analysis in Chinese Simmental beef cattle[J]. Livest Sci, 2018, 212: 75-82. DOI:10.1016/j.livsci.2018.04.001
[46]	LIU Y, XU L, WANG Z Z, et al. Genomic prediction and association analysis with models including dominance effects for important traits in Chinese simmental beef cattle[J]. Animals (Basel), 2019, 9(12): 1055.
[47]	ALBAGHA O M E, WANI S E, VISCONTI M R, et al. Genome-wide association identifies three new susceptibility loci for Paget's disease of bone[J]. Nat Genet, 2011, 43(7): 685-689. DOI:10.1038/ng.845
[48]	LINDHOLM-PERRY A K, SEXTEN A K, KUEHN L A, et al. Association, effects and validation of polymorphisms within the NCAPG - LCORL locus located on BTA6 with feed intake, gain, meat and carcass traits in beef cattle[J]. BMC Genet, 2011, 12(1): 103. DOI:10.1186/1471-2156-12-103
[49]	KAMATH R A D, BENSON M D. EphB3 as a potential mediator of developmental and reparative osteogenesis[J]. Cells Tissues Organs, 2021. DOI:10.1159/000520369


畜牧兽医学报 2022, Vol. 53 Issue (12): 4232-4243. DOI: 10.11843/j.issn.0366-6964.2022.12.010	PDF