Genome-wide association study and conditional analysis reveal the importance of non-additive effects and ethnicity interaction for coronary heart disease

引用本文

Yi DING, Jun ZHU. Genome-wide association study and conditional analysis reveal the importance of non-additive effects and ethnicity interaction for coronary heart disease[J]. Journal of Zhejiang University (Agriculture & Life Sciences), 2017, 43(1): 1-14. DOI:10.3785/j.issn.1008-9209.2016.12.301 复制到剪切板

丁艺, 朱军. 全基因组关联研究和条件分析揭示非加性效应和族系互作对冠心病的重要性 (英文)[J]. 浙江大学学报 (农业与生命科学版), 2017, 43(1): 1-14. DOI: 10.3785/j.issn.1008-9209.2016.12.301. 复制到剪切板

Genome-wide association study and conditional analysis reveal the importance of non-additive effects and ethnicity interaction for coronary heart disease

[PDF全文]

Yi DING, Jun ZHU

Institute of Bioinformatics, Zhejiang University, Hangzhou 310058, China

Foundation item: National Science Foundation of China (31301248;31371250)

Corresponding author: ZHU Jun (http://orcid.org/0000-0002-8509-8304), E-mail: jzhu@zju.edu.cn

Biography: DING Yi (http://orcid.org/0000-0002-6601-2712), E-mail: dingyi920@hotmail.com

Received: 2016-12-30; Accepted: 2017-01-13; Published online: 2017-01-22

Summary: Coronary heart disease (CHD) is a complex genetic etiology, and its incidence is also affected by life styles, such as smoke and physical activities. Genome-wide association studies have identified multiple risk loci associated with CHD. However, genetic risk variants reported to date account for only a small fraction of heritability. Besides, the mechanism of how life styles affect CHD progression still remains vague. In this study, we aimed to explore the missing heritability via introducing non-additive effects into genetic analysis model and to investigate the impacts of life styles on genetic architectures among different ethnic populations. Mixed linear model (MLM) was conducted to identify causal single nucleotide polymorphisms (SNPs) associated with CHD using data from the Multi-Ethnic Study of Atherosclerosis (MESA). Saturated model including genetic effects of additive, dominance, epistasis and gene-ethnicity interaction was adopted to fit the complex genetic architecture of CHD. Each of the six life styles (walk, read, transportation, exercises, TV, and smoke) was set as a cofactor to explore the change of genetic architecture after removing their influences. To facilitate personalized medicine, we also predicted genotypic effects for each locus. There were 61 quantitative trait SNPs (QTSs) and 23 pairs of epistasis detected significantly (P < 0.05). The heritability explainedfrom 64.58%to 74.94%across different models.We observed that additive effects (including both general and ethnic-specific additive effects) contributed only a small portion of heritability, ranging from 3.45% to 5.72%. In contrast, non-additive effects dominated large part of total heritability. Genetic effects attributed to life styles were analyzed by conditional analysis. The conditional analysis demonstrated that life styles exhibited significant impacts on the genetic architecture. In the meanwhile, four ethnic groups exhibited notably distinctive genetic patterns under seven models, indicating genetic heterogeneity for CHD among four races.

Keyword: coronary heart disease genome-wide association study conditional analysis life styles personalized medicine

全基因组关联研究和条件分析揭示非加性效应和族系互作对冠心病的重要性 (英文)

丁艺, 朱军

浙江大学生物信息学研究所，杭州 310058

摘要: 冠心病 (coronary heart disease，CHD) 是一个复杂的遗传疾病，其发病率也受生活方式的影响, 如吸烟和人体活动。全基因组关联研究已经检测到多个风险基因与冠心病相关联。然而报道的遗传变异仅占表现型变异的一小部分。生活方式对冠心病的影响机制仍然不清。本研究通过引入非加性效应的遗传分析，揭示非加性遗传的重要性，并用条件分析的方法研究了不同生活方式对种族人群遗传变异的影响。使用多族群研究动脉粥样硬化 (Multi-Ethnic Study of Atherosclerosis，MESA) 的数据，应用混合线性模型分析单核苷酸多态性 (single nucleotide polymorphisms，SNPs) 变异与冠心病的关联。采用包括加性、显性、上位性及族群互作的遗传模型，研究了冠心病的复杂遗传体系。采用6种不同行为 (步行、阅读、运输、运动、电视、吸烟) 作为条件分析的协变量，探索了人类行为对冠心病的遗传影响。为实现个性化治疗的精准诊断，预测了人类不同行为对基因位点的特异遗传效应。总共检测到61个数量性状位点 (quantitative trait SNPs，QTSs) 和23对上位性位点，与表现型变异存在显著的相关性。采用不同模型估算的遗传率达64.58%~74.94%。我们观察到加性效应贡献只占总遗传率的一小部分 (3.45%~5.72%)。相比之下，非加性效应占总遗传率的主要部分。基于不同生活方式的条件分析揭示，不同生活方式对冠心病遗传结构会产生较大影响。4个族群基于7个不同模型的遗传分析揭示，不同种群的冠心病具有显著的遗传特异性。

关键词: 冠心病全基因组关联分析条件分析生活方式个体化医疗

Cardiovascular disease (CVD) is the leading cause of mortality and disability in America. Coronary heart disease (CHD), also known as coronary artery disease (CAD), atherosclerotic heart disease, or ischemic heart disease (IHD), is a major type of CVD. According to a report from the American Heart Association (AHA), over 2 150 Americans died of CVD every day, and CHD alone caused approximate 1 of 6 total deaths in the United States in 2010^[1]. Although familial aggregation lends evidence to genetic background for CHD with heritability estimation ranging from 30% to 60%^[2], genetic risk factors do not cause the disease alone. Environmental risk factors, such as cigarette smoking^[3]and physical inactivity^[4] are two major risks for CHD as well. The AHA report demonstrates that Americans with CVD are much more likely to be current or former smokers than Americans without CVD. Americans with intermediate or poor levels of physical activities are more inclined to develop CVD. Thus, genetic predisposition together with environmental determines the development of CVD through an intricately interactive network.

Recent genome-wide association studies (GWAS) have identified multiple genetic loci associated with CHD or related traits. Previous studies have reported over 40 genes/regions associated with CHD risk at the genomewide significance (P_EW ＜ 5×10^-8)^[5]. However, genetic risk variants reported to date account for only a small fraction of heritability. In the recent report by the Coronary Artery Disease Genome-wide Replication and Meta-analysis plus the Coronary Artery Disease Genetics (CARDIoGRAMplusC4D) consortium, it was estimated that 15 newly discovered and 31 previously identified loci together with a further set of 104 likely independent single nucleotide polymorphism (SNP) explained only 10.6% of the genetic variance of CAD, suggesting important genetic loci remain to be discovered^[6]. The ignorance of dominance, epistasis and the interaction between genetic and environmental factors in most studies may impair the ability to detect multiple genuine signals.

Herein, we report a two-step GWAS study of CHD using software QTXNetwork based on data from the MultiEthnic Study of Atherosclerosis (MESA) study. Association of genotype SNPs and target trait Framingham risk score (JAMA version) was analyzed by mixed linear model (MLM) setting ethnicity as environment and six life styles including smoking status as individual cofactor. The aim of study is to identify novel CHD associated loci and explore the complicated network of genetic and environmental factors across different ethnics.

1 Materials and methods 1.1 Participants

The MESA data were obtained from dbGaP (database of Genotypes and Phenotypes, http://www.ncbi.nlm.nih.gov/gap). MESA is a prospective populationbased study focusing on characterization of subclinical CVD and the risk factors that enable prediction of the progression of CVD. Study participants of four ethnic groups include 6 500 men and women, nearly in equal numbers, who are aged 45-84 years and free of clinical CVD at baseline, and initially recruited in 2000 from six US communities: Baltimore, MD; Chicago, IL, Forsyth County, NC; Los Angeles County, CA, Northern Manhattan, NY; and St. Paul, MN. 38% of the recruited participants are European-American (E-A), 28% AfricanAmerican (A-A), 22% Hispanic American (H-A), and 12% Asian, predominantly of Chinese descent, American (C-A). MESA’ s enrollment and exclusion criteria are described previously. All participants provided written informed consent as approved by all participating Institutional Review Boards. Details of the study design and cohort characteristics have been described elsewhere.

1.2 Genotyping and quality control (QC)

Non-duplicate, unrelated participants were selected for analysis. DNA was isolated from blood samples that were collected from participants at the time of enrollment using Puregene DNA kit (Puregen, Gentra Systems, Minneapolis, MN, USA). Whole genome genotyping was conducted in 2009 using Affymetrix Human SNP array 6.0. SNP QC filters for analysis inclusion were based on the following criteria: (1) SNP call rate ＞ 95%; (2) subject call rate ＞ 95%; (3) polymorphic at least in one ethnic group (i.e., no monomorphic SNP) with no filtering of any SNP based on minor allele frequency due to allele frequency differences among the MESA ethnic groups; (4) heterozygosity ＜ 53% as the uniform heterozygosity distribution was restricted to the range of 0%-53% with removal of ＜ 0.01% of SNPs having heterozygosity of ＞ 53%. The original genotypes of 866 435 SNPs from 22 autosome that met QC criteria were used for the analysis.

1.3 Phenotype and covariate measurements

The dependent variable in this study is Framingham risk score calculated from the JAMA Framingham risk survival model (Frjama), which is used to predict risk of developing hard CHD within 10 years. Frjama was developed using multiple variables, including age, gender, hypertension stage, total cholesterol, high density lipoprotein (HDL) cholesterol, fasting glucose, diabetes mellitus and current smoking status. Six covariates were included in our study: (1) moderate walking (min/wk MSu): walking to get places to the bus, car, work, into the store; (2) light leisure read (MET-min/wk M-Su): read, knit, sew, visit, do nothing, non-work recreational computer; (3) light transportation (min/wk M-Su): drive or ride in car, ride the bus/subway, including travel to work; (4) total intentional exercise (MET-min/wk); (5) light leisure TV (min/wk M-Su): sit or recline and watch TV; and (6) pack-years of cigarette smoking. Six lifestyles were set as individual cofactor for conducting conditional mapping, respectively. The conditional models are (1) Frjama|Walk, (2) Frjama|Read, (3) Frjama|Trans, (4) Frjama|Exer, (5) Frjama|TV, and (6) Frjama|Smoke.

1.4 Statistical analysis

The genetic model for the phenotypic value of the k-th individual in the h-th ethnic population (y_hk) can be expressed by the following MLM,

$ {y_{hk}} = \mu + {s_k} + \sum\limits_i {{a_i}{x_{{A_{ik}}}}} + \sum\limits_i {{d_i}} {x_{{D_{ik}}}} + \sum\limits_{i < j} {a{a_{ij}}} {x_{A{A_{ijk}}}} +\sum\limits_{i < j} {a{d_{ij}}{x_{A{D_{ijk}}}}} + \sum\limits_{i < j} {a{d_{ij}}} {x_{A{D_{ijk}}}} + \\\sum\limits_{i < j} {d{a_{ij}}} {x_{D{A_{ijk}}}} + \sum\limits_{i < j} {d{d_{ij}}} {x_{D{D_{ijk}}}} + {e_h} + \sum\limits_i {a{e_{ih}}} {u_{A{E_{ihk}}}} + \sum\limits_i {d{e_{ih}}} {u_{D{E_{ihk}}}} + \sum\limits_{i < j} {aa{e_{ijh}}} {u_{AA{E_{ijhk}}}} + \\\sum\limits_{i < j} {ad{e_{ijh}}} {u_{AD{E_{ijhk}}}} + \sum\limits_{i < j} {da{e_{ijh}}} {u_{DA{E_{ijhk}}}} + \sum\limits_{i < j} {dd{e_{ijh}}} {x_{DD{E_{ijhk}}}} + {\varepsilon _{hk}}. $

where μ is the population mean; s_k is the fixed effect of the k-th individual (0 for female, 1 for male); a_i is the additive effect of the i-th locus with coefficient x_{A_ik} (1 for QQ, 0 for Qq, －1 for qq); d_i is the dominance effect of the i-th locus with coefficient x_{D_ik} (1 for Qq, 0 for QQ and qq); aa_ij, ad_ij, da_ij and dd_ij are the digenic epistasis effects with coefficients x_{AA_ijk} (1 for QQ×QQ and qq×qq, －1 for QQ×qq and qq×QQ, and 0 for others), x_{AD_ijk} (1 for QQ×Qq, －1 for qq×Qq, and 0 for others), x_{DA_ijk} (1 for Qq×QQ, －1 for Q ×qq, and 0 for others) and x_{DD_ijk} (1 for Qq×Qq, and 0 for others), respectively; e_h is the effect of the h-th ethnic population (1 for E-A, 2 for C-A, 3 for A-A, 4 for H-A); ae_ih is the additive × race interaction effect of the i-th locus in the h-th ethnic population with coefficient u_{AE_ihk}; de_ih is the dominance × race interaction effect of the i-th locus in the h-th ethnic population with coefficient u_{DE_ihk}; aae_ihk, ade_ihk, dae_ihk and dde_ihk are the digenic epistasis×race interaction effects in the h-th ethnic population with coefficient u_{AAE_ihk}, u_{ADE_ihk}, u_{DAE_ihk} and u_{DDE_ihk}, respectively; and ε_hk is the residual effect of the k-th individual in the h-th ethnic population. In this model, we have constraints for random variables with normal distributions of zero mean and variances δ_v².

To reduce the computational burden in mixed model-based GWAS analysis, a two-step strategy was employed to dissect genetic architecture. First we used GMDR modular (generalized multifactor dimensionality reduction) in QTXNetwork software to scan 866 435 SNP markers of two years records of 5 336 subjects for 1D-3D significant candidate SNP markers, and obtained 304 candidate SNPs. Quantitative trait SNP (QTS) mapping modular in QTXNetwork (http://ibi.zju.edu.cn/software/QTXNetwork/) was then used to dissect the genetic architecture of base model and 6 conditional models. Significant SNPs associated with phenotypic variants are analyzed by setting a total of 2 000 permutation tests to calculate the critical P-value for controlling the experiment-wise type 1 error. The QTS effects were predicted by using the Markov Chain Monte Carlo method with 20 000 Gibbs sampler iterations. The correlation coefficient (${R_{\hat Y}} $) between predicted breeding values and phenotypic values was estimated for each model.

2 Results 2.1 Estimated heritability and predicted genetic effects

We used full genetic model including additive, dominance, epistasis and ethnicity specific effects for our GWAS study on CHD^[7]. A total of 61 QTSs and 24 pairs of epistasis were detected significantly associated with Framingham risk score. One QTS resided within coding sequence, causing missense mutation, 30 QTSs located within intron region of genes, and the other QTSs located near genes. The estimated heritability explained by identified QTSs and epistasis under seven models are listed in Table 1. The total heritability varied across different models, with base model exhibiting a heritability of 64.68%, model Frjama|Walk exhibiting the lowest value ($h _{G + GE}^2\hat = 64.58\% $), and model Frjama|Smoke revealing the highest value ($h_{G + GE}^2\hat = 74.94\% $). We also observed that many QTSs and epistasis were ethnicity specific (Fig. 1), and in concordance with this finding, the ethnicity-specific effects (31.18%-43.99%) contributed more variation as compared with main effects (26.78%-33.50%), indicating a high specificity of genetic background among different ethnicity populations.

Table 1 Estimated heritability of significant QTSs for base model of Frjama and 6 conditional models excluding cofactors

Model	h_A²(%)	h_D²(%)	h_I²(%)	h_AE²(%)	h_DE²(%)	h_JE²(%)	h_T²(%)
Frjama	3.09	17.28	13.13	1.87	25.11	4.20	64.68
Frjama\|Trans	2.88	16.83	11.36	2.05	31.88	4.34	69.34
Frjama\|Walk	3.04	16.37	13.13	1.89	25.97	4.18	64.58
Frjama\|TV	3.27	18.18	9.50	2.40	33.09	6.79	73.23
Frjama\|Read	3.48	19.32	10.39	1.41	29.62	4.63	68.85
Frjama\|Exer	2.24	18.72	5.82	1.21	32.84	5.90	66.73
Frjama\|Smoke	3.12	16.72	11.11	2.60	32.23	9.16	74.94
Model: Frjama＝10-year hard CHD risk per circ (Frjama) without setting a cofactor, Frjama\|Trans＝Frjama with light transportation as a cofactor, Frjama\|Walk＝Frjama with moderate walking status as a cofactor, Frjama\|TV＝Frjama with TV watching status as a cofactor, Frjama\|Read＝Frjama with reading status as a cofactor, Frjama\|Exer＝Frjama with total intentional exercising status as a cofactor, Frjama\|Smok＝Frjama with smoking status as a cofactor. Heritability: h_A² ＝heritability of additive effects, h_D² ＝heritability of dominance effects, h_I² ＝heritability of epistasis effects (AA, AD, DA, and DD), h_AE² ＝heritability of environment-specific additive effects, h_DE² ＝heritability of environment-specific dominance effects, h_IE² ＝ heritability of environment-specific epistasis effects (AAE, ADE, DAE, and DDE), h_T² ＝total heritability.

点击放大

Model: defined as in Table 1. The left axis is the QTS IDs＝chromosome_SNP_alleles, circle＝QTS with additive effect, square＝QTS with dominant effect. Line between two QTSs＝epistasis effect, red color＝QTS with general effects for four race groups, green color＝QTS with ethnicityspecific effects, blue color＝QTS with both general and ethnicity-specific effects, black color＝QTS with significant epistasis effects but without detected individual effects. Fig. 1 GxG plot of detected QTSs for the base model of Frjama and six conditional models excluding cofactors

When we compare the results between the base model and conditioned models, we can find that the genetic architecture of CHD changed greatly after conditioned on cofactors of TV, smoke, transportation and exercise, but remained little changed after conditioned on walk and read. It was indicated that TV, smoke, transportation and exercise could have large impacts on CHD, but the effect of walk and read might be limited. We can further separate the effects into two groups: the first group contains QTS effects detected in base model and certain condition models (Table 2); the second group contains QTS effects only in condition models, which indicated that these effects were suppressed by life styles (Table 3).

Table 2 Predicted QTS effects with standard error, significance and heritability for Frjama in the base model and 6 conditional models (－log₁₀(P_EW-value) > 5)

Chr_SNP_Allele	Gene	Effect	Predict	Standard error	-lgP	h²(%)	Trans	Walk	TV	Read	Exer	Smoke
		a	—0.004 0	0.0008	6.6	0.13	×	×	√	×	√	√
		d	—0.034 7	0.001 3	144.2	9.62	×	×	+	+	+	+
1_rs17116652_C/T	LPAR3(intronic)	de₁	—0.029 1	0.0021	44.0	22.53	×	×	—	—	×	—
		de₃	—0.028 6	0.0023	33.5	22.53	×	×	—	×	×	—
		de₄	—0.033 1	0.0027	34.3	22.53	×	×	—	×	×	—
2_rs317258_G/C	GCFC2 (318 kb 5’）	a	—0.0053	0.0008	9.7	0.23	×	×	×	×	×	×
2_rs12621362_T/C	C2orf51 (20 kb 5’)	a	0.0040	0.0008	6.3	0.13	√	√	√	×	√	√
2_rs12621362_T/C	C2orf51 (20 kb 5’)	d	0.0082	0.001 3	9.8	0.53	×	×	×	×	×	×
2_rs905628_G/T	CNTNAP5(intronic)	a	—0.005 0	0.0009	6.8	0.20	√	×	√	√	√	√
2_rs905628_G/T	CNTNAP5(intronic)	d	0.007 6	0.0009	14.9	0.46	√	×	√	×	√	√
2_rs6706330_A/G	OSBPL6(intronic)	d	0.0064	0.0009	11.1	0.33	√	×	×	×	×	√
3_rs2455801_T/C	ANmD28(6.9 kb 3’)	d	0.005 7	0.001 0	8.1	0.26	×	×	√	√	√	×
3_rs711689_T/A	ZNF385D(intronic)	ae₃	0.007 8	0.001 5	6.3	0.23	√	×	×	×	×	√
3_rs2675434_T/A	U6(6.4 kb 3’)	d	—0.005 4	0.001 0	7.8	0.23	√	×	√	×	×	×
5_rs10805911_T/G	WDR41(intronic)	d	0.005 7	0.001 1	7.3	0.26	×	×	√	√	√	×
6_rs16888481_T/A	RP1-182D15.2(48 kb 5’)	d	0.0069	0.001 3	6.7	0.38	√	×	√	√	×	√
7_rs16876162_A/G	6.7 kb 5’of PER4	d	—0.015 6	0.001 0	54.6	1.96	×	×	×	×	√	√
7_rs6974603_G/A	WBSCR17(intronic)	d	0.0068	0.001 0	10.8	0.38	-	×	-	×	×	-
		d	-0.009 5	0.001 0	20.3	0.72	×	×	×	×	×	×
8_rs6996584_C/G	SAMD 12(intronic)	ae₂	-0.0146	0.002 4	8.7	0.57	×	×	√	√	√	×
		de₃	0.010 9	0.002 1	6.4	0.40	√	√	√	√	√	√
9_rs10965365_G/T	DMRTA1 (63 kb 3’)	a	0.005 4	0.0009	8.9	0.23	×	×	×	×	×	×
9_rs1930368_C/T	602 kb 5’ of TLE4	a	-0.007 2	0.0009	14.1	0.42	×	×	×	×	×	×
		d	-0.010 4	0.001 0	26.1	0.86	×	×	√	√	-	×
		ae₁	0.0098	0.001 4	11.3	0.26	×	×	√	×	×	√
		de₄	-0.0140	0.002 1	11.1	0.39	×	×	√	×	×	×
10_rs11250700_G/T	ADARB2(166 kb 3’)	d	-0.0061	0.0009	9.9	0.30	×	×	×	×	√	×
		ae₁	0.010 0	0.001 2	15.6	0.38	×	×	√	×	×	√
10_rs10795470_T/A	ST8SLA6 (20 kb 3’)	ae₄	-0.009 5	0.001 6	8.1	0.38	×	×	√	×	×	√
		de₁	0.010 1	0.002 0	6.6	0.36	×	×	×	×	×	√
16_rs8048681_T/C	WWOX(intronic)	de₃	0.012 1	0.002 0	8.9	0.76	×	×	√	×	×	×
19_rs12979519_T/A	ZNF552(intronic)	a	-0.010 9	0.0008	44.5	0.96	√	×	√	×	√	√
2_rs317258_G/C×8_rs6996584_C/G	GCFC2 (318 kb 5’)×SAMDV2(intronic)	aa	0.0084	0.001 1	13.2	0.56	×	×	×	×	×	×
2_rs317258_G/C×8_rs6996584_C/G	GCFC2 (318 kb 5’)×SAMDV2(intronic)	aae₁	-0.011 3	0.001 9	8.4	0.70	×	×	×	×	×	×
2_rs317258_G/C×8_rs6996584_C/G	GCFC2 (318 kb 5’)×SAMDV2(intronic)	aae₂	0.0149	0.002 9	6.6	0.70	×	×	√	×	√	×
5 rs10805911 T/G×9 rs1930368 C/T	WDR41(intronic)×602 kb 5’ of TLE4	dd	-0.0106	0.0015	11.6	0.91	×	×	√	×	×	×
5 rs10805911 T/G×9 rs1930368 C/T	WDR41(intronic)×602 kb 5’ of TLE4	dde₂	-0.0192	0.003 9	6.1	0.82	√	√	√	√	√	√
		ad	0.0091	0.001 1	14.8	0.66	×	×	√	×	×	+
6_rs9491965_G/A×9_rs1930368_C/T	PTPRK(intronic)×602 kb 5’ of TLE4	dd	0.0097	0.001 8	6.8	0.76	×	×	√	×	×	×
		aae₁	-0.011 1	0.001 7	9.6	0.43	×	×	√	×	×	×
		aa	0.0086	0.00 1 0	15.6	0.59	√	×	√	×	√	√
7_rs16876162_A/G×l9_rs12979519_T/A	6.7 kb 5’ of PER4×ZNF552(intronic)	da	0.011 9	0.001 1	24.6	1.13	√	×	√	×	√	√
		dd	0.0189	0.002 0	19.9	2.85	√	×	√	×	√	√
9_rs1930368_C/T×l2_rs7134443_C/T	602 kb 5’of TLE4×RP11-15519.1 (82 kb 3’)	dd	0.0188	0.001 5	33.5	2.83	√	×	√	√	√	√
10_rs4748631_G/A×20_rs7273557_C/G	121 kb 3’ of SFMBT2×62 kb 3’ of BMP7	ad	-0.0090	0.001 5	8.0	0.60	√	×	√	×	×	√
Genetic effect: a=additive, d=dominance, de₁=E-A specific dominance, de₃=A-A specific dominance, de₄=H-A specific dominance, ae₃=A-A specific additive, ae₂=C-A specific additive, ae₁=E-A specific additive, ae₄=H-A specific additive, aa=additive × additive epistasis, aae₁=E-A specific additive × additive epistasis, aae₂=C-A specific additive × additive epistasis, dd=dominance × dominance epistasis, dde₂=C-A specific dominance × dominance epistasis, ad=additive × dominance epistasis, da=dominance × additive epistasis; -lgP＝minus log₁₀(P_EW-value); h²＝heritability. Cofactor: Trans＝light transpottation minutes per week, Walk＝moderate walking minutes per week, TV＝light leisure TV watching minutes per week, Read＝ light leisure reading minutes per week, Exer＝total intentional exercise per week, Smoke＝average number of cigars smoke per day. Impact on the Frjama: ×＝not affacted by cofactor, √＝caused by cofactor.

点击放大

Table 3 Predicted QTS effects with standard error, significance and heritability for Frjama only detected in conditioned models (－log₁₀(P_EW-value) > 6)

Chr_SNP_Allele	Gene	Effect	Predict	Standard error	—lgP	h²(%)
Frjama\|Trans
1_rs6656939_G/C	LY9(intronic)	d	0.006 1	0.001 0	9.2	0.27
2_rs6754422_T/C	U7(413 kb 3’)	d	0.006 2	0.001 0	10.2	0.28
2_rs893799_G/C	U6 (3.9 kb 5’)	d	—0.006 8	0.001 0	11.6	0.33
6_rs9491965_G/A	PTPRK(intronic)	a	—0.006 6	0.000 8	16.7	0.31
		a	—0.004 5	0.000 9	6.2	0.14
7_rs16876162_A/G	6.7 kb 5’ of PER4	ae₁	—0.006 9	0.001 4	6.3	0.26
		ae₃	0.009 9	0.001 9	7.1	0.26
9_rs1930368_C/T	602 kb 5’ of TLE4	de₁	0.007 3	0.001 5	6.1	0.68
9_rs1930368_C/T	602 kb 5’ of TLE4	de₃	0.0102	0.001 9	7.0	0.68
		a	—0.005 8	0.000 8	11.7	0.24
10_rs10795470_T/A	ST8SLA6(20 kb 3’)	d	—0.007 1	0.001 1	9.8	0.36
		de₄	—0.0156	0.002 5	9.1	0.77
15_rs10220820_C/A	AKAP13 (intronic)	ae₁	—0.006 7	0.001 2	7.3	0.21
16_rs8048681_T/C	WWOX(intronic)	de₂	—0.020 9	0.003 5	8.8	1.15
19_rs9304805_A/G	ZNF552(9.8 kb 3’)	a	—0.007 8	0.000 8	23.8	0.43
19_rs9304805_A/G	ZNF552(9.8 kb 3’)	ae₃	—0.007 6	0.001 5	6.5	0.10
2_rs6754422_T/C × 16_rs8048681_T/C	U7(413 kb 3’)× WWOX(intronic)	ade₄	—0.0159	0.003 1	6.4	0.45
2_rs4675536_A/G × 10_rs1962663_A/C	PARD3B (1.1 kb 3’)× 293 kb 3’ of KLF6	dd	0.008 5	0.001 5	8.2	0.51
5_rs10805911_T/G × 9_rs1930368_C/T	WDR41 (intronic) × 602 kb 5’of TLE4	aae₃	0.013 1	0.002 4	7.3	0.30
7_rs16876162_A/G × 19_rs9304805_A/G	6.7 kb 5’ of PER4×ZNF552 (9.8 kb 3’)	aa	0.007 7	0.001 0	13.0	0.42
7_rs16876162_A/G × 19_rs9304805_A/G	6.7 kb 5’ of PER4×ZNF552 (9.8 kb 3’)	dd	0.0125	0.002 0	9.3	1.11
7_rs17520767_G/A × 17_rs11078840_A/G	AF104455.1 (47 kb 3’) × MYH13	dd	0.0141	0.002 5	8.0	1.42
10_rs10795470_T/A × 15_rs10220820_C/A	ST8SIA6 (20 kb 3’) × AKAP13 (intronic)	ad	0.008 8	0.001 4	9.8	0.56
12_rs7315299_T/A × 15_rs10220820_C/A	TRHDE (intronic) × AKAP13 (intronic)	aa	0.007 6	0.001 0	13.9	0.41
12_rs7315299_T/A × 15_rs10220820_C/A	TRHDE (intronic) × AKAP13 (intronic)	ad	—0.007 0	0.001 4	6.5	0.35
Frjama\|Walk
6_rs9491965_G/A	PTPRK (intronic)	a	—0.003 9	0.000 8	6.0	0.12
7_rs16876162_A/G × 19_rs12979519_T/A	6.7 kb 5’ of PER4 × ZNF552 (intronic)	ad	0.009 1	0.001 8	6.2	0.66
Frjama\|TV
1_rs6656939_G/C	LY9 (intronic)	d	0.005 8	0.001 0	8.3	0.21
2_rs6754422_T/C	U7 (413 kb 3’)	d	0.006 5	0.001 0	11.0	0.26
3_rs2455801_T/C	ANKRD28 (6.9 kb 3’)	a	—0.006 9	0.000 9	13.7	0.30
3_rs2455801_T/C	ANKRD28 (6.9 kb 3’)	ae₂	—0.0187	0.002 5	13.1	0.70
3_rs711689_T/A	ZNF385D (intronic)	a	—0.004 8	0.000 9	7.8	0.14
3_rs711689_T/A	ZNF385D (intronic)	ae₄	— 0.0105	0.001 8	7.9	0.34
4_rs1456759_T/C	401 kb 3’ of PCDH7	a	0.004 5	0.000 8	7.1	0.13
5_rs10805911_T/G	WDR41 (intronic)	ae₃	0.0100	0.001 7	8.5	0.26
6_rs16888481_T/A	55 kb 3’ of C6orf208	a	—0.004 3	0.000 8	7.7	0.12
7_rs16876162_A/G	6.7 kb 5’ of PER4	ae₁	—0.008 8	0.001 4	10.0	0.26
7_rs16876162_A/G	6.7 kb 5’ of PER4	ae₃	0.009 3	0.001 8	6.3	0.26
7_rs17520767_G/A	249 kb 3’ of ACTR3B	d	0.007 7	0.001 1	12.2	0.37
9_rs17482181_G/C	PIP5K1B (intronic)	ae₃	0.0132	0.001 7	14.2	0.40
15_rs11638820_T/C	TMOD2 (intronic)	d	0.005 9	0.000 9	9.4	0.22
19_rs9304805_A/G	ZNF552 (9.8 kb 3’)	a	-0.012 5	0.000 8	59.8	0.98
19_rs9304805_A/G	ZNF552 (9.8 kb 3’)	d	-0.008 0	0.001 3	8.6	0.40
2_rs399420_C/T×9_rs 17482181 _G/C	1.1 Mb 5’ of MYADML × PIP5K1B (intronic)	aa	-0.007 1	0.001 0	12.2	0.31
		aae₃	-0.022 5	0.001 8	35.6	1.05
		aae₄	0.0129	0.002 1	8.9	1.05
		dae₃	-0.025 2	0.005 1	6.2	0.99
		aa	0.005 4	0.001 0	6.7	0.19
3_rs2455801_T/C×6_rs16888481_T/A	ANKRD28 (6.9 kb 3’) × 55 kb 3’ of C6orf208	dd	0.011 1	0.001 9	8.2	0.77
		dde₃	-0.028 1	0.003 3	17.1	2.09
3_rs2166759_A/G × 7_rs6974603_G/A	ANKRD28 (intronic) × WBSCR17 (intronic)	dd	0.008 0	0.001 5	7.3	0.40
3_rs711689_T/A × 9_rs1930368_C/T	ZNF385D (intronic)×RP11-165H23.1 (166 kb 3’)	ad	0.008 0	0.001 2	10.3	0.40
5_rs 10805911_T/G × 5_rs11241477_C/A	WDR41 (intronic) × 72 kb 3’ of LOC100133050	aae₃	-0.0100	0.001 8	7.6	0.31
		aa	0.008 1	0.001 0	14.4	0.41
7_rs16876162_A/G × 19_rs9304805_A/G	6.7 kb 5’ of PER4 × ZNF552 (9.8 kb 3’)	ad	0.0107	0.001 1	20.7	0.72
		dd	0.018 5	0.002 0	19.6	2.15
7_rs17520767_G/A × 17_rs11078840_A/G	AF104455.1 (47 kb 3’) × MYH13	ade	-0.008 6	0.001 6	7.5	0.12
15_rs900825_G/C × 21_rs1009207_G/A	FAM81A × KCNJ15(intronic)	ad	-0.006 6	0.001 3	6.3	0.27
Frjama\|Read
1_rs 10863911_C/T	21 kb 5’ of RD3	d	0.005 2	0.001 0	6.7	0.19
6_rs9491965_G/A	PTPRK (intronic)	a	-0.004 1	0.000 8	6.7	0.12
7_rs16876162_A/G	6.7 kb 5’ of PER4	a	-0.005 0	0.000 9	7.4	0.18
10_rs10795470_T/A	ST8SIA6 (20 kb 3’)	a	-0.004 4	0.000 8	6.9	0.14
12_rs2177976_G/C	PDZRN4(intronic)	a	0.006 8	0.000 8	18.2	0.33
12_rs821858_A/G	ANO4(intronic)	a	-0.004 4	0.000 9	6.1	0.14
5_rs 10805911 _T/G × 8_rs6996584_C/G	WDR41(intronic) × SAMD12 (intronic)	aae₄	0.012 0	0.002 3	6.6	0.60
7_rs16876162_A/G × 19_rs12979519_T/A	6.7 kb 5’ of PER4 × ZNF552 (intronic)	ad	0.009 6	0.001 8	6.8	0.65
Frjama\|Exer
1_rs6656939_G/C	LY9(intronic)	d	0.006 6	0.001 0	10.3	0.33
1_rs 10863911_C/T	RP11-359E8.3 (21 kb 3’)	d	0.005 0	0.001 0	6.3	0.19
2_rs6754422_T/C	U7 (413 kb 3’)	d	0.005 8	0.001 0	8.9	0.26
6_rs1539002_C/T	CDYL (intronic)	a	0.004 7	0.000 9	7.4	0.17
6_rs1539002_C/T	CDYL (intronic)	d	0.007 6	0.001 1	11.5	0.43
6_rs9491965_G/A	PTPRK (intronic)	a	-0.005 1	0.000 8	10.2	0.20
15_rs2034856_G/A	TMOD2 (intronic)	d	0.007 4	0.000 9	14.1	0.41
16_rs8048681_T/C	WWOX (intronic)	de₂	-0.020 3	0.003 5	8.2	1.25
19_rs9304805_A/G	ZNF552(9.8 kb 3’)	a	-0.004 1	0.000 8	7.1	0.13
2_rs6754422_T/C × 16_rs8048681_T/C	U7(413 kb 3’) × WWOX (intronic)	ade₄	-0.0172	0.003 2	7.2	0.84
5_rs10805911_T/G × 9_rs1930368_C/T	WDR41 (intronic) × RP11-T65H23.1 (166 kb 3’)	aae₃	0.012 2	0.002 4	6.4	0.28
Frjamal Smoke
1_rs6656939_G/C	LY9(intronic)	d	0.006 4	0.001 0	10.1	0.25
2_rs317258_G/C	GCFC2(318 kb 5’)	ae₁	0.007 8	0.001 4	7.6	0.22
		a	-0.0124	0.000 9	40.1	0.90
		d	0.008 2	0.000 9	17.4	0.40
2_rs6754422_T/C	U7 (413 kb 3’)	ae₁	0.0169	0.0014	30.8	0.93
		ae₂	-0.0127	0.002 5	6.3	0.93
		ae₃	-0.0116	0.0019	8.9	0.93
3_rs2455801_T/C	ANKRD28 (6.9 kb 3’)	a	-0.007 3	0.000 9	15.1	0.32
		ae₂	-0.0186	0.002 5	13.1	0.68
		de₁	-0.008 6	0.001 5	7.9	0.40
		de₃	0.0119	0.002 0	8.9	0.40
6_rs9491965_G/A	PTPRK (intronic)	a	-0.008 0	0.000 8	24.3	0.38
9_rs1930368_C/T	RP11-165H23.1 (166 kb 3’)	de₁	0.007 8	0.001 5	6.8	0.96
9_rs1930368_C/T	RP11-165H23.1 (166 kb 3’)	de₃	0.0124	0.0019	9.9	0.96
11_rs4357719_T/C	OR52E6(missense)	a	0.005 5	0.000 9	9.1	0.18
16_rs8048681_T/C	WWOX(intronic)	de₂	-0.022 5	0.003 4	10.2	1.06
19_rs9304805_A/G	ZNF552 (9.8 kb 3’)	a	-0.004 5	0.000 8	8.6	0.12
2_rs399420_C/T × 2_rs6754422_T/C	AC012593.1× U7 (413 kb 3’)	aa	0.0123	0.001 1	30.2	0.90
		da	0.0173	0.0018	20.3	1.77
		aae₁	-0.0141	0.0018	13.8	0.49
		dae₁	-0.0145	0.002 3	9.6	0.31
2_rs6754422_T/C × 16_rs8048681_T/C	U7(413 kb 3’)× WWOX(intronic)	ade₄	-0.0199	0.003 2	9.5	1.03
2_rs4675536_A/G × 10_rs1962663_A/C	PARD3B (1.1 kb 3’)× RP11-482E14.1 (4.2 kb 3’)	dd	0.008 0	0.001 5	7.5	0.38
3_rs2455801_T/C × 6_rs 16888481 _T/A	ANKRD28 (6.9 kb 3’)× RP1-182D15.2 (48 kb 5’)	aa	0.006 3	0.001 0	8.6	0.23
		dd	0.0101	0.0019	6.9	0.60
		aae₂	0.0159	0.002 8	7.8	0.41
		dde₃	-0.032 1	0.003 3	22.1	2.64
5_rs10805911_T/G × 9_rs1930368_C/T	WDR41 (intronic) × RP11-T65H23.1 (166 kb 3’)	da	-0.007 0	0.0014	6.0	0.29
5_rs10805911_T/G × 9_rs1930368_C/T	WDR41 (intronic) × RP11-T65H23.1 (166 kb 3’)	aae₃	0.0134	0.002 4	7.6	0.32
6_rs9491965_G/A × 9_rs1930368_C/T	PTPRK (intronic) × RP11-T65H23.1 (166 kb 3’)	ade₄	0.0135	0.002 3	8.4	0.56
7_rs17520767_G/A × 17_rs11078840_A/G	AF104455.1 (47 kb 3’)× MYH13	dd	0.0142	0.002 5	8.2	1.20
12_rs7315299_T/A × 15_rs10220820_C/A	TRHDE (intronic) × AKAP13 (intronic)	aa	0.007 2	0.001 0	12.4	0.30
12_rs7315299_T/A × 15_rs10220820_C/A	TRHDE (intronic) × AKAP13 (intronic)	ad	-0.006 8	0.0014	6.1	0.27
Genetic effect: d=dominance, a=additive, ae₁=E-A specific additive, ae₃=A-A specific additive, de₁=E-A specific dominance, de₃=A-A specific dominance, de₄=H-A specific dominance, de₂=C-A specific dominance, ade₄=H-A specific additive × dominance epistasis, dd=dominance × dominance epistasis, aae₃=A-A specific additive × additive epistasis, aa=additive × additive epistasis, ad=additive × dominance epistasis, ae₂=C-A specific additive, ae₄=H-A specific additive, aae₄=H-A specific additive × additive epistasis, dae₃=A-A specific dominance × additive epistasis, dde₃=A-A specific dominance × dominance epistasis, a=dominance × additive epistasis, dae₁=E-A specific dominance × additive epistasis, aae₁=E-A specific additive × additive epistasis, aae₂=C-A specific additive × additive epistasis; -lgP＝minus log₁₀(P_EW-value); h²＝heritability.

点击放大

2.2 Effects detected under the base model and certain condition models

Only four single effects and two pairs of epistasis effects were detected remain unchanged across the base model and six condition models. There were some main genetic effects remaining unaltered by life styles, namely additive effect of rs317258 (318 kb 5' of GCFC2), dominance effect of rs12621362 (20 kb 5' of C2orf51), dominance effects of rs6996584 (SAMD12), additive effects of rs10965365 (63 kb 3' of DMRTA1) and rs1930368 (166 kb 3' of RP11-165H23.1), and additive× additive effects of rs317258 (318 kb 5' of GCFC2)×rs6996584 (SAMD12). Their robustness across different models and races indicates that these loci have fundamental roles not affected by six life styles in CHD genetic architecture.

Main dominance effects of rs6974603 (WBSCR17) and E-A, A-A, H-A specific dominance effects of rs17116652 (LPAR3) could also be detected in all models. But its corresponding effects fluctuated across different models. It was indicated that although they are not totally affected by cofactors, they still susceptible to certain cofactors to some extent. For instance, after removing the effects of transportation, TV, and smoke, the main additive effects of rs6974603 (WBSCR17) decreased, indicating that people with homozygotes of major alleles G/G for this locus can benefit from less frequent driving, TV watching or smoking. WBSCR17 was also confirmed to associate with type 2 diabetes in African Americans by GENNID study^[8].

2.3 Genes detected in the base model but not in certain condition models

The A-A specific dominance effect (de₃=0.010 9) of locus rs6996584 (SAMD12) was lost in all condition models, indicating that the effect was caused by all cofactors, and via giving up all corresponding habits could help to reduce the CHD risk for A-A individuals with heterozygote C/G in this locus. The gene SAMD12 have been detected before in GWAS of carotid artery intimamedia thickness^[9], which shares a similar mechanism with CHD progression.

There were also effects that were lost only in some condition models, like dominance of rs2455801 located 6.9 kb away from 3' of gene ANKRD28 which regulates focal adhesion and cell migration by ANKRD28-DOCK180 interaction^[10]. It remained unchanged after removing the effect of transportation, walk, and smoke, but disappeared after conditioned on TV, read, and exercise. It was suggested that rs2455801 could response to TV, read, and exercise but not the other three cofactors in terms of CHD.

2.4 Effects detected in certain condition models but not in the base model

There were also effects not detected in the base model, but appeared in the condition models. Only a minority of effects were detected after condition on walk, read, exercise (1 locus with single effects, and 1 pair of epistasis for |Walk, 6 loci with single effects and 2 pairs of epistasis for |Read, 8 loci with single effects and 2 pairs of epistasis for |Exer), but more effects appeared after taking |Trans, |TV and |Smoke into consideration, suggesting a larger suppressive effects these three activities held on the expression of genetic components for CHD.

Genetic effects response differently to corresponding cofactors. For example C-A specific dominance of rs8048681 (WWOX) was detected only in 3 conditional models (|Smoke, |Trans, and |Exer), but not in the other three models. In another genesmoking interaction GWAS, researchers also detected WWOX responsible for coronary artery calcification in smokers, but not in non-smokers, supporting our results that expression of WWOX was susceptible to cigarette smoking^[11].

2.5 Ethnicity-specific effects

In our study, we found some effects were quite stable across different ethnic populations, while others exhibited strong ethnic predisposition. For example, SNPs in 9p21.3 region were repeatedly associated with CHD in different populations, such as European ancestry population^{[2, 12-15]}, South Asian population^[15] and East Asia population^[16-18], since it was identified in first GWAS for CHD in 2005^[19]. In our study, rs10965365 (63 kb 3' of DMRTA1) also tagged this region with main additive effect (a＝0.005 4) irreverent to races in concordance with previous findings. It could also be detected under all models indicating that it may pay a fundamental role in CHD progression.

On the contrary, the SNP rs17116652 located near LPAR3 belongs to the ethnicity-specific group with dominance effects varied among four ethnic populations. LPAR3 encodes a subtype of lysophosphatidic acid (LPA) receptors. Pharmacological studies have identified LPAR3 as the primary mediator of LPA-induced platelet activation during thermogenesis^[20-21]. In our study, we found that despite of the universal main dominance effect (d = 0.034 7 in base model), rs17116652 also exhibited ethnicity-specific effects in response to distinct cofactors. Ethnic-specific dominance of E-A, A-A, and H-A populations could be detected in base model (de₁ = －0.029 1, de₃ = －0.028 6, and de₄ = －0.033 1), and their effects decreased simultaneously after removing effects of TV and smoke. But for E-A population, further decrease could be observed while eliminating the effects of read. It was suggested that carriers of heterozygote (C/ T) of rs17116652 in the three mentioned populations can reduce CHD risk via giving up watching TV or through cigarette cessation. However, further decrease could be obtained via stopping leisure reading for European American, because there was an additional reduction detected after removing the effects of reading.

2.6 Gene network of detected genes conditional on life styles

To summarize the biological pathways that are primarily depicted by our research, we examined whether the genes harboring identified loci enrolled in particular disease, pathways or molecular networks using Biopubinfo (Fig. 2). Genes were classified into 2 categories based on their reactions to conditional analysis, unaltered, or lost after condition. We found that additive genetic effects of 2 genes (DMRTA1 and TLE4) remain unaltered after condition on different life styles. Disorder of cardiovascular system and diabetes mellitus type 2 were detected to associate with both gene sets affected by life styles. Particularly, acquired immunodeficiency syndrome and bipolar affective psychoses were associated with gene sets suppressed or caused by lifestyles respectively. Except diseases relevant to cardiovascular system, we also detected many other diseases associated with gene sets suppressed by life styles, indicating a pleiotropic role played by those genes.

A: additive effects of 2 genes detected in all models; B: genes suppressed by |Trans; C: genes suppressed by |Walk; D: genes suppressed by |TV; E: genes suppressed by |Read; F: genes suppressed by |Exer; G: genes suppressed by |Smoke. Fig. 2 Network of seed genes with diseases, functions, chemicals and other genes for Framingham global CVD risk

3 Discussion

We performed a genome-wide association study for MESA cohorts on the 10-year hard CHD risk for individuals, with full model including genetic effects of additive, dominance, epistasis and ethnicity-specific effects to unveil the complex architecture of CHD. We also utilized conditioned models with six lifestyles (walk, read, transportation, exercise, TV, and smoke) as cofactors to study the influence of human lifestyles on CHD. In contrast to previous findings that dominance contributed little to the missing heritability, we found that dominance and ethnicity-specific dominance contributed almost half of the total phenotypic variance (42.34%-51.56%) for CHD. We also found that the genetic background for CHD varied greatly across four ethnic populations, with ethnicity-specific heritability ranging from 31.18% to 43.99%.

Missing heritability has always been a haunting problem in genomic association study. To combat this deficiency, we introduced epistasis effects into our full model, including four types of effects, additive × additive, additive × dominance, dominance × additive and dominance × dominance, along with their ethnic interactions. We also observed the heritability of epistasis contributed a large portion (11.72%-20.27%) of total heritability. Our finding offers a successful example exploring the missing heritability accounted by gene interaction (G×G) as supposed by ZUK et al.^[22].

One of the most desirable aspirations of GWAS is to provide patients with personalized risk prediction. In our method, genomic effects of individual loci and epistasis SNP pairs were predicted, based on which we can predict optical genotype combination of superior line (all loci are homozygotes) and superior hybrid (loci can be either homozygote or heterozygote) for each population (Tables 4 and 5). We found that setting rs17116652 as C/T, rs6706330 as A/G, rs423711 as G/A, and rs11250700 as G/T could simultaneously increase or decrease CHD risk in all ethnic groups. However, some loci were only efficacious to some specific populations. For example, the heterozygote T/C of rs8048681 was only detrimental to African Americans due to de₃ = 0.0121. The heterozygotes A/G of rs16876162 and C/G of 8_rs6996584_could exclusively decrease CHD risk in European Americans by 0.005 3 and 0.007 9 compared with homozygote A/A and G/G, respectively. Our method may offer a road map for the disease risk prediction.

Table 4 Predicted different loci between superior hybrid (SH) and superior line (SL) (positive)

QTS	Superior linc/hybrid	European		Chinese		African		Hispanic
QTS	Superior linc/hybrid	Alleles	Effect	Alleles	Effect	Alleles	Effect	Alleles	Effect
1_rs17116652_C/T (LRAR3)	SL (+)	T/T	0.004 0	T/T	0.004 0	T/T	0.004 0	T/T	0.008 3
1_rs17116652_C/T (LRAR3)	SH (+)	C/T	0.005 6	C/T	0.1268	C/T	0.006 1	T/T	0.008 3
1_rs 10863911_C/T	SL (+)	C/C	0.000 0	C/C	0.000 0	C/C	0.000 0	C/C	0.000 0
1_rs 10863911_C/T	SH (+)	C/T	0.010 1	C/T	0.004 7	C/T	0.004 7	C/T	0.004 7
2_rs12621362_T/C (C2orf51)	SL (+)	T/T	0.004 0	T/T	0.004 0	T/T	0.004 0	T/T	0.004 0
2_rs12621362_T/C (C2orf51)	SH (+)	T/C	0.008 2	T/C	0.008 2	T/C	0.008 2	T/T	0.004 0
2_rs905628_G/T (CNTNAP5)	SL (+)	T/T	-0.000 2	T/T	0.005 0	T/T	0.005 0	T/T	0.005 0
2_rs905628_G/T (CNTNAP5)	SH (+)	G/T	0.007 6	G/T	0.007 6	G/T	0.007 6	T/T	0.005 0
2_rs6706330_A/G (OSBPL6)	SL (+)	A/A	0.000 0	A/A	0.000 0	A/A	0.000 0	A/A	0.000 0
2_rs6706330_A/G (OSBPL6)	SH (+)	A/G	0.0019	A/G	0.006 4	A/G	0.006 4	A/G	0.011 2
3_rs2455801_T/C (ANKRD28)	SL (+)	T/T	0.000 0	C/C	0.007 5	T/T	0.000 0	T/T	0.000 0
3_rs2455801_T/C (ANKRD28)	SH (+)	T/C	0.005 7	C/C	0.007 5	T/C	0.005 7	T/C	0.005 7
3_rs711689_T/A (ZNF385D)	SL (+)	T/T	-0.002 9	T/T	-0.002 9	T/T	0.004 9	A/A	0.0102
3_rs711689_T/A (ZNF385D)	SH (+)	T/A	0.004 2	T/A	0.004 2	T/A	0.0129	A/A	0.0102
5_rs 10805911 _T/G (WDR41)	SL (+)	T/T	-0.002 7	T/T	-0.002 7	G/G	0.002 7	T/T	-0.002 7
5_rs 10805911 _T/G (WDR41)	SH (+)	T/T	-0.002 7	T/G	0.005 7	G/G	0.002 7	T/G	0.005 7
6_rs 16888481 _T/A	SL (+)	T/T	-0.003 0	T/T	-0.003 0	T/T	-0.003 0	T/T	-0.003 0
6_rs 16888481 _T/A	SH (+)	T/A	0.006 9	T/A	0.006 9	T/T	-0.003 0	T/A	0.0168
10_rs10795470_T/ (ST8SIA6)	SL (+)	T/T	0.006 3	A/A	0.003 7	T/T	-0.003 7	A/A	0.013 2
10_rs10795470_T/ (ST8SIA6)	SH (+)	T/A	0.006 8	A/A	0.003 7	T/T	-0.003 7	A/A	0.013 2
16_rs8048681_T/C (WWOX)	SL (+)	T/T	0.000 0	T/T	0.000 0	T/T	0.000 0	T/T	0.000 0
16_rs8048681_T/C (WWOX)	SH (+)	T/T	0.000 0	T/T	0.000 0	T/C	0.012 1	T/T	0.000 0
20_rs423711_G/A (RIN2)	SL (+)	G/G	0.000 0	G/G	0.000 0	G/G	0.000 0	G/G	0.000 0
20_rs423711_G/A (RIN2)	SH (+)	G/A	0.004 9	G/A	0.004 9	G/A	0.004 9	G/A	0.004 9

点击放大

Table 5 Predicted different loci between superior hybrid (SH) and superior line (SL) (negative)

QTS	Superior line/hybrid	European		Chinese		African		Hispanic
QTS	Superior line/hybrid	Alleles	Effect	Alleles	Effect	Alleles	Effect	Alleles	Effect
2_rs317258_G/C	SL (-)	C/C	0.000 2	C/C	0.005 3	C/C	0.005 3	G/G	-0.005 3
2_rs317258_G/C	SH (-)	G/C	0.000 0	C/C	0.005 3	C/C	0.005 3	G/G	-0.005 3
3_rs2675434_T/A (ANKRD28)	SL (-)	T/T	0.003 0	T/T	0.003 0	T/T	0.003 0	T/T	0.003 0
3_rs2675434_T/A (ANKRD28)	SH (-)	T/A	-0.005 4	T/A	-0.005 4	T/A	-0.005 4	T/A	-0.005 4
5_rs10805911_T/G (WDR41)	SL (-)	T/T	-0.002 7	T/T	-0.002 7	G/G	0.002 7	T/T	-0.002 7
5_rs10805911_T/G (WDR41)	SH (-)	T/G	0.005 7	T/G	0.005 7	G/G	0.002 7	T/G	0.005 7
6_rs 16888481_T/A	SL (-)	T/T	-0.003 0	T/T	-0.003 0	T/T	-0.003 0	T/T	-0.003 0
6_rs 16888481_T/A	SH (-)	T/T	-0.003 0	T/T	-0.003 0	T/A	-0.001 5	T/T	-0.003 0
7_rs16876162_A/G	SL (-)	A/A	-0.0103	G/G	0.003 8	G/G	-0.0018	G/G	0.003 8
7_rs16876162_A/G	SH (-)	A/G	-0.0156	G/G	0.003 8	G/G	-0.0018	A/G	-0.0156
8_rs6996584_C/G (SAMD12)	SL (-)	G/G	-0.001 6	C/C	-0.0171	C/C	-0.002 5	G/G	-0.005 0
8_rs6996584_C/G (SAMD12)	SH (-)	C/G	-0.009 5	C/C	-0.0171	C/C	-0.002 5	G/G	-0.005 0
9_rs1930368_C/T	SL (-)	C/C	0.002 6	C/C	-0.007 2	C/C	-0.007 2	C/C	-0.0129
9_rs1930368_C/T	SH (-)	C/C	0.002 6	C/T	-0.0104	C/C	-0.007 2	C/T	-0.024 4
10_rs11250700_G/T (ADARB2)	SL (-)	G/G	0.000 0	G/G	0.000 0	G/G	0.000 0	G/G	0.000 0
10_rs11250700_G/T (ADARB2)	SH (-)	G/T	-0.006 1	G/T	-0.006 1	G/T	-0.006 1	G/T	-0.006 1
10_rs10795470_T/A (ST8SIA6)	SH (-)	A/A	-0.006 3	T/A	-0.003 3	T/A	-0.003 3	T/T	-0.0132
10_rs10795470_T/A (ST8SIA6)	SL (-)	A/A	-0.006 3	T/T	-0.003 7	A/A	0.003 7	T/T	-0.0132
12_rs7134443_C/T	SL (-)	C/C	0.000 0	C/C	0.000 0	C/C	0.000 0	C/C	0.000 0
12_rs7134443_C/T	SH (-)	C/T	-0.003 9	C/C	0.000 0	C/C	0.000 0	C/C	0.000 0
16_rs8048681_T/C (WWOX)	SL (-)	T/T	0.000 0	T/T	0.000 0	T/T	0.000 0	T/T	0.000 0
16_rs8048681_T/C (WWOX)	SH (-)	T/T	0.000 0	T/C	-0.0153	T/T	0.000 0	T/T	0.000 0

点击放大

Acknowledgement: The datasets used by the analyses described in this manuscript were obtained from dbGaP|phs000209.v11. p3|Multi-Ethnic Study of Atherosclerosis (MESA) Cohort.

References

[1]	GO A S, MOZAFFARIAN D, ROGER V L, et al. Heart disease and stroke statistics—2014 update. Circulation, 2014,129 :e28. DOI: 10.1161/01.cir.0000441139.02102.80.
[2]	SCHUNKERT H, KÖNIG I R, KATHIRESAN S, et al. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nature Genetics, 2011,43 (4):333–338. DOI: 10.1038/ng.784.
[3]	VERHEUGT F. Passive smoking and the risk of coronary heart disease. Nederlands Tijdschrift Voor Geneeskunde, 2004,148 :645–647.
[4]	CARNETHON M R. Physical activity and cardiovascular disease: how much is enough?. American Journal of Lifestyle Medicine, 2009,3 :44–49. DOI: 10.1177/1559827609332737.
[5]	HINDORFF L A, JUNKINS H A, HALL P, et al. A catalog of published genome-wide association studies. National Human Genome Research Institute, 2011 .
[6]	DELOUKAS P, KANONI S, WILLENBORG C, et al. Large-scale association analysis identifies new risk loci for coronary artery disease. Nature Genetics, 2013,45 :25–33.
[7]	D'AGOSTINO R B, VASAN R S, PENCINA M J, et al. General cardiovascular risk profile for use in primary care the Framingham heart study. Circulation, 2008,117 :743–753. DOI: 10.1161/CIRCULATIONAHA.107.699579.
[8]	HASSTEDT S J, HIGHLAND H M, ELBEIN S C, et al. Five linkage regions each harbor multiple type 2 diabetes genes in the African American subset of the GENNID study. Journal of Human Genetics, 2013,58 :378–383. DOI: 10.1038/jhg.2013.21.
[9]	DONG C, DELLA-MORTE D, BEECHAM A, et al. Genetic variants in LEKR1 and GALNT10 modulate sex-difference in carotid intimamedia thickness: a genome-wide interaction study. Atherosclerosis, 2015,240 :462–467. DOI: 10.1016/j.atherosclerosis.2015.04.019.
[10]	KIYOKAWA E, MATSUDA M. Regulation of focal adhesion and cell migration by ANKRD28-DOCK180 interaction. Cell Adhesion & Migration, 2009,3 :281–284.
[11]	POLFUS L M, SMITH J A, SHIMMIN L C, et al. Genome-wide association study of gene by smoking interactions in coronary artery calcification. PLoS ONE, 2013,8 :e74642. DOI: 10.1371/journal.pone.0074642.
[12]	BURTON P R, CLAYTON D G, CARDON L R, et al. Genome-wide association study of 14 000 cases of seven common diseases and 3 000 shared controls. Nature, 2007,447 :661–678. DOI: 10.1038/nature05911.
[13]	SAMANI N J, ERDMANN J, HALL A S, et al. Genomewide association analysis of coronary artery disease. New England Journal of Medicine, 2007,357 :443–453. DOI: 10.1056/NEJMoa072366.
[14]	WILD P S, ZELLER T, SCHILLERT A, et al. A genome-wide association study identifies LIPA as a susceptibility gene for coronary artery disease. Circulation: Cardiovascular Genetics , 2011 .
[15]	Coronary Artery Disease (CAD) Genetics Consortium. A genomewide association study in Europeans and South Asians identifies five new loci for coronary artery disease. Nature Genetics, 2011,43 :339–344. DOI: 10.1038/ng.782.
[16]	TAKEUCHI F, YOKOTA M, YAMAMOTO K, et al. Genome-wide association study of coronary artery disease in the Japanese. European Journal of Human Genetics, 2012,20 :333–340. DOI: 10.1038/ejhg.2011.184.
[17]	LU X F, WANG L Y, CHEN S F, et al. Genome-wide association study in Han Chinese identifies four new susceptibility loci for coronary artery disease. Nature Genetics, 2012,44 :890–894. DOI: 10.1038/ng.2337.
[18]	LEE J Y, LEE B S, SHIN D J, et al. A genome-wide association study of a coronary artery disease risk variant. Journal of Human Genetics, 2013,58 :120–126. DOI: 10.1038/jhg.2012.124.
[19]	MCPHERSON R, PERTSEMLIDIS A, KAVASLAR N, et al. A common allele on chromosome 9 associated with coronary heart disease. Science, 2007,316 :1488–1491. DOI: 10.1126/science.1142447.
[20]	GARDELL S E, DUBIN A E, CHUN J. Emerging medicinal roles for lysophospholipid signaling. Trends in Molecular Medicine, 2006,12 :65–75. DOI: 10.1016/j.molmed.2005.12.001.
[21]	ROTHER E, BRANDL R, BAKER D L, et al. Subtype-selective antagonists of lysophosphatidic acid receptors inhibit platelet activation triggered by the lipid core of atherosclerotic plaques. Circulation, 2003,108 :741–747. DOI: 10.1161/01.CIR.0000083715.37658.C4.
[22]	ZUK O, HECHTER E, SUNYAEV S R, et al. The mystery of missing heritability: genetic interactions create phantom heritability. Proceedings of the National Academy of Sciences, 2012,109 :1193–1198. DOI: 10.1073/pnas.1119675109.