- Research article
- Open access
- Published:
Characterizing the polygenic overlap and shared loci between rheumatoid arthritis and cardiovascular diseases
BMC Medicine volume 22, Article number: 152 (2024)
Abstract
Background
Despite substantial research revealing that patients with rheumatoid arthritis (RA) have excessive morbidity and mortality of cardiovascular disease (CVD), the mechanism underlying this association has not been fully known. This study aims to systematically investigate the phenotypic and genetic correlation between RA and CVD.
Methods
Based on UK Biobank, we conducted two cohort studies to evaluate the phenotypic relationships between RA and CVD, including atrial fibrillation (AF), coronary artery disease (CAD), heart failure (HF), and stroke. Next, we used linkage disequilibrium score regression, Local Analysis of [co]Variant Association, and bivariate causal mixture model (MiXeR) methods to examine the genetic correlation and polygenic overlap between RA and CVD, using genome-wide association summary statistics. Furthermore, we explored specific shared genetic loci by conjunctional false discovery rate analysis and association analysis based on subsets.
Results
Compared with the general population, RA patients showed a higher incidence of CVD (hazard ratio [HR] = 1.21, 95% confidence interval [CI]: 1.15–1.28). We observed positive genetic correlations of RA with AF and stroke, and a mixture of negative and positive local genetic correlations underlying the global genetic correlation for CAD and HF, with 13 ~ 33% of shared genetic variants for these trait pairs. We further identified 23 pleiotropic loci associated with RA and at least one CVD, including one novel locus (rs7098414, TSPAN14, 10q23.1). Genes mapped to these shared loci were enriched in immune and inflammatory-related pathways, and modifiable risk factors, such as high diastolic blood pressure.
Conclusions
This study revealed the shared genetic architecture of RA and CVD, which may facilitate drug target identification and improved clinical management.
Background
Rheumatoid arthritis (RA) is a systemic autoimmune disease that affects approximately 2% of the global population [1]. Previous studies have characterized RA by excess morbidity and mortality from cardiovascular disease (CVD) [2]. Compared with the general population, patients with RA have a 1.48-fold higher incidence of CVD [3, 4]. Recent research has identified traditional risk factors for CVD comorbidity in RA, such as obesity, hypertension, and dyslipidemia [5]. However, after controlling for these established risk factors, patients with RA still showed an increased risk of developing CVD [6, 7]. Therefore, the pathology underlying the phenotypic linkage between RA and CVD requires further investigation.
As both RA and CVD are complex and heritable disorders [8,9,10], one hypothesis accounting for their phenotypic linkage is the genetic components [11, 12]. The genetic variants may either influence one trait through their effect on the other (causality) or have independent effects on both traits (pleiotropy). Several studies applying the Mendelian randomization approach, which uses genetic variants as proxies for exposures, showed that genetic liability to RA was associated with an increased risk of coronary artery disease (CAD), intracerebral hemorrhage and myocardial infarction, suggesting the potential causality of RA on CVD risk [13, 14]. Except for the potentially causal effects, the shared genetic variants may also contribute to their associations. Prior studies reported several genes, such as HLA-DRB1, TNFA, MTHFR, and CCR5, were associated with the developments of both RA and CVD [15,16,17]. However, a comprehensive assessment through genome-wide analyses has not been performed. The accumulating amount of genome-wide genetic data enable the utilization of several newly developed methods, such as conjunctional false discovery rate (conjFDR) analysis and association analysis based on subsets (ASSET), to discover novel susceptibility regions affecting multiple traits. Such design may help to examine the genetic contributions to the epidemiological associations and provide novel insights into the underlying biological mechanisms.
To quantify the shared and distinct etiology underlying RA and CVD, we comprehensively analyzed phenotypic linkage and polygenic overlap between RA and four CVD traits, including AF, CAD, HF, and stroke. The overall design of the study is shown in Fig. 1. Specifically, we first assessed the phenotypic linkage between RA and four CVD traits using longitudinal data from the UK Biobank [18,19,20,21,22]. Next, by integrating the large-scale genome-wide association studies (GWAS), we evaluated the genetic correlation and identified shared genetic loci between RA and CVD. We finally carried out functional annotations to delineate the biological impact of shared loci. By investigating the phenotypical and genetic associations between RA and CVD, along with the underlying biological mechanisms, the present study may provide insights into the clinical management of these diseases.
Methods
Data sources
The UK Biobank sataset and GWAS summary statistics
We included European participants with phenotypic data from the UK Biobank (Project ID:71,610). The diagnoses of RA, AF, CAD, HF, and stroke were defined using the International Classification of Diseases code (both Ninth Revision [ICD-9] and Tenth Revision [ICD-10]) and self-reported illness (Additional file 1: Table S1).
To assess the phenotypic linkage between RA and CVD, we performed two analyses (the RA-CVD cohort and the CVD-RA cohort). Specifically, when assessing the risk of CVD in RA patients, we excluded participants with prevalent CVD at baseline, resulting in a total of 441,096 participants (namely the RA-CVD cohort). To analyze RA risk in CVD patients, we excluded participants diagnosed with RA at baseline, the analytic sample comprised 466,285 participants ( the CVD-RA cohort). The UK Biobank study was approved by the North West Multi-Center Research Ethics Committee, and all participants provided informed consent. More details are displayed in the Methods in the Additional file 1 [23,24,25].
Summary statistics of RA and four CVD traits were obtained from publicly available GWAS, ranging from 58,324 to 1,030,836 individuals of European ancestry [18,19,20,21,22]. Detailed information is provided in Additional file 1: Table S2 and the Methods in the Additional file 1 [18,19,20,21,22, 26].
Statistical analyses
Observational analyses
To estimate the phenotypic linkage between RA and CVD, we calculated the hazard ratios (HR) with 95% confidence intervals (CI), using the multivariable Cox proportional hazards regression model. The HRs were progressively adjusted for confounders in two models. Model 1 included age and sex as covariates. Model 2 was adjusted for age, sex, body mass index [BMI], smoking status, alcohol drinking frequency, educational attainment, physical activities, hypertension, type 2 diabetes, and total cholesterol, when assessing the associations between RA and CVD risk. When evaluating the relationship between CVD and RA risk, Model 2 only contained age, sex, BMI, smoking status, alcohol drinking frequency, educational attainment and physical activities. We additionally performed propensity score analyses, and a series of sensitivity analyses, including stratifying participants by age and sex, excluding the incident cases in the first follow-up year, and excluding participants with kinship.
Analyses of genetic correlation and polygenic overlap
We calculated the global and local genetic correlation as well as the polygenic overlap between RA and CVD. We first applied linkage disequilibrium score regression (LDSC) to estimate the global genetic correlation using the 1000 Genomes Project European panel as the reference panel [27]. To investigate whether RA shares genetic overlap with CVD in specific genomic regions, we supplied the Local Analysis of [co]Variant Association (LAVA) method to estimate the local genetic correlation [28]. A bivariate test was performed to estimate these local genetic correlations based on the local genetic covariance between two traits [28]. A total of 2495 default linkage disequilibrium (LD)—independent genomic regions were applied in LAVA [28]. Additionally, we used the bivariate causal mixture model (MiXeR) to quantify the polygenic overlap beyond genetic correlations [29]. MiXeR estimated the number of trait-influencing variants (i.e., variants with non-zero additive genetic effects on the trait) for each trait and the number of shared trait-influencing single-nucleotide polymorphisms (SNPs) based on a bivariate Gaussian mixture model [29]. Model fit was evaluated by the Akaike Information Criterion (AIC) based on the likelihood maximization of signed test statistics (GWAS z-scores). A positive AIC value suggested sufficient power of input GWAS data to distinguish the estimated polygenic overlap. More details for the framework of MiXeR are provided in its original publication [29].
Discovery of shared genetic loci
To discover the shared genetic loci, we performed conjunctional false discovery rate (conjFDR) analysis on corresponding GWAS summary statistics. As an extension of the conditional FDR (condFDR) method, conjFDR is an empirical Bayesian statistical framework and re-ranks the test statistics in a primary phenotype (eg. RA) by conditioning on SNP associations with a secondary phenotype (e.g., CVD) [30, 31]. After switching the roles of the primary and secondary traits, the conjFDR is determined as the maximum of two condFDR values, which provides a conservative estimate of the FDR for the association with both phenotypes [30, 32]. In the present study, we applied a conjFDR threshold of 0.05.
As an alternative approach, we applied the association analysis based on subsets (ASSET) for genome-wide cross-trait meta-analysis to validate the results from conjFDR [33]. We used two-sided ASSET analysis, which allows for subset searches for association signals separately in positive and negative directions and then combines association signals from two directions by chi-square test [33]. The genome-wide significant variants were determined as P < 5 × 10−8.
To define independent genetic loci, we clumped the results of conjFDR and ASSET (–clump-kb 500; –clump-r2 0.1) using PLINK, separately [34]. For each shared locus found by conjFDR, we examined whether it was physically overlapped with a locus from ASSET, based on their lead SNPs using 500 kb windows. If a conjFDR locus overlapped with a locus from ASSET, we considered it as validated in ASSET, and such locus remained for subsequent analyses. We then assessed if a genetic locus was novel for RA and CVD based on a 500-kb region upstream and downstream of its lead SNP. We regarded a locus to be novel if it had not been previously reported to be associated with RA or CVD at genome-wide significance, according to the GWAS Catalog (latest accessed date on July 10th, 2021) and previously published GWAS [18,19,20,21,22].
Functional annotations and gene prioritization
We annotated the regulatory function for the lead SNPs, including enhancers and histone modification sites [35, 36]. The lead SNPs were further mapped to genes by a combination of physical position, expression quantitative trait loci (eQTL), and chromatin interaction mappings. These mapped genes were referred to as candidate-shared genes. Detailed information is displayed in the Methods in the Additional file 1 [35,36,37,38,39,40].
We prioritized these candidate shared genes by summary-data-based Mendelian randomization (SMR) analyses, which investigate the associations between the expression of shared genes and phenotypes of interest [41]. Cis-eQTLs in eQTLGen Consortium [42] and five tissues (artery aorta, artery coronary, atrial appendage, left ventricle, and whole-blood tissues) from GTEx v8 projects were used as instrumental variables to link the outcome via an exposure (i.e., gene expression) [40]. The analysis was conducted according to default SMR protocols. We further supplied the heterogeneity in dependent instruments (HEIDI) analyses and colocalization analyses to support inference by reducing the likelihood that linkage disequilibrium affected the MR findings.
Gene functional enrichment analysis
We explored the underlying biological mechanisms for those shared genes by evaluating their enrichment in gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways using KOBAS 3.0 [43]. Meanwhile, we examined the enrichment of shared genes in any other traits based on the dataset from the GWAS Catalog, using the Gene2Func function of FUMA [44]. The significance threshold of all the above enrichment analyses was set at FDR < 0.05.
Results
The phenotypic linkage between RA and CVD
The characteristics of the participants are displayed in Additional file 1: Table S3. We first assessed the association between RA and the incidence of CVD in the UK Biobank. We observed an increased incidence of CVD in patients with RA in both Model 1 (HR = 1.36, 95% CI: 1.29–1.43) and Model 2 (HR = 1.21, 95% CI: 1.15–1.28). In the fully adjusted model (Model 2), compared with non-RA controls, patients with RA had a 18% (95% CI: 9–28%) higher risk for AF, 18% (95% CI: 10–28%) higher risk for CAD, 56% (95% CI: 40–73%) higher risk for HF, and 16% (95% CI: 1–34%) higher risk for stroke, respectively (Fig. 2). The increased risk of CVD in patients with RA remained statistically significant in additional analyses, including stratification analyses by age and sex, sensitivity analyses excluding incident cases diagnosed in the first follow-up year, and excluding participants with kinship, as well as in propensity score analyses (Additional file 1: Table S4). The results for the associations between RA and four CVDs in the additional analyses can be found in the Additional file 1: Table S4.
Meanwhile, we investigated the relationship between CVD and the incidence of RA (Fig. 2 and Additional file 1: Table S5). After adjustment for age and sex, patients with CVD had a higher risk of RA (HR = 1.17, 95% CI: 1.09–1.26) (Model 1). Among them, patients with CAD had an elevated risk of RA (HR = 1.28, 95% CI: 1.18–1.40). However, in the fully adjusted Cox regression models (Model 2), only patients with CAD had a higher incidence of RA, with HR of 1.13 (95% CI: 1.04–1.24). Sensitivity analyses yielded consistent results (Additional file 1: Table S6). However, we observed a statistically significant association between CVD and RA risk among the younger participants (< 60 years old) (HR: 1.58, 95% CI: 1.38–1.80; P = 1.22 × 10−11) (Additional file 1: Table S6).
The genetic correlation and polygenic overlap between RA and CVD
LDSC analyses revealed that RA has significant positive global genetic correlations with AF (rg = 0.081, FDR-adjusted P = 0.025) and stroke (rg = 0.191, FDR-adjusted P = 0.008). Nominally significant positive global genetic correlations were found for RA with CAD (rg = 0.068, FDR-adjusted P = 0.080) and HF (rg = 0.096, FDR-adjusted P = 0.087) (Fig. 3A). In MiXeR analyses, conditional QQ plots showed cross-trait SNP enrichment for RA and CVD (Additional file 1: Fig S1), suggesting there were polygenic overlaps. Among the 0.8 k variants associated with RA, about 13 ~ 33% of them were also associated with CVD, including 0.1 k shared variants with AF, and 0.3 k with CAD (Additional file 1: Table S8). There were 45% and 46% shared genetic variants between RA and HF, RA and stroke, respectively; however, the model fit suggested the need to increase the sample size of GWAS to provide sufficient power (Fig. S1, and Table S2 in the Additional file 1).
As global genetic correlation estimates an average of genome-wide shared association, these nominal significant correlations may be led by the mixed shared associations [28, 29, 45]. Thus, we further applied LAVA to detect local genetic correlation. The LAVA analysis showed a mixture of negative and positive local genetic correlations underlying the global positive genetic correlation. Specifically, we observed that around 51% to 61% of nominally significant local genetic correlations were positive (Fig. 3B and Additional file 1: Table S7). Moreover, 7 regions, including 4 for RA and AF and 3 for RA and stroke, were significantly correlated. Taken together, the above findings suggested a shared genetic architecture between RA and CVD.
Shared genetic loci between RA and CVD
In light of the above findings, we performed further analyses to detect the loci shared between RA and CVD. First, by using conjFDR analysis, we identified 12, 19, 5, and 6 loci shared between RA and AF, CAD, HF, and stroke, respectively (Fig. 4, and Additional file 1: Table S9). Next, we assessed these identified shared loci using ASSET analysis (Additional file 1: Table S10), and validated 23 loci, including 10, 13, 5, and 5 shared between RA and AF, CAD, HF, and stroke, respectively (Table 1). A total of 27 candidate-shared genes were mapped by these identified loci (Table 1).
Among the replicated loci, one shared locus (rs7098414, 10q23.1, conjFDR = 0.003; PASSET = 3.08 × 10−9) for RA and CAD was novel for both traits. The lead SNP rs7098414 is located in the intron of TSPAN14 and overlapped with 13 histone modification markers (Additional file 1: Table S11). Moreover, eQTL analyses showed that the SNP rs7098414 was associated with the expression of TSPAN14 (P = 1.83 × 10−41) in whole blood. Notably, we also found two loci, rs10774624, and rs12126142, shared by multiple traits. rs10774624 (12q24.12), was shared between RA and CAD (conjFDR = 4.24 × 10−5; PASSET = 5.82 × 10−15), stroke (conjFDR = 3.20 × 10−5; PASSET = 4.58 × 10−17) and HF (conjFDR = 4.12 × 10−5; PASSET = 1.55 × 10−15). Within this locus, we observed five histone modification markers (Additional file 1: Table S11). The lead SNP rs10774624 resides between PHETA1 and SH2B3 and was associated with expressions of both genes in whole blood.
Moreover, SMR analysis suggested that the expression of SWAP70 was associated with the risk of RA (OR = 0.79, 95% CI = 0.68–0.91, P = 0.001) and two CVD traits (CAD: OR = 0.85, 95% CI = 0.80–0.91, P = 1.97 × 10−7; and HF: OR = 0.91, 95% CI = 0.86–0.96, P = 6.24 × 10−4), with no evidence of heterogeneity (P > 0.05, Additional file 1: Table S12). eQTL for SWAP70 in whole blood tissue from eQTLGen Consortium were also colocalized with the GWAS of RA, CAD, and HF at this locus (PPH4: 0.87 for eQTL-CAD trait pair; PPH4: 0.56 for eQTL-RA trait pair; PPH4: 0.52 for eQTL-HF trait pair, Additional file 1: Table S12).
Enriched biological pathways and shared risk factors between RA and CVD
To determine the over-represented pathways among 27 candidate-shared genes, we carried out GO and KEGG pathway enrichment analyses. As a result, we found that shared genes significantly enriched the regulation of the cell cycle and cyclin binding in the biological process. As to cellular components, they were enriched in the actin cytoskeleton, and cyclin-dependent protein serine/threonine kinase activity. For molecular function, these genes were significantly enriched in the protein homodimerization activity, and protein binding, as well as in the regulation of insulin receptor signaling pathway (Additional file 1: Table S13). There were 21 biological pathways significantly enriched, including the human cytomegalovirus infection, and GABAergic synapse (Additional file 1: Table S14).
Additionally, based on the GWAS catalog, these shared genes were mainly enriched in chronic inflammatory diseases, such as Crohn’s disease, systemic lupus erythematosus, and Type 1 diabetes (Fig. 4B). Besides, the shared genes were also enriched in diastolic blood pressure, one risk factor for CVD (Fig. 4B). Hence, we applied a factorial analysis to evaluate the associations of diastolic blood pressure with the risk of RA and CVD in the UK Biobank. As shown in Fig. 4C, participants with neither RA nor high diastolic blood pressure had the lowest risk of CVD (any of AF, CAD, HF, and stroke). We observed an incremental risk in patients with RA alone (HR = 1.05, 95% CI: 1.02–1.07), followed by patients with high diastolic blood pressure alone (HR = 1.20, 95% CI: 1.13–1.28), and the risk was the highest in patients with both RA and high diastolic blood pressure (HR = 1.22, 95% CI: 1.09–1.36) (Additional file 1: Table S15). However, the combination of high diastolic blood pressure and CVD was not significantly positively associated with a higher incidence of RA with adjustment of age, sex, BMI, smoking status, alcohol drinking frequency, education levels and physical activity (Additional file 1: Table S15).
Discussion
In the present study, we systematically investigated the phenotypic linkage and the shared genetic architecture between RA and CVD. We identified 23 shared loci, including one novel locus for RA and CVD. Genes mapped by these shared loci suggested the involvement of immune and inflammatory-related pathways as well as modifiable risk factors such as high diastolic blood pressure for their etiology.
Using data including more than 440,000 UK Biobank participants, our results revealed a higher incidence of CVD in RA patients, with HRs ranging from 1.16 to 1.56, which were consistent with previous epidemiological evidence [46, 47]. We noted that the association between RA and CAD exhibited significance among females and younger participants, whereas it diminished to insignificance among males and older individuals. Reasons for the insignificant association in the sensitivity analyses may be diverse. It is possible that the association between RA and CVD may differ by age, sex, or could be influenced by the limited sample size [48]. In this study, we also evaluated the relationship between CVD and the incidence of RA, revealing a significant association solely between CAD and RA. Further investigation may be warranted to validate our findings.
The epidemiological associations between RA and CVD may be the result of pleiotropic effects of genetics components, reflected by the global and local genetic correlations. In the present study, we observed significant positive global genetic correlations of RA with AF and stroke. However, the global genetic correlations of RA with CAD and HF were of nominal significance. Since the global genetic correlation represents the average of genome-wide shared associations, this insignificant global correlation may be due to opposing directions at different regions [45]. Consistently, LAVA analysis revealed a mixed effect direction at the regional level, by showing nearly half of the regions were positively correlated with nominal significance. The shared genetic architecture hypothesis was further supported by MiXeR, which found that RA shares 33.3% genetic variants with CAD and 44.4% with HF and reported cross-trait SNP enrichment in conditional Q-Q plots. Altogether, the above findings supported the presence of polygenic overlap between RA and CVD.
Among the shared loci, one locus on chromosome 10 has not been linked to RA or CVD in previous GWAS studies. The lead SNP of this locus is rs7098414 (10q23.1), an intronic SNP of TSPAN14. TSPAN14 encodes Tspan14 protein, which belongs to a subfamily of six related proteins TspanC8 group [49, 50]. TSPAN14 has been found to positively regulate ADAM10-dependent Notch activation [51], which was involved in the pathogenesis of RA [52]. TSPAN14 was also shown to interact with the inflammatory pathway of atherosclerosis, which has a major role in the development of CAD [53]. Further research is needed to better understand the specific mechanisms through which Tspan14 influences these diseases and to validate the functional significance of TSPAN14 in RA and CAD. Besides, the other shared loci also provided new insights into the pathogenesis of the association between RA and CVD. For example, rs10774624 (12q24.12), which was jointly associated with RA, CAD, HF, and stroke in the present study, has been demonstrated to be related to blood pressure and peripheral artery disease [54, 55]. Moreover, the identification of rs12126142 (IL6R) shared with RA, AF, and CAD may be of specific clinical interest. The IL6R gene product has been linked to immunological response [56] and chronic inflammation [57], both of which have been implicated in the pathogenesis of RA and CVD. Consistently, prior studies also reported that genetically determined lower IL-6 level was associated with a decreased risk of coronary heart disease events [58, 59]. The pathway enrichment analysis also suggested the involvement of Th17 cell differentiation, which is regulated by IL-6 [60]. Currently, several drugs, such as Tocilizumab, have been developed to inhibit IL6 signaling for RA treatment [61]. Thus, our findings of shared genetic loci between RA and CVD may shed light on the clinical implications of individualized treatment to reduce the occurrence of CVD in RA patients.
Additionally, GO enrichment analysis suggested a potential involvement of negative regulation of insulin receptor signaling pathways in the comorbidity of RA and CVD. This signaling pathway may be of special clinical interest, given the established role of insulin resistance in the development of CVD [62] and its suggested involvement in the risk of RA in patients with CVD [63]. Moreover, the shared genetic loci were also enriched in diabolic blood pressure, which can be caused by insulin resistance [64, 65]. These data suggested that the metabolic mechanisms may be relevant to the association of RA and CVD.
Some limitations should be considered when interpreting our findings. First, when assessing the associations between RA and CVD in the UK Biobank, though we have included the potential confounding factors in the our analysis, residual confounding may still exist, which may result in an overestimation of the association. Second, the MiXeR analyses for RA-HF, and RA-stroke still suggested inadequate power, though we applied the GWAS data sets with the largest sample size to date for each phenotype assessed. Further validation studies of the present findings using larger GWAS datasets are required. Additionally, due to available GWAS data and multiancestral differences in allele frequency and LD structure, we restricted our analyses to the European population, therefore, these findings may not be generalizable to other populations with different ethnicities. Future investigations should incorporate more diverse population samples, such as non-European cohorts, to validate and extend our results. Finally, several included GWAS datasets did not include the major histocompatibility complex (MHC) region, which precluded us from exploring genetic linkage for RA and CVD in the MHC region. Since the MHC region has a well-established role in immune responses and inflammation, and the present study indicated the involvement of inflammation in the shared genetic loci between RA and CVD, further studies with a focus on MHC are suggested.
Conclusions
The present study showed phenotypic linkages between RA and CVD, as well as shared genetic components, which might underlie the established epidemiological associations. The shared genes were involved in immune and inflammatory pathways. These findings progress our understanding of shared genetic mechanisms underlying RA and CVD and may provide insights into new therapies for these disorders and improve clinical management.
Availability of data and materials
The data generated or analyzed during this study are available in this published article and its supplementary information files.
Abbreviations
- AIC:
-
Akaike Information Criterion
- BMI:
-
Body mass index
- CAD:
-
Coronary artery disease
- CI:
-
Confidence intervals
- CondFDR:
-
Conditional false discovery rate
- ConjFDR:
-
Conjunctional false discovery rate
- CVD:
-
Cardiovascular disease
- eQTL:
-
Expression quantitative trait loci
- GO:
-
Gene ontology
- GWAS:
-
Genome-wide association studies
- HF:
-
Heart failure
- HR:
-
Hazard ratio
- ICD:
-
International Classification of Diseases
- KEGG:
-
Kyoto Encyclopedia of Genes and Genomes
- LAVA:
-
Local Analysis of [co]Variant Association
- LD:
-
Linkage disequilibrium
- LDSC:
-
Linkage disequilibrium score regression
- RA:
-
Rheumatoid arthritis
- SMR:
-
Summary-data-based Mendelian randomization
- SNPs:
-
Single-nucleotide polymorphisms
References
Safiri S, Kolahi AA, Hoy D, Smith E, Bettampadi D, Mansournia MA, Almasi-Hashiani A, Ashrafi-Asgarabad A, Moradi-Lakeh M, Qorbani M, et al. Global, regional and national burden of rheumatoid arthritis 1990–2017: a systematic analysis of the Global Burden of Disease study 2017. Ann Rheum Dis. 2019;78(11):1463–71.
Avina-Zubieta JA, Choi HK, Sadatsafavi M, Etminan M, Esdaile JM, Lacaille D. Risk of cardiovascular mortality in patients with rheumatoid arthritis: a meta-analysis of observational studies. Arthritis Rheum. 2008;59(12):1690–7.
Avina-Zubieta JA, Thomas J, Sadatsafavi M, Lehman AJ, Lacaille D. Risk of incident cardiovascular events in patients with rheumatoid arthritis: a meta-analysis of observational studies. Ann Rheum Dis. 2012;71(9):1524–9.
Crowson CS, Liao KP, Davis JM 3rd, Solomon DH, Matteson EL, Knutson KL, Hlatky MA, Gabriel SE. Rheumatoid arthritis and cardiovascular disease. Am Heart J. 2013;166(4):622–628e621.
England BR, Thiele GM, Anderson DR, Mikuls TR. Increased cardiovascular risk in rheumatoid arthritis: mechanisms and implications. BMJ. 2018;361:k1036.
Solomon DH, Karlson EW, Rimm EB, Cannuscio CC, Mandl LA, Manson JE, Stampfer MJ, Curhan GC. Cardiovascular morbidity and mortality in women diagnosed with rheumatoid arthritis. Circulation. 2003;107(9):1303–7.
Solomon DH, Curhan GC, Rimm EB, Cannuscio CC, Karlson EW. Cardiovascular risk factors in women with and without rheumatoid arthritis. Arthritis Rheum. 2004;50(11):3444–9.
Sack M. Tumor necrosis factor-alpha in cardiovascular biology and the potential role for anti-tumor necrosis factor-alpha therapy in heart disease. Pharmacol Ther. 2002;94(1–2):123–35.
Tada H, Fujino N, Hayashi K, Kawashiri MA, Takamura M. Human genetics and its impact on cardiovascular disease. J Cardiol. 2022;79(2):233–9.
Kathiresan S, Srivastava D. Genetics of human cardiovascular disease. Cell. 2012;148(6):1242–57.
Logstrup BB, Olesen KKW, Masic D, Gyldenkerne C, Thrane PG, Ellingsen T, Botker HE, Maeng M. Impact of rheumatoid arthritis on major cardiovascular events in patients with and without coronary artery disease. Ann Rheum Dis. 2020;79(9):1182–8.
Torkamani A, Topol EJ, Schork NJ. Pathway analysis of seven common diseases assessed by genome-wide association. Genomics. 2008;92(5):265–72.
Zhang K, Jia Y, Wang R, Guo D, Yang P, Sun L, Wang Y, Liu F, Zang Y, Shi M, et al. Rheumatoid arthritis and the risk of major cardiometabolic diseases: a Mendelian randomization study. Scand J Rheumatol. 2023;52(4):335–41.
Yuan S, Carter P, Mason AM, Yang F, Burgess S, Larsson SC. Genetic Liability to Rheumatoid Arthritis in Relation to Coronary Artery Disease and Stroke Risk. Arthritis Rheumatol. 2022;74(10):1638–47.
Rodriguez-Rodriguez L, Lopez-Mejias R, Fernandez-Gutierrez B, Balsa A, Gonzalez-Gay MA, Martin J. Rheumatoid arthritis: genetic variants as biomarkers of cardiovascular disease. Curr Pharm Des. 2015;21(2):182–201.
Skeoch S, Bruce IN. Atherosclerosis in rheumatoid arthritis: is it all about inflammation? Nat Rev Rheumatol. 2015;11(7):390–400.
Viatte S, Plant D, Han B, Fu B, Yarwood A, Thomson W, Symmons DP, Worthington J, Young A, Hyrich KL, et al. Association of HLA-DRB1 haplotypes with rheumatoid arthritis severity, mortality, and treatment response. JAMA. 2015;313(16):1645–56.
Okada Y, Wu D, Trynka G, Raj T, Terao C, Ikari K, Kochi Y, Ohmura K, Suzuki A, Yoshida S, et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature. 2014;506(7488):376–81.
Nielsen JB, Thorolfsdottir RB, Fritsche LG, Zhou W, Skov MW, Graham SE, Herron TJ, McCarthy S, Schmidt EM, Sveinbjornsson G, et al. Biobank-driven genomic discovery yields new insight into atrial fibrillation biology. Nat Genet. 2018;50(9):1234–9.
Nelson CP, Goel A, Butterworth AS, Kanoni S, Webb TR, Marouli E, Zeng L, Ntalla I, Lai FY, Hopewell JC, et al. Association analyses based on false discovery rate implicate new loci for coronary artery disease. Nat Genet. 2017;49(9):1385–91.
Shah S, Henry A, Roselli C, Lin H, Sveinbjornsson G, Fatemifar G, Hedman AK, Wilk JB, Morley MP, Chaffin MD, et al. Genome-wide association and Mendelian randomisation analysis provide insights into the pathogenesis of heart failure. Nat Commun. 2020;11(1):163.
Malik R, Chauhan G, Traylor M, Sargurupremraj M, Okada Y, Mishra A, Rutten-Jacobs L, Giese AK, van der Laan SW, Gretarsdottir S, et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat Genet. 2018;50(4):524–37.
Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, Downey P, Elliott P, Green J, Landray M, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779.
Craig CL, Marshall AL, Sjostrom M, Bauman AE, Booth ML, Ainsworth BE, Pratt M, Ekelund U, Yngve A, Sallis JF, et al. International physical activity questionnaire: 12-country reliability and validity. Med Sci Sports Exerc. 2003;35(8):1381–95.
Guo W, Key TJ, Reeves GK. Accelerometer compared with questionnaire measures of physical activity in relation to body size and composition: a large cross-sectional analysis of UK Biobank. BMJ Open. 2019;9(1):e024206.
Arnett FC, Edworthy SM, Bloch DA, McShane DJ, Fries JF, Cooper NS, Healey LA, Kaplan SR, Liang MH, Luthra HS, et al. The American Rheumatism Association 1987 revised criteria for the classification of rheumatoid arthritis. Arthritis Rheum. 1988;31(3):315–24.
Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J. Schizophrenia Working Group of the Psychiatric Genomics C, Patterson N, Daly MJ, Price AL, Neale BM: LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47(3):291–5.
Werme J, van der Sluis S, Posthuma D, de Leeuw CA. An integrated framework for local genetic correlation analysis. Nat Genet. 2022;54(3):274–82.
Frei O, Holland D, Smeland OB, Shadrin AA, Fan CC, Maeland S, O’Connell KS, Wang Y, Djurovic S, Thompson WK, et al. Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation. Nat Commun. 2019;10(1):2417.
Andreassen OA, Djurovic S, Thompson WK, Schork AJ, Kendler KS, O’Donovan MC, Rujescu D, Werge T, van de Bunt M, Morris AP, et al. Improved detection of common variants associated with schizophrenia by leveraging pleiotropy with cardiovascular-disease risk factors. Am J Hum Genet. 2013;92(2):197–209.
Schork AJ, Wang Y, Thompson WK, Dale AM, Andreassen OA. New statistical approaches exploit the polygenic architecture of schizophrenia–implications for the underlying neurobiology. Curr Opin Neurobiol. 2016;36:89–98.
Smeland OB, Frei O, Shadrin A, O’Connell K, Fan CC, Bahrami S, Holland D, Djurovic S, Thompson WK, Dale AM, et al. Discovery of shared genomic loci using the conditional false discovery rate approach. Hum Genet. 2020;139(1):85–94.
Bhattacharjee S, Rajaraman P, Jacobs KB, Wheeler WA, Melin BS, Hartge P, GliomaScan C, Yeager M, Chung CC, Chanock SJ, et al. A subset-based approach improves power and interpretation for the combined analysis of genetic association studies of heterogeneous traits. Am J Hum Genet. 2012;90(5):821–35.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.
Wang Z, Zhang Q, Zhang W, Lin JR, Cai Y, Mitra J, Zhang ZD. HEDD: Human Enhancer Disease Database. Nucleic Acids Res. 2018;46(D1):D113–20.
Davis CA, Hitz BC, Sloan CA, Chan ET, Davidson JM, Gabdank I, Hilton JA, Jain K, Baymuradov UK, Narayanan AK, et al. The Encyclopedia of DNA elements (ENCODE): data portal update. Nucleic Acids Res. 2018;46(D1):D794–801.
Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7.
Ge X, Frank-Bertoncelj M, Klein K, McGovern A, Kuret T, Houtman M, Burja B, Micheroli R, Shi C, Marks M, et al. Functional genomics atlas of synovial fibroblasts defining rheumatoid arthritis heritability. Genome Biol. 2021;22(1):247.
Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164.
Consortium GT: Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 2015;348(6235):648-660.
Zhu Z, Zhang F, Hu H, Bakshi A, Robinson MR, Powell JE, Montgomery GW, Goddard ME, Wray NR, Visscher PM, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. 2016;48(5):481–7.
Vosa U, Claringbould A, Westra HJ, Bonder MJ, Deelen P, Zeng B, Kirsten H, Saha A, Kreuzhuber R, Yazar S, et al. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat Genet. 2021;53(9):1300–10.
Bu D, Luo H, Huo P, Wang Z, Zhang S, He Z, Wu Y, Zhao L, Liu J, Guo J, et al. KOBAS-i: intelligent prioritization and exploratory visualization of biological functions for gene enrichment analysis. Nucleic Acids Res. 2021;49(W1):W317–25.
Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8(1):1826.
Cheng W, Frei O, van der Meer D, Wang Y, O’Connell KS, Chu Y, Bahrami S, Shadrin AA, Alnaes D, Hindley GFL, et al. Genetic Association Between Schizophrenia and Cortical Brain Surface Area and Thickness. JAMA Psychiat. 2021;78(9):1020–30.
Khalid Y, Dasu N, Shah A, Brown K, Kaell A, Levine A, Dasu K, Raminfard A. Incidence of congestive heart failure in rheumatoid arthritis: a review of literature and meta-regression analysis. ESC Heart Fail. 2020;7(6):3745–53.
Meune C, Touze E, Trinquart L, Allanore Y. High risk of clinical cardiovascular events in rheumatoid arthritis: Levels of associations of myocardial infarction and stroke through a systematic review and meta-analysis. Arch Cardiovasc Dis. 2010;103(4):253–61.
Rohrich DC, van de Wetering EHM, Rennings AJ, Arts EE, Meek IL, den Broeder AA, Fransen J, Popa CD. Younger age and female gender are determinants of underestimated cardiovascular risk in rheumatoid arthritis patients: a prospective cohort study. Arthritis Res Ther. 2021;23(1):2.
Boucheix C, Rubinstein E. Tetraspanins. Cell Mol Life Sci. 2001;58(9):1189–205.
Dornier E, Coumailleau F, Ottavi JF, Moretti J, Boucheix C, Mauduit P, Schweisguth F, Rubinstein E. TspanC8 tetraspanins regulate ADAM10/Kuzbanian trafficking and promote Notch activation in flies and mammals. J Cell Biol. 2012;199(3):481–96.
Jouannet S, Saint-Pol J, Fernandez L, Nguyen V, Charrin S, Boucheix C, Brou C, Milhiet PE, Rubinstein E. TspanC8 tetraspanins differentially regulate the cleavage of ADAM10 substrates, Notch activation and ADAM10 membrane compartmentalization. Cell Mol Life Sci. 2016;73(9):1895–915.
Chen J, Cheng W, Li J, Wang Y, Chen J, Shen X, Su A, Gan D, Ke L, Liu G, et al. Notch-1 and Notch-3 Mediate Hypoxia-Induced Activation of Synovial Fibroblasts in Rheumatoid Arthritis. Arthritis Rheumatol. 2021;73(10):1810–9.
Noy PJ, Yang J, Reyat JS, Matthews AL, Charlton AE, Furmston J, Rogers DA, Rainger GE, Tomlinson MG. TspanC8 Tetraspanins and A Disintegrin and Metalloprotease 10 (ADAM10) Interact via Their Extracellular Regions: EVIDENCE FOR DISTINCT BINDING MECHANISMS FOR DIFFERENT TspanC8 PROTEINS. J Biol Chem. 2016;291(7):3145–57.
Giri A, Hellwege JN, Keaton JM, Park J, Qiu C, Warren HR, Torstenson ES, Kovesdy CP, Sun YV, Wilson OD, et al. Trans-ethnic association study of blood pressure determinants in over 750,000 individuals. Nat Genet. 2019;51(1):51–62.
van Zuydam NR, Stiby A, Abdalla M, Austin E, Dahlstrom EH, McLachlan S, Vlachopoulou E, Ahlqvist E, Di Liao C, Sandholm N, et al. Genome-Wide Association Study of Peripheral Artery Disease. Circ Genom Precis Med. 2021;14(5):e002862.
van der Poll T, Keogh CV, Guirao X, Buurman WA, Kopf M, Lowry SF. Interleukin-6 gene-deficient mice show impaired defense against pneumococcal pneumonia. J Infect Dis. 1997;176(2):439–44.
Landskron G, De la Fuente M, Thuwajit P, Thuwajit C, Hermoso MA. Chronic inflammation and cytokines in the tumor microenvironment. J Immunol Res. 2014;2014:149185.
Interleukin-6 Receptor Mendelian Randomisation Analysis C, Swerdlow DI, Holmes MV, Kuchenbaecker KB, Engmann JE, Shah T, Sofat R, Guo Y, Chung C, Peasey A et al: The interleukin-6 receptor as a target for prevention of coronary heart disease: a mendelian randomisation analysis. Lancet 2012;379(9822):1214-1224.
Rosa M, Chignon A, Li Z, Boulanger MC, Arsenault BJ, Bosse Y, Theriault S, Mathieu P. A Mendelian randomization study of IL6 signaling in cardiovascular diseases, immune-related disorders and longevity. NPJ Genom Med. 2019;4:23.
Kimura A, Kishimoto T. IL-6: regulator of Treg/Th17 balance. Eur J Immunol. 2010;40(7):1830–5.
Smolen JS, Beaulieu A, Rubbert-Roth A, Ramos-Remus C, Rovensky J, Alecock E, Woodworth T, Alten R, Investigators O. Effect of interleukin-6 receptor inhibition with tocilizumab in patients with rheumatoid arthritis (OPTION study): a double-blind, placebo-controlled, randomised trial. Lancet. 2008;371(9617):987–97.
Ormazabal V, Nair S, Elfeky O, Aguayo C, Salomon C, Zuniga FA. Association between insulin resistance and the development of cardiovascular disease. Cardiovasc Diabetol. 2018;17(1):122.
Dessein PH, Stanwix AE, Joffe BI. Cardiovascular risk in rheumatoid arthritis versus osteoarthritis: acute phase response related decreased insulin sensitivity and high-density lipoprotein cholesterol as well as clustering of metabolic syndrome features in rheumatoid arthritis. Arthritis Res. 2002;4(5):R5.
Kaze AD, Musani SK, Correa A, Bertoni AG, Golden SH, Abdalla M, Echouffo-Tcheugui JB. Insulin resistance, metabolic syndrome, and blood pressure progression among Blacks: the Jackson Heart Study. J Hypertens. 2021;39(11):2200–9.
Abel ED, O’Shea KM, Ramasamy R. Insulin resistance: metabolic mechanisms and consequences in the heart. Arterioscler Thromb Vasc Biol. 2012;32(9):2068–76.
Acknowledgements
The MEGASTROKE project received funding from sources specified at http://www.megastroke.org/acknowledgments.html. Data on coronary artery disease / myocardial infarction has been contributed by CARDIoGRAMplusC4D investigators and has been downloaded from www.CARDIOGRAMPLUSC4D.ORG.
Funding
This work was supported by grants from the National Natural Science Foundation of China (82174208 and 81973663), the Natural Science Foundation of Zhejiang Province (LQ21H260001 and LQ20H260008), the Foundation of Zhejiang Chinese Medical University (KC201905, 2020ZG01, 2020ZG16, and 2021JKZKTS004A), and part of convergence environment [4MENT] funded by UiO: Life Science and Scientia Fellows, European Union’s Horizon2020 Research and Innovation program (#801133 Marie Skłodowska-Curie grant agreement and The CoMorMent project grant agreement No. 847776) and Research Council of Norway (#223273, #324252).
Author information
Authors and Affiliations
Contributions
Conceptualization, S.X., Q.Y., and M.Y.; methodology, S.X., Q.Y., C.W., Y.D., W.C., O.A.A., and M.Y.; formal analysis, S.X., Q.Y., C.W., Y.D., L.B., Z.D., and M.Y.; resources, S.X., Q.Y., and M.Y.; data curation, Q.Y., Y.D., L.B., and Z.D.; writing original draft preparation, S.X. and Q.Y.; writing-review and editing, S.X., Q.Y., O.A.A., and M.Y.; supervision, S.X.; project administration, O.A.A. and M.Y.; funding acquisition, O.A.A. and M.Y. All authors reviewed, revised and approved the final version of the paper.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
OAA is a consultant to HealtLytics. The other authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1: Supplementary Fig. 1.
Polygenic overlap between rheumatoid arthritis and cardiovascular diseases: (A) RA and AF; (B) RA and stroke; (C) RA and HF; and (D) RA and CAD. Each sub-figure contained Venn Diagrams, conditional Q-Q plots, and negative log-likelihood plot, respectively. Venn diagrams showed the polygenic overlap (gray) between RA (blue) and the corresponding CVD phenotype (orange). The numbers in the Venn diagram indicate the estimated number of variants (in thousands) explained 90% of heritability in the corresponding phenotype, followed by the standard error. Conditional Q–Q plots of observed versus expected -log10 P-values in the primary trait as a function of significance of association with a secondary trait at the different threshold of P-values. The Black dotted line indicate the null hypothesis. In the negative log-likelihood plot, minus log-likelihood is calculated for the bivariate model as a function of parameter. Abbreviations: AF, atrial fibrillation; CAD, coronary artery disease; HF, heart failure; RA, rheumatoid arthritis. Table S1. The ICD-9 diagnosis codes, ICD-10 diagnosis codes, and self-reported codes in the UK Biobank for rheumatoid arthritis and cardiovascular diseases. Table S2. Detailed information of summary statistics of rheumatoid arthritis and cardiovascular diseases. Table S3. Basic characteristics of individuals included in the RA-CVD cohort. Table S4. Propensity score and stratified analyses of associations between rheumatoid arthritis and cardiovascular diseases risk. Table S5. Basic characteristics of individuals included in the CVD-RA cohort. Table S6. Propensity score and stratified analyses of associations between cardiovascular diseases and rheumatoid arthritis risk. Table S7. The regions with nominally significant genetic correlation were identified from LAVA analysis. Table S8. The results of the model fit in MeXiR analysis. Table S9. The independent pleiotropic loci based on conjFDR analyses. Table S10. The independent pleiotropic loci based on ASSET analyses. Table S11. The results of histone modification and enhancer enrichment for lead SNPs. Table S12. The results of summary-based Mendelian randomization analyses and colocalization results. Table S13. The enriched GO terms of the mapped genes. Table S14. The enriched KEGG pathway terms of the mapped genes. Table S15. The effects of different combinations of rheumatoid arthritis/cardiovascular disease status and overlapped modifiable risk factors (diastolic blood pressure).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Sun, X., Qian, Y., Cheng, W. et al. Characterizing the polygenic overlap and shared loci between rheumatoid arthritis and cardiovascular diseases. BMC Med 22, 152 (2024). https://doi.org/10.1186/s12916-024-03376-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12916-024-03376-1