[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Next Article in Journal
The N-Linked Glycosylation Asn191 and Asn199 Sites Are Controlled Differently Between PKA Signal Transduction and pEKR1/2 Activity in Equine Follicle-Stimulating Hormone Receptor
Previous Article in Journal
Adenovirus 36 Seropositivity Is Related to Inflammation and Imbalance Between Oxidative Stress and Antioxidant Status Regardless of Body Mass Index in Mexican Population
Previous Article in Special Issue
Bio-Chemoinformatics-Driven Analysis of nsp7 and nsp8 Mutations and Their Effects on Viral Replication Protein Complex Stability
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of Candidate Lung Function-Related Plasma Proteins to Pinpoint Drug Targets for Common Pulmonary Diseases: A Comprehensive Multi-Omics Integration Analysis

1
Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Wenhua West Road, Jinan 250012, China
2
Institute for Medical Dataology, Shandong University, 12550, Erhuan East Road, Jinan 250003, China
3
Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
4
Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA
5
Department of Statistics, School of Mathematics, Shandong University, Shanda South Street, Jinan 250100, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Curr. Issues Mol. Biol. 2025, 47(3), 167; https://doi.org/10.3390/cimb47030167
Submission received: 30 December 2024 / Revised: 20 February 2025 / Accepted: 26 February 2025 / Published: 1 March 2025
(This article belongs to the Special Issue Predicting Drug Targets Using Bioinformatics Methods)
Figure 1
<p>Study Workflow. We conducted a comprehensive omics integration analysis under a cutting-edge analytic framework by sequential using PWAS, MR, and COLOC to identify the plasma proteins associated with lung function indices and explore the potential mediating mechanism underlying the effect of proteins on lung diseases. We first identified protein-index associations by PWAS, followed by enrichment analysis and PPI analysis to reveal the underlying biological processes and pathways. Then, we performed MR and COLOC to further identify proteins causally associated with lung function indices followed by replication analysis. Mediation analysis was further performed to investigate the mediating role of these indices regarding the association between proteins and common lung disorders. Finally, we prioritized the potential drug targets. Abbreviations: pQTL, protein quantitative trait loci; GWAS, genome-wide association studies; PWAS, proteome-wide association study; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; MR mendelian randomization; FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; PEF, peak expiratory flow; COPD, chronic obstructive pulmonary disease; PPI, protein–protein interaction network.</p> ">
Figure 2
<p>Results of PWAS and subsequent enrichment analysis. We identified a total of 210 protein-index associations and subsequently performed a gene set enrichment analysis using Metascape, with the top 20 significant GO terms or top 10 KEGG pathways displayed. (<b>a</b>) Manhattan plot for PWAS analysis of four lung function indices; (<b>b</b>) Bubble chart for GO enrichment analysis; (<b>c</b>) Bubble chart for KEGG enrichment analysis; (<b>d</b>) PPI with genes enriched in significant pathways. Abbreviations: FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; PEF, peak expiratory flow; PPI, Protein–protein interaction network.</p> ">
Figure 3
<p>Results of the two-sample MR analysis. We used the ARIC data and deCODE data for discovery and replication analysis, respectively. The forest plot is utilized to illustrate the β values, <span class="html-italic">p</span>-values, and confidence intervals of mendelian randomization. FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; PEF, peak expiratory flow; IVW, inverse variance weighted method.</p> ">
Figure 4
<p>Mediation effects of protein on lung diseases via lung function indices. Mediation analyses to quantify the effects of three proteins on lung diseases via lung function indices. (<b>a</b>) EFEMP1 effect on COPD mediated by FEV1/FVC; (<b>b</b>) MARK3 effect on asthma mediated by FEV1/FVC; (<b>c</b>) NPNT effect on COPD mediated by FEV1/FVC. <math display="inline"><semantics> <mrow> <msub> <mrow> <mi>β</mi> </mrow> <mrow> <mi>E</mi> <mi>M</mi> </mrow> </msub> </mrow> </semantics></math> represents the effect of exposure on mediator, <math display="inline"><semantics> <mrow> <msub> <mrow> <mi>β</mi> </mrow> <mrow> <mi>M</mi> <mi>O</mi> </mrow> </msub> </mrow> </semantics></math> represents the effect of mediator on outcome, <math display="inline"><semantics> <mrow> <msub> <mrow> <mi>β</mi> </mrow> <mrow> <mi>E</mi> <mi>O</mi> </mrow> </msub> </mrow> </semantics></math> represents the total effect of exposure on outcome. IV, instrumental variable; FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; COPD, chronic obstructive pulmonary disease.</p> ">
Review Reports Versions Notes

Abstract

:
The genome-wide association studies (GWAS) of lung disease and lung function indices suffer from challenges to be transformed into clinical interventions, due to a lack of knowledge on the molecular mechanism underlying the GWAS associations. A proteome-wide association study (PWAS) was first performed to identify candidate proteins by integrating two independent largest protein quantitative trait loci datasets of plasma proteins and four large-scale GWAS summary statistics of lung function indices (forced expiratory volume in 1 s (FEV1), forced vital capacity (FVC), FEV1/FVC and peak expiratory flow (PEF)), followed by enrichment analysis to reveal the underlying biological processes and pathways. Then, with a discovery dataset, we conducted Mendelian randomization (MR) and Bayesian colocalization analyses to select potentially causal proteins, followed by a replicated MR analysis with an independent dataset. Mediation analysis was also performed to explore the possible mediating role of these indices on the association between proteins and two common lung diseases (chronic obstructive pulmonary disease, COPD and Asthma). We finally prioritized the potential drug targets. A total of 210 protein–lung function index associations were identified by PWAS, and were significantly enriched in the pulmonary fibrosis and lung tissue repair. Subsequent MR and colocalization analysis identified 59 causal protein-index pairs, among which 42 pairs were replicated. Further mediation analysis identified 3 potential pathways from proteins to COPD or asthma mediated by FEV1/FVC. The mediated proportion ranges from 68.4% to 82.7%. Notably, 24 proteins were reported as druggable targets in Drug Gene Interaction Database, among which 8 were reported to interact with drugs, including FKBP4, GM2A, COL6A3, MAPK3, SERPING1, XPNPEP1, DNER, and FER. Our study identified the crucial plasma proteins causally associated with lung functions and highlighted potential mediating mechanism underlying the effect of proteins on common lung diseases. These findings may have an important insight into pathogenesis and possible future therapies of lung disorders.

1. Introduction

Impaired lung function could promote the risk of various diseases [1,2,3,4] and has been proved as a crucial predictor of mortality [5,6]. Many lung diseases are difficult to detect in the early stages and are often under-diagnosed without spirometry [7]. Indeed, the treatment and monitoring of many chronic respiratory diseases is a gradual and time-consuming process [8,9]. Therefore, the corresponding non-invasive spirometry assessment, such as forced expiratory volume in 1 s (FEV1) and forced vital capacity (FVC), provides the opportunity to investigate lung functions and pulmonary diseases more effectively and conveniently, as these indices can reflect the status of lung function and are easily to be measured and obtained. Genome-wide association studies (GWAS) have identified thousands of variants and revealed the significant single nucleotide polymorphisms (SNPs) for lung diseases along with lung function indices [10,11]. However, it remains challenging to utilize these GWAS associations to achieve in-depth interventional design in clinical practice, due to the lack of knowledge on the underlying molecular mechanism of the GWAS associations.
One promising approach to facilitate the interpretation on how the GWAS risk variants contribute to the outcome traits is to integrate GWAS with functional genomics. Among sorts of functional molecules, proteins always undertake the major function of biological processes and are likely to be the alternative and preferable drug targets [12,13]. Indeed, plasma proteins, especially those involved in immune or inflammatory responses, have been suggested to contribute to lung functions and related disorders in several investigations [14,15,16,17]. Although these studies have reported the associations between plasma proteins and lung functions, the scales of proteins included in these analyses are limited compared to whole plasma proteome.
More proteins should be involved to facilitate the exploration of more protein–lung function associations. In addition, these studies failed to tease out the relationship between the proteins, lung function indices and common lung disease.
Several statistical genetics methods have been developed to detect the potential trait-related proteins. A proteome-wide association Study (PWAS) is proposed to identify the proteins associated with disease or disease-related traits [18]. Mendelian randomization (MR) analysis has become an efficient tool to identify the causal relationship between proteins and disease, while colocalization (COLOC) analysis is used to identify the shared causal variants between proteins and diseases [19]. Mediation analysis [20] is always carried out to figure out the potential mediating mechanism. All these analyses could, to some extent, identify the potential proteins that are causally associated with complex disease and benefit the proteomics-driven drug target discovery, thus facilitating initial drug development. Recently, more attentions have been paid on investigating the proteins associated with lung functions. However, these limited studies either lack independent replication analysis [15] or lack comprehensive analysis [21], which would make the findings not robust against different dataset or different analysis methods.
In this study, we integrated two independent, publicly available largest protein quantitative trait loci (pQTL) datasets of plasma proteins to date and four large-scale GWAS summary statistics of lung function indices (FEV1, FVC, FEV1/FVC and peak expiratory flow [PEF]), to identify lung function indices-associated plasma proteins under a cutting-edge analytic framework by sequentially using PWAS, MR, and COLOC and mediation analysis. Specifically, we first performed PWAS to identify candidate proteins, followed by enrichment analysis to reveal the underlying biological processes and pathways. Then, we conducted MR analysis as well as COLOC analysis with discovery data to further select potentially causal proteins, followed by replicated MR analysis with replication data. Mediation analysis was also performed to explore if these lung function indices can serve as the possible mediators for the association between proteins and two common lung diseases, including chronic obstructive pulmonary disease (COPD) and asthma. Finally, we prioritized the potential drug targets.

2. Materials and Methods

The overall study design and comprehensive analysis procedure are displayed in Figure 1.

2.1. GWAS Data Source

We utilized the largest publicly available GWAS summary statistics with European ancestry to date for lung function indices and diseases. Specifically, the GWASs of FEV1, FVC and FEV1/FVC were undertaken with 321,047 individuals from UK Biobank and 79,055 individuals from the SpiroMeta Consortium. The GWAS of PEF was conducted with 321,047 individuals from UK Biobank and 24,218 individuals from the SpiroMeta Consortium [10]. The GWAS of COPD [22] (21,007 cases and 179,689 controls) and asthma [23] (56,087 cases and 428,511 controls) were both from the UK Biobank. All GWAS were approved by relevant ethic committees. Additional information including diagnostic criteria for cases, covariates adjusted during quality control were provided in Table S1. Further genotyping methods and statistical analyses can be found in the original publications.

2.2. Human Plasma pQTL Data

The pQTL studies typically investigate the association between genetic variants and protein levels, which have recently been integrated with GWASs to elucidate the underlying mechanisms of complex traits [24]. In this study, two independent large-scale pQTL datasets with European ancestry were used as discovery and replication data sets, respectively. The relative concentrations of plasma proteins or protein complexes analyzed in both data were measured by Slow Off-Rate Modified Aptamers (SOMAmers) assay on the SomaLogic version-4 platform. The discovery dataset was derived from 7213 European Americans in the Atherosclerosis Risk in Communities (ARIC) study [25] including cis-pQTL data for 4657 plasma protein SOMAmers while the replication data were obtained from 35,559 Icelanders in deCODE genetics involving 27 million variants and 4907 plasma protein SOMAmers [26]. Both studies first regressed the plasma proteins on the covariates and obtain the residuals, and then performed the association analysis between the rank-inverse normalized residuals and genetic variants. Additional information including covariates adjusted during quality control were provided in Table S1. Further genotyping methods and statistical analyses can be found in the original publications.

2.3. Statistical Analysis

2.3.1. Proteome-Wide Association Study (PWAS)

PWAS aims to investigate the association between the genetically predicted proteins and complex traits through integrating the pQTL and GWAS. Typically, the z-score between protein and complex trait in PWAS was a linear sum of GWAS summary statistics multiplied by imputation weights of common variants at a certain locus, with the linkage disequilibrium (LD) accounted for. In this study, we conducted PWAS using FUSION pipeline [27] along with the predicted weights of 1348 cis-heritable proteins, which were identified to have significant nonzero cis-heritability (p < 0.01) from ARIC data. Trait specifically Bonferroni corrected p-value (0.05/1348) was used to control the multiple testing and declare the significance.

2.3.2. Enrichment Analysis and Protein–Protein Interaction Network

To explore the biological functional interpretations of the proteins that are associated with the lung function indices from PWAS, we performed a gene set enrichment analysis using Metascape. Metascape implements enrichment analysis by identifying the gene sets whose members are significantly overrepresented using hypergeometric test and can eliminate confounding data interpretation issues by absorbing the majority of redundancies into representative clusters [28]. Here, we mainly focused on the biological process in Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG). We performed the analysis with default parameter setting and kept the top 20 items for GO and top 10 for KEGG ranked by p-values. To delve further into the intricate web of interactions among the genes enriched in significant pathways from GO and KEGG, we employed protein–protein interaction (PPI) network analysis using STRING tool [29].

2.3.3. Mendelian Randomization (MR) Analysis

For the significant protein–lung function index pairs identified by PWAS, we performed MR analyses, coupled with a series of sensitivity analyses, to evaluate the potential causal relationship between protein and lung function indices. For each pair, we first carried out discovery MR analysis using ARIC pQTL [25] data, and then conducted replication MR analysis using deCODE pQTL data [26]. Both MR analyses followed the strict procedure below and conformed to the STROBE-MR Statement [30], mainly involving the selection and assessment of F-statistics, primary MR analysis, and sensitivity analysis.
For instrumental variable (IV) selection, given that trans-pQTLs are likely to produce the pleiotropy issue and violate the basic assumption of MR analysis in assessing the potential causal effect of protein on the trait, we selected variants from the cis-region (500 kb upstream and downstream of the transcription start site of the protein-coding gene) of each protein. We adopted a less strict p-value threshold (p < 1 × 10−5) to determine significant pQTLs and obtained the independent cis-pQTLs (r2 < 0.01 in the 1 Mb cis-region) using the LD clumping procedure with the European LD reference panel in the 1000 Genomes Project. We further harmonized effect alleles of IVs in both exposure and outcome data. We calculated the F-statistic to evaluate the strength of instruments with values greater than 10 as evidence against weak instruments [31]. Further MR Steiger directionality test was applied to assess whether the MR analysis was biased by reverse causation.
For primary MR analysis, taking the different number of IVs for each protein into account, we used the Wald Ratio method for the proteins with only one IV, the fixed-effect inverse-variance weighted method (IVW) for proteins with two or three IVs, and the random-effect IVW method for proteins with four or more IVs. Of note, the random-effect model is able to account for heterogeneity across IVs by allowing for over-dispersion of the regression model. Further sensitivity analyses, including MR-Egger, leave-one-out and heterogeneity test, were also conducted to assess the robustness of the results. p-values were corrected using the Benjamani–Hochberg false discovery rate (BH-FDR) method, with FDR < 0.05 being declared to be significant. Similar MR analysis were further adopted to explore the causal relationship between potential causal proteins related to lung functions and common pulmonary diseases. These analyses were implemented using packages TwoSampleMR v.0.5.6, MendelianRandomization v.0.6.0 in the R v.4.2.1.

2.3.4. Bayesian Colocalization Analysis

For the potential causal protein–trait pairs identified by MR analysis with discovery dataset, we further performed a colocalization analysis using the COLOC method [32] implemented in the R package COLOC (version 5.1.2) with the default setting. The colocalization analysis was able to investigate whether the association between protein and lung function indices was driven by common variants and help to examine the bias in MR analysis due to LD. The COLOC is a Bayesian method to assess the potential genetic patterns for each protein-trait pair with five hypotheses, of which hypothesis 4 (PP.H4) is our focus to reflect the setting that protein and trait share one causal SNP. We considered a significant colocalization signal with PP.H4 > 0.75. These analyses were implemented using packages COLOC v.5.2.0 in the R v.4.2.1.

2.3.5. Mediation Analysis

To further explore whether these lung function indices could serve as the potential mediators for the effect of proteins on two common lung diseases, including COPD and asthma, we performed a mediation analysis under the MR framework with the product of coefficients method among proteins with potential causal effects on both lung functions and lung diseases. In the product method, the indirect effect could be estimated as the effect of proteins on the outcome through the mediator and confidence intervals were derived from the bootstrap approach. In addition, the mediation proportion will be calculated if the direction of the total effect and indirect effect are concordant [33].

2.3.6. Candidate Druggable Targets

For proteins identified in the MR analysis and colocalization, we used Drug-Gene Interaction Database (DGIdb) (version 4.2.0) [34] and molecular docking [35] to explore whether they are the targets of the existing drugs or druggable gene targets, respectively. DGIdb integrates and normalizes information on drug-gene interactions from over 40 different resources. We first used it to prioritize drug target for disease. Subsequently, for the proteins that could interact with drugs from DGIdb, we further explored their molecular structure by molecular docking. The three-dimensional (3D) structures of drugs and proteins were obtained from PubChem (https://pubchem.ncbi.nlm.nih.gov/, accessd on 14 Febuary 2025), Protein Data Bank (http://www.rcsb.org/, accessed on 14 Febuary 2025), and AlphaFold (https://alphafold.com/, accessed on 14 Febuary 2025) predicted structures were used due to the lack of experimental data. Protein structures were preprocessed in PyMOL 2.4 by removing crystallographic water and adding polar hydrogens. AutoDock Tools 1.5.6 were used to prepare PDBQT files and define docking boxes. Molecular docking was performed using AutoDock Vina 1.2.2 (http://autodock.scripps.edu/, accessed on 16 Febuary 2025), with binding affinity assessed by binding free energy (ΔG, kcal/mol). Values below −7 kcal/mol indicated strong binding activity.

3. Results

3.1. PWAS Identified 210 Protein-Index Associations

By integrating the GWAS of four lung functions indices with the imputation of 1348 cis-heritable proteins from the ARIC pQTL data, we identified a total of 210 protein-index associations with Bonferroni-corrected p-value less than 0.05/1348, involving 130 unique proteins (Figure 2a with details are available in Table S2). Among these 130 unique proteins, 10 proteins are associated with asthma, while 6 proteins are linked to COPD (Table S3). Of note, 54 plasma proteins were related with two or more indices and 8 of them were simultaneously associated with FEV1, FVC, PEF and FEV1/FVC, suggesting the common pathogenesis of these four indices. For example, the Butyrophilin subfamily 3 member A3 encoded by gene BTN3A3, acting in T-cell responses in the adaptive immune response, was positively associated with these four indices (z-score > 0), while Complement C2 encoded by gene C2, as a part of pathway of the complement system, was negatively associated with them (z-score < 0). The enrichment analysis for the significant genes identified by PWAS revealed that GO terms relevant with pulmonary fibrosis and lung tissue repair were particularly prominent (Figure 2b), such as collagen-containing extracellular matrix (GO:0062023, p = 2.09 × 10−19) and morphogenesis of a branching epithelium (GO:0061138, p = 1.10 × 10−10). In addition, the proteins were mostly enriched in the pathways including PI3K-Akt signaling pathway (KEGG:hsa04151, p = 1.87× 10−5) and Wnt signaling pathway (KEGG:hsa04310, p = 7.54 × 10−4), which were well known to be related with the pathogenesis of lung function (Figure 2c). A total of 112 pathway-enriched proteins were subjected to PPI analysis. As shown in Figure 2d, we identified a significant network consisting of 66 enriched genes and 106 edges (p < 1 ×10−16), including the well-known lung function related genes IL1B [36,37], and SHH [38,39].

3.2. MR and COLOC Confirmed 59 Protein-Index Causal Relationships

For 210 protein-index pairs identified by PWAS, we first conducted MR discovery analysis using IVs derived from ARCI pQTL, to explore the potential causal relationship between the proteins and these indices. Initially, 231 instrumental variants for 130 proteins were extracted, among which 20 proteins had only one IV, 31 proteins had two or three IVs and 78 proteins had four or more IVs. Two pairs were excluded due to the lack of suitable IVs. F-statistics ranged from 19.8 to 595.8 (Table S4), indicating less weak instrument bias. After harmonizing effect alleles, 208 plasma protein-index pairs were subjected to the MR analysis and several sensitivity analyses. MR-Steiger tests suggested that there was no reverse causal relationship, and MR-Egger did not show any horizontal pleiotropy (Table S5). We finally obtained 175 significant protein-index pairs with FDR p-value < 0.05 (Figures S62–S65).
Given that the false MR findings can be produced if the association between the protein and indices was driven by distinct variants with LD, we implemented COLOC for 175 protein-function pairs identified from MR analysis to remove such potentially false signals (Table S7 and Figures S1–S61), and finally, 59 pairs showing significant evidence of colocalization remained, including 11 proteins for FEV1, 13 proteins for FVC, 14 proteins for PEF, and 21 proteins for FEV1/FVC (Table 1).

3.3. Forty-Two Protein-Index Causal Relationships Were Replicated

We further conducted replication MR analysis for the 58 protein-function pairs with deCODE pQTL data, with the remaining pair excluded due to the absence in the replication data. A total of 473 valid IVs were selected, which is 2.04 times higher than that from ARIC data, possibly due to the larger sample size. Following the same procedure above, 42 protein-function pairs were replicated with consistent effect estimates from discovery MR analysis, among which 23 pairs show positive associations and 19 pairs shows negative associations (Figure 3). Details are illustrated in Table 1, Tables S7 and S8 and Figures S66–S69.

3.4. Mediation Analysis Highlighted 3 Potential Pathways

For the 42 protein-index pairs obtained from both the discovery analysis and replication analysis, we further performed MR mediation analysis to assess the potential mediating roles of these lung function indices in the causal pathway from proteins to COPD or Asthma. Firstly, in MR analysis between proteins and pulmonary diseases, three proteins, including EFEMP1, GM2A, and NPNT, were identified that exhibit a causal relationship with COPD while five proteins were highlighted that have causal effect on asthma (Tables S9 and S10). Subsequently, with these proteins related to pulmonary diseases, we found three potential mediation pathways including indirect effect of EFEMP1 on COPD via FEV1/FVC, an indirect effect of MAPK3 on Asthma via FEV1/FVC and an indirect effect of NPNT on COPD risk via FEV1/FVC. The mediated proportions for these three pathways are 68.4%, 71.4%, and 82.7%, respectively (Figure 4).

3.5. Drug-Gene Interaction and Molecular Docking Discovery

Among the 34 unique protein-coding genes with significant evidence from both discovery and replication analysis, 24 were annotated as druggable genes (Table S11) in DGIdb and 8 were reported to interact with drugs, including FKBP4, FER, DENR, XPNPEP1, COL6A3, GM2A, MAPK3 and SERPING1 (Table S12). For the proteins encoded by above 8 genes interacting with drugs, we obtained 46 protein-drug pairs with 3D structures in molecular docking analysis. Finally, 3 proteins (MAPK3, FER, XPNPEP1) paired with 26 drugs were found with the docking criteria, where MAPK3 exhibited a strong binding affinity with 15 drugs, FER interacted with 10 drugs, and XPNPEP1 with 1 drug (Table S13).

4. Discussion

In our study, through comprehensive and systematic omics integration analysis, we highlighted the role of plasma proteins in lung function and illuminated the potential mediating mechanism underlying the effect of proteins on common lung diseases. We have identified a total of 34 candidate proteins (24 druggable targets) with putatively causal effects on lung function, which could serve as novel therapeutic targets.
Some genes identified in this study, including SFRP1, NPNT and EFEMP1, were previously reported from the mouse model or allied experiments or association analysis, while the others were biologically plausible and are more likely to be vital for lung function. In particular, soluble frizzled-related protein-1, encoded by SFRP1, was demonstrated in the adult lung, leading to the destruction of lung tissue [40], and functioned as antagonizing the Wnt signaling which has a great impact in lung development and the progression of lung diseases including COPD, IPF and emphysema [41,42]. Several studies showed that the Nephronectin encoded by NPNT gene, as an extracellular protein, was highly expressed in human lung pneumocytes and alveolar fibroblasts cells [43,44] and necessary to maintain the normal development of lung [45,46] as well as contribute to COPD risk. In addition, the mRNA for the Fibulin-3 encoded by EFEMP1 gene was discovered mostly abundant in the lung [47,48] in the mouse and associated with the incidence of restrictive physiology [49]. In addition, many significant genes were involved in several biological processes related to lung function disorder, for example, glutathione synthetase (GSS) was involved in the synthesis of glutathione (GSH), which could protect lung epithelium from damage in response to oxidants and inflammation [50]. Collagen Type VI Alpha 3 Chain (COL6A3) was confirmed to be involved in the construction of type VI collagen in most connective tissues [51,52].
Of note, plasma proteins identified by PWAS were significantly enriched in immune-related biological processes, which implied the close link between peripheral immunity and lung functions. Complex communications between immune system and respiratory disease have important etiological and clinical implications for lung disorders and have been of great interest in the last two decades. Some previous studies demonstrated that inflammation-sensitive plasma proteins were correlated with FVC or FEV1 [16,53]. Recent omics studies have provided a series of biomarkers of systemic inflammation reflecting important physiological roles in lung physiology, among which plasma C-reactive protein (CRP) may participate in the pathogenesis of various pulmonary disorders [54,55]. In addition, impaired pulmonary function resulted from immune disorders may play an important role in COPD and lung cancer [56,57].
Interestingly, we found that all significant proteins identified in the mediation analysis could serve as potential drug targets, with MAPK3-encoded protein being associated with 40 available drugs. Protein encoded by MAPK3 was found to have an indirect effect on asthma mediated by FEV1/FVC. The MAPK/ERK pathway, including MAPK3, plays a role in the proliferation of airway smooth muscle cells, contributing to airway remodeling [58,59]. Thus, MAPK3-encoded protein may improve lung function and, in turn, ameliorate asthma. Similarly, both proteins encoded by EFEMP1 and NPNT exhibited indirect effects on chronic obstructive pulmonary disease (COPD) by FEV1/FVC. These two proteins are involved in the regulating extracellular matrix (ECM), where its destruction and remodeling contribute to tissue damage and loss of airway elasticity [47,60]. Notably, the mediation proportions for these three proteins were 71.4%, 68.4%, and 82.7%, respectively, which indicated that there remained certain proportions of the protective effects of proteins not fully explained by lung function indices. These findings suggest that these proteins may exert protective effects independently of lung function, which is further supported by existing research.
One strength of our study was that we used two publicly available large-scale independent pQTL datasets of plasma protein and many GWASs of lung function indices, and implemented replication analysis to validate the finding. More importantly, other than focusing on the PWAS association, we sequentially used MR and COLOC analysis to further explore the protein-disorder causal associations to enhance the evidence of our findings. In addition, the mediation analysis further illustrated the potential mediating mechanism regarding the association between plasma protein and common lung diseases. However, our study is not without limitations. First, PWAS analysis was performed on only 1,348 proteins with significant cis-heritability due to the availability of genetic imputation model of protein expression; however, the remaining proteins may also play important roles in lung function. Second, all the analyses were based on European populations due to the data accessibility, and caution should be made when extending these findings to other populations. Third, due to limitations in accessing individual-level data of GWAS summary data, we were unable to consider the effects of medication when examining the two common chronic respiratory diseases. Fourth, all GWAS data on lung functions were derived from large populations, which include both healthy individuals and those with relevant diseases. The inclusion of disease conditions may introduce potential confounding, highlighting the importance of conducting analysis in different disease groups.

5. Conclusions

In summary, our findings provide further biological insights into the pathogenesis of impaired lung functions or common lung diseases through comprehensive analysis. We not only identified certain significant plasma proteins putatively causally associated with lung functions but also highlighted the potential mediating roles of lung function indices in the causal pathway from proteins to the common lung diseases. Meanwhile, these findings will benefit the development of preventive or therapeutic drugs for lung disorders, and explore the repurposing potential of existing drugs in the treatment of lung diseases.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cimb47030167/s1.

Author Contributions

J.J.: conceptualization, funding acquisition, writing—review and editing, supervision, project administration. W.G.: conceptualization, writing—review and editing, supervision, project administration. Y.Z.: data curation, investigation, formal analysis, visualization, writing—original draft. L.S.: data curation, investigation, formal analysis, writing—review and editing. R.Y.: methodology, visualization, writing—original draft. S.L.: visualization. Y.C.: visualization. P.G.: writing—review and editing. L.L.: writing—review and editing. Z.Y.: supervision, writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (82473738).

Institutional Review Board Statement

Ethical approval is not required.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated and/or analyzed during the current study are publicly available. GWAS summary data for FVC, FEV1, FEV1/FVC, PEF, Asthma and COPD are available from GWAS Catalog (study accession: GCST007429, GCST007432, GCST007431, GCST007430, GCST90038616, GCST90016586) at https://www.ebi.ac.uk/gwas/ (accessed on 29 December 2024). The two pQTL data are available from Atherosclerosis Risk in Communities study at http://nilanjanchatterjeelab.org/pwas/ (accessed on 29 December 2024) and deCODE genetics at https://www.decode.com/summarydata/ (accessed on 29 December 2024).

Acknowledgments

We would like to thank GWAS Catalog for providing us with the FVC, FEV1, FEV1/FVC, PEF, Asthma and COPD GWAS summary data, the Atherosclerosis Risk in Communities (ARIC) and deCODE genetics for providing the pQTL summary data.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Silvestre, O.M.; Nadruz, W., Jr.; Querejeta Roca, G.; Claggett, B.; Solomon, S.D.; Mirabelli, M.C.; London, S.J.; Loehr, L.R.; Shah, A.M. Declining Lung Function and Cardiovascular Risk: The ARIC Study. J. Am. Coll. Cardiol. 2018, 72, 1109–1122. [Google Scholar] [CrossRef] [PubMed]
  2. Lutsey, P.L.; Chen, N.; Mirabelli, M.C.; Lakshminarayan, K.; Knopman, D.S.; Vossel, K.A.; Gottesman, R.F.; Mosley, T.H.; Alonso, A. Impaired Lung Function, Lung Disease, and Risk of Incident Dementia. Am. J. Respir. Crit. Care Med. 2019, 199, 1385–1396. [Google Scholar] [CrossRef] [PubMed]
  3. Marcon, A.; Locatelli, F.; Dharmage, S.C.; Svanes, C.; Heinrich, J.; Leynaert, B.; Burney, P.; Corsico, A.; Caliskan, G.; Calciano, L.; et al. The coexistence of asthma and COPD: Risk factors, clinical history and lung function trajectories. Eur. Respir. J. 2021, 58, 33863744. [Google Scholar] [CrossRef]
  4. Dharmage, S.C.; Bui, D.S.; Walters, E.H.; Lowe, A.J.; Thompson, B.; Bowatte, G.; Thomas, P.; Garcia-Aymerich, J.; Jarvis, D.; Hamilton, G.S.; et al. Lifetime spirometry patterns of obstruction and restriction, and their risk factors and outcomes: A prospective cohort study. Lancet Respir. Med. 2023, 11, 273–282. [Google Scholar] [CrossRef]
  5. Agusti, A.; Noell, G.; Brugada, J.; Faner, R. Lung function in early adulthood and health in later life: A transgenerational cohort analysis. Lancet Respir. Med. 2017, 5, 935–945. [Google Scholar] [CrossRef]
  6. Vasquez, M.M.; Zhou, M.; Hu, C.; Martinez, F.D.; Guerra, S. Low Lung Function in Young Adult Life Is Associated with Early Mortality. Am. J. Respir. Crit. Care Med. 2017, 195, 1399–1401. [Google Scholar] [CrossRef] [PubMed]
  7. Joo, M.J.; Au, D.H.; Lee, T.A. Use of spirometry in the diagnosis of chronic obstructive pulmonary disease and efforts to improve quality of care. Transl. Res. 2009, 154, 103–110. [Google Scholar] [CrossRef]
  8. Bailey, K.L. The importance of the assessment of pulmonary function in COPD. Med. Clin. N. Am. 2012, 96, 745–752. [Google Scholar] [CrossRef]
  9. Majellano, E.C.; Clark, V.L.; Winter, N.A.; Gibson, P.G.; McDonald, V.M. Approaches to the assessment of severe asthma: Barriers and strategies. J. Asthma Allergy 2019, 12, 235–251. [Google Scholar] [CrossRef]
  10. Shrine, N.; Guyatt, A.L.; Erzurumluoglu, A.M.; Jackson, V.E.; Hobbs, B.D.; Melbourne, C.A.; Batini, C.; Fawcett, K.A.; Song, K.; Sakornsakolpat, P.; et al. New genetic signals for lung function highlight pathways and chronic obstructive pulmonary disease associations across multiple ancestries. Nat. Genet. 2019, 51, 481–493. [Google Scholar] [CrossRef]
  11. Wyss, A.B.; Sofer, T.; Lee, M.K.; Terzikhan, N.; Nguyen, J.N.; Lahousse, L.; Latourelle, J.C.; Smith, A.V.; Bartz, T.M.; Feitosa, M.F.; et al. Multiethnic meta-analysis identifies ancestry-specific and cross-ancestry loci for pulmonary function. Nat. Commun. 2018, 9, 2976. [Google Scholar] [CrossRef] [PubMed]
  12. Al-Nesf, M.A.Y.; Abdesselem, H.B.; Bensmail, I.; Ibrahim, S.; Saeed, W.A.H.; Mohammed, S.S.I.; Razok, A.; Alhussain, H.; Aly, R.M.A.; Al Maslamani, M.; et al. Prognostic tools and candidate drugs based on plasma proteomics of patients with severe COVID-19 complications. Nat. Commun. 2022, 13, 946. [Google Scholar] [CrossRef]
  13. Santos, R.; Ursu, O.; Gaulton, A.; Bento, A.P.; Donadi, R.S.; Bologa, C.G.; Karlsson, A.; Al-Lazikani, B.; Hersey, A.; Oprea, T.I.; et al. A comprehensive map of molecular drug targets. Nat. Rev. Drug Discov. 2017, 16, 19–34. [Google Scholar] [CrossRef]
  14. Sunyer, J.; Pistelli, R.; Plana, E.; Andreani, M.; Baldari, F.; Kolz, M.; Koenig, W.; Pekkanen, J.; Peters, A.; Forastiere, F. Systemic inflammation, genetic susceptibility and lung function. Eur. Respir. J. 2008, 32, 92–97. [Google Scholar] [CrossRef] [PubMed]
  15. Keefe, J.; Yao, C.; Hwang, S.J.; Courchesne, P.; Lee, G.Y.; Dupuis, J.; Mizgerd, J.P.; O’Connor, G.; Washko, G.R.; Cho, M.H.; et al. An Integrative Genomic Strategy Identifies sRAGE as a Causal and Protective Biomarker of Lung Function. Chest 2022, 161, 76–84. [Google Scholar] [CrossRef]
  16. Engstrom, G.; Lind, P.; Hedblad, B.; Wollmer, P.; Stavenow, L.; Janzon, L.; Lindgarde, F. Lung function and cardiovascular risk: Relationship with inflammation-sensitive plasma proteins. Circulation 2002, 106, 2555–2560. [Google Scholar] [CrossRef]
  17. Dahl, M.; Tybjaerg-Hansen, A.; Vestbo, J.; Lange, P.; Nordestgaard, B.G. Elevated plasma fibrinogen associated with reduced pulmonary function and increased risk of chronic obstructive pulmonary disease. Am. J. Respir. Crit. Care Med. 2001, 164, 1008–1011. [Google Scholar] [CrossRef] [PubMed]
  18. Wingo, T.S.; Liu, Y.; Gerasimov, E.S.; Gockley, J.; Logsdon, B.A.; Duong, D.M.; Dammer, E.B.; Lori, A.; Kim, P.J.; Ressler, K.J.; et al. Brain proteome-wide association study implicates novel proteins in depression pathogenesis. Nat. Neurosci. 2021, 24, 810–817. [Google Scholar] [CrossRef]
  19. Zhao, H.; Rasheed, H.; Nost, T.H.; Cho, Y.; Liu, Y.; Bhatta, L.; Bhattacharya, A.; Global Biobank Meta-analysis, I.; Hemani, G.; Davey Smith, G.; et al. Proteome-wide Mendelian randomization in global biobank meta-analysis reveals multi-ancestry drug targets for common diseases. Cell Genom. 2022, 2, 100195. [Google Scholar] [CrossRef]
  20. Chen, L.; Peters, J.E.; Prins, B.; Persyn, E.; Traylor, M.; Surendran, P.; Karthikeyan, S.; Yonova-Doing, E.; Di Angelantonio, E.; Roberts, D.J.; et al. Systematic Mendelian randomization using the human plasma proteome to discover potential therapeutic targets for stroke. Nat. Commun. 2022, 13, 6143. [Google Scholar] [CrossRef]
  21. Rydell, A.; Nowak, C.; Janson, C.; Lisspers, K.; Stallberg, B.; Iggman, D.; Leppert, J.; Hedberg, P.; Sundstrom, J.; Ingelsson, E.; et al. Plasma proteomics and lung function in four community-based cohorts. Respir. Med. 2021, 176, 106282. [Google Scholar] [CrossRef]
  22. Kim, W.; Prokopenko, D.; Sakornsakolpat, P.; Hobbs, B.D.; Lutz, S.M.; Hokanson, J.E.; Wain, L.V.; Melbourne, C.A.; Shrine, N.; Tobin, M.D.; et al. Genome-Wide Gene-by-Smoking Interaction Study of Chronic Obstructive Pulmonary Disease. Am. J. Epidemiol. 2021, 190, 875–885. [Google Scholar] [CrossRef]
  23. Donertas, H.M.; Fabian, D.K.; Valenzuela, M.F.; Partridge, L.; Thornton, J.M. Common genetic associations between age-related diseases. Nat. Aging 2021, 1, 400–412. [Google Scholar] [CrossRef] [PubMed]
  24. Pietzner, M.; Wheeler, E.; Carrasco-Zanini, J.; Cortes, A.; Koprulu, M.; Worheide, M.A.; Oerton, E.; Cook, J.; Stewart, I.D.; Kerrison, N.D.; et al. Mapping the proteo-genomic convergence of human diseases. Science 2021, 374, eabj1541. [Google Scholar] [CrossRef] [PubMed]
  25. Zhang, J.; Dutta, D.; Kottgen, A.; Tin, A.; Schlosser, P.; Grams, M.E.; Harvey, B.; Consortium, C.K.; Yu, B.; Boerwinkle, E.; et al. Plasma proteome analyses in individuals of European and African ancestry identify cis-pQTLs and models for proteome-wide association studies. Nat. Genet. 2022, 54, 593–602. [Google Scholar] [CrossRef] [PubMed]
  26. Ferkingstad, E.; Sulem, P.; Atlason, B.A.; Sveinbjornsson, G.; Magnusson, M.I.; Styrmisdottir, E.L.; Gunnarsdottir, K.; Helgason, A.; Oddsson, A.; Halldorsson, B.V.; et al. Large-scale integration of the plasma proteome with genetics and disease. Nat. Genet. 2021, 53, 1712–1721. [Google Scholar] [CrossRef] [PubMed]
  27. Gusev, A.; Ko, A.; Shi, H.; Bhatia, G.; Chung, W.; Penninx, B.W.; Jansen, R.; de Geus, E.J.; Boomsma, D.I.; Wright, F.A.; et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 2016, 48, 245–252. [Google Scholar] [CrossRef]
  28. Zhou, Y.; Zhou, B.; Pache, L.; Chang, M.; Khodabakhshi, A.H.; Tanaseichuk, O.; Benner, C.; Chanda, S.K. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 2019, 10, 1523. [Google Scholar] [CrossRef]
  29. Szklarczyk, D.; Kirsch, R.; Koutrouli, M.; Nastou, K.; Mehryary, F.; Hachilif, R.; Gable, A.L.; Fang, T.; Doncheva, N.T.; Pyysalo, S.; et al. The STRING database in 2023: Protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023, 51, D638–D646. [Google Scholar] [CrossRef]
  30. Skrivankova, V.W.; Richmond, R.C.; Woolf, B.A.R.; Davies, N.M.; Swanson, S.A.; VanderWeele, T.J.; Timpson, N.J.; Higgins, J.P.T.; Dimou, N.; Langenberg, C.; et al. Strengthening the reporting of observational studies in epidemiology using mendelian randomisation (STROBE-MR): Explanation and elaboration. BMJ 2021, 375, n2233. [Google Scholar] [CrossRef]
  31. Stock, J.H.; Wright, J.H.; Yogo, M. A Survey of Weak Instruments and Weak Identification in Generalized Method of Moments. J. Bus. Econ. Stat. 2002, 20, 518–529. [Google Scholar] [CrossRef]
  32. Giambartolomei, C.; Vukcevic, D.; Schadt, E.E.; Franke, L.; Hingorani, A.D.; Wallace, C.; Plagnol, V. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014, 10, e1004383. [Google Scholar] [CrossRef]
  33. Carter, A.R.; Sanderson, E.; Hammerton, G.; Richmond, R.C.; Davey Smith, G.; Heron, J.; Taylor, A.E.; Davies, N.M.; Howe, L.D. Mendelian randomisation for mediation analysis: Current methods and challenges for implementation. Eur. J. Epidemiol. 2021, 36, 465–478. [Google Scholar] [CrossRef] [PubMed]
  34. Freshour, S.L.; Kiwala, S.; Cotto, K.C.; Coffman, A.C.; McMichael, J.F.; Song, J.J.; Griffith, M.; Griffith, O.L.; Wagner, A.H. Integration of the Drug-Gene Interaction Database (DGIdb 4.0) with open crowdsource efforts. Nucleic Acids Res. 2021, 49, D1144–D1151. [Google Scholar] [CrossRef]
  35. Eberhardt, J.; Santos-Martins, D.; Tillack, A.F.; Forli, S. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. J. Chem. Inf. Model 2021, 61, 3891–3898. [Google Scholar] [CrossRef]
  36. Tabei, Y.; Nakajima, Y. IL-1beta-activated PI3K/AKT and MEK/ERK pathways coordinately promote induction of partial epithelial-mesenchymal transition. Cell Commun. Signal. 2024, 22, 392. [Google Scholar] [CrossRef] [PubMed]
  37. Zhang, S.; Fan, Y.; Qin, L.; Fang, X.; Zhang, C.; Yue, J.; Bai, W.; Wang, G.; Chen, Z.; Renz, H.; et al. IL-1beta augments TGF-beta inducing epithelial-mesenchymal transition of epithelial cells and associates with poor pulmonary function improvement in neutrophilic asthmatics. Respir. Res. 2021, 22, 216. [Google Scholar] [CrossRef]
  38. Zeng, L.H.; Barkat, M.Q.; Syed, S.K.; Shah, S.; Abbas, G.; Xu, C.; Mahdy, A.; Hussain, N.; Hussain, L.; Majeed, A.; et al. Hedgehog Signaling: Linking Embryonic Lung Development and Asthmatic Airway Remodeling. Cells 2022, 11, 1774. [Google Scholar] [CrossRef]
  39. Hu, Y.; Peng, L.; Zhuo, X.; Yang, C.; Zhang, Y. Hedgehog Signaling Pathway in Fibrosis and Targeted Therapies. Biomolecules 2024, 14, 1485. [Google Scholar] [CrossRef]
  40. Foronjy, R.; Imai, K.; Shiomi, T.; Mercer, B.; Sklepkiewicz, P.; Thankachen, J.; Bodine, P.; D’Armiento, J. The divergent roles of secreted frizzled related protein-1 (SFRP1) in lung morphogenesis and emphysema. Am. J. Pathol. 2010, 177, 598–607. [Google Scholar] [CrossRef]
  41. Aros, C.J.; Pantoja, C.J.; Gomperts, B.N. Wnt signaling in lung development, regeneration, and disease progression. Commun. Biol. 2021, 4, 601. [Google Scholar] [CrossRef] [PubMed]
  42. Shi, J.; Li, F.; Luo, M.; Wei, J.; Liu, X. Distinct Roles of Wnt/beta-Catenin Signaling in the Pathogenesis of Chronic Obstructive Pulmonary Disease and Idiopathic Pulmonary Fibrosis. Mediat. Inflamm. 2017, 2017, 3520581. [Google Scholar] [CrossRef] [PubMed]
  43. Tsukui, T.; Sun, K.H.; Wetter, J.B.; Wilson-Kanamori, J.R.; Hazelwood, L.A.; Henderson, N.C.; Adams, T.S.; Schupp, J.C.; Poli, S.D.; Rosas, I.O.; et al. Collagen-producing lung cell atlas identifies multiple subsets with distinct localization and relevance to fibrosis. Nat. Commun. 2020, 11, 1920. [Google Scholar] [CrossRef]
  44. Uhlen, M.; Fagerberg, L.; Hallstrom, B.M.; Lindskog, C.; Oksvold, P.; Mardinoglu, A.; Sivertsson, A.; Kampf, C.; Sjostedt, E.; Asplund, A.; et al. Proteomics. Tissue-based map of the human proteome. Science 2015, 347, 1260419. [Google Scholar] [CrossRef]
  45. Morimura, N.; Tezuka, Y.; Watanabe, N.; Yasuda, M.; Miyatani, S.; Hozumi, N.; Tezuka Ki, K. Molecular cloning of POEM: A novel adhesion molecule that interacts with alpha8beta1 integrin. J. Biol. Chem. 2001, 276, 42172–42181. [Google Scholar] [CrossRef]
  46. Brandenberger, R.; Schmidt, A.; Linton, J.; Wang, D.; Backus, C.; Denda, S.; Muller, U.; Reichardt, L.F. Identification and characterization of a novel extracellular matrix protein nephronectin that is associated with integrin alpha8beta1 in the embryonic kidney. J. Cell Biol. 2001, 154, 447–458. [Google Scholar] [CrossRef]
  47. Livingstone, I.; Uversky, V.N.; Furniss, D.; Wiberg, A. The Pathophysiological Significance of Fibulin-3. Biomolecules 2020, 10, 1294. [Google Scholar] [CrossRef] [PubMed]
  48. Giltay, R.; Timpl, R.; Kostka, G. Sequence, recombinant expression and tissue localization of two novel extracellular matrix proteins, fibulin-3 and fibulin-4. Matrix Biol. 1999, 18, 469–480. [Google Scholar] [CrossRef]
  49. McNeill, J.N.; Lee, D.H.; Hwang, S.J.; Courchesne, P.; Yao, C.; Huan, T.; Joehanes, R.; O’Connor, G.T.; Ho, J.E.; Levy, D. Association of 71 cardiovascular disease-related plasma proteins with pulmonary function in the community. PLoS ONE 2022, 17, e0266523. [Google Scholar] [CrossRef]
  50. Njalsson, R. Glutathione synthetase deficiency. Cell Mol. Life Sci. 2005, 62, 1938–1945. [Google Scholar] [CrossRef]
  51. Pan, T.C.; Zhang, R.Z.; Markova, D.; Arita, M.; Zhang, Y.; Bogdanovich, S.; Khurana, T.S.; Bonnemann, C.G.; Birk, D.E.; Chu, M.L. COL6A3 protein deficiency in mice leads to muscle and tendon defects similar to human collagen VI congenital muscular dystrophy. J. Biol. Chem. 2013, 288, 14320–14331. [Google Scholar] [CrossRef] [PubMed]
  52. Ucero, A.; Bakiri, L.; Suzuki, M.; Fustero-Torre, C.; Ximenez-Embun, P.; Hernandez, A.; Wagner, E. Collagen-producing macrophages can mediate lung fibrosis throughFra-2/AP-1 expression. Eur. Respir. J. 2018, 52 (Suppl. 62), LSC-1216. [Google Scholar] [CrossRef]
  53. Lee, J.S.; Rosengart, M.R.; Kondragunta, V.; Zhang, Y.; McMurray, J.; Branch, R.A.; Choi, A.M.; Sciurba, F.C. Inverse association of plasma IL-13 and inflammatory chemokines with lung function impairment in stable COPD: A cross-sectional cohort study. Respir. Res. 2007, 8, 64. [Google Scholar] [CrossRef]
  54. Hancox, R.J.; Gray, A.R.; Sears, M.R.; Poulton, R. Systemic inflammation and lung function: A longitudinal analysis. Respir. Med. 2016, 111, 54–59. [Google Scholar] [CrossRef] [PubMed]
  55. Agassandian, M.; Shurin, G.V.; Ma, Y.; Shurin, M.R. C-reactive protein and lung diseases. Int. J. Biochem. Cell Biol. 2014, 53, 77–88. [Google Scholar] [CrossRef] [PubMed]
  56. Kachuri, L.; Johansson, M.; Rashkin, S.R.; Graff, R.E.; Bosse, Y.; Manem, V.; Caporaso, N.E.; Landi, M.T.; Christiani, D.C.; Vineis, P.; et al. Immune-mediated genetic pathways resulting in pulmonary function impairment increase lung cancer susceptibility. Nat. Commun. 2020, 11, 27. [Google Scholar] [CrossRef]
  57. Eriksson Strom, J.; Pourazar, J.; Linder, R.; Blomberg, A.; Lindberg, A.; Bucht, A.; Behndig, A.F. Airway regulatory T cells are decreased in COPD with a rapid decline in lung function. Respir. Res. 2020, 21, 330. [Google Scholar] [CrossRef]
  58. Johnson, P.R.; Roth, M.; Tamm, M.; Hughes, M.; Ge, Q.; King, G.; Burgess, J.K.; Black, J.L. Airway smooth muscle cell proliferation is increased in asthma. Am. J. Respir. Crit. Care Med. 2001, 164, 474–477. [Google Scholar] [CrossRef]
  59. Hirst, S.J.; Martin, J.G.; Bonacci, J.V.; Chan, V.; Fixman, E.D.; Hamid, Q.A.; Herszberg, B.; Lavoie, J.P.; McVicker, C.G.; Moir, L.M.; et al. Proliferative aspects of airway smooth muscle. J. Allergy Clin. Immunol. 2004, 114, S2–S17. [Google Scholar] [CrossRef]
  60. Wilson, C.L.; Hung, C.F.; Schnapp, L.M. Endotoxin-induced acute lung injury in mice with postnatal deletion of nephronectin. PLoS ONE 2022, 17, e0268398. [Google Scholar] [CrossRef]
Figure 1. Study Workflow. We conducted a comprehensive omics integration analysis under a cutting-edge analytic framework by sequential using PWAS, MR, and COLOC to identify the plasma proteins associated with lung function indices and explore the potential mediating mechanism underlying the effect of proteins on lung diseases. We first identified protein-index associations by PWAS, followed by enrichment analysis and PPI analysis to reveal the underlying biological processes and pathways. Then, we performed MR and COLOC to further identify proteins causally associated with lung function indices followed by replication analysis. Mediation analysis was further performed to investigate the mediating role of these indices regarding the association between proteins and common lung disorders. Finally, we prioritized the potential drug targets. Abbreviations: pQTL, protein quantitative trait loci; GWAS, genome-wide association studies; PWAS, proteome-wide association study; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; MR mendelian randomization; FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; PEF, peak expiratory flow; COPD, chronic obstructive pulmonary disease; PPI, protein–protein interaction network.
Figure 1. Study Workflow. We conducted a comprehensive omics integration analysis under a cutting-edge analytic framework by sequential using PWAS, MR, and COLOC to identify the plasma proteins associated with lung function indices and explore the potential mediating mechanism underlying the effect of proteins on lung diseases. We first identified protein-index associations by PWAS, followed by enrichment analysis and PPI analysis to reveal the underlying biological processes and pathways. Then, we performed MR and COLOC to further identify proteins causally associated with lung function indices followed by replication analysis. Mediation analysis was further performed to investigate the mediating role of these indices regarding the association between proteins and common lung disorders. Finally, we prioritized the potential drug targets. Abbreviations: pQTL, protein quantitative trait loci; GWAS, genome-wide association studies; PWAS, proteome-wide association study; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; MR mendelian randomization; FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; PEF, peak expiratory flow; COPD, chronic obstructive pulmonary disease; PPI, protein–protein interaction network.
Cimb 47 00167 g001
Figure 2. Results of PWAS and subsequent enrichment analysis. We identified a total of 210 protein-index associations and subsequently performed a gene set enrichment analysis using Metascape, with the top 20 significant GO terms or top 10 KEGG pathways displayed. (a) Manhattan plot for PWAS analysis of four lung function indices; (b) Bubble chart for GO enrichment analysis; (c) Bubble chart for KEGG enrichment analysis; (d) PPI with genes enriched in significant pathways. Abbreviations: FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; PEF, peak expiratory flow; PPI, Protein–protein interaction network.
Figure 2. Results of PWAS and subsequent enrichment analysis. We identified a total of 210 protein-index associations and subsequently performed a gene set enrichment analysis using Metascape, with the top 20 significant GO terms or top 10 KEGG pathways displayed. (a) Manhattan plot for PWAS analysis of four lung function indices; (b) Bubble chart for GO enrichment analysis; (c) Bubble chart for KEGG enrichment analysis; (d) PPI with genes enriched in significant pathways. Abbreviations: FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; PEF, peak expiratory flow; PPI, Protein–protein interaction network.
Cimb 47 00167 g002
Figure 3. Results of the two-sample MR analysis. We used the ARIC data and deCODE data for discovery and replication analysis, respectively. The forest plot is utilized to illustrate the β values, p-values, and confidence intervals of mendelian randomization. FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; PEF, peak expiratory flow; IVW, inverse variance weighted method.
Figure 3. Results of the two-sample MR analysis. We used the ARIC data and deCODE data for discovery and replication analysis, respectively. The forest plot is utilized to illustrate the β values, p-values, and confidence intervals of mendelian randomization. FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; PEF, peak expiratory flow; IVW, inverse variance weighted method.
Cimb 47 00167 g003
Figure 4. Mediation effects of protein on lung diseases via lung function indices. Mediation analyses to quantify the effects of three proteins on lung diseases via lung function indices. (a) EFEMP1 effect on COPD mediated by FEV1/FVC; (b) MARK3 effect on asthma mediated by FEV1/FVC; (c) NPNT effect on COPD mediated by FEV1/FVC. β E M represents the effect of exposure on mediator, β M O represents the effect of mediator on outcome, β E O represents the total effect of exposure on outcome. IV, instrumental variable; FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; COPD, chronic obstructive pulmonary disease.
Figure 4. Mediation effects of protein on lung diseases via lung function indices. Mediation analyses to quantify the effects of three proteins on lung diseases via lung function indices. (a) EFEMP1 effect on COPD mediated by FEV1/FVC; (b) MARK3 effect on asthma mediated by FEV1/FVC; (c) NPNT effect on COPD mediated by FEV1/FVC. β E M represents the effect of exposure on mediator, β M O represents the effect of mediator on outcome, β E O represents the total effect of exposure on outcome. IV, instrumental variable; FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; COPD, chronic obstructive pulmonary disease.
Cimb 47 00167 g004
Table 1. The 59 protein–lung function index pairs with high evidence of putatively causal relationship.
Table 1. The 59 protein–lung function index pairs with high evidence of putatively causal relationship.
Lung Function IndexGeneChrPWAS.PBeta (95% CI)PP.H4ReplicatedExisting Drug
FEV1ASPN91.12 × 10−6−0.020 (−0.032, −0.008)0.800NONO
FEV1EFEMP124.42 × 10−12−0.039 (−0.060, −0.018)0.976YesNO
FEV1FKBP4121.12 × 10−120.041 (0.030, 0.053)1.000YesYes
FEV1FN122.22 × 10−60.044 (0.027, 0.061)0.969NONO
FEV1GM2A56.49 × 10−10−0.023 (−0.034, −0.013)0.981NOYes
FEV1GMPR2141.64 × 10−6−0.027 (−0.037, −0.017)0.991YesNO
FEV1KYNU23.06 × 10−5−0.015 (−0.021, −0.008)0.959YesNO
FEV1OGN95.43 × 10−90.023 (0.009, 0.037)0.978YesNO
FEV1SCARF2223.60 × 10−90.047 (0.034, 0.061)0.999YesNO
FEV1SELM227.69 × 10−7−0.076 (−0.114, −0.037)0.963YesNO
FEV1SFRP181.55 × 10−5−0.038 (−0.067, −0.009)0.984YesNO
FEV1/FVCAGER69.29 × 10−173−0.186 (−0.199, −0.173)1.000NONO
FEV1/FVCARFIP141.02 × 10−60.027 (0.016, 0.038)0.976NONO
FEV1/FVCBOC36.47 × 10−80.049 (0.034, 0.063)0.988YesNO
FEV1/FVCBTN3A361.28 × 10−60.016 (0.008, 0.025)0.859YesNO
FEV1/FVCCOL6A321.39 × 10−9−0.053 (−0.096, −0.011)0.999YesYes
FEV1/FVCEFEMP121.41 × 10−160.047 (0.020, 0.074)0.891YesNO
FEV1/FVCFAM213A106.04 ×10−7−0.187 (−0.238, −0.136)0.983YesNO
FEV1/FVCGM2A54.37 × 10−11−0.027 (−0.036, −0.018)0.770YesYes
FEV1/FVCHP162.04 × 10−80.021 (0.013, 0.029)0.973NONO
FEV1/FVCIL1RL123.28 × 10−110.017 (0.007, 0.028)0.999YesNO
FEV1/FVCLTBP4191.53 × 10−28−0.197 (−0.229, −0.165)1.0000NONO
FEV1/FVCMANSC4124.84 × 10−7−0.016 (−0.022, −0.010)0.856YesNO
FEV1/FVCMAPK3163.93 × 10−100.025 (0.016, 0.033)0.999YesYes
FEV1/FVCNID2148.49 × 10−60.016 (0.008, 0.024)0.868YesNO
FEV1/FVCNOG171.80 × 10−70.064 (0.032, 0.096)0.995NONO
FEV1/FVCNPNT47.84 × 10−1160.146 (0.070, 0.221)1.000YesNO
FEV1/FVCPAPPA99.05 × 10−280.350 (0.287, 0.412)0.993NONO
FEV1/FVCSCARF2221.19 × 10−190.069 (0.055, 0.083)0.999YesNO
FEV1/FVCSERPING1112.32 × 10−100.017 (0.010, 0.023)0.996YesYes
FEV1/FVCTMEM294.67 × 10−8−0.061 (−0.082, −0.039)0.998YesNO
FEV1/FVCXPNPEP1101.32 × 10−5−0.041 (−0.058, −0.024)0.933YesYes
FVCCA383.02 × 10−7−0.046 (−0.062, −0.029)0.997NONO
FVCCCDC12671.14 × 10−6−0.026 (−0.052, −0.001)0.880YesNO
FVCDNER27.47 × 10−100.038 (0.011, 0.066)0.998YesYes
FVCEFEMP122.16 × 10−32−0.066 (−0.089, −0.044)0.999YesNO
FVCFER55.67 × 10−70.051 (0.032, 0.071)0.997YesYes
FVCFKBP4125.24 × 10−180.049 (0.038, 0.061)1.000YesYes
FVCGMPR2141.51 × 10−5−0.025 (−0.036, −0.015)0.963YesNO
FVCIL17RD34.71 × 10−60.017 (0.006, 0.029)0.820YesNO
FVCMRC2172.51 × 10−110.025 (0.013, 0.036)0.956YesNO
FVCOGN92.17 × 10−50.017 (0.003, 0.031)0.793YesNO
FVCPARK713.87 × 10−7−0.019 (−0.027, −0.012)0.978YesNO
FVCRHOC11.83 × 10−70.056 (0.036, 0.076)0.989YesNO
FVCTRIL79.55 × 10−60.045 (0.024, 0.066)0.980YesNO
PEFACAN154.95 × 10−7−0.066 (−0.088, −0.044)1.000YesNO
PEFAGER61.13 × 10−37−0.091 (−0.105, −0.078)0.999NONO
PEFCACYBP22.98 × 10−5−0.023 (−0.034, −0.013)0.944YesNO
PEFFKBP4126.54 × 10−70.030 (0.016, 0.044)0.998NOYes
PEFFN122.27 × 10−90.053 (0.035, 0.072)0.996NONO
PEFGM2A59.64 × 10−8−0.022 (−0.032, −0.013)0.951NOYes
PEFGSS201.66 × 10−90.037 (0.020, 0.055)0.886YesNO
PEFHAPLN151.44 × 10−24−0.202 (−0.232, −0.173)0.976YesNO
PEFLILRB2192.79 × 10−60.018 (0.011, 0.026)0.998YesNO
PEFLTBP4198.36 × 10−6−0.093 (−0.127, −0.059)1.000NONO
PEFNOG174.18 × 10−120.102 (0.068, 0.136)0.959NONO
PEFNPNT41.55 × 10−420.092 (0.044, 0.140)1.000NONO
PEFSCARF2224.88 × 10−70.041 (0.026, 0.056)0.995YesNO
PEFXPNPEP1101.59 × 10−5−0.043 (−0.062, −0.025)0.924YesYes
The estimate of causal effects (beta) from MR analysis and the posterior probability (PP.H4) from COLOC analysis with the discovery samples for the final 59 protein-function pairs. Pairs identified in the replication analysis are annotated as Yes in the Replicated column. Genes that are interacted with existing drugs in Drug Gene Interaction Database are annotated as Yes in the Existing drug column. Chr = Chromosome. PWAS.P = p-value from PWAS analysis. CI = Confidence interval. FEV1 = Forced expiratory volume in 1 s. FVC = Forced vital capacity. PEF = Peak expiratory flow. MR = Mendelian Randomization. PWAS = Proteome-wide association study.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, Y.; Shen, L.; Yan, R.; Liu, L.; Guo, P.; Liu, S.; Chen, Y.; Yuan, Z.; Gong, W.; Ji, J. Identification of Candidate Lung Function-Related Plasma Proteins to Pinpoint Drug Targets for Common Pulmonary Diseases: A Comprehensive Multi-Omics Integration Analysis. Curr. Issues Mol. Biol. 2025, 47, 167. https://doi.org/10.3390/cimb47030167

AMA Style

Zhao Y, Shen L, Yan R, Liu L, Guo P, Liu S, Chen Y, Yuan Z, Gong W, Ji J. Identification of Candidate Lung Function-Related Plasma Proteins to Pinpoint Drug Targets for Common Pulmonary Diseases: A Comprehensive Multi-Omics Integration Analysis. Current Issues in Molecular Biology. 2025; 47(3):167. https://doi.org/10.3390/cimb47030167

Chicago/Turabian Style

Zhao, Yansong, Lujia Shen, Ran Yan, Lu Liu, Ping Guo, Shuai Liu, Yingxuan Chen, Zhongshang Yuan, Weiming Gong, and Jiadong Ji. 2025. "Identification of Candidate Lung Function-Related Plasma Proteins to Pinpoint Drug Targets for Common Pulmonary Diseases: A Comprehensive Multi-Omics Integration Analysis" Current Issues in Molecular Biology 47, no. 3: 167. https://doi.org/10.3390/cimb47030167

APA Style

Zhao, Y., Shen, L., Yan, R., Liu, L., Guo, P., Liu, S., Chen, Y., Yuan, Z., Gong, W., & Ji, J. (2025). Identification of Candidate Lung Function-Related Plasma Proteins to Pinpoint Drug Targets for Common Pulmonary Diseases: A Comprehensive Multi-Omics Integration Analysis. Current Issues in Molecular Biology, 47(3), 167. https://doi.org/10.3390/cimb47030167

Article Metrics

Back to TopTop