Skip to main content

ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 12 April 2023
Sec. Clinical and Translational Cardiovascular Medicine
Volume 10 - 2023 | https://doi.org/10.3389/fcvm.2023.1123682

An integrated signature of extracellular matrix proteins and a diastolic function imaging parameter predicts post-MI long-term outcomes

Hiromi W.L. Koh1,2 Anna P. Pilbrow3 Sock Hwee Tan1,4 Qing Zhao1 Peter I. Benke5 Bo Burla5 Federico Torta5,6 John W. Pickering3 Richard Troughton3 Christopher Pemberton3 Wern-Miin Soo1,4 Lieng Hsi Ling1,4 Robert N. Doughty7 Hyungwon Choi1 Markus R. Wenk5,6 A. Mark Richards1,3,4* Mark Y. Chan1,4*
  • 1Department of Medicine, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
  • 2Institute of Molecular and Cell Biology (IMCB), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
  • 3Department of Medicine, Christchurch Heart Institute, University of Otago, Christchurch, New Zealand
  • 4National University Heart Centre, National University Health System, Singapore, Singapore
  • 5Singapore Lipidomics Incubator (SLING), Life Sciences Institute, National University of Singapore, Singapore, Singapore
  • 6Precision Medicine Translational Research Programme and Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
  • 7Heart Health Research Group, University of Auckland, Auckland, New Zealand

Background: Patients suffering from acute myocardial infarction (AMI) are at risk of secondary outcomes including major adverse cardiovascular events (MACE) and heart failure (HF). Comprehensive molecular phenotyping and cardiac imaging during the post-discharge time window may provide cues for risk stratification for the outcomes.

Materials and methods: In a prospective AMI cohort in New Zealand (N = 464), we measured plasma proteins and lipids 30 days after hospital discharge and inferred a unified partial correlation network with echocardiographic variables and established clinical biomarkers (creatinine, c-reactive protein, cardiac troponin I and natriuretic peptides). Using a network-based data integration approach (iOmicsPASS+), we identified predictive signatures of long-term secondary outcomes based on plasma protein, lipid, imaging markers and clinical biomarkers and assessed the prognostic potential in an independent cohort from Singapore (N = 190).

Results: The post-discharge levels of plasma proteins and lipids showed strong correlations within each molecular type, reflecting concerted homeostatic regulation after primary MI events. However, the two molecular types were largely independent with distinct correlation structures with established prognostic imaging parameters and clinical biomarkers. To deal with massively correlated predictive features, we used iOmicsPASS + to identify subnetwork signatures of 211 and 189 data features (nodes) predictive of MACE and HF events, respectively (160 overlapping). The predictive features were primarily imaging parameters, including left ventricular and atrial parameters, tissue Doppler parameters, and proteins involved in extracellular matrix (ECM) organization, cell differentiation, chemotaxis, and inflammation. The network signatures contained plasma protein pairs with area-under-the-curve (AUC) values up to 0.74 for HF prediction in the validation cohort, but the pair of NT-proBNP and fibulin-3 (EFEMP1) was the best predictor (AUC = 0.80). This suggests that there were a handful of plasma proteins with mechanistic and functional roles in predisposing patients to the secondary outcomes, although they may be weaker prognostic markers than natriuretic peptides individually. Among those, the diastolic function parameter (E/e' - an indicator of left ventricular filling pressure) and two ECM proteins, EFEMP1 and follistatin-like 3 (FSTL3) showed comparable performance to NT-proBNP and outperformed left ventricular measures as benchmark prognostic factors for post-MI HF.

Conclusion: Post-discharge levels of E/e', EFEMP1 and FSTL3 are promising complementary markers of secondary adverse outcomes in AMI patients.

Introduction

Circulating proteins and lipids are clinically useful biomarkers for the diagnosis and prognosis of cardiovascular diseases (CVD) (13). Historically, prognostic biomarker studies have favored an analytic framework focusing on one or a few prioritized candidates within a risk stratification framework to predict specific clinical events (47). Recently, omics-scale technologies such as mass spectrometry, affinity-based proteomics, and nuclear magnetic resonance spectroscopy have enabled simultaneous measurement of hundreds to thousands of analytes including proteins, metabolites, and lipids in cardiovascular research (811).

Despite the arrival of high-throughput platforms, the new discoveries have barely challenged the use of imaging parameters to aid detection of abnormalities in the heart and biomarkers of heart failure (HF) and that of traditional biomarkers such as NT-proBNP and cardiac troponin-I/T to evaluate cardiac injury, and these two modalities firmly remain as the gold standard clinical practice for long-term prognostication. In addition, although the number of assessable analytes increased with omics scale assays, few studies have carefully investigated the relationship between newly measured circulating molecules and the conventional cardiac markers and imaging parameters.

A possible explanation for this phenomenon is that the emerging plasma proteomics and metabolomics data sets are often analyzed from the perspective of the predictive capacity alone in biomarker studies, and that the natriuretic peptides and troponins often outperformed most challengers in terms of diagnostic or prognostic indicators as they represent the most tissue specific evidence of myocardial damage. As a result, most data analyses tend to overlook the massive correlation structure underlying the milieu of measured molecules and neglect the biologically interpretable, meaningful co-variations in these rich data sets.

We have previously reported an analysis of plasma protein candidates predicting post-MI heart failure events (HF) in two independent post-MI cohorts in a similar context, with unique complementary information from a single cell transcriptomics data set generated for mouse heart (12). Expanding on this work, we put together a systems-level data analysis approach to integrate plasma lipids, acylcarnitines, proteins and multi-parametric cardiac imaging data, with the express purpose of building an interpretable prognostic signature of long-term post-MI major adverse cardiovascular events (MACE) and HF. Our key approach is the use of network topology information as adjudicating evidence for biomarker prioritization, in combination with the conventional univariate prognostic potential measured by the area under the curve (AUC) of the receiver operating characteristic (ROC).

Given the highly correlated nature of multi-dimensional data across the different modalities, we improved our previously published data integration approach iOmicsPASS (13) and present the new toolbox as iOmicsPASS + . Using the new tool, we first identify an undirected graph, or a partial correlation network of all data features via estimation of a sparse precision matrix (14) and then discern subnetwork signatures predictive of clinical outcomes using a network-based scoring approach. Using the clinical outcome data collected prospectively and molecular data profiled at 30 days after hospital discharge, we searched for the prognostic network signatures of HF as well as other MACE. To isolate pure predictive signals within the intra- and inter-modality correlation structures and ensure the compatibility of information between the training and test/validation cohorts, we performed the network-level analysis without incorporating other conventional demographic risk factors such as age and gender.

Materials and methods

The coronary disease cohort study (CDCS)

The CDCS cohort consisted of 2,140 patients recruited from two tertiary hospitals (Christchurch Hospital and Auckland City Hospital) in New Zealand (NZ) for an acute coronary syndrome (ACS) event from 2002 to 2009 (ACTRN 12605000431628). Participants with angiographically-documented coronary artery disease were invited to return to the hospital 30 days after discharge for baseline biometric, echocardiographic and blood-based measurements. Patients were excluded from the study if their life expectancy was estimated to be less than 3 years. More information on the study can be found in Prickett et al. (15). Details of the nested case-control cohort (N = 741) and supervised analysis on the subcohort (N = 464) are in the Supplementary Information.

The improving outcomes in myocardial infarction through reversal of cardiac remodelling (IMMACULATE) cohort

The IMMACULATE cohort consisted of 859 patients who were admitted for MI at 3 tertiary hospitals in Singapore (National University Heart Centre, Tan Tock Seng Hospital and National Heart Centre) from 2011 to 2014. The patients were followed up for a median of 3.9 years (interquartile range IQR: 2.0–4.8 years) from their hospital discharge date, and biometric, echocardiographic, and blood-based measurements were collected at the same post-discharge time point (30 days) as the CDCS. Detailed information on the nested case-control subset 190 patients used as a validation cohort in this report can be found in the Supplementary Information.

Proteomics, lipidomics and clinical biomarker measurements

Protein abundance was measured using Slow Off-rate Modified Aptamer (SOMAmer)–based capture array, called SOMAscan® (SomaLogic, Inc, Boulder, CO, United States) (16) and reported as relative fluorescent units (RFU). Targeted lipidomics experiments were performed using an Agilent 6495A triple quadrupole (QQQ) mass spectrometer coupled to an Agilent 1,290 Infinity-II UHPLC system, with automated data processing and quality control. Acylcarnitines were also measured as part of this panel. One lipid standard was used for each lipid class and the ratio of the peak areas of endogenous lipids to their respective lipid standards was reported as the lipid measurements (without conversion to molar concentrations). Please refer to Section 2 of the Supplementary Information for the details of data acquisition and quality control.

Established cardiac markers, including natriuretic peptides (ANP, BNP, NT-proANP, NT-proBNP), high sensitivity troponin-I (hsTNI) and creatinine, were measured using clinical-grade assays. In CDCS, the concentrations of natriuretic peptides were measured in pmol/l except for NT-proANP (in nmol/L). In IMMACULATE, the concentrations of ANP, BNP and NT-proBNP were measured in pg/ml and NT-proANP was not measured. To allow for comparison between the two studies, we unified the units of measurements to pg/ml for ANP, BNP and NT-proBNP and ng/ml for NT-proANP. For ANP and NT-proANP, we converted 1 pmol/L to 3.081 pg/ml and 1 nmol/l to 12.7 ng/ml, respectively. For BNP and NT-proBNP, we converted the units using 1 pmol/l to 3.47 pg/ml and 1 pmol/l to 8.475 pg/ml, respectively. Due to the higher accuracy and sensitivity of the clinical-grade assays, we removed four corresponding proteins measured in SOMAscan® (Supplementary Information).

Transthoracic echocardiography

Standard M-mode measurements of left ventricle (LV) dimensions, wall thickness and left atrial (LA) dimensions were made according to the recommendations of the American Society of Echocardiography (ASE) (17). LV volumes and the derived left ventricular ejection fraction (LVEF) were measured by the Simpson modified biplane method (18). Pulsed-wave Doppler velocities of trans-mitral early diastolic (E) and atrial (A) filling were obtained from the apical 4-chamber view with a 5 mm sample volume placed between the tips of the mitral leaflets (19), and systolic (S), diastolic (D), and atrial reversal (AR) pulmonary vein velocities were acquired from the apical 4-chamber view with a 5 mm sample volume placed 1 cm into the right upper pulmonary vein. Early (e') and late (a') diastolic, and systolic (s') pulsed-wave tissue Doppler velocities were obtained at both septal and lateral mitral annular corners, and averaged (20). The ratio of trans-mitral E and annular e’ velocities (E/e') was derived as a measure of LV filling pressure. Only variables with at least 70% of the measurements quantified were used and missing values in those variables were imputed together with missing entries from the six cardiac markers using multiple imputation by chained equations (MICE) (21). All missing data were assumed to be missing at random (MAR) and the estimates from the multiple models are pooled using Rubin's rule (22).

Ascertainment of clinical events

The composite outcome of interest was 5-point MACE (or MACE including HF), defined as: MI, stroke, unstable angina, HF and/or a CV-related death. All patients were followed from the time of their primary hospital discharge to a future major adverse cardiac event, death or end of study, whichever was earlier. For both CDCS and IMMACULATE, we defined three phenotypic outcomes: (1) patients who remained event-free (Event-free), (2) patients who had any one of the five adverse cardiac outcomes (MACE), with further stratification by (3) patients with non-fatal and fatal HF events (HF).

Statistical analysis

For continuous variables, two-sample t-tests were used to compare the difference in means across groups, whereas for categorical variables, Chi-squared tests were applied to test for associations with groups. Fisher-exact test was used where cell-frequencies were less than five. Clinical variables with missing or unknown self-reported entries were removed before performing the statistical test for association. All p-values were adjusted for multiple-testing correction using the Benjamini-Hochberg's (BH) method to control the overall false discovery rates (FDR). The AUC of the ROC was computed for each marker to evaluate their predictive performance and presented with a 95% confidence interval. Cox proportional hazard (PH) models were used to assess the prognostic value of individual markers from hospital discharge to MACE or HF, death or end of follow-up, whichever is earlier. Hazard ratio (HR), p-value, Harrell's C-index and its 95% confidence interval were reported for each marker. Log-rank test was employed to compare the Kaplan-Meier (KM) survival curves when stratified by risk groups.

Network-level predictive analysis using iOmicsPASS+

To estimate the Gaussian graphical model (GGM) underlying the observed data (with a corresponding partial correlation network), we utilized all markers across 741 patients from CDCS and computed a covariance matrix using all pairwise complete observations as an input to the graphical LASSO algorithm (14), using a module built in iOmicsPASS + . Originally developed as iOmicsPASS for the integration of DNA, mRNA and protein-level multi-omics data over biological networks (13), iOmicsPASS + has been extended to directly infer an undirected graph representing conditional (in)dependence relationships among all continuously scaled data features from the multiple data types, through estimation of GGM (Supplementary Figure S1) (14). Following the network inference, iOmicsPASS is employed to carry out dimension reduction of the information from multiple data sets by using edge-level co-expression scores, taking the direction of the correlation into account, to identify a sparse set of subnetwork signatures that separates the phenotypic groups via a supervised approach (see Supplementary Material).

To prevent potential confounding, we removed the patients with remodelled heart, medical history of HF, stroke and previous MI events that occurred within the last five years, leaving 464 CDCS patients for the predictive analysis. We sought to identify subnetwork signatures that best (1) differentiate patients destined to incur secondary MACE from those spared such events, and (2) patients suffering secondary HF from those not, using 10-fold cross-validation (CV) for parameter optimization. Within the iOmicsPASS framework, network signatures are characterized by group-specific centroids of edge-wise co-expression scores. For every i-th edge with a non-zero partial correlation estimate, co-expression scores were calculated for individual patients. The sign of the partial correlation is reflected into the co-expression score calculation, where the scores for two positively correlated nodes and two negatively correlated nodes are calculated as products and ratios of the normalized data in each subject (sums and differences in logarithmic scale), respectively.

Once subject-level co-expression scores are calculated, test-statistics contrasting group differences, denoted by dik* (23) (k = 1 for outcome and k = 0 for no outcome), are computed for the edges. For simplicity, we refer to these network edge-wise test-statistics as d-scores hereafter, the magnitude of which indicates the combined discriminative power of a pair of predictors in separating the phenotypes. The test-statistics were penalized iteratively to select the optimal sparse network classifiers that minimize the overall misclassification error in the CV. The final d-scores are visualized in the networks, incorporating the sign and magnitude, reflected by the color and thickness of edges in the network respectively (13).

iOmicsPASS + also offers a direct prediction functionality on external data sets with the same input data types, allowing for missing data features in the new data set. The prediction module takes all available variables of the selected classifiers from a data set, standardizes the variables within the new data, computes the co-expression scores of predictive edges and derives the composite discriminant scores and classification probabilities for individual patients. These discriminant scores or class probabilities can then be used to stratify patients in the external data, or within the training data itself if necessary.

Further details of the method and the software can be found in the Supplementary Information. All analyses were carried out in R version 4.2.0 (24) using in-house R-scripts. Network visualization is done using the Cytoscape software (25).

Throughout this paper, we used the CDCS cohort as the training dataset to build a network and estimate predictive network signatures for both MACE and incident HF. Then using these signatures, we computed the class probabilities on both CDCS and another independent cohort, IMMACULATE, as a validation dataset.

Secondary analysis of individual markers in the network signatures

As these network signatures were derived without any consideration for time to event, we further assessed if the predicted class probabilities could predict the risk of future MACE and incident HF in the presence of other factors influencing long-term outcomes. We achieved this by fitting three types of Cox PH models: without any clinical adjustments (Model A), adjusted for age, gender and BMI (Model B) and further adjusted for ST-elevation status, history of hypertension and medications prescribed at discharge (Model C). We first identified the medications that were directly associated with either MACE or HF, at both discharge and admission, for adjustment in the model. In CDCS, we adjusted for beta-blockers, ACE inhibitors, aspirin, clopidogrel, calcium channel antagonists, long-acting nitrates, diuretics, statins and warfarin use at hospital discharge. In IMMACULATE, we adjusted for beta-blockers, ACE inhibitors and antagonist receptor blockers (ARB). To determine the usefulness of adding clinical variables to the model, we also calculated the continuous net reclassification index (cNRI) and integrated discrimination index (IDI) of the 2-year event prediction of Model B and Model C, relative to Model A.

Results

Characteristics of the CDCS cohort

Figure 1 illustrates the analysis workflow for the CDCS cohort. Using the implementation in iOmicsPASS+, we integrated the data for proteins, lipids, clinical biomarkers and echocardiographic measurements from 741 subjects, and derived a network of conditionally dependent data features using the graphical LASSO (26). Given the diversity of data types, we call individual variables “data features” hereafter. For this network, we calculated partial correlation for each edge of the network as a measure of association strength between the two corresponding data features, accounting for the effects of all other variables (see Supplemental Information). In the second step, we subsequently identified predictive subnetwork signatures of secondary adverse outcomes, namely MACE and HF. This supervised analysis was conducted using 464 patients meeting the criteria for outcome data (Supplemental Information), where 185 patients remained event-free, and 279 patients had a secondary MACE, including 117 patients hospitalized for HF during follow-up. In what follows, we first describe the predictive potential of individual data features and the overall network structure and present the network-driven analysis results afterwards.

FIGURE 1
www.frontiersin.org

Figure 1. Analysis workflow applied to the CDCS cohort. The background network reflecting the conditional dependence structure among the four data types was inferred using graphical LASSO. From the network, subnetwork signatures of secondary MACE and HF were obtained using the training dataset (CDCS) using 10-fold cross validation for choosing the optimal regularization parameter.

Table 1 provides the overall characteristics of the 464 patients. The mean age was 69 years (SD = 10.7 years), with more males (69.2%) than females (30.8%). The majority were of European descent from New Zealand (56.7%) and other countries (27.8%), while the rest (7.1%) were Asians, Africans, Maoris, Fijians and the Pacific Islanders. Most patients either quit smoking (54.5%) or had never smoked (38.8%). Upon admission for a primary AMI episode, 29.7% were diagnosed with STEMI and 70.3% NSTEMI. The median follow-up time from the date of hospital discharge was 4.88 years, and the time from discharge to a MACE ranged from one day to 8.8 years (median 0.74 years). Comparisons between patients remaining event-free and those incurring secondary MACE revealed significant differences in age, diagnosis of ST-segment elevation and hypertension, as well as clinical biomarkers such as serum creatinine and plasma natriuretic peptides. MACE cases were older (mean age = 70.6 years, SD = 11.1) and more frequently hypertensive (56.6%) than patients who remained event-free (36.8%). The index events were also more likely NSTEMI (74.6%) than STEMI (63.8%). Event-free patients had lower levels of serum creatinine and plasma concentrations of natriuretic peptides (ANP, BNP, NT-proANP, NT-proBNP) than those with MACE on follow-up.

TABLE 1
www.frontiersin.org

Table 1. Clinical characteristics of post-AMI patients in CDCS study.

Supplementary Table S1 shows the comparison of the echocardiographic measurements between event-free patients and MACE patients, as well as between event-free patients and HF patients. As expected, more echocardiographic variables differed between HF patients and event-free patients than between all MACE patients and event-free patients. Both adverse outcome groups (MACE and the HF subgroup) had greater baseline LA and LV dimensions but lower LVEF than event-free patients. Tissue Doppler parameters of myocardial motion differed between MACE/HF and event-free cases with e' and E/e’ differing most sharply between the subgroup with HF and event-free patients.

Single markers of MACE and HF

We next compared the post-discharge levels of individual proteins, lipids, echocardiographic parameters and clinical biomarkers in the patients with MACE and HF to event-free patients. A total of 184 data features were significantly different in mean abundance between MACE patients and event-free patients (FDR < 0.05), where 66.3% were higher in MACE patients. Comparing the HF patients with the event-free ones, 368 markers were significantly different (FDR < 0.05). Details of this analysis are reported in Supplementary Table S2.

The differential features of MACE included 166 proteins, seven lipids, five echocardiographic measurements, and all six clinical biomarkers. Although all natriuretic peptides, hsTNI and creatinine levels were higher in MACE patients, BNP and NT-proBNP attained the AUC of 0.65 (95% CI: 0.60–0.70 for BNP and 0.59–0.70 for NT-proBNP), indicating modest discrimination for MACE. Echocardiographic measurements such as LA area, LV mass, indexed left ventricle internal dimension in diastole (LVIDDi) and systole (LVIDSi), as well as indices of diastolic dysfunction (E/e'), were all higher in MACE than in event-free patients, yet the AUC values were modest at best. When considering lipids, only a handful of lipids showed statistically significant differences. Three species (phosphatidylethanolamine PE 34:1 and PE 34:2, sphingosine-1-phosphate S1P d18:0) were higher, whereas four species (phosphatidylcholine PC 38:4 and 40:8, LysoPC 20:4, sphingomyelin SM 43:1) were lower in the MACE patients. Lastly, the top three markers were all plasma proteins, including macrophage-capping protein (CAPG) with AUC of 0.68 (95% CI: 0.63–0.73), aspartate aminotransferase (GOT1) with AUC of 0.65 (95% CI: 0.60–0.70) and follistatin-related protein 3 (FSTL3) with AUC of 0.64 (95% CI: 0.59–0.69).

The 368 differential features of HF included 298 proteins, 31 lipids, nine echocardiographic variables, and all six clinical biomarkers. Similar to the differential features of MACE, all six clinical biomarkers were higher in HF than in event-free patients as expected: NT-proBNP demonstrated the highest AUC of 0.79 (95% CI: 0.74–0.84), followed by BNP with AUC of 0.78 (95% CI: 0.73–0.83). In echocardiographic parameters, all five measurements significantly altered in MACE also differed in the HF subgroup. In addition, interventricular septum (IVS), LA width, LVESVi were also significantly higher, while the early diastolic mitral annulus velocity e’ was significantly lower. Similar to the MACE comparison, the differences in the lipid levels were not pronounced in the HF group, including glycerophospholipids (8 PEs, 3 PC, 4 phosphatidylinositol PI), four lysophospholipids (3 LysoPC, 1 LysoPE), eight sphingolipids (SM 38:1, 38:2, 43:1, 44:1, 44:2), S1P d18:0, ceramide d19:1/24:0 and ganglioside GM3 d18:1/16:0, cholesteryl ester CE 20:4 and two glycerolipids (diacylglycerol DG 38:6 and triacylglycerol TG 58:10). Among those, LPC 20:4, PE 34:2 and PE 35:2 had the highest AUCs at a modest value of 0.64 (95% CI: 0.57–0.71).

Of the 298 significant proteins, 61.7% were higher in HF patients than in event-free patients. CAPG had the highest AUC of 0.77 (95% CI: 0.72–0.83), followed by FSTL3 with AUC of 0.75 (95% CI: 0.70–0.81) and Cystatin-C (CST3) with AUC of 0.74 (95% CI: 0.68–0.79). The top performing proteins such as CAPG, CST3, FSTL3, and EGF-containing fibulin-like extracellular matrix protein 1 (EFEMP1), also known as fibulin3, were equally predictive of HF as the natriuretic peptides, with overlapping 95% CIs.

Partial correlation network connecting the four modalities

Supplementary Figure S1 illustrates the network inference workflow of iOmicsPASS + . First, each data type was standardized by mean centering and pareto variance scaling. Then, outlier observations were filtered out before being concatenated into a single data matrix using all 741 samples. The network estimation module produced a network of 27,334 edges in this data set (1.9% of all possible edges) based on the regularized parameter selected at the lowest eBIC value, connecting 1,690 data features. To assign measures of strength to the selected edges, the regularized precision estimates were converted to partial correlations, denoted by r, which ranged from −0.43 to 0.66 in this data. We remark that all correlations reported in this work refer to regularized partial correlations, and for this reason, the magnitude of partial correlations may seem small.

Not surprisingly, the intra-modality partial correlations (within the same data type) were much stronger than the inter-modality partial correlations (between data types): only 5.4% of the edges were between proteins and lipids, and as few as 373 edges were between proteins and echocardiographic parameters and clinical biomarkers. This result clearly shows that the proteomic variation is largely independent of the lipidomic variation and imaging parameter differences, and both types of molecular data relate differently to the risk of future adverse outcomes.

The core segment of the network connecting echocardiographic measurements and clinical biomarkers to proteins, lipids, and acylcarnitines is shown in Figure 2. In this partial correlation network, the natriuretic peptides and hsTNI (TNI in purple color) were positively correlated with each other and BNP, NT-proBNP and hsTNI were negatively correlated with LVEF. Among the echocardiographic measurements, LVEDVi and LVESVi had the highest partial correlation (r = 0.475), followed by LVIDSi with LVIDDi (r = 0.459) and the negative correlation between LVEF and LVESVi (r = −0.375). Overall, only ten echocardiographic parameters (LA area, LA width, LVEF, LVMi, LVIDDi, LVIDSi, Peak S/D, a', s’ and E/e') were connected with clinical biomarkers in this subnetwork. Among those, the highest correlations were recorded between NT-proBNP and E/e’ (r = 0.058), between NT-proBNP and s' (r = −0.057), between BNP and LVEF (r = −0.053) and between BNP and a' (r = −0.050). This finding suggests that myocardial motion parameters captured by tissue Doppler imaging are particularly correlated with neurohormonal activation from the primary MI episodes.

FIGURE 2
www.frontiersin.org

Figure 2. Visualization of the partial correlation network connecting plasma proteins and lipids with echocardiographic imaging variables and cardiac biomarkers. Plasma proteins and lipids were drawn as nodes in light red and cyan, and echocardiographic variables and cardiac biomarkers in blue and light purple, respectively. Network edges were colored according to the signs of partial correlations, i.e. positive in red and negative in light purple.

Meanwhile, stronger correlations were observed among proteins, lipids, and acylcarnitines with clinical biomarkers than with echocardiographic parameters, suggesting that morphological and functional features provide orthogonal information to molecular markers during the early recovery post-discharge. The highest correlation among the connected proteins was between hsTNI and cardiac troponin T (TNNT2) (r = 0.206), followed by the correlation of creatinine with insulin-like growth factor binding protein 2 (IGFBP2) (r = 0.142), glycoprotein CD59 (r = 0.118), and cystatin M (CST6) (r = 0.115). Nine proteins were connected with at least three natriuretic peptides in the network, including positive correlation with angiopoietin-2 (ANGPT2), periostin (POSTN), R-spondin 4 (RSPO4), IGFBP2, thrombospondin 2 (THBS2), Spondin-1 (SPON1), vascular endothelial growth factor D (VEGFD) (also known as FIGF), bone morphogenetic protein 10 (BMP10), and negative correlations with bone morphogenetic protein 1 (BMP1) and coagulation factor×(F10). See Supplementary Table S3 for the table of regularized partial correlation estimates among the data features with the clinical biomarkers from the four modalities.

Predictive subnetwork signature of MACE

Using this estimated network as the background, we next applied the supervised analysis module of iOmicsPASS + for dimension reduction and to obtain subnetwork signatures of MACE (13). Table 2 reports the number of proteins, lipids, echocardiographic variables, and clinical biomarkers in the predictive signatures. The MACE signature included 524 edges connecting 211 nodes (mean cross-validated error 38.5%). Figure 3 visualizes the MACE signature, where the edges were colored by the sign of the d-scores for the MACE group (red if higher in MACE, blue otherwise). This subnetwork signature contains 194 proteins, five lipids, eight echo imaging parameters and all four natriuretic peptides. Although only five lipids were part of this network, four of them happen to be phosphatidylethanolamines (PE 34:1, 34:2, 35:2, 37:4).

FIGURE 3
www.frontiersin.org

Figure 3. Subnetwork signature of MACE. (A) Network visualization with plasma proteins, lipids and acylcarnitines, echocardiographic imaging variables, and clinical biomarkers in different colors. Edges were colored according to the signs of the d-scores computed by iOmicsPASS + and the thickness reflects the magnitude of the scores representing the discriminatory power. (B) Enrichment of biological functions in the constituent nodes in panel (A), accounting for tissue-specific expression of genes at the mRNA level (median-normalized TPM > 5 in each tissue). (C) Visualization of the subset of the network signature illustrating the plasma proteins connected to echocardiographic imaging variables and clinical biomarkers.

TABLE 2
www.frontiersin.org

Table 2. Summary of the predictive subnetwork signatures of lipids, proteins and clinical markers in two comparisons, separating event-free post-MI patients from those with a future major adverse cardiac outcome (MACE) and from HF patients.

The subnetwork consists of two segments. The majority (96.4%) were highly correlated proteins, lipids, imaging parameters and clinical biomarkers with higher d-scores (in red) for the MACE group. The other part was a protein-only network with lower centroids (in blue). Here, the main driver of separation between the two groups were the edges connecting one protein to another (90.6%). Twenty-two edges connected natriuretic peptides to plasma proteins, 13 connected echocardiographic variables to proteins, and six linked lipids to proteins. The edges with the highest d-scores for MACE were the joint performance of CAPG with several proteins including trefoil factor 3 (TFF3), FSTL3 and ephrin type-A receptor 2 (EPHA2). The top five highest d-scores for the event-free group were between mitogen-activated protein kinase kinase 4 (MAP2K4) with prolyl endopeptidase (FAP), superoxide dismutase in mitochondria (SOD2), histidine-rich glycoprotein (HRG) and between plasmin (PLG) and vascular endothelial growth factor receptor 2 (KDR). Six major hub proteins, CAPG, FSTL3, EPHA2, TFF3, tumor necrosis factor receptor superfamily member 19l (RELT) and beta-2-microglobulin (B2M) were densely connected to many other nodes (i.e., degree above 30). The detailed subnetwork signature is reported in Supplementary Table S4.

Incorporating tissue-specific mRNA expression levels of protein-coding genes in the heart, arteries, kidneys, liver and lungs (see Methods), we carried out biological pathway enrichment of the proteins in our signature for each tissue type, separately. MACE predictive proteins over-expressed in the heart were related to cell chemotaxis, cell development, ECM organization, involved in apoptotic signalling pathway, muscle structure development and homeostatic process (Figure 3B). Those over-expressed in the arteries largely overlapped with those expressed in the heart, with enrichment in cell projection organization, response to hormone, peptidyl-tyrosine phosphorylation and transmembrane receptor protein tyrosine kinase signalling pathway. Proteins specifically expressed in the liver were found to be enriched in inflammatory response and cytokine-mediated signalling pathways, as expected (Supplementary Table S5).

Using the echocardiographic variables and natriuretic peptides as endophenotype, the visualization of their first-degree neighbors in Figure 3C highlights plasma proteins worthy of further investigation, many of which are known biomarkers in various cardiovascular diseases. All network edges had positive d-scores (in red), suggesting that the combined co-expression scores were higher in the MACE group compared to event-free group. Among the eight echocardiographic imaging parameters in the signature, E/e' was most connected to plasma proteins, including parathyroid hormone (PTH), EFEMP1, FSTL3, RSPO4, THBS2 and SPON1. The pair with the highest discriminatory power and d-score was between the ratio E/e' with EFEMP1, followed by LV mass with FSTL3. BNP was connected to LV mass, LA area, LVEF, LVIDSi and E/e'; ANP was only connected with LV mass. NT-proBNP was connected to key proteins such as growth differentiation factor 15 (GDF15), EFEMP1, BMP10, IGFBP2 and RSPO4; NT-proANP was connected to the same proteins except RSPO4 and to additional proteins including B2M, CAPG, TFF3 and FSTL3.

Predictive subnetwork signature of HF

Next, we repeated the supervised analysis for the subgroup of patients who developed HF. iOmicsPASS + identified 566 edges in the HF signature with a lower mean cross-validated error of 26.7%. This subnetwork consists of 164 proteins, 8 lipids, 12 echo imaging parameters and 5 clinical biomarkers, all of which largely overlapped with the MACE signature (Table 2). Not surprisingly, echocardiographic variables contributed more to the signature with higher d-values in the prediction of HF. In addition to the eight variables in the MACE signature, four additional echocardiographic variables (LA width, LVEDVi, LVIDDi and a') were included in the HF signature. hsTNI, seven proteins including interleukin-1 receptor-like 1 (IL1RL1), POSTN, CD55 and CD93, two lipids (PE(O-36:4), PC(P-30:0)) and acylcarnitine C14:0 were also unique to this signature.

Similarly, the HF subnetwork signature showed two contrasting segments, one with higher centroids for HF (red edges) and the other with lower scores (blue edges) compared to event-free patients (Figure 4A). Most edges connected one protein to another (84.1%), 34 edges were between natriuretic peptides and proteins, 18 were between echo imaging parameters and proteins, and 12 were between lipids and proteins. Interestingly, the edges with the highest d-values for HF did not involve NT-proBNP. Instead, d-scores between CAPG and FSTL3, TFF3, B2M and EPHA2 (d-scores between 1.60 to 1.89) and between EFEMP1 and FSTL3, GDF15, B2M, RSPO4, BMP10 and E/e' (d-scores between 1.60 to 1.85) were higher than that between NT-proBNP and EFEMP1 (d-score = 1.32). The ratio E/e', a validated indicator of LV filling pressures, was also connected with other proteins such as FSTL3, RSPO4, THBS1, PTH, SPON1 and the four natriuretic peptides, illustrating its joint prognostic value in predicting future HF. Full results are reported in Supplementary Table S6.

FIGURE 4
www.frontiersin.org

Figure 4. Subnetwork signature of HF. (A) The edges were colored by the sign of the d-scores for the HF group (positive in red, and negative in blue, respectively). The thickness of edges reflects the magnitude of the scores. (B) Enrichment of biological functions in the constituent nodes in the panel (A) accounting for tissue-specific expression of genes at the mRNA level. (C) A subset of the HF signatures consisting of first-degree neighbors of echocardiographic variables and clinical biomarkers, with line properties of the edges set according to the signs of respective partial correlations (positive in solid, and negative in dashed).

Proteins in the signature with respective mRNA over-expression in the arteries and the heart were enriched for ECM organization, external encapsulating structure organization and an additional cell chemotaxis in the heart only. Proteins over-expressed in kidney were pro-inflammatory response, lipid response, cytokine production and chemical homeostasis, whereas the proteins of hepatic origin were related to adaptive immune response and cell-growth (Figure 4B). The results of the tissue-specific enrichment analyses are reported in Supplementary Table S7.

Focusing on the subnetwork signature involving the echocardiographic variables, natriuretic peptides and hsTNI, we visualized the important proteins connected with each marker (Figure 4C). Both ANP and NT-proANP were connected to BMP10; both BNP and NT-proBNP were connected to THBS2, RSPO4, IGFBP2, VEGFD/FIGF and SPON1. Only one protein IGFBP2 was strongly connected to all four natriuretic peptides. Among the echocardiographic measurements, other than E/e', LV mass connected with all four natriuretic peptides and two proteins (FSTL3, RELT). On the other hand, the ratio of LVEF to BNP, LV mass and TNNT2 (i.e., negative correlation) yielded higher d-scores for the HF group when compared to the event-free group.

Characterization of plasma protein markers by potential tissues of origin

To delineate the proteins directly associated with cardiac assault and tissue damage post-MI and those representing other systematic responses, we mapped the proteins to the tissue-enriched genes in the transcriptomic data provided by the Genotype-Tissue Expression (GTEx) database. Based on our definition of tissue-enriched genes (see Methods), 63.4% of the proteins in the secondary MACE signature and 66.5% of the proteins in the HF signature were enriched in at least one of the 54 tissues catalogued in the GTEx. Figure 5 shows the median normalized gene expression (in transcripts per million, TPM) of the tissue-enriched markers from the two signatures, illustrating the specificity of hub proteins to the five tissues related to the heart, arteries, kidneys, liver and skeletal muscle.

FIGURE 5
www.frontiersin.org

Figure 5. Tissue specific gene expression (median-normalized TPM values) of 36 plasma proteins in the MACE and HF signatures with specific enrichment in heart, arteries, kidneys, liver, and skeletal muscle according to the GTEx database.

In the MACE signature, 10.6% of proteins were specifically of hepatic origin, followed by lungs (9.8%) and arteries (5.7%). In the HF subnetwork signature, the majority were enriched in lungs (13.8%), followed by liver (8.3%) and kidney (5.5%) (Table 2). In both signatures, only four proteins were enriched in the heart tissues: BMP10, TNNT2, cysteine and glycine rich protein 3 (CSRP3) and hFABP (FABP3). Among the proteins enriched in the aortic and coronary arteries, EFEMP1, IGFBP2, THBS2, metalloproteinase inhibitor 1 (TIMP1), were shared between both signatures; biglycan (BGN), contactin-4 (CNTN4) and inhibin beta A chain (INHBA) were unique to the MACE signature, while POSTN was unique to HF signature. The table of tissue-enriched proteins is reported in Supplementary Table S8.

Predictive performance of subnetwork signatures in IMMACULATE cohort

Next, we assessed the predictive value of the two subnetwork signatures in 190 post-MI patients from the IMMACULATE registry, applying the same inclusion criteria. In IMMACULATE, we did not remove any patients with self-reported history of MI due to the small number of MACE cases and the information regarding how long ago the episode took place was not available. This cohort was recruited between 2011 and 2014 in Singapore (median follow-up period 4.2 years). The patients in IMMACULATE were substantially younger (mean = 53.3 ± 8 years) than those in CDCS, with a much larger proportion of current smokers (63.2%) compared to only 6.7% in CDCS and, majority of the patients were STEMI in IMMACULATE (53.5%) and NSTEMI in CDCS (70.3%). Thus, both cohorts represent post-MI patient populations from two different time periods with substantially different ethnic and genetic background and more contemporary clinical management during the follow-up in the latter. More details of the characteristics of IMMACULATE patients are provided in Table 3 and the comparison of the echocardiographic variables between event-free group and MACE group as well as HF group in IMMACULATE study is reported in Supplementary Table S9.

TABLE 3
www.frontiersin.org

Table 3. Clinical characteristics of post-AMI patients in IMMACULATE study.

In IMMACULATE, 38 patients had secondary MACE events, of which 23 were HF, representing a lower frequency of secondary MACE than CDCS, although there may be under-reporting of MACE as hospitalizations as unstable angina status was not collected. Despite these differences and in the absence of NT-proANP and ANP, the prognostic value of NT-proBNP as a single marker of MACE and HF remained exceptionally high. Supplementary Figure S2 clearly shows that NT-proBNP stratifies post-MI patients into three groups of well-separated risks (log-rank test P-values < 0.01) based on tertiles in both studies.

To see whether the incorporation of all other correlated molecular markers improves the predictive performance, we used the subnetwork signatures derived in the CDCS study to calculate the classification probability scores of MACE and HF for the IMMACULATE subjects. While we observed that the survival curves of high and low risk groups were well separated for MACE (P = 0.014) and HF (P = 0.036), the separation of survival curves showed at most a comparable separation of outcome groups to the results using NT-proBNP (Figure 6A). This observation was a direct testament to why natriuretic peptides have long overruled the discovery of new circulating biomarkers for incident cardiovascular events.

FIGURE 6
www.frontiersin.org

Figure 6. Risk stratification by the iOmicsPASS probability scores of future MACE and HF events in the cohorts. (A) Kaplan-Meier curves with stratification of patients by three categories based on the probability scores of MACE and HF in the training dataset (CDCS) and the validation dataset (IMMACULATE). (B) Boxplots of the co-expression scores of the four top scoring feature pairs predictive of HF and their respective ROC curves.

However, our predictive analysis approach to this data was to discover what other molecular and imaging parameters underlie the variable risk strata beyond NT-proBNP and what other biological processes are reflected in the circulation. Since iOmicsPASS + reports edge-wise co-expression values for pairs of data features in every patient, we were able to dissect the prognostic signals into network edges and examined their individual contributions to the prediction. In particular, we examined 41 pairs of data features with d-scores above 1.2 in absolute value (arbitrarily chosen threshold) and compared the AUC values in both CDCS and IMMACULATE. Of these, Table 4 shows that the top ten pairs of data features that had the highest AUC values above 0.73 without natriuretic peptides and up to 0.80 with NT-proBNP. All 11 proteins involved in these connections were network hub features: ECM-associated proteins EFEMP1, CAPG, FSTL3, and imaging parameter E/e'. Similar to the CDCS cohort, we discovered that the joint predictive powers of most of these pairs, as measured by the AUC value, were comparable to that of NT-proBNP in IMMACULATE. However, a few others such as the pair of FSTL3 and CAPG were not as reproducible between the two populations (Figure 6B), which explains the lack of improvement of the subnetwork signature in AUC values over the dominant NT-proBNP as a single marker.

TABLE 4
www.frontiersin.org

Table 4. The predictive performance and hazard ratios of the top ten pairs of data features with the highest AUC and largest magnitude of d-scores for HF in the CDCS and IMMACULATE cohorts.

Taken together, NT-proBNP remains the most predictive biomarker of secondary MACE and HF in this study. However, a combination of two ECM proteins (EFEMP1 and FSTL3) and E/e' proved to be powerful predictors of MACE and HF risk, and both protein markers can be explored as potential therapeutic targets in post-MI patients given the emerging evidence (2730).

Prognostic value of the network signatures by adjusting for clinical variables

Our network-based supervised method constructs a binary classifier without considering the event times and censored outcomes, and we purposefully ruled out other clinical variables in the main analysis. Nonetheless, we were able to demonstrate that the classification probability scores of MACE and incident HF using molecular data and echo imaging information showed significant differences in the Kaplan-Meier curves among the three risk groups. We next sought to determine if the prognostic value of the classification probability scores could be further improved by adjusting for clinical variables. We calculated the HRs per 10% increase in the probability scores for MACE and HF and evaluated if adding clinical variables improves the overall cNRI and IDI (Table 5).

TABLE 5
www.frontiersin.org

Table 5. The hazard ratios (HR), C-index, continuous net reclassification index (cNRI) and the integrated discrimination index (IDI) of classification probability scores from MACE and HF network signatures, adjusting for clinical variables and medication use at hospital discharge.

In CDCS, the HR per 10% increase in the probability score is associated with a HR of 1.09 (95% CI: 1.06–1.12) and 1.19 (95% CI: 1.14–1.24) times increase in risk of MACE and HF, respectively. After adjusting for age, gender and BMI, the HR becomes 1.10 (95% CI: 1.07–1.13) and 1.15 (95% CI: 1.10–1.21) for MACE and HF, respectively. The adjusted models did not improve the model's performance with low values of cNRI and IDI (p-value ≥ 0.05) for both outcomes. However, after further adjusting for ST-elevation status, hypertension and medication use at discharge, the adjusted HRs became 1.07 (95% CI: 1.04–1.10) for MACE and 1.12 (95% CI: 1.06–1.18) for HF, with significant improvement in reclassification for both outcomes (p-value < 0.05). Similarly for IMMACULATE, the fully adjusted HRs were 1.13 (95% CI: 1.04–1.22) for MACE and 1.11 (95% CI: 1.01–1.22) for HF and only the fully adjusted model showed significant improvement from the unadjusted model with significance for IDI (p-value < 0.05) and marginal significance for cNRI (p-value < 0.10). The results are summarized in Table 5.

Discussion

We used interpretable machine learning to connect the global landscape of plasma proteins and lipids with echocardiographic imaging variables and established circulating biomarkers in two independent patient cohorts hospitalized for AMI with heterogeneous and contrasting ethnic and genetic background. Plasma proteins carried stronger prognostic signals than lipids in both cohorts, and communities of plasma proteins were associated with increased risks of long-term secondary MACE and HF. However, we strongly suspect that the lower prognostic potential of lipids is largely affected by the nearly universal prescription of statins at hospital discharge. Notably, we identified new prognostic plasma proteins and echocardiographic variables with equivalent classification performance to existing biomarkers, with different biological and mechanical interpretation. The diastolic parameter E/e' outperformed LVEF as a prognostic variable, while two network hub proteins EFEMP1 and FSTL3 maintained their prognostic significance similar to NT-proBNP and other established circulating biomarkers. As blood is frequently the only available reporter tissue in CVD biomarker discovery, we also annotated individual proteins in terms of their potential tissue(s) of origin. Interestingly, the number of prognostic proteins of liver, lung and kidney origin outperformed those specific to the heart and arteries.

The network hub proteins with reproducible signals in the predictive subnetworks were EFEMP1 and FSTL3. EFEMP1, also known as fibulin-3, is an ECM glycoprotein implicated in vascular endothelium remodeling (31). It plays an important role in reducing vascular calcification and inhibiting metalloproteinases in oxidative stress (3234). FSTL3, an extracellular regulator of TGF-β family cytokines such as activin A, is involved in various biological functions including cell proliferation and inflammation, and altered transcriptomic regulation of the gene with another follistatin family member FSTL1 in myocardium has been associated with HF severity (30). FSTL3 has also been characterized as a stress-induced regulator of cardiac hypertrophy through Smad signalling pathway modulation in a mouse model (35), opening an opportunity for investigation of the follistatin family of proteins as a therapeutic target for the protection against HF.

Despite the lower prognostic power of lipids, both subnetwork signatures included four lipid transport proteins, namely APOL1, APOE, APOA1 and FABP3. FABP3 encodes hFABP, a marker of myocardial injury. hFABP was connected to a multitude of acylcarnitine species in both subnetwork signatures. It has previously been implicated in HF, where reduced fatty acid utilization in the heart leads to the progression of chronic HF, left ventricular hypertrophy and remodelling (36, 37). Intriguingly, here we found that circulating levels of hFABP and acylcarnitines (C12:0, C14:0, C14:1, C14:2, C16:1) were jointly increased in MACE and HF patients, suggesting that the circulating levels may have a dual role in both ischemic and HF events.

One of the most challenging issues in analyzing plasma molecular profile, especially among patients with predisposed conditions, is the use of medications and combination of drugs prescribed during the treatment of the primary episode, which could potentially confound the outcomes of interest and/or affect the protein or lipid levels in the blood. In both our studies, all patients were prescribed at least one type of medications at hospital discharge for the treatment of the primary AMI and most patients were on beta-blockers, aspirins and statins in both CDCS and IMMACULATE study. Thus, it is difficult to truly evaluate if the medication use to blood collection at baseline could potentially mediate the outcomes or affect the molecular profile. To this end, we investigated the medication use by the patients during hospital admission and discharge in both cohorts.

We first identified the types of medications directly associated with secondary MACE and HF. In CDCS, the use of beta-blockers, antiplatelet therapy (clopidogrel), calcium channel antagonists, long-acting nitrates, diuretics, statins and warfarin at hospital discharge were associated with both MACE and HF. Similarly, the same medications, except clopidogrel, and two additional medications (ACE inhibitors and aspirin use) at admission were associated with both outcomes. In IMMACULATE, medication information was collected for fewer drugs, but none were associated with MACE or HF. This is likely due to the small number of secondary outcomes and the resulting lack of statistical power. Then, we fitted separate Cox PH models on individual markers to obtain the unadjusted HR and HR adjusted for the medication use at discharge. Supplementary Figure S3 shows the scatter plot of the unadjusted and adjusted HR and data features that were part of the network signatures for MACE and HF are colored as blue (protein), green (lipid), salmon (clinical biomarker) and purple (echo imaging). The plots show that the HRs before and after adjusting for medication use remain unchanged and this observation suggests that the associations we reported are unlikely to be altered by medication use. The full statistical summary can be found in Supplementary Table S10.

Overall, our study had several limitations. First, the IMMACULATE cohort had a smaller size and fewer MACE and HF events compared to the CDCS cohort, leading to borderline statistical significance of several associations in the IMMACULATE cohort. Second, the acute phase of AMI and various interventions including pharmacotherapies and revascularization may have confounded the association between data features and events; we deliberately performed the proteomic and lipidomic analysis using blood samples collected at the 30-day post-MI timepoint instead of close to the time index AMI event in order to reflect a more “steady-state” after initial treatment and stabilization, yet we acknowledge that this sampling scheme may have introduced bias from heterogeneous treatment courses of the primary AMI events across the patients. Third, the number of proteomic targets measured was larger than that of the lipid targets by almost four-fold; this might partially explain the greater prognostic signals from proteins than lipids in our study, in addition to the effects from the medications with immediate impact on circulating lipid levels. Fourth, higher resolution imaging tools including cardiac magnetic resonance could have contributed a much richer set of imaging features to the networks. Last but not least, the proteomic assay platform could have suffered from the potential cross-reactivity of the SOMAmer reagents (e.g., across multiple member proteins of the same family such as follistatin and fibulin) as cautioned by an increasing number of publications (3840).

These shortcomings notwithstanding, our work is to the best of our knowledge the first network-based multivariate analysis of large-scale cohorts of post-MI patients to explore the interplay between gold-standard natriuretic peptides with echocardiographic imaging variables and circulating plasma proteins, lipids and acylcarnitine. With potential caveats in the choice of sampling points and the proteomic assay in mind, our analysis highlighted that there exist a plethora of under-recognized data features that not only jointly modulate the risk of post-MI HF but also open the potential for therapeutic intervention through modulation of ECM. In this study we highlighted two ECM proteins FSTL3 and EFEMP1, both elevated in response to the primary AMI event. According to our inference of tissue gene expression data, it is highly likely that they are secreted from the myocardium and the microvasculature and the aorta and they participate in key cell fate decisions and damaged myocardial tissue repair during the response. Overall, with the increasing throughput of proteomic assays and the power of interpretable machine learning techniques, we are confident that more promising cues for improved risk stratification and therapeutic intervention can be discovered to mitigate HF after AMI.

Data availability statement

The original contributions presented in the study are publicly available. This data can be found here: iOmicsPASS+ on open-source R package in the GitHub repository (https://github.com/cssblab/iOmicsPASSplus). The proteomic and lipidomic data in support of the findings of this study are available at https://github.com/Hiromikwl/Data_iOP. Patient-level clinical records, including echocardiographic imaging variables and clinical biomarkers can only be shared on reasonable request due to data privacy concerns.

Ethics statement

The studies involving human participants were reviewed and approved by New Zealand Multi-region Ethics Committee (CTY/02/02/018) Singapore's National Healthcare Group Domain Specific Review Board (DSRB 2013/00248 and 2013/00635). The patients/participants provided their written informed consent to participate in this study.

Author contributions

HK and HC developed iOmicsPASS + workflow and performed data analysis. QZ developed software for tissue specificity and enrichment analysis. SHT coordinated sample preparation and proteomic and lipidomic analysis. PI, BB, FT, and MRW performed lipidomics analysis. AP and RND performed project management and data collection for CDCS. JWP performed statistical analysis for CDCS, RT performed core laboratory echo analysis for CDCS. CP performed biomarker analysis for CDCS. WS and LH performed core laboratory echo analysis for IMMACULATE. TSH performed data collection for IMMACULATE. HK, HC, FT, AMR, and MC wrote the draft of the paper with contributions from all other authors. MC, MRW, AMR supervised the project and acquired funding. All authors contributed to the article and approved the submitted version.

Funding

This work was supported in part by grants from National Medical Research Council of Singapore (CIRG17may014 and MOH-000280 to MYC), from New Zealand: The Coronary Disease Cohort Study was funded by the Health Research Council of New Zealand (Program Grants 02/152, 08/070, 11/1070); National Heart Foundation of New Zealand; New Zealand Lotteries Grant Board; Foundation for Research, Science and Technology, and the Christchurch Heart Institute Trust, and National Medical Research Council of Singapore (to AMR). AMR holds the NZ Heart Foundation Chair of Cardiovascular Studies; the National University of Singapore via the Life Sciences Institute (LSI) (to MRW), National Research Foundation (NRFI2015-05, NRFSBP-P4 to MRW) and A*STAR (I1901E0040 to MRW); Singapore Ministry of Education (MOE2016/T2/1/001, NMRC/CG/M009/2017, MOE2018/T2/2/058, MOE-T2EP20121-0018 to HC). APP holds the Heart Foundation of New Zealand Foundation100 Fellowship. MYC receives salary support and research grant support from a clinician scientist award --senior investigator (NMRC MOH-000280-00), centre grant (NMRC CG21APR1008), Collaborative Centre Grant scheme (NMRC/CG21APRC006), MOH Health Services Research Program (MOH MH70:70/1-2017). MYC also received funding from an Externally Sponsored Clinical Research (ESCR) grant supported by Astra Zeneca but the funding has no involvement in the studies aforementioned in this article.

Acknowledgments

HK and HC developed iOmicsPASS+ workflow and performed data analysis. QZ developed software for tissue specificity and enrichment analysis. SHT coordinated sample preparation and proteomic and lipidomic analysis. PB, BB, FT, and MRW performed lipidomics analysis. APP and RND performed project management and data collection for CDCS. JWP performed statistical analysis for CDCS, RT performed core laboratory echo analysis for CDCS. CP performed biomarker analysis for CDCS. WMS and LHL performed core laboratory echo analysis for IMMACULATE. SHT performed data collection for IMMACULATE. HK, HC, FT, AMR, and MYC wrote the draft of the paper with contributions from all other authors. MYC, MRW, AMR supervised the project and acquired funding. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2023.1123682/full#supplementary-material.

References

1. Ngo D, Sinha S, Shen D, Kuhn EW, Keyes MJ, Shi X, et al. Aptamer-Based proteomic profiling reveals novel candidate biomarkers and pathways in cardiovascular disease. Circulation. (2016) 134(4):270–85. doi: 10.1161/CIRCULATIONAHA.116.021803

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Ganz P, Heidecker B, Hveem K, Jonasson C, Kato S, Segal MR, et al. Development and validation of a protein-based risk score for cardiovascular outcomes among patients with stable coronary heart disease. JAMA. (2016) 315(23):2532–41. doi: 10.1001/jama.2016.5951

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Meijers WC, Bayes-Genis A, Mebazaa A, Bauersachs J, Cleland JGF, Coats AJS, et al. Circulating heart failure biomarkers beyond natriuretic peptides: review from the biomarker study group of the heart failure association (HFA), European society of cardiology (ESC). Eur J Heart Fail. (2021) 23(10):1610–32. doi: 10.1002/ejhf.2346

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Lindsey ML, Mayr M, Gomes AV, Delles C, Arrell DK, Murphy AM, et al. Transformative impact of proteomics on cardiovascular health and disease: a scientific statement from the American heart association. Circulation. (2015) 132(9):852–72. doi: 10.1161/CIR.0000000000000226

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Chan D, Ng LL. Biomarkers in acute myocardial infarction. BMC Med. (2010) 8:34. doi: 10.1186/1741-7015-8-34

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Ibrahim NE, Januzzi JL Jr. Established and emerging roles of biomarkers in heart failure. Circ Res. (2018) 123(5):614–29. doi: 10.1161/CIRCRESAHA.118.312706

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Vidula MK, Orlenko A, Zhao L, Salvador L, Small AM, Horton E, et al. Plasma biomarkers associated with adverse outcomes in patients with calcific aortic stenosis. Eur J Heart Fail. (2021) 23(12):2021–32. doi: 10.1002/ejhf.2361

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Nikpay M, Soubeyrand S, Tahmasbi R, McPherson R. Multiomics screening identifies molecular biomarkers causally associated with the risk of coronary artery disease. Circ Genom Precis Med. (2020) 13(6):e002876. doi: 10.1161/CIRCGEN.119.002876

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Kornej J, Hanger VA, Trinquart L, Ko D, Preis SR, Benjamin EJ, et al. New biomarkers from multiomics approaches: improving risk prediction of atrial fibrillation. Cardiovasc Res. (2021) 117(7):1632–44. doi: 10.1093/cvr/cvab073

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Leon-Mimila P, Wang J, Huertas-Vazquez A. Relevance of multi-omics studies in cardiovascular diseases. Front Cardiovasc Med. (2019) 6:91. doi: 10.3389/fcvm.2019.00091

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Joshi A, Rienks M, Theofilatos K, Mayr M. Systems biology in cardiovascular disease: a multiomics approach. Nat Rev Cardiol. (2021) 18(5):313–30. doi: 10.1038/s41569-020-00477-1

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Chan MY, Efthymios M, Tan SH, Pickering JW, Troughton R, Pemberton C, et al. Prioritizing candidates of post-myocardial infarction heart failure using plasma proteomics and single-cell transcriptomics. Circulation. (2020) 142(15):1408–21. doi: 10.1161/CIRCULATIONAHA.119.045158

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Koh HWL, Fermin D, Vogel C, Choi KP, Ewing RM, Choi H. iOmicsPASS: network-based integration of multiomics data for predictive subnetwork discovery. NPJ Syst Biol Appl. (2019) 5:22. doi: 10.1038/s41540-019-0099-y

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Friedman J, Hastie T, Tibshirani R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics. (2008) 9(3):432–41. doi: 10.1093/biostatistics/kxm045

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Prickett TC, Doughty RN, Troughton RW, Frampton CM, Whalley GA, Ellis CJ, et al. C-Type natriuretic peptides in coronary disease. Clin Chem. (2017) 63(1):316–24. doi: 10.1373/clinchem.2016.257816

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Gold L, Ayers D, Bertino J, Bock C, Bock A, Brody EN, et al. Aptamer-based multiplexed proteomic technology for biomarker discovery. PLoS One. (2010) 5(12):e15004. doi: 10.1371/journal.pone.0015004

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Sahn DJ, DeMaria A, Kisslo J, Weyman A. Recommendations regarding quantitation in M-mode echocardiography: results of a survey of echocardiographic measurements. Circulation. (1978) 58(6):1072–83. doi: 10.1161/01.CIR.58.6.1072

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Schiller NB, Shah PM, Crawford M, DeMaria A, Devereux R, Feigenbaum H, et al. Recommendations for quantitation of the left ventricle by two-dimensional echocardiography. American society of echocardiography committee on standards, subcommittee on quantitation of two-dimensional echocardiograms. J Am Soc Echocardiogr. (1989) 2(5):358–67. doi: 10.1016/S0894-7317(89)80014-8

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Oh JK, Appleton CP, Hatle LK, Nishimura RA, Seward JB, Tajik AJ. The noninvasive assessment of left ventricular diastolic function with two-dimensional and Doppler echocardiography. J Am Soc Echocardiogr. (1997) 10(3):246–70. doi: 10.1016/S0894-7317(97)70062-2

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Quiñones MA, Otto CM, Stoddard M, Waggoner A, Zoghbi WA; Doppler Quantification Task Force of the Nomenclature and Standards Committee of the American Society of Echocardiography. Recommendations for quantification of Doppler echocardiography: a report from the Doppler quantification task force of the Nomenclature and standards committee of the American society of echocardiography. J Am Soc Echocardiogr. (2002) 15(2):167–84. doi: 10.1067/mje.2002.120202

CrossRef Full Text | Google Scholar

21. Zhang Z. Multiple imputation with multivariate imputation by chained equation (MICE) package. Ann Transl Med. (2016) 4(2):30. doi: 10.3978/j.issn.2305-5839.2015.12.63

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Rubin DB. Multiple imputation for nonresponse in surveys. New York: John Wiley & Sons, Inc (1987).

23. Abbate A, Toldo S, Marchetti C, Kron J, Van Tassell BW, Dinarello CA. Interleukin-1 and the inflammasome as therapeutic targets in cardiovascular disease. Circ Res. (2020) 126(9):1260–80. doi: 10.1161/CIRCRESAHA.120.315937

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Team RC. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing (2022).

25. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. (2003) 13(11):2498–504. doi: 10.1101/gr.1239303

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Mazumder R, Hastie T. The graphical lasso: new insights and alternatives. Electron J Stat. (2012) 6:2125–49. doi: 10.1214/12-EJS740

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Sharma P, Beck D, Murtha LA, Figtree G, Boyle A, Gentile C. Fibulin-3 deficiency protects against myocardial injury following ischaemia/reperfusion in in vitro cardiac spheroids. Front Cardiovasc Med. (2022) 9:913156. doi: 10.3389/fcvm.2022.913156

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Oshima Y, Ouchi N, Shimano M, Pimentel DR, Papanicolaou KN, Panse KD, et al. Activin A and follistatin-like 3 determine the susceptibility of heart to ischemic injury. Circulation. (2009) 120(16):1606–15. doi: 10.1161/CIRCULATIONAHA.109.872200

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Liu M, Mao C, Li J, Han F, Yang P. Effects of the activin A-follistatin system on myocardial cell apoptosis through the endoplasmic Reticulum stress pathway in heart failure. Int J Mol Sci. (2017) 18(2):374. doi: 10.3390/ijms18020374

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Lara-Pezzi E, Felkin LE, Birks EJ, Sarathchandra P, Panse KD, George R, et al. Expression of follistatin-related genes is altered in heart failure. Endocrinology. (2008) 149(11):5822–7. doi: 10.1210/en.2008-0151

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Zhang C, Yu C, Li W, Zhu Y, Ye Y, Wang Z, et al. Fibulin-3 affects vascular endothelial function and is regulated by angiotensin II. Microvasc Res. (2020) 132:104043. doi: 10.1016/j.mvr.2020.104043

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Lin Z, Wang Z, Li G, Li B, Xie W, Xiang D. Fibulin-3 may improve vascular health through inhibition of MMP-2/9 and oxidative stress in spontaneously hypertensive rats. Mol Med Rep. (2016) 13(5):3805–12. doi: 10.3892/mmr.2016.5036

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Chen JM, Guo RR, Gao Y, Wang CX, Yuan PN. Mechanism of action of profilin-1 and fibulin-3 in vascular remodeling in hypertensive rats. Eur Rev Med Pharmacol Sci. (2019) 23(18):8101–8. doi: 10.26355/eurrev_201909_19028

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Luong TTD, Schelski N, Boehme B, Makridakis M, Vlahou A, Lang F, et al. Fibulin-3 attenuates phosphate-induced vascular smooth muscle cell calcification by inhibition of oxidative stress. Cell Physiol Biochem. (2018) 46(4):1305–16. doi: 10.1159/000489144

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Shimano M, Ouchi N, Nakamura K, Oshima Y, Higuchi A, Pimentel DR, et al. Cardiac myocyte-specific ablation of follistatin-like 3 attenuates stress-induced myocardial hypertrophy. J Biol Chem. (2011) 286(11):9840–8. doi: 10.1074/jbc.M110.197079

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Okizaki A, Shuke N, Sato J, Sasaki T, Hasebe N, Kikuchi K, et al. A compartment model analysis for investigation of myocardial fatty acid metabolism in patients with hypertrophic cardiomyopathy. Nucl Med Commun. (2007) 28(9):726–35. doi: 10.1097/MNM.0b013e32828da1c7

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Neglia D, De Caterina A, Marraccini P, Natali A, Ciardetti M, Vecoli C, et al. Impaired myocardial metabolic reserve and substrate selection flexibility during stress in patients with idiopathic dilated cardiomyopathy. Am J Physiol Heart Circ Physiol. (2007) 293(6):H3270–8. doi: 10.1152/ajpheart.00887.2007

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Raffield LM, Dang H, Pratte KA, Jacobson S, Gillenwater LA, Ampleford E, et al. Comparison of proteomic assessment methods in multiple cohort studies. Proteomics. (2020) 20(12):e1900278. doi: 10.1002/pmic.201900278

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Candia J, Daya GN, Tanaka T, Ferrucci L, Walker KA. Assessment of variability in the plasma 7k SomaScan proteomics assay. Sci Rep. (2022) 12(1):17147. doi: 10.1038/s41598-022-22116-0

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Joshi A, Mayr M. In aptamers they trust: the caveats of the SOMAscan biomarker discovery platform from SomaLogic. Circulation. (2018) 138(22):2482–5. doi: 10.1161/CIRCULATIONAHA.118.036823

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: integrative analysis, multi-omics, echocardiogaphy, major adverse cardiac events (MACE), heart failure hospitalization

Citation: Koh HWL, Pilbrow AP, Tan SH, Zhao Q, Benke PI, Burla B, Torta F, Pickering JW, Troughton R, Pemberton C, Soo W-M, Ling LH, Doughty RN, Choi H, Wenk MR, Richards AM and Chan MY (2023) An integrated signature of extracellular matrix proteins and a diastolic function imaging parameter predicts post-MI long-term outcomes. Front. Cardiovasc. Med. 10:1123682. doi: 10.3389/fcvm.2023.1123682

Received: 14 December 2022; Accepted: 20 March 2023;
Published: 12 April 2023.

Edited by:

Julia Ramírez, Queen Mary University of London, United Kingdom

Reviewed by:

Chang Liu, Emory University, United States
Simon Kraler, University of Zurich, Switzerland
Christoph Sinning, University Medical Center Hamburg-Eppendorf, Germany

© 2023 Koh, Pilbrow, Tan, Zhao, Benke, Burla, Torta, Pickering, Troughton, Pemberton, Soo, Ling, Doughty, Choi, Wenk, Richards and Chan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mark Richards mark.richards@nus.edu.sg Mark Chan mark.chan@nus.edu.sg

Specialty Section: This article was submitted to Clinical and Translational Cardiovascular Medicine, a section of the journal Frontiers in Cardiovascular Medicine

Download