ORIGINAL RESEARCH article
Sec. Respiratory Physiology and Pathophysiology
Volume 14 - 2023 | https://doi.org/10.3389/fphys.2023.1144192
Machine learning for screening of at-risk, mild and moderate COPD patients at risk of FEV1 decline: results from COPDGene and SPIROMICS
- 1Division of Pulmonary and Critical Care Medicine, University of Michigan, Ann Arbor, MI, United States
- 2Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, United States
- 3Weill Cornell Medical College, New York, NY, United States
- 4Medical Service, VA Ann Arbor Healthcare System, Ann Arbor, MI, United States
- 5Department of Radiology, University of Iowa, Iowa City, IA, United States
- 6Department of Radiology, University of Michigan, Ann Arbor, MI, United States
- 7Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI, United States
- 8Imbio Inc., Minneapolis, MN, United States
Purpose: The purpose of this study was to train and validate machine learning models for predicting rapid decline of forced expiratory volume in 1 s (FEV1) in individuals with a smoking history at-risk-for chronic obstructive pulmonary disease (COPD), Global Initiative for Chronic Obstructive Lung Disease (GOLD 0), or with mild-to-moderate (GOLD 1–2) COPD. We trained multiple models to predict rapid FEV1 decline using demographic, clinical and radiologic biomarker data. Training and internal validation data were obtained from the COPDGene study and prediction models were validated against the SPIROMICS cohort.
Methods: We used GOLD 0–2 participants (n = 3,821) from COPDGene (60.0 ± 8.8 years, 49.9% male) for variable selection and model training. Accelerated lung function decline was defined as a mean drop in FEV1% predicted of > 1.5%/year at 5-year follow-up. We built logistic regression models predicting accelerated decline based on 22 chest CT imaging biomarker, pulmonary function, symptom, and demographic features. Models were validated using n = 885 SPIROMICS subjects (63.6 ± 8.6 years, 47.8% male).
Results: The most important variables for predicting FEV1 decline in GOLD 0 participants were bronchodilator responsiveness (BDR), post bronchodilator FEV1% predicted (FEV1.pp.post), and CT-derived expiratory lung volume; among GOLD 1 and 2 subjects, they were BDR, age, and PRMlower lobes fSAD. In the validation cohort, GOLD 0 and GOLD 1–2 full variable models had significant predictive performance with AUCs of 0.620 ± 0.081 (p = 0.041) and 0.640 ± 0.059 (p < 0.001). Subjects with higher model-derived risk scores had significantly greater odds of FEV1 decline than those with lower scores.
Conclusion: Predicting FEV1 decline in at-risk patients remains challenging but a combination of clinical, physiologic and imaging variables provided the best performance across two COPD cohorts.
Chronic obstructive pulmonary disease (COPD) is characterized by a variety of clinical phenotypes and disease courses that are often difficult to predict (Miravitlles et al., 2013; Mathioudakis et al., 2020). There is a pressing need to create actionable tools to assist clinicians and researchers in identifying patients who are at higher risk for accelerated lung function decline so that early, directed therapies can be appropriately initiated. In addition to conventional markers of lung function decline using pulmonary function testing (PFT) metrics, novel advanced chest imaging analytic techniques are currently being explored to identify high risk patients. Among these techniques are Parametric Response Mapping (PRM), which co-registers inspiratory and expiratory images to distinguish between normal lung, emphysema, and non-emphysematous air trapping (functional small airways disease, fSAD). Prior studies using PRM have demonstrated the association between fSAD and 5-year forced expiratory volume in 1 s (FEV1) decline, progression of emphysema and exacerbation risk (Bhatt et al., 2016; Han et al., 2017; Labaki et al., 2019).
Vascular remodeling is also prevalent among COPD patients and believed to be part of the pathogenesis of this disease. CT scans have been used to visualize changes in distal pruning of blood vessels (Rahaghi et al., 2019). The ratio of blood volume in vessels with a cross-sectional area < 5 mm2 to total blood vessel volume (TBV) has been proposed as an imaging biomarker and, consistent with this theory, has been shown to decrease as COPD progresses (Estepar et al., 2013). Increased airway wall thickness (AWT) has also been associated with more frequent COPD exacerbations (Han et al., 2011) and with greater FEV1 decline and development of airflow limitation in smokers (Mohamed et al., 2015). AWT is often measured using Pi10, the square root of an airway wall area with a 10 mm lumen perimeter (Nakano et al., 2005).
Predictive models have been widely used for prediction of clinically meaningful outcomes in subjects with COPD. These models have been used to identify factors that place patients at-risk-for exacerbations and hospital admissions and readmissions and to demonstrate the effect of smoking reduction on FEV1 decline (Simmons et al., 2005; Bahadori et al., 2009; Bertens et al., 2013). Predictive models have also been combined with deep learning methods to assist in staging COPD severity and predict disease progression using automated CT staging to quantify the degree of emphysema and air trapping visualized on images (Hasenstab et al., 2021). Predicting risk of mortality, both in patients admitted to the ICU with exacerbations (Jain et al., 2018) and those in the outpatient primary care setting using a variety of predictive modeling methods (Kiddle et al., 2020) has been extensively studied. However, prediction of FEV1 decline has remained challenging with currently available risk models.
The purpose of this study was to train and evaluate logistic regression prediction models for rapid FEV1 decline over a 5-year time span in ever-smoking participants of the Genetic Epidemiology of Chronic Obstructive Pulmonary Disease (COPDGene) study who were either at-risk-for COPD (smoking history but normal spirometry, GOLD 0) or with mild-moderate COPD (GOLD 1–2). We used a variable importance methodology to select and rank a subset of variables that were important for prediction of rapid FEV1 decline (defined as a drop in FEV1% predicted of > 1.5%/year). Models using only imaging biomarkers and data readily available from the Digital Imaging and Communications in Medicine (DICOM) header (i.e., age, sex) were also trained and compared to full data models to determine if data captured only in a CT scan has adequate predictive value. Finally, we externally validated these prediction models in the large Subpopulations and Intermediate Outcome Measures in COPD (SPIROMICS) cohort.
Materials and methods
This study is a retrospective analysis of prospectively acquired data obtained from two large North American cohorts. Both studies were IRB-approved at all clinical centers, elicited written informed consent from all participants, and were compliant with the Health Insurance Portability and Accountability Act (HIPAA).
COPDGene (ClinicalTrials.gov Identifier: NCT 00608764) is an ongoing NIH-sponsored, prospective, multicenter (n = 21), observational cohort study starting in November 2007 and consisting of more than 10,000 individuals who were current or former smokers at the time of enrollment. COPDGene aims to understand the etiology, progression, and heterogeneity of COPD (Regan et al., 2010). Inclusion criteria were age 45–80 years old at baseline visit, > 10 pack-years cigarette smoking history, and non-Hispanic white or African American race. Exclusion criteria were other lung diseases, pregnancy, cancer other than skin cancer in the 5 years prior to study entry, receiving antibiotics for a COPD exacerbation in the month prior to study enrollment, and relative of a previously enrolled participant.
SPIROMICS (ClinicalTrials.gov Identifier: NCT 01969344) is an ongoing NIH-sponsored prospective, multicenter (n = 12), observational cohort study starting in November 2010 and consisting of 2,981 current, former, and never-smokers at the time of enrollment (Couper et al., 2014). Inclusion criteria were age 40–80 years old at baseline visit, >20 pack-year cigarette smoking history for current or former smokers, and meeting lung function criteria based on spirometry without bronchodilators. An extensive list of exclusion criteria can be found on the ClinicalTrials.gov website (https://clinicaltrials.gov/ct2/show/NCT01969344).
Analysis utilized inspiratory and expiratory chest CT scans from both COPDGene (Regan et al., 2010; Han et al., 2011) and SPIROMICS cohorts (Sieren et al., 2016). In COPDGene, CT scanning occurred at three phases between 2007 and 2022. The intervals between phases were Phase 1–2 5.68 ± 0.89 (mean ± standard deviation, SD) years, Phase 2-3 4.60 ± 0.63 years, and Phase 1–3 10.09 ± 0.40 years. In SPIROMICS, CT scanning occurred at five timepoints between 2010 and 2022, with the baseline and fifth timepoint used in this analysis occurring 6.13 ± 1.05 years apart.
For both studies, COPD was defined by a post-bronchodilator FEV1/FVC (forced vital capacity) < 0.7 at the baseline visit, as specified in the GOLD guidelines at the time of study inclusion (Rabe et al., 2007). GOLD grades 1–4, based on post-bronchodilator spirometry, were used to define disease severity (GOLD 1, FEV1 ≥ 80% predicted; GOLD 2, FEV1 50%–79% predicted; GOLD 3, FEV1 30%–49% predicted; and GOLD 4, FEV1 < 30% predicted), with GOLD 0 classification defined by a post-bronchodilator FEV1/FVC ≥ 0.7 and FEV1% predicted ≥ 80%. We performed risk modeling based on GOLD category on the following numbers of participants: in COPDGene, GOLD 0 (n = 2,298) and GOLD 1–2 (n = 1,523), and in SPIROMICS, GOLD 0 (n = 385) and GOLD 1–2 (n = 500) (Figure 1; Tables 1, 2).
FIGURE 1. Consolidated Standards of Reporting Trials (CONSORT) diagram for model training. (COPDGene Phase 1), 5-year internal validation (COPDGene Phase 2–3), 10-year internal validation (COPDGene Phase 1–3), and external validation (SPIROMICS) datasets.
TABLE 1. Summary statistics for COPDGene Phase 1, COPDGene Phase 2, and SPIROMICS cohorts for GOLD 0 subjects.
TABLE 2. Summary statistics for COPDGene Phase 1, COPDGene Phase 2, and SPIROMICS cohorts for GOLD 1–2 subjects.
For all scans, Imbio Inc. Lung Density Analysis version 3.1 (Minneapolis, MN, United States) was used to perform PRM, lung volume measurements, TBV, and tBV5 (the ratio of vascular tree length in vessels with a cross-sectional area less than 5 mm2 to total vascular tree volume). PRMemphysema is defined as the percentage of lung voxels less than −950 Hounsfield units (HU) on the inspiratory CT scan and less than −856 HU on the expiratory CT scan following deformable registration; PRMfSAD is defined as the percentage of lung voxels greater than or equal to −950 HU on the inspiratory CT scan but less than −856 HU on the expiratory CT. PRM metrics were binarized into clinically significant (PRMemphysema > 2.0%, PRMfSAD >10.0%) and non-clinically significant categories. The 2% and 10% thresholds were chosen by finding the thresholds that had the highest combination of sensitivity and specificity for univariate prediction of rapid decline in GOLD 0–2 subjects. Bone Mineral Density Analysis v.0.1 (Imbio Inc.) was used to compute the average HU value within the trabecular region of the T12 vertebral body (BMDT12). Airway analysis metrics such as mean segmental AWT and AWT for all airways with an internal perimeter of 10 mm (Pi10) were computed and analyzed in the feature selection step, but could not be used because different software vendors processed data for COPDGene (Thirona) and SPIROMICS (Vida Diagnostics), and thus, the high inter-software variability associated with airway measurements precluded comparisons of models using this data.
At the time this study was conducted, co-author CH, who performed the following statistical analysis and developed the machine learning models, was employed by Imbio Inc. Neither Imbio nor CH had influence over participant inclusion.
Definition of accelerated FEV1 decline
There is no universally accepted definition of accelerated FEV1 decline. Martinez et al. defined it as 60 mL/year, roughly double the normal rate of decline in non-smokers (Martinez et al., 2018). However, in the COPDGene cohort, significantly more men than women fit this categorization. To develop sex-agnostic models, we instead categorized rapid progression as a drop of more than 1.5% FEV1 percent predicted per year (
We initially considered a set of K = 29 features consisting of symptoms, spirometry, demographic, and CT imaging biomarker data (Figure 2). To simplify the models and to increase reliability of the inputs, we chose to not use self-reported data. A 29 × 29 table of Spearman correlations between each feature was generated to assess feature redundancy (Parr et al., 2020) using data from GOLD 0–2 subjects. Features that had a Spearman correlation > 0.80 with any other variables were considered for removal due to information redundancy. For example, we retained weight and height but removed body mass index (BMI) because weight and height were independent from each other, but weight was highly correlated to BMI. Forced mid-expiratory flow (FEF25.75) was removed because it correlated strongly with FEV1/FVC. Finally, of the PRM imaging biomarker variables, PRMupper lobes emphysema and PRMlower lobes fSAD had the lowest correlation and thus were retained.
FIGURE 2. Spearman correlation matrix for the initial 29 variables. Spearman correlations > 0.80 are highlighted in orange. Strong correlations exist between PRM imaging biomarker variables, FEV1/FVC and FEF25-75, and BMI and weight. Abbreviations: bdr, bronchodilator responsiveness; sgrq, St. George’s Respiratory Questionnaire; BMI, body mass index; FEV1.pp.post, post bronchodilator FEV1% predicted; FVC.pp.post, post bronchodilator FVC % predicted; FEV1.FVC.post, post bronchodilator FEV1/FVC; FEF.25.75.post, post bronchodilator forced mid-expiratory flow; walk.dist, 6-min walk distance; pi10.thirona, square root of an airway wall area with a 10 mm lumen perimeter, measured by Thirona software; awt.thirona, segmental airway wall thickness, measured by Thirona software; prm.total.fsad, PRM total fSAD > 10%; prm.total.emphysema, PRM total emphysema > 2%; prm.upper.fsad, PRM upper lobes fSAD > 10%; prm.lower.fsad, PRM lower lobes fSAD > 10%; prm.upper.emphysema, PRM upper lobes emphysema > 2%; prm.lower.emphysema, PRM lower lobes emphysema > 2%; insp.ct.vol, CT-derived inspiratory lung volume; exp.ct.vol, CT-derived expiratory lung volume; tbv, total blood vessel volume; t.bv5, ratio of vascular tree length in vessels with a cross-sectional area less than 5 mm2 to total vascular tree volume; trabecular.T12, average Hounsfield unit (HU) value within the trabecular region of the T12 vertebral body; cortical.T12, average Hounsfield unit (HU) value within the cortical region of the T12 vertebral body.
Following manual feature pruning, we conducted a “drop-column” feature-importance ranking and data-driven pruning procedure for the remaining 22 parameters. Classification models were trained using COPDGene Phase 1 data, with the outcome
In addition to a full variable model considering all variables that survived feature selection, we also created limited models using only imaging biomarkers (i.e., lung volumes, PRM, vascular, and bone density measurements) and data readily available from the DICOM header (i.e., age, sex), referred to hereafter as “CT-limited model.” The purpose of this analysis was to determine the predictive value of data that could only be obtained from a CT scan file within the electronic medical record.
Machine learning methods and parameters
A wide variety of machine-learning techniques can be used to predict biological outcomes from multiple sources of data (Garavand et al., 2022). In this study, models were developed and validated using logistic regression with an L1 regularization penalty (a.k.a. LASSO) due to simplicity of implementation (i.e., few hyper-parameters) compared to other popular machine learning frameworks such as Random Forests or XGBoost. The L1 regularization penalty was employed to learn models that have a sparse set of significant predictor variables. We used the scikit-learn software package v1.0.2 within Python version 3.8.10 for all model development and validation. Model hyper-parameters were optimized using a Randomized Cross Validation search algorithm (sklearn.model_selection.RandomizedSearchCV). The cross validation hyper-parameter search resulted in an optimal inverse regularization penalty weight of C = 0.3.
Internal and external validation
Internal validation was performed in two ways. First, we applied the models developed on the COPDGene Phase 1 training data to the COPDGene Phase 2 data, where the output variable was the
Relative risk based on model output probabilities
Following model generation, we recorded the logistic regression output probabilities associated with the lower 25th (p25) and upper 75th (p75) percentiles of the training data. We computed the relative risk between participants with FEV1%pred decline risk > p75 and those with risk < p25 for the training dataset (COPDGene Phase 1-Phase 2), the internal validation dataset (COPDGene Phase 2-Phase 3), and the external validation dataset. Additionally, we looked at the relative risk of rapid decline over 10 years by examining subjects that had
Results are separated into 3 broad sections: 1) feature selection to create the training model, 2) within the COPDGene cohort cross validation and 3) external SPIROMICS cohort validation, followed by a section on relative risk.
Within cohort analysis
Figure 3 depicts all features that survived the variable selection procedure during model training in models developed on COPDGene Phase 1-Phase 2 data. In the full-variable model for the GOLD 0 cohort, higher post bronchodilator FEV1% predicted (FEV1pp.post), bronchodilator responsiveness (BDR), greater expiratory CT volume, and a higher St. George’s Respiratory Questionnaire (SGRQ) score were the most important variables for predicting rapid FEV1 decline. Less significant variables included a lower TBV, male sex, and current smoking status.
FIGURE 3. Features that survived the drop-column variable selection procedure for each GOLD group and model type (top: GOLD 0; bottom: GOLD 1–2). The mean ±1.96 × SD decrease in AUC related to dropping each feature is plotted only for features that significantly affected the model. Red bars indicate a higher risk of rapid FEV1 decline with increasing value, having bronchodilator reversibility, African American race, and/or being a current smoker. Blue bars indicate a higher risk of rapid FEV1 decline with decreasing value and male sex. Abbreviations: fev1.pp.post, post bronchodilator FEV1% predicted; bdr, bronchodilator responsiveness; exp.ct.vol, CT-derived expiratory lung volume; sgrq, St. George’s Respiratory Questionnaire; tbv, total blood vessel volume; fev1.fvc.post, post bronchodilator FEV1/FVC; walk.dist, 6-min walk distance; prm.lower.fsad, PRM lower lobes fSAD > 10%; cortical.T12, average HU value in the cortical region of the T12 vertebra; prm.upper.emphysema, PRM upper lobes emphysema > 2%; t.bv5, ratio of vascular tree length in vessels with a cross-sectional area less than 5 mm2 to total vascular tree volume.
In the full-variable model for the GOLD 1–2 cohort, BDR, younger age, PRMlower lobes fSAD > 10%, and higher blood vessel volume were the most important variables predicting rapid FEV1 decline. Less significant variables included higher SGRQ score, lower weight, higher FEV1pp.post, current smoking status, higher expiratory CT volume, shorter height and 6-min walk distance (6MWD), African American race, and PRMupper lobes emphysema > 2%.
Within cohort validation
Following model training, we validated the CT-limited COPDGene models using the same training variables within the COPDGene cohort. This process used data from both the 5-year (P2–P3) and 10-year (P1–P3) follow-up periods. For the GOLD 0 models, AUCs were similar to those from the training dataset, whereas the GOLD 1–2 models were still significant but had much lower AUCs during the 5-year follow up (P2–P3) than during training (Table 3).
Surprisingly, AUCs increased for all models in the 10-year validation (P1–P3). This result is atypical in machine learning, as model AUCs tend to be highest in training data. A possible explanation is that we employed the same pool of independent variables used for training to predict the same signal, but over a longer period of time. This process might decrease the noise inherent in serial FEV1 measurements, and thus increase the prediction accuracy.
External cohort validation
External validation on SPIROMICS data resulted in trends similar to COPDGene P2–P3 data, showing slight predictive drops in performance. The full variable models were significant (p < 0.05) for both the GOLD 0 and GOLD 1–2 groups; however, the performance of the CT-limited models declined and were no longer significant for GOLD 0 participants. A possible explanation is that there were only roughly half as many SPIROMICS participants as in the COPDGene P2–P3 analysis.
In general, GOLD 1–2 models consistently performed better than GOLD 0 models. This finding suggests that accelerated lung function decline can be predicted with greater accuracy in those who already have spirometric evidence of airflow obstruction. Additionally, the prediction accuracy of the CT-limited models was much lower than the full variable models across all training and validation groups.
We next computed relative risk between two groups: those with rapid decline risk greater than the upper 75th (p75) versus those with decline less than the lower 25th (p25) (Table 4). For this purpose, we used the logistic regression output probabilities of the training data for all four datasets. The highest relative risk was consistently seen in the 10-year internal validation group (COPDGene P1–P3), suggesting that over longer follow-up, these predictive models can effectively discriminate those at highest risk of accelerated lung function decline.
TABLE 4. Relative risk of rapid FEV1 decline associated with risk score greater than the upper 75th percentile (p75) versus risk less than the lower 25th (p25).
Focusing only on GOLD 1–2 participants in the COPDGene P1–P3 group, the full model can predict an over 8-fold increased risk of accelerated FEV1 decline among p75 versus p25 participants. Relative risk was lower in all the tested cohorts when using the CT-limited model, consistent with prior results in this analysis.
Overall, these relative risk profiles suggest that the full variable models could effectively identify those at highest risk of lung function decline with sufficient accuracy to provide clinical relevancy and utility in decision making. By contrast, the CT-limited models lack significantly increased relative risk in the SPIROMICS data, despite promising results in the COPDGene 10-year analysis. This finding highlights the need for further external validation using 10-year follow-up data to determine more conclusively whether screening for rapid FEV1 decline will be practically feasible using only data from DICOM files.
We studied the important clinical question of whether machine learning techniques can predict which ever-smokers, either at-risk-for COPD or with mild-moderate disease, are at highest risk for accelerated FEV1 decline. In an ideal world, one may imagine that identifying individuals at-risk-for disease progression only from data available in a chest CT would allow for increased availability of data to clinicians in a potentially automated fashion. Using logistic regression models built based on a combination of radiographic and clinical features, we show that this approach is feasible.
We were able to identify the most relevant imaging biomarker features predicting accelerated lung function decline, which importantly differed between those with established disease and those at-risk. Key variables in GOLD 1–2 participants included PRMlower lobes fSAD and demographic information such as age, expiratory lung volume in GOLD 0 participants, and pulmonary function results, particularly BDR, in both groups. Our models showed only modest decreases in predictive strength, as indicated by AUC analysis, in the external validation cohort, and improved at longer intervals in the training set (COPDGene), for which longer follow-up data were available. In the latter group, the full variable models were able to predict an 8-fold difference in relative risk between those in the highest versus lowest quartile of lung function decline. These findings support the use of combined radiographic and clinical data models to select participants for therapeutic trials of potentially disease-modifying agents in COPD.
A strength of this analysis is that the input variables in the full models are objective, and most are readily available, due to the use of spirometry rather than more complex pulmonary function testing and to the fact that many individuals also have recent CT scans. In the CT-limited model, additional variables, including TBV, were also found to be significant contributors to prediction of accelerated FEV1 decline, though with interesting differences based on COPD status. Thus, FEV1 decline was associated with lower TBV in GOLD 0, but with higher TBV in GOLD 1–2. One possible explanation for this disparity is the known association of distal pruning of pulmonary vessels with early COPD progression (Estepar et al., 2013; Synn et al., 2019; Weatherald et al., 2019). However, further studies are needed to understand why this relationship is reversed in later disease stages, where there may be additional anatomic changes associated with development of pulmonary hypertension (Elwing and Panos, 2008).
Our finding of the strong predictive impact of BDR is also noteworthy, as BDR has recently been associated with thicker airway walls in COPD subjects and decline in lung function (Donaldson et al., 2005; Kim et al., 2014). This may be due to active inflammatory disease at the level of the distal, smaller airways, leading to loss of lung function over time (Barjaktarevic et al., 2019).
Our results agree with a recent study that used similar machine learning techniques to predict progression of FEV1 in COPDGene (Boueiz et al., 2022) and that also utilized P1–P2 and P1–P3 data for internal validation. Their machine learning models, both logistic regression and random forest, were able to predict an absolute cross-sectional FEV1 at follow-up visits with excellent results (ROC-AUCs > 0.9). However, similar to our work, they had greater difficulty predicting change in FEV1 over time, with ROC-AUCs around 0.7 for both types of their models. This may be due to the fact that the relative changes in FEV1 over a few years are small and result in low signal-to-noise ratios. We extend those findings via use of novel PRM and vascular biomarkers in our prediction models, as well as by external validation in a separate cohort to improve their performance despite these inherent limitations.
The overall modest AUCs across our models, ranging roughly between 0.5–0.7, imply that FEV1 is an imprecise (i.e., noisy) dependent variable. Models trained on noisy binary classification outcomes can still converge upon an optimal solution, as long as noise is symmetric, i.e., the same odds of false positive and false negative measurements, and there are sufficient observations (Lugosi, 1992). That AUCs were highest for GOLD 1-2 participants was expected, and in agreement with the analysis of Boueiz et al. (2022), as FEV1 decline in COPD is most rapid in this group, making the signal-to-noise ratio highest (D'Amato et al., 2016). Also as expected, we found that pulmonary function, symptoms, and more extensive demographic data improved prediction accuracy over using imaging variables alone. In particular, BDR was consistently the most important non-imaging variable, with SGRQ score also providing significant information.
That some of our findings were no longer statistically significant in the CT-limited model for GOLD 0 indicates that screening this group using imaging data alone is inadequate. These findings further emphasize the challenge of predicting FEV1 decline in ever-smokers without airflow obstruction. While currently available clinical parameters alone do not adequately predict risk in this group, it is promising that when combined with radiographic variables, these novel machine learning techniques can be harnessed to assist in clinical decision making in a relatively short period of time. There remains great potential to harness these techniques for a variety of clinical applications to fill patient care gaps. For example, the ability to better phenotype patients will allow researchers to identify suitable subjects for clinical trials, target subjects at greatest risk for accelerated lung function decline, and more rapidly assess response to new therapies.
We also acknowledge several limitations to our work and have identified key goals moving forward. We only tested logistic regression models using machine learning techniques for feature selection. Although we did not test other algorithms, notably decision tree models, we replicated findings of a recent study that did. Prior studies have attempted to phenotype subjects at risk of lung function decline using machine learning methods such as decision tree models in other cohorts, but did not adjust for decline associated with sex, which was the greatest predictor of decline (Nikolaou et al., 2021). As stated above, prediction of FEV1 remains challenging, and continued refinement and optimization of our models will be required before these techniques can be applied clinically to account for lung function decline that may be fairly subtle over a few years. A wealth of biomarker data correlates to risk of COPD progression, exacerbations, and mortality (Leitao Filho et al., 2020; Tanimura et al., 2020; Pratte et al., 2021; Serban et al., 2021; Singh et al., 2021) that was not included in our current models, but in future iterations would be important to consider. Survivorship bias is another limitation in longitudinal observational cohort studies, especially in subjects with advanced COPD and increased short term mortality risk, that may influence our results. Finally, most of the longitudinal follow-up in this study occurred over a 5-year period, with some additional analysis based on 10-year follow-up data. However, a longer follow-up period may further enhance our model’s performance and predictive ability.
In summary, we determined that a combination of radiographic and clinical variables can help predict which individuals are at highest risk for rapid FEV1 decline. While logistic regression models trained on a limited but more easily obtainable dataset captured from CT scans also had significant predictive value, these models were not as accurate. Predicting FEV1 decline continues to be challenging and there remains a strong unmet clinical need for further refinement of these models.
Data availability statement
The data analyzed in this study is subject to the following licenses/restrictions: The datasets presented in this study are not readily available because they are part of NIH sponsored clinical trials and require a data-use agreement to be signed. For access to COPDGene data visit https://www.copdgene.org/phase-1-study-documents.htm for instructions. For access to SPIROMICS data, visit https://www.spiromics.org/spiromics/obtaining-data for instructions. Requests to access these datasets should be directed to COPDGene: https://www.copdgene.org/phase-1-study-documents.htm; SPIROMICS: https://www.spiromics.org/spiromics/obtaining-data.
CH processed the imaging data with quantitative CT (QCT) biomarker software from Imbio. CH designed the analysis and CH and JW analyzed the data. JW drafted the manuscript with editing from CH, WL, and MH. All authors reviewed and approved the final version of the manuscript for publication.
This work was supported by NHLBI Grant R01 HL150023 and by NHLBI Grants U01 HL089897 and U01 HL089856, which support the COPDGene study. The COPDGene study (NCT00608764) is also supported by the COPD Foundation through contributions made to an Industry Advisory Committee comprised of AstraZeneca, Bayer Pharmaceuticals, Boehringer-Ingelheim, Genentech, GlaxoSmithKline, Novartis, Pfizer and Sunovion. SPIROMICS was supported by contracts from the NIH/NHLBI (HHSN 268200900013C, HHSN 268200900014C, HHSN 268200900015C, HHSN 268200900016C, HHSN 268200900017C, HHSN 268200900018C, HHSN 268200900019C, HHSN 268200900020C), grants from the NIH/ NHLBI (U01 HL137880 and U24 HL141762), and supplemented by contributions made through the Foundation for the NIH and the COPD Foundation from AstraZeneca/MedImmune; Bayer; Bellerophon Therapeutics; Boehringer-Ingelheim Pharmaceuticals, Inc.; Chiesi Farmaceutici S.p.A.; Forest Research Institute, Inc.; GlaxoSmithKline; Grifols Therapeutics, Inc.; Ikaria, Inc.; Novartis Pharmaceuticals Corporation; Nycomed GmbH; ProterixBio; Regeneron Pharmaceuticals, Inc.; Sanofi; Sunovion; Takeda Pharmaceutical Company; and Theravance Biopharma and Mylan. The authors declare that support for the COPDGene and SPIROMICS studies include commercial funding, as described above. These commercial funders were not involved in the design, collection, analysis, or interpretation of data for the present study, the writing of this article, or the decision to submit it for publication.
The authors wish to thank Lee Olsen for assisting with manuscript preparation and editing. The authors thank the SPIROMICS participants and participating physicians, investigators and staff for making this research possible. More information about the study and how to access SPIROMICS data is available at www.spiromics.org. The authors would like to acknowledge the University of North Carolina at Chapel Hill BioSpecimen Processing Facility for sample processing, storage, and sample disbursements (http://bsp.web.unc.edu/). We would like to acknowledge the following current and former investigators of the SPIROMICS sites and reading centers: Neil E Alexis, MD; Wayne H Anderson, PhD; Mehrdad Arjomandi, MD; Igor Barjaktarevic, MD, PhD; R Graham Barr, MD, DrPH; Patricia Basta, PhD; Lori A Bateman, MSc; Surya P Bhatt, MD; Eugene R Bleecker, MD; Richard C Boucher, MD; Russell P Bowler, MD, PhD; Stephanie A Christenson, MD; Alejandro P Comellas, MD; Christopher B Cooper, MD, PhD; David J Couper, PhD; Gerard J Criner, MD; Ronald G Crystal, MD; JC; Claire M Doerschuk, MD; Mark T Dransfield, MD; Brad Drummond, MD; Christine M Freeman, PhD; CG, PhD; MH; Nadia N Hansel, MD, MPH; Annette T Hastie, PhD; Eric A EH, PhD; Yvonne Huang, MD; Robert J Kaner, MD; Richard E Kanner, MD; Eric C Kleerup, MD; Jerry A Krishnan, MD, PhD; Lisa M LaVange, PhD; Stephen C Lazarus, MD; FM; Deborah A Meyers, PhD; Wendy C Moore, MD; John D Newell Jr, MD; Robert Paine, III, MD; Laura Paulin, MD, MHS; Stephen P Peters, MD, PhD; Cheryl Pirozzi, MD; Nirupama Putcha, MD, MHS; Elizabeth C Oelsner, MD, MPH; Wanda K O’Neal, PhD; Victor E Ortega, MD, PhD; Sanjeev Raman, MBBS, MD; Stephen I. Rennard, MD; Donald P Tashkin, MD; J Michael Wells, MD; Robert A Wise, MD; and Prescott G Woodruff, MD, MPH. The project officers from the Lung Division of the National Heart, Lung, and Blood Institute were Lisa Postow, PhD, and Lisa Viviano, BSN.
Conflict of interest
WL reports personal fees from Konica Minolta and Continuing Education Alliance. JC reports grants from NHLBI and the COPD Foundation, during performance of this study; grants from NHLBI, NIAID, the Department of Veterans Affairs, and the Department of Defense, outside this study; consulting fees (paid to his institution) from AstraZeneca PLC and CLS Behring, LLC, outside this study; and personal fees from AstraZeneca Global, outside this study. CG is co-inventor of Parametric Response Mapping, which the University of Michigan has licensed to Imbio, LLC., and has a financial interest in Imbio, LLC. MH reports personal fees from GlaxoSmithKline, AstraZeneca, Boehringer Ingelheim, Cipla, Chiesi, Novartis, Pulmonx, Teva, Verona, Merck, Mylan, Sanofi, DevPro, Aerogen, Polarian, Regeneron, Amgen, UpToDate, Altesa Biopharma, Medscape, NACE, MDBriefcase and Integrity. She has received either in kind research support or funds paid to the institution from the NIH, Novartis, Sunovion, Nuvaira, Sanofi, AstraZeneca, Boehringer Ingelheim, Gala Therapeutics, Biodesix, the COPD Foundation and the American Lung Association. She has participated in Data Safety Monitoring Boards for Novartis and Medtronic with funds paid to the institution. She has received stock options from Meissa Vaccines and Altesa Biopharma. CH is employed by and has stock options in Imbio, Inc.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Bahadori, K., Fitzgerald, J. M., Levy, R. D., Fera, T., and Swiston, J. (2009). Risk factors and outcomes associated with chronic obstructive pulmonary disease exacerbations requiring hospitalization. Can. Respir. J. 16, e43–e49. doi:10.1155/2009/179263
Barjaktarevic, I. Z., Buhr, R. G., Wang, X., Hu, S., Couper, D., Anderson, W., et al. (2019). Clinical significance of bronchodilator responsiveness evaluated by forced vital capacity in COPD: SPIROMICS cohort analysis. Int. J. Chron. Obstruct Pulmon Dis. 14, 2927–2938. doi:10.2147/COPD.S220164
Bertens, L. C., Reitsma, J. B., Moons, K. G., van Mourik, Y., Lammers, J. W., Broekhuizen, B. D., et al. (2013). Development and validation of a model to predict the risk of exacerbations in chronic obstructive pulmonary disease. Int. J. Chron. Obstruct Pulmon Dis. 8, 493–499. doi:10.2147/COPD.S49609
Bhatt, S. P., Soler, X., Wang, X., Murray, S., Anzueto, A. R., Beaty, T. H., et al. (2016). Association between functional small airway disease and FEV1 decline in chronic obstructive pulmonary disease. Am. J. Respir. Crit. Care Med. 194, 178–184. doi:10.1164/rccm.201511-2219OC
Boueiz, A., Xu, Z., Chang, Y., Masoomi, A., Gregory, A., Lutz, S. M., et al. (2022). Machine learning prediction of progression in forced expiratory volume in 1 second in the COPDGene® study. Chronic Obstr. Pulm. Dis. 9, 349–365. doi:10.15326/jcopdf.2021.0275
Couper, D., Lavange, L. M., Han, M., Barr, R. G., Bleecker, E., Hoffman, E. A., et al. (2014). Design of the Subpopulations and intermediate outcomes in COPD study (SPIROMICS). Thorax 69, 491–494. doi:10.1136/thoraxjnl-2013-203897
Donaldson, G. C., Seemungal, T. A., Patel, I. S., Bhowmik, A., Wilkinson, T. M., Hurst, J. R., et al. (2005). Airway and systemic inflammation and decline in lung function in patients with COPD. Chest 128, 1995–2004. doi:10.1378/chest.128.4.1995
Estepar, R. S., Kinney, G. L., Black-Shinn, J. L., Bowler, R. P., Kindlmann, G. L., Ross, J. C., et al. (2013). Computed tomographic measures of pulmonary vascular morphology in smokers and their clinical implications. Am. J. Respir. Crit. Care Med. 188, 231–239. doi:10.1164/rccm.201301-0162OC
Garavand, A., Salehnasab, C., Behmanesh, A., Aslani, N., Zadeh, A. H., and Ghaderzadeh, M. (2022). Efficient model for coronary artery disease diagnosis: A comparative study of several machine learning algorithms. J. Healthc. Eng. 2022, 5359540. doi:10.1155/2022/5359540
Han, M. K., Kazerooni, E. A., Lynch, D. A., Liu, L. X., Murray, S., Curtis, J. L., et al. (2011). Chronic obstructive pulmonary disease exacerbations in the COPDGene study: Associated radiologic phenotypes. Radiology 261, 274–282. doi:10.1148/radiol.11110173
Han, M. K., Quibrera, P. M., Carretta, E. E., Barr, R. G., Bleecker, E. R., Bowler, R. P., et al. (2017). Frequency of exacerbations in patients with chronic obstructive pulmonary disease: An analysis of the SPIROMICS cohort. Lancet Respir. Med. 5, 619–626. doi:10.1016/S2213-2600(17)30207-2
Hasenstab, K. A., Yuan, N., Retson, T., Conrad, D. J., Kligerman, S., Lynch, D. A., et al. (2021). Automated CT staging of chronic obstructive pulmonary disease severity for predicting disease progression and mortality with a deep learning convolutional neural network. Radiol. Cardiothorac. Imaging 3, e200477. doi:10.1148/ryct.2021200477
Jain, S. S., Sarkar, I. N., Stey, P. C., Anand, R. S., Biron, D. R., and Chen, E. S. (2018). Using demographic factors and comorbidities to develop a predictive model for ICU mortality in patients with acute exacerbation COPD. AMIA Annu. Symp. Proc. 2018, 1319–1328.
Kiddle, S. J., Whittaker, H. R., Seaman, S. R., and Quint, J. K. (2020). Prediction of five-year mortality after COPD diagnosis using primary care records. PLoS One 15, e0236011. doi:10.1371/journal.pone.0236011
Kim, V., Desai, P., Newell, J. D., Make, B. J., Washko, G. R., Silverman, E. K., et al. (2014). Airway wall thickness is increased in COPD patients with bronchodilator responsiveness. Respir. Res. 15, 84. doi:10.1186/s12931-014-0084-3
Labaki, W. W., Gu, T., Murray, S., Hatt, C. R., Galban, C. J., Ross, B. D., et al. (2019). Voxel-wise longitudinal parametric response mapping analysis of chest computed tomography in smokers. Acad. Radiol. 26, 217–223. doi:10.1016/j.acra.2018.05.024
Leitao Filho, F. S., Mattman, A., Schellenberg, R., Criner, G. J., Woodruff, P., Lazarus, S. C., et al. (2020). Serum IgG levels and risk of COPD hospitalization: A pooled meta-analysis. Chest 158, 1420–1430. doi:10.1016/j.chest.2020.04.058
Martinez, F. J., Han, M. K., Allinson, J. P., Barr, R. G., Boucher, R. C., Calverley, P. M. A., et al. (2018). At the root: Defining and halting progression of early chronic obstructive pulmonary disease. Am. J. Respir. Crit. Care Med. 197, 1540–1551. doi:10.1164/rccm.201710-2028PP
Mathioudakis, A. G., Janssens, W., Sivapalan, P., Singanayagam, A., Dransfield, M. T., Jensen, J. S., et al. (2020). Acute exacerbations of chronic obstructive pulmonary disease: In search of diagnostic biomarkers and treatable traits. Thorax 75, 520–527. doi:10.1136/thoraxjnl-2019-214484
Miravitlles, M., Soler-Cataluña, J. J., Calle, M., and Soriano, J. B. (2013). Treatment of COPD by clinical phenotypes: Putting old evidence into clinical practice. Eur. Respir. J. 41, 1252–1256. doi:10.1183/09031936.00118912
Mohamed, H., de Jong, P. A., Lammers, J. W., Mali, W. P., Schmidt, M., de Koning, H. J., et al. (2015). Airway wall thickness associated with forced expiratory volume in 1 second decline and development of airflow limitation. Eur. Respir. J. 45, 644–651. doi:10.1183/09031936.00020714
Nakano, Y., Wong, J. C., de Jong, P. A., Buzatu, L., Nagao, T., Coxson, H. O., et al. (2005). The prediction of small airway dimensions using computed tomography. Am. J. Respir. Crit. Care Med. 171, 142–146. doi:10.1164/rccm.200407-874OC
Nikolaou, V., Massaro, S., Garn, W., Fakhimi, M., Stergioulas, L., and Price, D. B. (2021). Fast decliner phenotype of chronic obstructive pulmonary disease (COPD): Applying machine learning for predicting lung function loss. BMJ Open Respir. Res. 8, e000980. doi:10.1136/bmjresp-2021-000980
Pratte, K. A., Curtis, J. L., Kechris, K., Couper, D., Cho, M. H., Silverman, E. K., et al. (2021). Soluble receptor for advanced glycation end products (sRAGE) as a biomarker of COPD. Respir. Res. 22, 127. doi:10.1186/s12931-021-01686-z
Rabe, K. F., Hurd, S., Anzueto, A., Barnes, P. J., Buist, S. A., Calverley, P., et al. (2007). Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease: GOLD executive summary. Am. J. Respir. Crit. Care Med. 176, 532–555. doi:10.1164/rccm.200703-456SO
Rahaghi, F. N., Argemi, G., Nardelli, P., Dominguez-Fandos, D., Arguis, P., Peinado, V. I., et al. (2019). Pulmonary vascular density: Comparison of findings on computed tomography imaging with histology. Eur. Respir. J. 54, 1900370. doi:10.1183/13993003.00370-2019
Sieren, J. P., Newell, J. D., Barr, R. G., Bleecker, E. R., Burnette, N., Carretta, E. E., et al. (2016). SPIROMICS protocol for multicenter quantitative computed tomography to phenotype the lungs. Am. J. Respir. Crit. Care Med. 194, 794–806. doi:10.1164/rccm.201506-1208PP
Simmons, M. S., Connett, J. E., Nides, M. A., Lindgren, P. G., Kleerup, E. C., Murray, R. P., et al. (2005). Smoking reduction and the rate of decline in FEV(1): Results from the lung Health study. Eur. Respir. J. 25, 1011–1017. doi:10.1183/09031936.05.00086804
Singh, D., Criner, G. J., Dransfield, M. T., Halpin, D. M. G., Han, M. K., Lange, P., et al. (2021). InforMing the PAthway of COPD treatment (IMPACT) trial: Fibrinogen levels predict risk of moderate or severe exacerbations. Respir. Res. 22, 130. doi:10.1186/s12931-021-01706-y
Synn, A. J., Li, W., San Jose Estepar, R., Zhang, C., Washko, G. R., O'Connor, G. T., et al. (2019). Radiographic pulmonary vessel volume, lung function and airways disease in the Framingham Heart Study. Eur. Respir. J. 54, 1900408. doi:10.1183/13993003.00408-2019
Tanimura, K., Sato, S., Sato, A., Tanabe, N., Hasegawa, K., Uemasu, K., et al. (2020). Low serum free light chain is associated with risk of COPD exacerbation. ERJ Open Res. 6, 00288–02019. doi:10.1183/23120541.00288-2019
Weatherald, J., Montani, D., and Humbert, M. (2019). Seeing the forest for the (arterial) tree: Vascular pruning and the chronic obstructive pulmonary disease pulmonary vascular phenotype. Am. J. Respir. Crit. Care Med. 200, 406–408. doi:10.1164/rccm.201901-0248ED
Keywords: chronic obstructive pulmonary disease, machine learning, computed tomography, lung function decline, quantitative imaging
Citation: Wang JM, Labaki WW, Murray S, Martinez FJ, Curtis JL, Hoffman EA, Ram S, Bell AJ, Galban CJ, Han MK and Hatt C (2023) Machine learning for screening of at-risk, mild and moderate COPD patients at risk of FEV1 decline: results from COPDGene and SPIROMICS. Front. Physiol. 14:1144192. doi: 10.3389/fphys.2023.1144192
Received: 13 January 2023; Accepted: 03 April 2023;
Published: 21 April 2023.
Edited by:Sam Bayat, Université Grenoble Alpes, France
Reviewed by:Sandeep Bodduluri, University of Alabama at Birmingham, United States
Mustafa Ghaderzadeh, Shahid Beheshti University of Medical Sciences, Iran
Copyright © 2023 Wang, Labaki, Murray, Martinez, Curtis, Hoffman, Ram, Bell, Galban, Han and Hatt. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jennifer M. Wang, firstname.lastname@example.org