Application of Machine Learning in Pulmonary Function Assessment Where Are We Now and Where Are We Going?

Giri, Paresh C.; Chowdhury, Anand M.; Bedoya, Armando; Chen, Hengji; Lee, Hyun Suk; Lee, Patty; Henriquez, Craig; MacIntyre, Neil R.; Huang, Yuh-Chin T.

doi:10.3389/fphys.2021.678540

PERSPECTIVE article

Front. Physiol., 24 June 2021

Sec. Computational Physiology and Medicine

Volume 12 - 2021 | https://doi.org/10.3389/fphys.2021.678540

This article is part of the Research TopicArtificial Intelligence in Human PhysiologyView all 9 articles

Application of Machine Learning in Pulmonary Function Assessment Where Are We Now and Where Are We Going?

Hyun Suk Lee⁴

Neil R. MacIntyre²

¹Division of Pulmonary and Critical Care Medicine, Loma Linda University Medical Center, Loma Linda, CA, United States
²Division of Pulmonary, Allergy and Critical Care Medicine, Duke University Medical Center, Durham, NC, United States
³Department of Mechanical Engineering and Materials Science, Pratt School of Engineering, Duke University Medical Center, Durham, NC, United States
⁴Hartford HealthCare, Hartford, CT, United States

Analysis of pulmonary function tests (PFTs) is an area where machine learning (ML) may benefit clinicians, researchers, and the patients. PFT measures spirometry, lung volumes, and carbon monoxide diffusion capacity of the lung (DLCO). The results are usually interpreted by the clinicians using discrete numeric data according to published guidelines. PFT interpretations by clinicians, however, are known to have inter-rater variability and the inaccuracy can impact patient care. This variability may be caused by unfamiliarity of the guidelines, lack of training, inadequate understanding of lung physiology, or simply mental lapses. A rules-based automated interpretation system can recapitulate expert’s pattern recognition capability and decrease errors. ML can also be used to analyze continuous data or the graphics, including the flow-volume loop, the DLCO and the nitrogen washout curves. These analyses can discover novel physiological biomarkers. In the era of wearables and telehealth, particularly with the COVID-19 pandemic restricting PFTs to be done in the clinical laboratories, ML can also be used to combine mobile spirometry results with an individual’s clinical profile to deliver precision medicine. There are, however, hurdles in the development and commercialization of the ML-assisted PFT interpretation programs, including the need for high quality representative data, the existence of different formats for data acquisition and sharing in PFT software by different vendors, and the need for collaboration amongst clinicians, biomedical engineers, and information technologists. Hurdles notwithstanding, the new developments would represent significant advances that could be the future of PFT, the oldest test still in use in clinical medicine.

Introduction

The application of machine learning (ML) in medicine is burgeoning and its use in healthcare is potentially transformative (Deo, 2015; Rajkomar et al., 2019). ML is a sub-discipline of computer science in which computers “learn” from large quantities of data in order to find patterns without being explicitly programmed to do so (Sidey-Gibbons and Sidey-Gibbons, 2019). There are two main types of ML—supervised and unsupervised learning. Supervised learning occurs when inputs are chosen with the goal of predicting a known output or target. Typical examples are the development of predictive models that can identify suspicious pulmonary nodules on chest imaging (Uthoff et al., 2019), predict Framingham Risk Score (Chen et al., 2020), and interpret EKGs (Chang et al., 2020). In unsupervised learning, there are no outputs to predict. Instead, the goal is to find naturally occurring patterns or groupings within data. This method has been employed, for example, to identify different phenotypes of sepsis (Seymour et al., 2019). Analysis of pulmonary function test (PFT) is an area where ML, supervised and unsupervised, may benefit the clinicians and the patients (Mlodzinski et al., 2020). The ML methods that may be useful in the analysis of numeric and graphic data of PFTs are shown in the Table 1. To understand how, it is helpful to first examine the current pitfalls in PFT interpretation.

TABLE 1

Table 1. Machine learning methods that may be useful in the analysis of the numeric and graphic data of the pulmonary function tests.

PFT Interpretations Today Use Discrete Data Points and Are Guided by Rules-Based Algorithms

The most common pulmonary function tests (PFTs) are spirometry, lung volume determinations, and diffusing capacity assessments. These tests have two fundamental goals: (1) describe/categorize physiologic abnormalities in a subject and (2) quantify the magnitude of the abnormality. Physiologic abnormalities are conventionally grouped as obstructive ventilatory defects, restrictive ventilatory defects, and gas transfer defects, though multiple defects can be found in a single test. These are currently defined by whether certain discrete measurements are within a pre-determined normal range of values (“rules based” interpretations). The magnitude of the defect is quantified by either reporting a percent predicted of a reference value or else describing the degree of statistical deviation from the mean predicted value (i.e., z-scores). Importantly, PFT interpretation alone cannot diagnose disease states—this can only be accomplished by incorporating PFT results into the overall clinical picture.

Guidelines have been published to assist in interpretating PFTs (Pellegrino et al., 2005). PFT interpretations by clinicians, however, are known to have significant inter-rater variability (Miller et al., 2011; Holt et al., 2019; Topalovic et al., 2019b) and the inconsistency can potentially impact patient care (Enright, 2006; Holt et al., 2019). This variability is due to multiple factors: (1) disagreement among interpreters regarding the choice of reference values and cut-points, (2) actual quality of test performance (and the recognition of this by interpreters), (3) inherent uncertainty about the significance of borderline values, and (4) human errors from lack of training, inadequate understanding of lung physiology, or simply mental lapses.

A rules-based expert computerized interpretation system can help standardize interpretation criteria and address human factors thereby decreasing inter-rater variability and improving the quality and consistency of PFT interpretation. This is not a new concept. Indeed, automated algorithms developed as early as the 1980s sought to reproduce the assessments of an expert physician. For example, a rules-based expert system, PUFF, was developed for local use. It aimed to capture expertise knowledge in PFT interpretation and reduce the tedious work for the clinicians (Aikins et al., 1983). The agreement between PUFF and the physicians was excellent (89–96%), but the continued use of PUFF was hampered by incompatibility when it was modified for other network systems. Another interpretation program was developed using a least mean squares method to rapidly analyze the patient’s PFT data. It was able to select the “best interpretation statements” that were acceptable to a pulmonary physician in 90% of the patients (Krumpe et al., 1982). In one study, the pattern recognition of PFTs by pulmonologists matched the guidelines in about 75% of the cases (Topalovic et al., 2019a). An AI-based software perfectly matched the PFT pattern interpretations (100%) and assigned a correct diagnosis in 82% of all case (Topalovic et al., 2019a). Today, modern PFT systems often offer automated interpretive features based on recommendations such as those proposed by the ATS/ERS in 2005 (Pellegrino et al., 2005), but the features vary significantly and are not validated. Importantly, any rules-based expert computerized interpretation systems should not be considered a replacement for human input because there are still patient and technical factors that may affect the data’s suitability for interpretation by an algorithm.

Using ML to Go Beyond Simply Automating Interpretation Rules

The potential for using ML in PFT interpretation is expanding in several directions. First, ML is being used to better detect technical deficiencies and poor-quality data to avert algorithm misclassifications and alert the interpreters. Second, attempts are being made to combine PFT data with the clinical picture to better diagnose specific disease states. Third, and perhaps most exciting, ML is being used to analyze continuous data, not just discrete data points, to define new patterns of physiologic dysfunction and links to disease states. Finally, ML can be used to integrate PFT data into the realm of telehealth. These are discussed in detail below and outlined in the Figure 1.

FIGURE 1

Figure 1. The diagram shows how machine learning may be used in pulmonary function testing. The two main areas are to assist in the interpretation and to discover novel physiological biomarkers.

Improving Test Quality

Standard PFTs require properly calibrated equipment, standardized testing procedures, and cooperative patients. Current ERS/ATS standards define test quality using checklists filled out by the technologists. Checklists, however, are poor at assessing many of the nuances associated with good patient effort and subtle machine performance characteristics. ML techniques could aid in assessing the quality of the forced expiratory flow pattern, inert gas washout pattern, panting maneuvers, and breath-holds required during standard PFTs. These possibilities have yet to be developed in any practical fashion.

Incorporating PFT Patterns With Clinical Data to Formulate Diagnostic Possibilities

Although ML algorithms can be developed to learn to simply recapitulate the multitude of patterns already known to and used by experts, an exciting challenge is using ML to explore existing PFTs in conjunction with the available healthcare data to potentially uncover completely novel associations between PFT patterns and diseases. A simple example available today is a decision tree model that incorporates lung function and clinical variables to improve the accuracy for detecting common lung diseases including COPD, asthma, interstitial lung disease, and neuromuscular disorder, compared to using PFT data alone (Topalovic et al., 2017). This study included 968 new patients seen in a pulmonary practice. The pulmonary diagnoses of these patients were labeled based on the combination of PFT results and the physician’s assessment. It was found the ATS/ERS algorithm resulted in a correct diagnostic label in 38% of the patients. COPD had the highest positive predictive value (74%), whereas all other diseases were poorly identified. The decision tree algorithm improved the overall accuracy by ~ 2-fold (68%) with an improved positive predictive value for COPD (83%), asthma (66%), interstitial lung disease (52%), and neuromuscular disorder (100%). Another study that used a neuro-fuzzy system incorporating spirometric parameters (FEV₁, FVC, and FEV₁/FVC) and clinical symptoms was able to classify asthma and COPD with >99% accuracy (Badnjevic et al., 2015).

Analyzing Continuous Data

ML, supervised and unsupervised, can also be used to go beyond identifying patterns described by discrete data points and analyze continuous data or graphics. A prime example would be an analysis of the entire expiratory flow-volume loop (FVL). The traditional approach to assess the forced expiratory portion of the FVL uses a small number of discrete values, such as FEV₁, FVC, FEF₂₅_–₇₅_%, and the FEV₁/FVC ratio. FEV₁ is the most reproducible spirometric parameter reflecting bronchial caliber, yet it is unable to quantify ventilation inhomogeneity (Ross et al., 1992). FEF₂₅_–₇₅_% is quite variable and is not reliable for the diagnosis of small airway disease (the so-called “quiet zone”) (Quanjer et al., 2014; Malerba et al., 2016). Since the expiratory flow signals in a FVL reflect the sequential emptying properties of multiple lung units having different time constants as the lung volume decreases, analysis of the entire curve would be a better method to explore regional lung properties such as small airway function and mechanical (i.e., compliance and resistance) heterogeneity (Topalovic et al., 2014).

Numerous approaches to analyzing the FVL have been reported. An artificial neural network model that combined traditional spirometric measurements and area under the expiratory flow-volume curve differentiated well between normal, obstruction, restriction, and mixed impairments (Ioachimescu and Stoller, 2020). The square root of the area under the expiratory flow-volume curve plus FEV₁, FVC, and FEV₁/FVC z-scores could categorize ventilatory impairments with very low rates of misclassification (<9%) compared with standard classifications based on FVC, FEV₁/FVC, and TLC (Ioachimescu and Stoller, 2020). A fully convolutional neural network of flow-volume curves with CT scan as the output was more accurate in discriminating predominant emphysema/airway phenotypes in COPD (area under the curve [AUC] = 0.80) compared with traditional spirometry parameters (FEV₁ [area under the curve, or AUC = 0.71] and FEV₁/FVC [AUC = 0.70]) and random forest classifier (AUC = 0.78). The neural network was also better in discriminating predominant emphysema/small airway phenotypes (AUC 0.91) compared with FEV₁/FVC (AUC 0.80), FEV₁% predicted (AUC 0.83), and had similar accuracy to random forest classifier (AUC 0.90) (Bodduluri et al., 2020). Geometric analysis of the expiratory FVL demonstrated the concavity of the forced expiratory curve quantified by a slope-ratio was more prominent in asthmatic patients (1.35 ± 0.03) than normal subjects (0.90 ± 0.11) (Dominelli et al., 2016). The shape factors at 50 and 75% FVC and the slope ratio at 75% FVC improved after inhaled corticosteroid treatment (Kraan et al., 1989). The concavity of the expiratory limb of the spontaneous FVL measured by a rectangular area ratio obtained during exercise was inversely correlated with dynamic hyperinflation and exercise limitation in patients with COPD (Varga et al., 2016). The parameters in a second-order transfer function model of the flow dynamics during forced expiration (the two poles and the steady state gain) identified patients with COPD defined by the existing guidelines with high accuracy (88.2%) and may help in cases where FEV₁/FVC ratio-based diagnosis is uncertain (Topalovic et al., 2014). The multiscale computational modeling, which is the use of computers to simulate and study complex systems using mathematics, physics, and computer science, can be used to analyze expiratory flow in FVL and help evaluate obstructive lung diseases (Burrowes et al., 2014). Incorporating ML and computational modeling into the analysis of the entire FVL is a fruitful avenue for future research.

Evaluation of regional D_LCO may also be an application in which ML can be useful. Traditional approaches to measuring D_LCO, including single-breath, steady-state, and rebreathing methods, treat the lung as a single, well-mixed compartment and produce a single value of D_LCO that is taken to represent an average D_LCO at a given lung volume. Currently the single breath D_LCO measures are calculated by the changes in the concentration of CO relative to an inert gas in one alveolar sample in the expired gas (Graham et al., 2017). In fact, the entire exhalation curves of CO and inert gas after the breath hold can be modeled and analyzed with the help of ML. This could shed light on the non-uniform distribution of D_LCO due to the heterogeneity of blood flow and ventilation distribution that is best observed in slow exhalation, when D_LCO is found to decrease non-linearly from high to low lung volumes (Stokes et al., 1981; Huang et al., 2002). Like the FVL, ML analysis of the entire exhalation curve could improve insight into regional changes in D_LCO over single summary measurements, and provide earlier diagnosis of lung disease, such as emphysema and pulmonary vascular disease. Similarly, ML can also be used to analyze the nitrogen washout curve and data from forced oscillation technique. Such analyses potentially allow for the discovery of novel clinical and physiological phenotypes related to emptying properties of different lung regions (or ventilation heterogeneity) in obstructive lung diseases independent of traditional spirometry indices (Mikamo et al., 2013; Timmins et al., 2014).

Intergrating PFT Data Into Telehealth Applications

In the era of wearables and telehealth, remote respiratory monitoring for chronic respiratory patients has been proposed to reduce hospitalizations, improve self-care, and enhance health-related quality of life. The need for telemonitoring has become more essential with the COVID-19 pandemic restricting in-person pulmonary function testing (Kouri et al., 2020). Handheld spirometers are among the respiratory devices that are suitable for telemonitoring application. The spirometric data can be transmitted via a gateway (e.g., a smartphone) to a cloud repository site. ML may have increased application opportunities in screening and interpreting these mobile pulmonary function data. A smartphone game-based pulmonary function assessment has demonstrated good correlation with those measured by a spirometer in 34 stroke patients with intraclass correlation coefficients for FVC and FEV₁ of > 0.90 (Joo et al., 2018). FEV₁ and FVC measured by a smartphone-connected spirometer agreed with those measured by a conventional spirometer in pediatric patients with cystic fibrosis and asthma (Pearson correlation coefficients > 0.9 for FEV₁ and FVC), but only less than half of the tests were acceptable and reproducible according to the ATS/ERS criteria (Kruizinga et al., 2020). An automated mobile expert diagnostic telehealth system that consists of a spirometer, mobile application, and expert diagnostic system was able to diagnose asthma and COPD with high accuracy (>90%) in 780 patients at remote primary healthcare institutions and hospitals (Gurbeta et al., 2018). An ML algorithm combined with sociodemographic, clinical, and physiological telemonitoring data was better in predicting acute exacerbations of COPD than the two traditional symptoms-counting algorithms (AUC of 0.74 with the ML algorithm vs. 0.60 and 0.58 for the traditional algorithms) (Orchard et al., 2018). ML may be used to extract usable spirometry data from the mobile programs in the wearable devices. Finally, precision medicine can be delivered by tailoring analyses to an individual’s clinical profile and learning from the experience of a patient (Franssen et al., 2019).

Hurdles for Development of ML-Based PFT Programs

Despite the exciting potential for the use of ML in PFT, there are hurdles in the development and commercialization of the ML-assisted PFT interpretation programs. These include: (1) the need for high quality representative data, (2) the existence of inherent biases in historical data, (3) the need for development and constant update of validated endpoints on which to train ML models, (4) the existence of different formats for data acquisition and sharing in PFT software by different vendors, and (5) the need for collaboration amongst clinicians, biomedical engineers, and information technologists to acquire large data sets to develop and validate such algorithms. Our medical system is reluctant to entrust a machine with a task that a human can do, particularly due to the “black-box” nature of fully automated systems. In recognition of these hurdles and the need to regulate the development, production and monitoring of ML models in medical devices, the US Food and Drug Administration (FDA) in April 2019 published a discussion paper that proposed a framework for AI/ML-based Software as a Medical Device (SaMD) (USFDA, 2019). The FDA purports to risk stratify SaMDs based on the intended use of SaMD-derived information for healthcare decision making and the risk profile of the individual patient. Based on this stratification, ML-based PFT software could be used for “treating and diagnosing,” “driving clinical management” as well as “informing clinical management” in all tiers of risk, “critical,” “serious,” and “non-serious” (USFDA, 2019). The American Society of Mechanical Engineers has also introduced standards for the ML-based PFT software as an SaMD (V&V40) and machine learning (V&V70) (ASME, 2021). There are working groups on patient-specific models, computational modeling of medical devices, and machine learning under V&V40. This standard can be used by the practitioner as a framework to assess the device/software using sound engineering judgment. Such a nationally accepted framework would be key to addressing quality, liability, privacy, reimbursement, and regulatory issues of AI/ML-based software and may serve to provide an impetus for amalgamations of an integrated man-and-machine approach into daily medical practice.

Hurdles notwithstanding, there is significant potential for the use of ML in pulmonary function assessment. Future research should focus on how ML may improve, simplify, enhance, and expedite PFT interpretation, and integrate PFT parameters with imaging and clinical information to discover novel physiological markers that can enhance the diagnostic sensitivity and specificity of pulmonary diseases. All these developments would represent significant advances that could be the future of PFTs, the oldest test still in use in clinical medicine (MacIntyre, 2012).

Data Availability Statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

Author Contributions

PG, AC, and Y-CH: develop the concept and write the draft and edit the manuscript. AC, AB, CH, HL, PL, and HC: provide discussion on the machine learning and edit the manuscript. PG, NM, and Y-CT: provide discussion on the pulmonary function test and edit the manuscript. All authors contributed to the article and approved the submitted version.

Funding

The work was supported by an internal fund from the Division of Pulmonary, Allergy and Critical Care Medicine, Department of Medicine, Duke University Medical Center.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Aikins, J. S., Kunz, J. C., Shortliffe, E. H., and Fallat, R. J. (1983). PUFF: an expert system for interpretation of pulmonary function data. Comput. Biomed. Res. 16, 199–208. doi: 10.1016/0010-4809(83)90021-6

CrossRef Full Text | Google Scholar

ASME (2021). Verification, Validation and Uncertainty Quantification (VVUQ). New York, NY: ASME.

Google Scholar

Badnjevic, A., Cifrek, M., Koruga, D., and Osmankovic, D. (2015). Neuro-fuzzy classification of asthma and chronic obstructive pulmonary disease. BMC Med. Inform. Decis. Mak. 15(Suppl. 3):S1. doi: 10.1186/1472-6947-15-S3-S1

PubMed Abstract | CrossRef Full Text | Google Scholar

Bodduluri, S., Nakhmani, A., Reinhardt, J. M., Wilson, C. G., McDonald, M. L., Rudraraju, R., et al. (2020). Deep neural network analyses of spirometry for structural phenotyping of chronic obstructive pulmonary disease. JCI Insight 5:e132781.

Google Scholar

Burrowes, K. S., Doel, T., and Brightling, C. (2014). Computational modeling of the obstructive lung diseases asthma and COPD. J. Transl. Med. 12(Suppl. 2):S5.

Google Scholar

Chang, A., Cadaret, L. M., and Liu, K. (2020). Machine learning in electrocardiography and echocardiography: technological advances in clinical cardiology. Curr. Cardiol. Rep. 22:161.

Google Scholar

Chen, Y. S., Cheng, C. H., Chen, S. F., and Jhuang, J. Y. (2020). Identification of the framingham risk score by an entropy-based rule model for cardiovascular disease. Entropy (Basel) 22:1406. doi: 10.3390/e22121406

PubMed Abstract | CrossRef Full Text | Google Scholar

Deo, R. C. (2015). Machine learning in medicine. Circulation 132, 1920–1930.

Google Scholar

Dominelli, P. B., Molgat-Seon, Y., Foster, G. E., Dominelli, G. S., Haverkamp, H. C., Henderson, W. R., et al. (2016). Quantifying the shape of maximal expiratory flow-volume curves in healthy humans and asthmatic patients. Respir. Physiol. Neurobiol. 220, 46–53. doi: 10.1016/j.resp.2015.09.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Enright, P. (2006). Flawed interpretative strategies for lung function tests harm patients. Eur. Respir. J. 27, 1322–1323. doi: 10.1183/09031936.06.00009006

PubMed Abstract | CrossRef Full Text | Google Scholar

Franssen, F. M., Alter, P., Bar, N., Benedikter, B. J., Iurato, S., Maier, D., et al. (2019). Personalized medicine for patients with COPD: where are we? Int. J. Chron. Obstruct. Pulmon Dis. 14, 1465–1484. doi: 10.2147/copd.s175706

PubMed Abstract | CrossRef Full Text | Google Scholar

Graham, B. L., Brusasco, V., Burgos, F., Cooper, B. G., Jensen, R., Kendrick, A., et al. (2017). 2017 ERS/ATS standards for single-breath carbon monoxide uptake in the lung. Eur. Respir. J. 49:1600016. doi: 10.1183/13993003.00016-2016

PubMed Abstract | CrossRef Full Text | Google Scholar

Gurbeta, L., Badnjevic, A., Maksimovic, M., Omanovic-Miklicanin, E., and Sejdic, E. (2018). A telehealth system for automated diagnosis of asthma and chronical obstructive pulmonary disease. J. Am. Med. Inform. Assoc. 25, 1213–1217. doi: 10.1093/jamia/ocy055

PubMed Abstract | CrossRef Full Text | Google Scholar

Holt, N. R., Thompson, B. R., Miller, B., and Borg, B. M. (2019). Substantial variation exists in spirometry interpretation practices for airflow obstruction in accredited lung function laboratories across Australia and New Zealand. Intern. Med. J. 49, 41–47. doi: 10.1111/imj.14047

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, Y. C., O’Brien, S. R., and MacIntyre, N. R. (2002). Intrabreath diffusing capacity of the lung in healthy individuals at rest and during exercise. Chest 122, 177–185. doi: 10.1378/chest.122.1.177

PubMed Abstract | CrossRef Full Text | Google Scholar

Ioachimescu, O. C., and Stoller, J. K. (2020). An alternative spirometric measurement. area under the expiratory flow-volume curve. Ann. Am. Thorac. Soc 17, 582–588. doi: 10.1513/annalsats.201908-613oc

PubMed Abstract | CrossRef Full Text | Google Scholar

Joo, S., Lee, K., and Song, C. (2018). A comparative study of smartphone game with spirometry for pulmonary function assessment in stroke patients. Biomed. Res. Int. 2018:2439312.

Google Scholar

Kouri, A., Gupta, S., Yadollahi, A., Ryan, C. M., Gershon, A. S., To, T., et al. (2020). Addressing reduced laboratory-based pulmonary function testing during a pandemic. Chest 158, 2502–2510. doi: 10.1016/j.chest.2020.06.065

PubMed Abstract | CrossRef Full Text | Google Scholar

Kraan, J., van der Mark, T. W., and Koeter, G. H. (1989). Changes in maximum expiratory flow-volume curve configuration after treatment with inhaled corticosteroids. Thorax 44, 1015–1021. doi: 10.1136/thx.44.12.1015

PubMed Abstract | CrossRef Full Text | Google Scholar

Kruizinga, M. D., Essers, E., Stuurman, F. E., Zhuparris, A., van Eik, N., Janssens, H. M., et al. (2020). Technical validity and usability of a novel smartphone-connected spirometry device for pediatric patients with asthma and cystic fibrosis. Pediatr. Pulmonol. 55, 2463–2470. doi: 10.1002/ppul.24932

PubMed Abstract | CrossRef Full Text | Google Scholar

Krumpe, P., Weigt, G., Martinez, N., Marcum, R., and Cummiskey, J. M. (1982). Computerized rapid analysis of pulmonary function test: use of a least mean squares correlation for interpretation of data. Comput. Biol. Med. 12, 295–307. doi: 10.1016/0010-4825(82)90033-6

CrossRef Full Text | Google Scholar

MacIntyre, N. R. (2012). The future of pulmonary function testing. Respir Care 57, 154–161. doi: 10.4187/respcare.01422

PubMed Abstract | CrossRef Full Text | Google Scholar

Malerba, M., Radaeli, A., Olivini, A., Damiani, G., Ragnoli, B., Sorbello, V., et al. (2016). Association of FEF25-75% impairment with bronchial hyperresponsiveness and airway inflammation in subjects with asthma-like symptoms. Respiration 91, 206–214. doi: 10.1159/000443797

PubMed Abstract | CrossRef Full Text | Google Scholar

Mikamo, M., Shirai, T., Mori, K., Shishido, Y., Akita, T., Morita, S., et al. (2013). Predictors of phase III slope of nitrogen single-breath washout in COPD. Respir. Physiol. Neurobiol. 189, 42–46. doi: 10.1016/j.resp.2013.06.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Miller, M. R., Quanjer, P. H., Swanney, M. P., Ruppel, G., and Enright, P. L. (2011). Interpreting lung function data using 80% predicted and fixed thresholds misclassifies more than 20% of patients. Chest 139, 52–59. doi: 10.1378/chest.10-0189

PubMed Abstract | CrossRef Full Text | Google Scholar

Mlodzinski, E., Stone, D. J., and Celi, L. A. (2020). Machine learning for pulmonary and critical care medicine: a narrative review. Pulm Ther. 6, 67–77. doi: 10.1007/s41030-020-00110-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Orchard, P., Agakova, A., Pinnock, H., Burton, C. D., Sarran, C., Agakov, F., et al. (2018). Improving prediction of risk of hospital admission in chronic obstructive pulmonary disease: application of machine learning to telemonitoring data. J. Med. Internet Res. 20:e263. doi: 10.2196/jmir.9227

PubMed Abstract | CrossRef Full Text | Google Scholar

Pellegrino, R., Viegi, G., Brusasco, V., Crapo, R. O., Burgos, F., Casaburi, R., et al. (2005). Interpretative strategies for lung function tests. Eur. Respir. J. 26, 948–968. doi: 10.1183/09031936.05.00035205

PubMed Abstract | CrossRef Full Text | Google Scholar

Quanjer, P. H., Weiner, D. J., Pretto, J. J., Brazzale, D. J., and Boros, P. W. (2014). Measurement of FEF25-75% and FEF75% does not contribute to clinical decision making. Eur. Respir. J. 43, 1051–1058. doi: 10.1183/09031936.00128113

PubMed Abstract | CrossRef Full Text | Google Scholar

Rajkomar, A., Dean, J., and Kohane, I. (2019). Machine learning in medicine. N. Engl. J. Med. 380, 1347–1358.

Google Scholar

Ross, J., Bates, D. V., Dean, E., and Abboud, R. T. (1992). Discordance of airflow limitation and ventilatory inhomogeneity in asthma and cystic fibrosis. Clin. Invest. Med. 15, 97–102.

Google Scholar

Seymour, C. W., Kennedy, J. N., Wang, S., Chang, C. H., Elliott, C. F., and Xu, Z. (2019). Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA 321, 2003–2017. doi: 10.1001/jama.2019.5791

PubMed Abstract | CrossRef Full Text | Google Scholar

Sidey-Gibbons, J. A. M., and Sidey-Gibbons, C. J. (2019). Machine learning in medicine: a practical introduction. BMC Med. Res. Methodol. 19:64. doi: 10.1186/s12874-019-0681-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Stokes, D. L., MacIntyre, N. R., and Nadel, J. A. (1981). Nonlinear increases in diffusing capacity during exercise by seated and supine subjects. J. Appl. Physiol. Respir. Environ. Exerc. Physiol. 51, 858–863. doi: 10.1152/jappl.1981.51.4.858

PubMed Abstract | CrossRef Full Text | Google Scholar

Timmins, S. C., Diba, C., Schoeffel, R. E., Salome, C. M., King, G. G., and Thamrin, C. (2014). Changes in oscillatory impedance and nitrogen washout with combination fluticasone/salmeterol therapy in COPD. Respir. Med. 108, 344–350. doi: 10.1016/j.rmed.2013.10.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Topalovic, M., Das, N., Burgel, P. R., Daenen, M., Derom, E., Haenebalcke, C., et al. (2019a). Artificial intelligence outperforms pulmonologists in the interpretation of pulmonary function tests. Eur. Respir. J. 53:1801660. doi: 10.1183/13993003.01660-2018

PubMed Abstract | CrossRef Full Text | Google Scholar

Topalovic, M., Das, N., and Janssens, W. (2019b). Artificial intelligence for pulmonary function test interpretation. Eur. Respir. J. 53:1900782. doi: 10.1183/13993003.00782-2019

PubMed Abstract | CrossRef Full Text | Google Scholar

Topalovic, M., Exadaktylos, V., Decramer, M., Troosters, T., Berckmans, D., and Janssens, W. (2014). Modelling the dynamics of expiratory airflow to describe chronic obstructive pulmonary disease. Med. Biol. Eng. Comput. 52, 997–1006. doi: 10.1007/s11517-014-1202-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Topalovic, M., Laval, S., Aerts, J. M., Troosters, T., Decramer, M., Janssens, W., et al. (2017). Automated interpretation of pulmonary function tests in adults with respiratory complaints. Respiration 93, 170–178. doi: 10.1159/000454956

PubMed Abstract | CrossRef Full Text | Google Scholar

USFDA (2019). Proposed Regulatory Framework for Modifications to Artificial Intelligence/Machine Learning (AI/ML)-Based Software as a Medical Device (SaMD). Silver Spring, MD: USFDA.

Google Scholar

Uthoff, J., Stephens, M. J., Newell, J. D. Jr., Hoffman, E. A., Larson, J., Koehn, N., et al. (2019). Machine learning approach for distinguishing malignant and benign lung nodules utilizing standardized perinodular parenchymal features from CT. Med. Phys. 46, 3207–3216. doi: 10.1002/mp.13592

PubMed Abstract | CrossRef Full Text | Google Scholar

Varga, J., Casaburi, R., Ma, S., Hecht, A., Hsia, D., Somfay, A., et al. (2016). Relation of concavity in the expiratory flow-volume loop to dynamic hyperinflation during exercise in COPD. Respir. Physiol. Neurobiol. 234, 79–84. doi: 10.1016/j.resp.2016.08.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: pulmonary function test, flow-volume loop, machine learning, artificial intelligence, spirometry, lung volumes, DLCO

Citation: Giri PC, Chowdhury AM, Bedoya A, Chen H, Lee HS, Lee P, Henriquez C, MacIntyre NR and Huang Y-CT (2021) Application of Machine Learning in Pulmonary Function Assessment Where Are We Now and Where Are We Going? Front. Physiol. 12:678540. doi: 10.3389/fphys.2021.678540

Received: 09 March 2021; Accepted: 02 June 2021;
Published: 24 June 2021.

Edited by:

Stefano Schena, Johns Hopkins Medicine, United States

Reviewed by:

Henggui Zhang, The University of Manchester, United Kingdom
Tinen Lee Iles, University of Minnesota Twin Cities, United States

Copyright © 2021 Giri, Chowdhury, Bedoya, Chen, Lee, Lee, Henriquez, MacIntyre and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yuh-Chin T. Huang, eXVoY2hpbi5odWFuZ0BkdWtlLmVkdQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.