Skip to main content

ORIGINAL RESEARCH article

Front. Public Health, 09 May 2024
Sec. Aging and Public Health
This article is part of the Research Topic Aging-Related Sarcopenia and Frailty: Prevalence, Risk Factors and Prediction Models View all 11 articles

Physical frailty identification using machine learning to explore the 5-item FRAIL scale, Cardiovascular Health Study index, and Study of Osteoporotic Fractures index

Chen-Cheng Yang,,Chen-Cheng Yang1,2,3Po-Hong ChenPo-Hong Chen4Cheng-Hong YangCheng-Hong Yang5Chia-Yen DaiChia-Yen Dai3Kuei-Hau LuoKuei-Hau Luo3Tzu-Hua ChenTzu-Hua Chen6Hung-Yi Chuang,,
Hung-Yi Chuang3,7,8*Chao-Hung KuoChao-Hung Kuo9
  • 1Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung City, Taiwan
  • 2Department of Occupational and Environmental Medicine, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University, Kaohsiung City, Taiwan
  • 3Department of Occupational and Environmental Medicine, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung City, Taiwan
  • 4Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung City, Taiwan
  • 5Department of Information Management, Tainan University of Technology, Tainan, Taiwan
  • 6Department of Family Medicine, Kaohsiung Municipal Ta-tung Hospital, Kaohsiung Medical University, Kaohsiung City, Taiwan
  • 7Research Center for Precision Environmental Medicine, Kaohsiung Medical University, Kaohsiung City, Taiwan
  • 8Department of Public Health and Environmental Medicine, Kaohsiung Medical University, Kaohsiung City, Taiwan
  • 9Department of Internal Medicine, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University, Kaohsiung City, Taiwan

Background: Physical frailty is an important issue in aging societies. Three models of physical frailty assessment, the 5-Item fatigue, resistance, ambulation, illness and loss of weight (FRAIL); Cardiovascular Health Study (CHS); and Study of Osteoporotic Fractures (SOF) indices, have been regularly used in clinical and research studies. However, no previous studies have investigated the predictive ability of machine learning (ML) for physical frailty assessment. The aim was to use two ML algorithms, random forest (RF) and extreme gradient boosting (XGBoost), to predict these three physical frailty assessment models.

Materials and methods: Questionnaires regarding demographic characteristics, lifestyle habits, living environment, and physical frailty assessment were answered by 445 participants aged 60 years and above. The RF and XGBoost algorithms were used to assess their scores for the three physical frailty indices. Furthermore, feature importance and Shapley additive explanations (SHAP) were used to determine the important physical frailty factors.

Results: The XGBoost algorithm obtained higher accuracy for predicting the three physical frailty indices; the areas under the curve obtained by the XGBoost algorithm for the 5-Item FRAIL, CHS, and SOF indices were 0.84. 0.79, and 0.69, respectively. The feature importance and SHAP of the XGBoost algorithm revealed that systolic blood pressure, diastolic blood pressure, age, and body mass index play important roles in all three physical frailty models.

Conclusion: The XGBoost algorithm has a more accurate predictive rate than RF across all three physical frailty assessments. Thus, ML can be a useful tool for the early detection of physical frailty.

Introduction

Physical frailty has become an important issue in the geriatric population of super-aging societies. It is a condition wherein susceptibility to stressors increases, especially in the older adults population, (1) resulting in undesirable health consequences, such as falling, stroke, disability, hospitalization, institutionalization, and death (25). The prevalence of physical frailty ranges from 3.9–51.4%, (68) influenced by different nationalities, socioeconomic conditions, and, most importantly, the assessment tool. Currently, there is no gold-standard diagnostic tool for assessing physical frailty. Several assessments have been established, including Fried’s phenotype model (9) and the physical frailty index in Rockwood’s cumulative deficit model (10). These assessments help identify persons with physical frailty who are at high risk of adverse consequences and provide an opportunity to counteract the evolution of adverse sequelae (11).

Machine learning (ML), a subset of artificial intelligence (AI), is a method of self-learning to provide solutions (12, 13). According to scholars such as Arthur Samuel, ML provides computers with the ability to learn without explicit programming. Therefore, ML can be classified as a computer science (14). Nevertheless, ML algorithms can be classified as “supervised” or “non-supervised” (15). Supervised ML involves training the model on predictions of relationships between features and outputs from data, whereas non-supervised ML involves searching for relevant structures within a dataset (15). The advantage of supervised ML is that it can achieve a high classification rate using a large amount of labeled data (16). Random forest (RF), initially published by Breiman, is a non-parametric learning algorithm wherein classification results are determined through voting on multiple decision trees (17). It has the advantage of reducing outliers and is less susceptible to overfitting, resulting in higher classification accuracy in many applications (18). RF is widely used in mass spectrometry, soil mapping, eye-state estimation, and remote sensing imaging (19). The extreme gradient boosting (XGBoost) algorithm, proposed by Chen, (20) randomly selects subsets to iteratively fit a single predictor and obtain a minimized loss function, and introduces a stochastic gradient boosting procedure. Through regularization, Boost can reduce the risk of overfitting and improve generalisability (21). It has been applied to detect abnormal satellite engineering parameters, personal credit risk assessment, and urban water resources (22).

However, there are a limited number of studies on using ML for predicting health conditions of the older adults, and there are no studies on predicting their physical frailty status. We aimed to employ two supervised ML methods, RF and XGBoost, to explore three physical frailty assessment indices and construct prediction models. The physical frailty assessment indices were the 5-Item fatigue, resistance, ambulation, illness, and loss of weight (FRAIL) scale; Cardiovascular Health Study (CHS) index; and Study of Osteoporotic Fractures (SOF) index.

Materials and methods

Participants

The participants were included after obtaining informed consent and approval from the Institutional Review Board. We randomly selected community residents from three urban districts in Kaohsiung City, and randomly selected participants according to the proportion of the population over 60 years old. Participants were included to this study after informed consent. The inclusion criteria were: (1) aged 60 years and above, (2) ability to respond to a questionnaire, and (3) allowing for a physical assessment. The exclusion criteria were: (1) suffering from a mental disability or psychological disease, (2) unwillingness to provide informed consent and inability to cooperate with the study, and (3) acute hospitalization within the 3 months prior to the study. From April–October 2022, 445 participants were recruited for the study. This study was approved by the Kaohsiung Medical University Hospital Institutional Review Board [IRB number: KMUHIRB-E(I)-20220048].

Measurements and questionnaire

All the participants were assessed through one-to-one interviews. After they completed the questionnaire and physical frailty assessment, we obtained their demographic characteristics, including sex, age, living environment, education level, and smoking and drinking habits. Elementary school education or no education was considered “low education.” The participants’ past histories were documented using their medical records obtained from their National Health Insurance cards. Physical examinations of height, weight, and blood pressure were also performed. The assessment indices for physical frailty included the (1) 5-Item FRAIL, (23) (2) CHS (Fried’s Frailty Phenotype), (24) and (3) SOF (25). Two researchers independently entered the data and confirmed their accuracy.

Three tools for physical frailty assessment

The Geriatric Advisory Panel developed the 5-Item FRAIL scale, which comprises five items: (1) exhaustion, (2) weakness, (3) slowness while walking, (4) low activity, and (5) weight loss. Two items—fatigue and weight loss—were considered biological factors; another two—resistance and ambulation—were considered functional factors; and the last item was considered to involve deficit accumulation because of illness. The 5-Item FRAIL scale categorizes participants’ health statuses based on their scores as physical frail (3–5), physical pre-frail (1–2), and physical non-frail (0) (23).

The CHS index, a biological model of physical frailty, comprises five components: (1) unintentional weight loss, (2) feeling of exhaustion, (3) decreased physical activity, (4) slow walking speed, and (5) weakness, which are also used to classify health statuses based on scores as physical frail (3–5), physical pre-frail (1–2), and physical non-frail (0) (9, 24).

The SOF index comprises two factors with three components: (1) inability to complete five chair rises or suffering from weight loss, representing biological factors, and (2) reduced energy levels, representing a functional factor, which are also used to classify health statuses based on scores as physical frail (2–3), physical pre-frail (1), and physical non-frail (0) (25).

Machine learning

The RF algorithm, developed by Breiman in 2001, (17) is an ensemble learning bagging algorithm (26). RF involves random sampling of the original training dataset, creating a new classifier for each sample, (27) and voting on the results generated by each classifier. The result is determined by voting on the results generated by each classifier, and the category with the largest number of votes constitutes the final result (28). RF requires minimal pruning and has no overfitting risk. Furthermore, it has high tolerance for outliers and noise, high adaptability to new samples, and good stability. Therefore, RF is suitable for parallel computing, even for high-dimensional data, with faster training speed and higher computing performance (29). The RF decision tree is built by selecting a feature at the root node and partitioning the training dataset into subsets of values of the selected feature (30). The information gain (IG) for partitioning training data y into subsets (yi) is calculated as follows Equation (1):

IG=iyiyEyi    (1)

where Eyi is the entropy of setyiand is calculated as Equation (2):

Eyi=j=1nPjlog2Pj    (2)

The XGBoost algorithm, developed by Chen, (20) can be applied to handle regression and classification problems (31). It originated from the gradient boosting decision tree algorithm, which was modified to improve its generalisability and convergence rate (32). Boosting is an ensemble learning algorithm that converts weak classifier iterative learning into a strong classifier algorithm (32). It produces a new decision tree at each iteration based on the residuals of the previous one (33). XGBoost enhances the regularization of the loss function as a whole to create an objective function and improve the performance of the algorithm, (34) which is described in Equation (3).

Jθ=Lθ+Rθ    (3)

where θis the parameter for data training, L is the loss function, and R is the regularization. Because the decision tree is the base model, the output of model yî is an ensemble of k decision trees and is computed as follows Equation (4):

yî=i=1kfkχi,fkF    (4)

where χi is the ith sample in the training set and F is the decision tree value.

Loss function L is calculated as follows Equation (5):

L=inyîyi+kΩfk    (5)
Ωfk=ΥΤ+12λw2    (6)

where T is the number of trees in the leaf and w is the leaf weight in Equation (6).

Evaluation metrics

To evaluate the performances of the RF and XGBoost algorithms for classifying the participant assessments on the 5-Item FRAIL, CHS, and SOF indices into robust, pre-frail, and frail, we employed the common evaluation indicators for ML classification: Accuracy (Equation 7), Precision (Equation 8), Recall (Equation 9), and F1 score (Equation 10): (35).

Accuracy=TP+TNTP+FP+TN+FN×100    (7)
Precision=TPTP+FP×100    (8)
Recall=TPTP+FN×100    (9)
F1score=Precision×Recall×2Precision+Recall    (10)

Shapley additive explanations (SHAP)

SHAP, proposed by Lundberg and Lee in 2017, (36) is a framework for a unified interpretation of different ML prediction models (37). It is a Shapley value based on game theory (38) that explains the impact of each feature on an ML prediction (39). It is useful for both single- and full-feature interpretability; therefore, it can be used for the entire dataset to explain the influence of each feature on the prediction (39).

Statistics

Descriptive statistics were used to analyse the mean and dispersion of continuous variables, including age and physical frailty scores. Numbers and proportions were used to evaluate categorical variables such as sex, smoking, and alcohol consumption. Furthermore, the participants were divided into groups according to their physical frailty status. The scores for the 5-Item FRAIL, CHS, and SOF indices were classified into physical non-frail, physical pre-frail, and physical frail groups. Statistical analyses were performed using IBM SPSS version 20 and Python (version 3.8.8).

Results

Demographic characteristics

In total, 445 participants satisfied the inclusion criteria. They were classified into physical non-frail, physical pre-frail, and physical frail groups according to their scores on the three indices and their demographic characteristics were determined, as listed in Table 1. According to the 5-Item FRAIL scale, 196 (44.04%), 184 (41.35%), and 65 (14.61%) participants were classified into physical non-frail, physical pre-frail, and physical frail groups, respectively. According to the CHS index, 144 (32.36%), 145 (32.58%), and 156 (35.06%) participants were classified into physical non-frail, physical pre-frail, and physical frail groups, respectively. According to the SOF index, 230 (51.69%), 152 (34.16%), and 63 (14.15%) participants were classified into physical non-frail, physical pre-frail, and physical frail groups, respectively. The average age of the participants was 68.75 years, and their average body mass index (BMI) was 25.39. They comprised 163 men (36.63%) and 282 women (63.37%). Moreover, 38.20% (170) had low levels of education (elementary school only or no education), 9.66% (40) lived alone, 4.49% (19) were smokers, and 10.11% (41) consumed alcohol.

Table 1
www.frontiersin.org

Table 1. Demographic characteristics for model prediction according to the three physical frailty indices: 5-Item FRAIL, CHS, and SOF.

ML algorithms: RF and XGBoost

The XGBoost and RF predictions were compared based on accuracy, recall, precision, and F1 score. Compared with RF, XGBoost predicted the 5-Item FRAIL scale, CHS index, and SOF index with higher accuracy (Table 2). The receiver operating characteristic (ROC) curve was used to estimate model performance, with the ordinate and abscissa representing the frequencies of true and false positives, respectively. For the 5-Item FRAIL scale, the area under the ROC curve (AUC) of the RF algorithm was 0.78, and that of the XGBoost algorithm was 0.84, as shown in Figure 1A. For the CHS index, AUC of RF was 0.76, and that of the XGBoost was 0.79, as shown in Figure 1B. For the SOF index, AUC of RF was 0.62, and that of XGBoost was 0.69, as shown in Figure 1C. In summary, XGBoost had a better predictive ability than RF.

Table 2
www.frontiersin.org

Table 2. Scores for the three physical frailty indices, 5-Item FRAIL, CHS, and SOF predicted using RF and XGBoost.

Figure 1
www.frontiersin.org

Figure 1. Results of the two machine learning algorithms for the (A) 5-Item Fatigue, Resistance, Ambulation, Illness, and Loss of Weight (FRAIL) scale prediction, (B) Cardiovascular Health Study (CHS) index prediction, and (C) Study of Osteoporotic Fracture (SOF) index prediction.

Feature importance

Feature importance was determined using the XGBoost algorithm. The F-score indicates the number of times a feature is split during model training (42). The higher the score, the more important the feature and the greater its impact on the classification results (43). Figures 2AC show the feature importance in the 5-Item FRAIL, CHS, and SOF indices, respectively. In all three, systolic blood pressure, diastolic blood pressure, age, and BMI have the top four F-score values.

Figure 2
www.frontiersin.org

Figure 2. Feature importance for the (A) 5-Item FRAIL scale, (B) CHS index, and (C) SOF index.

SHAP

SHAP shows the contribution of important features across the dataset. The x-axis represents the Shapley value and the y-axis represents the important features in the dataset, which are sorted according to their Shapley values. In the SHAP graph, the red points indicate that the value of the data is higher, and blue points indicate that the value of the data is lower. Figure 3A shows the SHAP values of the top 20 features in the 5-Item FRAIL scale, wherein the eigenvalues of age, diastolic blood pressure, systolic blood pressure, and BMI all affected the predicted value to some extent, and polypharmacy showed a positive correlation, indicating that the larger the feature value, the higher its contribution to the prediction. Figure 3B shows the SHAP values of the top 20 characteristics of the CHS index, where the eigenvalues of age, diastolic blood pressure, systolic blood pressure, and BMI affect the predicted value to some extent, and polypharmacy and urology disorders are positively correlated, indicating that the characteristics with larger values contribute more to the model prediction. Figure 3C shows the SHAP values of the top 20 SOF features. The eigenvalues of age, systolic blood pressure, BMI, and diastolic blood pressure affected the predicted value.

Figure 3
www.frontiersin.org

Figure 3. SHAP values for the (A) 5-Item FRAIL scale, (B) CHS index, and (C) SOF index.

Post-stratification of HTN

Table 3 shown the proportion of HTN or non-HTN in the three frailty assessments. Compared with physical non-frail population, HTN take significantly larger proportion in physical frail population in all three assessment classifications.

Table 3
www.frontiersin.org

Table 3. The post-stratification of HTN and non-HTN in the three frailty assessments, 5-Item FRAIL scale, CHS index, and SOF index.

Discussion

To compare RF and XGBoost, the same data were used for the training and testing evaluation. Overall, XGBoost performed better than RF. A significant difference was observed between high recall and low precision, as shown in Table 2. The recall rate is calculated by dividing the true positives by anything that should have been predicted as positive. Precision refers to the number of actual positives among the positive predictions, and a high recall rate indicates that the number of false positives are low, which is generally desirable. In summary, the XGBoost algorithm achieved a better prediction rate.

The exceptional predictive accuracy of XGBoost compared to Random Forest is the result of several unique techniques and features integral to XGBoost’s approach. Notably, its Gradient Boosting Framework allows for systematic improvements in predictions by specifically addressing errors from previous training rounds, employing gradient descent to reduce loss with each new addition (40). Additionally, XGBoost incorporates a regularization term in its objective function, which serves to prevent overfitting by penalizing overly complex models, thus fostering more generalizable and robust predictions. It also employs a sophisticated tree pruning method, which ensures the retention of only the most beneficial structures. Furthermore, XGBoost’s built-in routine for handling missing values, which intelligently decides the best course of action to minimize loss, significantly enhances its predictive capabilities (44). These combined features not only enhance XGBoost’s efficiency but also establish it as a formidable tool in machine learning competitions and applications where prediction accuracy is paramount.

The uniqueness of this study is that it employed ML to explore and address the characteristics of physical frailty predictions. The RF algorithm is a widely used ML algorithm in many fields (41) and has high accuracy, robustness, and the ability to handle high-dimensional data (30). It has been applied to the Minnesota Multiphasic Personality Inventory scale, and resulted in better classification and prediction (45). The XGBoost algorithm is a new ensemble learning method with an excellent implementation performance. Compared to other classifiers, XGBoost is anti-overfitting, highly efficient, entails low computational cost, and has better generalisability and accuracy compared to other ML algorithms (46, 47). The XGBoost algorithm has been previously applied to mental health prediction. Six ML algorithms were used to predict mental health using electronic medical records, of which XGBoost obtained the highest AUC value (48). Therefore, ML, especially the XGBoost algorithm, is better for classification and prediction of the three physical frailty indices: 5-Item FRAIL, CHS, and SOF.

Our study suggests that the 5-Item FRAIL is more aligned or similar to the SOF Index when it comes to classifying individuals who are physically frail. This implies that both tools might share common criteria or assess similar aspects of frailty, making them more interchangeable or comparable for identifying frail individuals. When it comes to classifying physical pre-frailty, the CHS Index is said to be closer to the SOF Index (49). This means that for identifying individuals who are not fully frail but have some signs of frailty (pre-frail), the CHS Index and SOF Index might share more similarities or provide more consistent classifications compared to other combinations of indices or scales. The result implies a comparison of the effectiveness or similarity of different frailty assessment tools, which is crucial for research, clinical practice, and policy-making, as identifying and managing frailty can help improve quality of life, reduce healthcare costs, and delay or prevent the progression to disability.

In this study, we used the SHAP tool and XGBoost algorithm to determine feature importance for a better understanding of these predictors. Figures 2, 3 show that among the top 20 important features, the influences of age, diastolic blood pressure, systolic blood pressure, and BMI on the prediction of the ML algorithms can be clearly understood. This indicates that a higher age is associated with higher physical frailty. For glioma grading, Cheng et al. applied the deep neural network model and SHAP tool, which not only shows the importance of every feature on the outcome but also indicates the influences of the associations between features on the predictions (50). For patients with severe COVID-19 intubation, Fleuren et al. applied the SHAP and found predictors of extubation failure, including ventilatory settings, inflammatory parameters, neurological status, and BMI (51). Hathaway et al. (52) conducted supervised learning through SHAP by identifying the most relevant and novel cardiac biomarkers for forecasting diabetes mellitus development, and discovered that this approach may be a potential guideline for investigating disease pathogenesis and discovering novel biomarkers in the future. For predicting infant autopsy outcome, Booth et al. used three models for model training, including decision tree, RF, and gradient boosting. Fundamental data items associated with determining the medical cause of death, including the most important items, such as age at death and cardiovascular and respiratory histological findings, were recognized using model feature importance, with the XGBoost algorithm being the most effective (53). The SHAP method and its feature importance classification can further assist clinicians in expanding their knowledge of the fundamental mechanisms by which predictors affect the output of ML models for health outcomes.

In our study, hypertension is recognized as one of the important predictive factors in the frailty among older adults. Studies have shown that hypertension can contribute to the development of frailty by affecting cardiovascular health, leading to impairments in physical function and an increased risk of adverse health outcomes (54). Research by Fried et al. (9) in the criteria for frailty, highlight the relationship between hypertension and frailty, suggesting that managing hypertension could be crucial in preventing or mitigating frailty in the older population. Our study represents the first instance of utilizing ML techniques to explore this domain, and remarkably, we have found results that align closely with those of previous studies.

This study had several limitations. First, it was a cross-sectional study that could only demonstrate associations and not infer causality. Further longitudinal studies are required to determine the causality between the possible risk factors and physical frailty. Second, we used self-reported questionnaires, and the results may have been influenced by recall biases such as memory, mood, or cognition. Third, ML models require a large amount of historical data for training to ensure that the model is not biased, (55) and it must be combined with datasets from other medical institutions to improve their predictive ability (56), such as Goh’s study, which aim to develop a predictive model for bacteremia in septic patients using machine learning methods, analysing data from an emergency department (57). Fourthly, the economic factor, a critical determinant that could significantly influence physical frailty through insufficient access to nutrition and healthcare, was omitted from the machine learning models. This oversight highlights the necessity of integrating economic considerations into future research. Incorporating this factor into subsequent studies will allow for a more comprehensive analysis, potentially uncovering deeper insights into the dynamics between economic status and physical frailty. Fifth, because the dataset is inherently predictive, when the sample size is small, models may face challenges. One of these challenges is the high sensitivity to outliers, which may overly emphasize anomalies in the samples, leading the ML model to believe that these outliers have a greater impact (52). Due to limitations in the dataset, the model may overfit to the training data, especially when using derived models like classification trees. This means that during training, the model may generate a branch for each patient sample, and such a complex model may not generalize well to new, (58) unseen data because it overly caters to the details and noise in the training data. Furthermore, training ML models is costly, and stakeholders, such as governments and major hospitals, must be persuaded, trained, and educated on ML applications; therefore, the adoption of ML algorithms is another challenge. These issues must be addressed to obtain the optimal gains in predictive accuracy (55). In light of these limitations encountered in this study, there are several promising avenues for deepening future research. Primarily, undertaking longitudinal studies emerges as a critical next step to establish causality between risk factors and physical frailty, moving beyond the associations observed in a cross-sectional framework. Additionally, future studies should consider employing objective measures alongside or in place of self-reported questionnaires to mitigate the impact of recall bias and enhance the reliability of data. The integration of economic factors into ML models is another vital area for exploration, aiming to capture the nuanced impacts of socioeconomic status on physical frailty. This inclusion promises a more rounded analysis and could reveal intricate dynamics that have been previously overlooked. Expanding the datasets for ML training by incorporating data from a variety of medical institutions will also be crucial in improving the models’ predictive accuracy and reducing bias. Lastly, addressing the challenges related to the cost and complexity of ML model training, as well as fostering stakeholder engagement, are essential steps for the broader adoption and application of ML in healthcare research. These focused directions not only aim to rectify the limitations of the current study but also pave the way for more comprehensive and impactful future research on physical frailty.

Conclusion

This study demonstrated that two machine learning models are used for physical frailty assessing by the 5-item FRAIL scale, CHS index, and SOF index. XGBoost model is more precise predictive rate than RF model in all the three physical frailty models. Machine learning might be a useful instrument for early detection of physical frailty in the future. Furthermore, this study highlights the transformative potential of machine learning, especially the XGBoost algorithm’s efficacy in frailty assessments, for advancing early detection practices in healthcare. By integrating the XGBoost model, this research not only promises significant improvements in health care but also emphasizes the importance of such findings in informing health policy development. Furthermore, it offers practical guidance for healthcare professionals on leveraging these insights to enhance frailty management strategies for the aging population.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Kaohsiung Medical University Hospital Institutional Review Board. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

C-CY: Conceptualization, Data curation, Funding acquisition, Writing – original draft. P-HC: Formal analysis, Methodology, Software, Writing – original draft. C-HY: Conceptualization, Formal analysis, Methodology, Resources, Writing – review & editing. C-YD: Investigation, Writing – review & editing. K-HL: Formal analysis, Investigation, Writing – review & editing. T-HC: Investigation, Writing – review & editing. H-YC: Conceptualization, Formal analysis, Supervision, Writing – review & editing. C-HK: Supervision, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Science and Technology Council, R.O.C., Taiwan (grant number 112-2314-B-037-038), by Kaohsiung Medical University (grant number N-N112227, NSYSU-KMU-113-P25), by Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University (grant number R-11101, S-11103), and by Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University (grant number O-11217) from the Occupational Injury and Disease Diagnosis and Treatment Medical Institutions Accreditation Management Subsidy from the Occupational Safety and Health Administration of the Ministry of Labor, R.O.C., Taiwan.

Acknowledgments

We would like to thank all the participants, Kaohsiung Municipal Siaogang Hospital, and Kaohsiung Medical University.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

5-Item FRAIL, 5-Item fatigue, resistance, ambulation, illness and loss of weight; CHS, Cardiovascular Health Study; SOF, Study of Osteoporotic Fractures; RF, Random Forest; XGBoost, extreme gradient boosting; SHAP, Shapley additive explanations.

References

1. Xue, QL. The frailty syndrome: definition and natural history. Clin Geriatr Med. (2011) 27:1–15. doi: 10.1016/j.cger.2010.08.009

PubMed Abstract | Crossref Full Text | Google Scholar

2. Makizako, H, Shimada, H, Doi, T, Tsutsumimoto, K, and Suzuki, T. Impact of physical frailty on disability in community-dwelling older adults: a prospective cohort study. BMJ Open. (2015) 5:e008462. doi: 10.1136/bmjopen-2015-008462

PubMed Abstract | Crossref Full Text | Google Scholar

3. Burton, JK, Stewart, J, Blair, M, Oxley, S, Wass, A, Taylor-Rowan, M, et al. Prevalence and implications of frailty in acute stroke: systematic review & meta-analysis. Age Ageing. (2022) 51:afac064. doi: 10.1093/ageing/afac064

PubMed Abstract | Crossref Full Text | Google Scholar

4. Boyd, CM, Xue, QL, Simpson, CF, Guralnik, JM, and Fried, LP. Frailty, hospitalization, and progression of disability in a cohort of disabled older women. Am J Med. (2005) 118:1225–31. doi: 10.1016/j.amjmed.2005.01.062

Crossref Full Text | Google Scholar

5. McKenzie, K, Ouellette-Kuntz, H, and Martin, L. Frailty as a predictor of institutionalization among adults with intellectual and developmental disabilities. Intellect Dev Disabil. (2016) 54:123–35. doi: 10.1352/1934-9556-54.2.123

Crossref Full Text | Google Scholar

6. Siriwardhana, DD, Hardoon, S, Rait, G, Weerasinghe, MC, and Walters, KR. Prevalence of frailty and prefrailty among community-dwelling older adults in low-income and middle-income countries: a systematic review and meta-analysis. BMJ Open. (2018) 8:e018195. doi: 10.1136/bmjopen-2017-018195

PubMed Abstract | Crossref Full Text | Google Scholar

7. O'Caoimh, R, Galluzzo, L, Rodríguez-Laso, Á, Van der Heyden, J, Ranhoff, AH, Lamprini-Koula, M, et al. Prevalence of frailty at population level in European ADVANTAGE joint action member states: a systematic review and meta-analysis. Ann Ist Super Sanita. (2018) 54:226–38. doi: 10.4415/ann_18_03_10

PubMed Abstract | Crossref Full Text | Google Scholar

8. Coelho-Junior, HJ, Marzetti, E, Picca, A, Calvani, R, Cesari, M, and Uchida, MC. Prevalence of Prefrailty and frailty in South America: a systematic review of observational studies. J Frailty Aging. (2020) 9:197–213. doi: 10.14283/jfa.2020.22

PubMed Abstract | Crossref Full Text | Google Scholar

9. Fried, LP, Tangen, CM, Walston, J, Newman, AB, Hirsch, C, Gottdiener, J, et al. Frailty in older adults: evidence for a phenotype. J Gerontol A Biol Sci Med Sci. (2001) 56:M146–57. doi: 10.1093/gerona/56.3.m146

Crossref Full Text | Google Scholar

10. Mitnitski, AB, Mogilner, AJ, and Rockwood, K. Accumulation of deficits as a proxy measure of aging. ScientificWorldJournal. (2001) 1:323–36. doi: 10.1100/tsw.2001.58

PubMed Abstract | Crossref Full Text | Google Scholar

11. Walston, J, Buta, B, and Xue, QL. Frailty screening and interventions: considerations for clinical practice. Clin Geriatr Med. (2018) 34:25–38. doi: 10.1016/j.cger.2017.09.004

PubMed Abstract | Crossref Full Text | Google Scholar

12. Habib, M, Wang, Z, Qiu, S, Zhao, H, and Murthy, AS. Machine learning based healthcare system for investigating the association between depression and quality of life. IEEE J Biomed Health Inform. (2022) 26:2008–19. doi: 10.1109/jbhi.2022.3140433

PubMed Abstract | Crossref Full Text | Google Scholar

13. Dormosh, N, Schut, MC, Heymans, MW, Maarsingh, O, Bouman, J, van der Velde, N, et al. Predicting future falls in older people using natural language processing of general practitioners' clinical notes. Age Ageing. (2023) 52:afad046. doi: 10.1093/ageing/afad046

PubMed Abstract | Crossref Full Text | Google Scholar

14. Painuli, D, Mishra, D, Bhardwaj, S, and Aggarwal, M. Forecast and prediction of COVID-19 using machine learning. Data Sci COVID-19. (2021):381–97. doi: 10.1016/B978-0-12-824536-1.00027-7

Crossref Full Text | Google Scholar

15. Chen, K, and Wang, S. Semi-supervised learning via regularized boosting working on multiple semi-supervised assumptions. IEEE Trans Pattern Anal Mach Intell. (2011) 33:129–43. doi: 10.1109/tpami.2010.92

Crossref Full Text | Google Scholar

16. Zhang, W, Pan, Z, and Hu, Y. Exploring PolSAR images representation via self-supervised learning and its application on few-shot classification. IEEE Geosci Remote Sens Lett. (2022) 19:1–5. doi: 10.1109/LGRS.2022.3198135

Crossref Full Text | Google Scholar

17. Breiman, L. Random forests. Mach Learn. (2001) 45:5–32. doi: 10.1023/A:1010933404324

Crossref Full Text | Google Scholar

18. Guo, Q, Zhang, J, Guo, S, Ye, Z, Deng, H, Hou, X, et al. Urban tree classification based on object-oriented approach and random Forest algorithm using unmanned aerial vehicle (UAV) multispectral imagery. Remote Sens. (2022) 14:3885. doi: 10.3390/rs14163885

Crossref Full Text | Google Scholar

19. Han, T, Jiang, D, Zhao, Q, Wang, L, and Yin, K. Comparison of random forest, artificial neural networks and support vector machine for intelligent diagnosis of rotating machinery. Trans Inst Meas Control. (2018) 40:2681–93. doi: 10.1177/0142331217708242

Crossref Full Text | Google Scholar

20. Budholiya, K, Shrivastava, SK, and Sharma, V. An optimized XGBoost based diagnostic system for effective prediction of heart disease. J King Saud Univ Comput Inf Sci. (2022) 34:4514–23. doi: 10.1016/j.jksuci.2020.10.013

Crossref Full Text | Google Scholar

21. Zhou, Y, Xue, J, Chen, S, Zhou, Y, Liang, Z, Wang, N, et al. Fine-resolution mapping of soil Total nitrogen across China based on weighted model averaging. Remote Sens. (2020) 12:85. doi: 10.3390/rs12010085

Crossref Full Text | Google Scholar

22. Li, X, Wu, X, Sun, M, Yang, S, and Song, W. A novel intelligent leakage monitoring-warning system for sustainable rural drinking water supply. Sustain For. (2022) 14:6079. doi: 10.3390/su14106079

Crossref Full Text | Google Scholar

23. Abellan van Kan, G, Rolland, YM, Morley, JE, and Vellas, B. Frailty: toward a clinical definition. J Am Med Dir Assoc. (2008) 9:71–2. doi: 10.1016/j.jamda.2007.11.005

Crossref Full Text | Google Scholar

24. Chan, DC, Tsou, HH, Yang, RS, Tsauo, JY, Chen, CY, Hsiung, CA, et al. A pilot randomized controlled trial to improve geriatric frailty. BMC Geriatr. (2012) 12:58. doi: 10.1186/1471-2318-12-58

PubMed Abstract | Crossref Full Text | Google Scholar

25. Ensrud, KE, Ewing, SK, Taylor, BC, Fink, HA, Cawthon, PM, Stone, KL, et al. Comparison of 2 frailty indexes for prediction of falls, disability, fractures, and death in older women. Arch Intern Med. (2008) 168:382–9. doi: 10.1001/archinternmed.2007.113

PubMed Abstract | Crossref Full Text | Google Scholar

26. Sun, J, Yu, H, Zhong, G, Dong, J, Zhang, S, and Yu, H. Random Shapley forests: cooperative game-based random forests with consistency. IEEE Trans Cybern. (2022) 52:205–14. doi: 10.1109/tcyb.2020.2972956

PubMed Abstract | Crossref Full Text | Google Scholar

27. Amasyali, MF, and Ersoy, OK. Classifier ensembles with the extended space Forest. IEEE Trans Knowl Data Eng. (2014) 26:549–62. doi: 10.1109/TKDE.2013.9

Crossref Full Text | Google Scholar

28. Azar, AT, Elshazly, HI, Hassanien, AE, and Elkorany, AM. A random forest classifier for lymph diseases. Comput Methods Prog Biomed. (2014) 113:465–73. doi: 10.1016/j.cmpb.2013.11.004

Crossref Full Text | Google Scholar

29. Zhang, L, Liu, K, Wang, Y, and Omariba, ZB. Ice detection model of wind turbine blades based on random Forest classifier. Energies. (2018) 11:2548. doi: 10.3390/en11102548

Crossref Full Text | Google Scholar

30. Haque, N, Jamshed, A, Chatterjee, K, and Chatterjee, S. Accurate sensing of power transformer faults from dissolved gas data using random Forest classifier aided by data clustering method. IEEE Sensors J. (2022) 22:5902–10. doi: 10.1109/jsen.2022.3149409

Crossref Full Text | Google Scholar

31. Weldegebriel, HT, Liu, H, Haq, AU, Bugingo, E, and Zhang, D. A new hybrid convolutional neural network and eXtreme gradient boosting classifier for recognizing handwritten Ethiopian characters. IEEE Access. (2020) 8:17804–18. doi: 10.1109/ACCESS.2019.2960161

Crossref Full Text | Google Scholar

32. Jiang, Y, Tong, G, Yin, H, and Xiong, N. A pedestrian detection method based on genetic algorithm for optimize XGBoost training parameters. IEEE Access. (2019) 7:118310–21. doi: 10.1109/ACCESS.2019.2936454

Crossref Full Text | Google Scholar

33. Xie, J, Li, Z, Zhou, Z, and Liu, S. A novel bearing fault classification method based on XGBoost: the fusion of deep learning-based features and empirical features. IEEE Trans Instrum Meas. (2021) 70:1–9. doi: 10.1109/TIM.2020.3042315

Crossref Full Text | Google Scholar

34. Mahmud, SMH, Chen, W, Jahan, H, Liu, Y, Sujan, NI, and Ahmed, S. iDTi-CSsmoteB: identification of drug–target interaction based on drug chemical structure and protein sequence using XGBoost with over-sampling technique SMOTE. IEEE Access. (2019) 7:48699–714. doi: 10.1109/ACCESS.2019.2910277

Crossref Full Text | Google Scholar

35. Aldhyani, THH, Alsubari, SN, Alshebami, AS, Alkahtani, H, and Ahmed, ZAT. Detecting and analyzing suicidal ideation on social media using deep learning and machine learning models. Int J Environ Res Public Health. (2022) 19:12635. doi: 10.3390/ijerph191912635

PubMed Abstract | Crossref Full Text | Google Scholar

36. Lundberg, SM, and Lee, S-I. A unified approach to interpreting model predictions. Adv Neural Inf Proces Syst. (2017) 30:4768–77.

Google Scholar

37. Wang, M, Zheng, K, Yang, Y, and Wang, X. An explainable machine learning framework for intrusion detection systems. IEEE Access. (2020) 8:73127–41. doi: 10.1109/ACCESS.2020.2988359

Crossref Full Text | Google Scholar

38. Singh, P, and Sharma, A. Interpretation and classification of arrhythmia using deep convolutional network. IEEE Trans Instrum Meas. (2022) 71:1–12. doi: 10.1109/TIM.2022.3204316

Crossref Full Text | Google Scholar

39. Alwarthan, S, Aslam, N, and Khan, IU. An explainable model for identifying at-risk student at higher education. IEEE Access. (2022) 10:107649–68. doi: 10.1109/ACCESS.2022.3211070

Crossref Full Text | Google Scholar

40. Friedman, JH. Greedy function approximation: a gradient boosting machine. Ann Stat. (2001) 29:1189–232. doi: 10.1214/aos/1013203451

Crossref Full Text | Google Scholar

41. Dong, L, Xing, L, Liu, T, Du, H, Mao, F, Han, N, et al. Very high resolution remote sensing imagery classification using a fusion of random Forest and deep learning technique—subtropical area for example. IEEE J Sel Top Appl Earth Obs. (2019) 13:113–28. doi: 10.1109/JSTARS.2019.2953234

Crossref Full Text | Google Scholar

42. Li, N, Li, B, and Gao, L. Transient stability assessment of power system based on XGBoost and factorization machine. IEEE Access. (2020) 8:28403–14. doi: 10.1109/ACCESS.2020.2969446

Crossref Full Text | Google Scholar

43. Gu, X, Han, Y, and Yu, J. A novel lane-changing decision model for autonomous vehicles based on deep autoencoder network and XGBoost. IEEE Access. (2020) 8:9846–63. doi: 10.1109/ACCESS.2020.2964294

Crossref Full Text | Google Scholar

44. Chen, T, and Guestrin, C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; (2016) San Francisco, California, USA

Google Scholar

45. Kim, S, Lee, HK, and Lee, K. Screening of mood symptoms using MMPI-2-RF scales: an application of machine learning techniques. J Pers Med. (2021) 11:812. doi: 10.3390/jpm11080812

PubMed Abstract | Crossref Full Text | Google Scholar

46. Zhang, W, Zhao, X, and Li, Z. A comprehensive study of smartphone-based indoor activity recognition via Xgboost. IEEE Access. (2019) 7:80027–42. doi: 10.1109/ACCESS.2019.2922974

Crossref Full Text | Google Scholar

47. He, S, Li, B, Peng, H, Xin, J, and Zhang, E. An effective cost-sensitive XGBoost method for malicious URLs detection in imbalanced dataset. IEEE Access. (2021) 9:93089–96. doi: 10.1109/ACCESS.2021.3093094

Crossref Full Text | Google Scholar

48. Półchłopek, O, Koning, NR, Büchner, FL, Crone, MR, Numans, ME, and Hoogendoorn, M. Quantitative and temporal approach to utilising electronic medical records from general practices in mental health prediction. Comput Biol Med. (2020) 125:103973. doi: 10.1016/j.compbiomed.2020.103973

PubMed Abstract | Crossref Full Text | Google Scholar

49. Malmstrom, TK, Miller, DK, and Morley, JE. A comparison of four frailty models. J Am Geriatr Soc. (2014) 62:721–6. doi: 10.1111/jgs.12735

PubMed Abstract | Crossref Full Text | Google Scholar

50. Cheng, J, Gao, M, Liu, J, Yue, H, Kuang, H, Liu, J, et al. Multimodal disentangled Variational autoencoder with game theoretic interpretability for glioma grading. IEEE J Biomed Health Inform. (2022) 26:673–84. doi: 10.1109/jbhi.2021.3095476

PubMed Abstract | Crossref Full Text | Google Scholar

51. Fleuren, LM, Dam, TA, Tonutti, M, de Bruin, DP, Lalisang, RCA, Gommers, D, et al. Predictors for extubation failure in COVID-19 patients using a machine learning approach. Crit Care. (2021) 25:448. doi: 10.1186/s13054-021-03864-3

PubMed Abstract | Crossref Full Text | Google Scholar

52. Hathaway, QA, Roth, SM, Pinti, MV, Sprando, DC, Kunovac, A, Durr, AJ, et al. Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics. Cardiovasc Diabetol. (2019) 18:78. doi: 10.1186/s12933-019-0879-0

PubMed Abstract | Crossref Full Text | Google Scholar

53. Booth, J, Margetts, B, Bryant, W, Issitt, R, Hutchinson, C, Martin, N, et al. Machine learning approaches to determine feature importance for predicting infant autopsy outcome. Pediatr Dev Pathol. (2021) 24:351–60. doi: 10.1177/10935266211001644

PubMed Abstract | Crossref Full Text | Google Scholar

54. Xu, Q, Ou, X, and Li, J. The risk of falls among the aging population: a systematic review and meta-analysis. Front Public Health. (2022) 10:902599. doi: 10.3389/fpubh.2022.902599

PubMed Abstract | Crossref Full Text | Google Scholar

55. Mohd Zebaral Hoque, J, Ab Aziz, NA, Alelyani, S, Mohana, M, and Hosain, M. Improving water quality index prediction using regression learning models. Int J Environ Res Public Health. (2022) 19:13702. doi: 10.3390/ijerph192013702

PubMed Abstract | Crossref Full Text | Google Scholar

56. Goh, V, Chou, Y-J, Lee, C-C, Ma, M-C, Wang, WYC, Lin, C-H, et al. Predicting bacteremia among septic patients based on ED information by machine learning methods: a comparative study. Diagnostics. (2022) 12:2498. doi: 10.3390/diagnostics12102498

PubMed Abstract | Crossref Full Text | Google Scholar

57. Samuel, AL. Some studies in machine learning using the game of checkers. IBM J Res Dev. (1959) 3, 44:207–26.

Google Scholar

58. Shanthini, A, Vinodhini, G, Chandrasekaran, RM, and Supraja, P. A taxonomy on impact of label noise and feature noise using machine learning techniques. Soft Comput. (2019) 23:8597–607. doi: 10.1007/s00500-019-03968-7

Crossref Full Text | Google Scholar

Keywords: machine learning, physical frailty, model, prediction, XGBoost

Citation: Yang C-C, Chen P-H, Yang C-H, Dai C-Y, Luo K-H, Chen T-H, Chuang H-Y and Kuo C-H (2024) Physical frailty identification using machine learning to explore the 5-item FRAIL scale, Cardiovascular Health Study index, and Study of Osteoporotic Fractures index. Front. Public Health. 12:1303958. doi: 10.3389/fpubh.2024.1303958

Received: 28 September 2023; Accepted: 22 March 2024;
Published: 09 May 2024.

Edited by:

Xiaolei Liu, Sichuan University, China

Reviewed by:

Pedro Miguel Gaspar, University of Trás-os-Montes and Alto Douro, Portugal
Mack Shelley, Iowa State University, United States

Copyright © 2024 Yang, Chen, Yang, Dai, Luo, Chen, Chuang and Kuo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hung-Yi Chuang, ericch@kmu.edu.tw

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.