Use of Machine Learning and Statistical Algorithms to Predict Hospital Length of Stay Following Colorectal Cancer Resection: A South African Pilot Study

Achilonu, Okechinyere J.; Fabian, June; Bebington, Brendan; Singh, Elvira; Nimako, Gideon; Eijkemans, Rene M. J. C.; Musenge, Eustasius

doi:10.3389/fonc.2021.644045

ORIGINAL RESEARCH article

Front. Oncol., 01 October 2021

Sec. Surgical Oncology

Volume 11 - 2021 | https://doi.org/10.3389/fonc.2021.644045

Use of Machine Learning and Statistical Algorithms to Predict Hospital Length of Stay Following Colorectal Cancer Resection: A South African Pilot Study

Okechinyere J. Achilonu^1*

June Fabian^2,3

Brendan Bebington^3,4

Elvira Singh^1,5

Gideon Nimako^1,6

Rene M. J. C. Eijkemans⁷

Eustasius Musenge¹

¹Division of Epidemiology and Biostatistics, School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
²Medical Research Council/Wits University Rural Public Health and Health Transitions Research Unit (Agincourt), School of Public Health, Faculty of Health Sciences, University of Witwatersrand, Johannesburg, South Africa
³Wits Donald Gordon Medical Centre, School of Clinical Medicine, Faculty of Health Sciences, University of Witwatersrand, Johannesburg, South Africa
⁴Department of Surgery, Faculty of Health Science, University of the Witwatersrand Faculty of Science, Parktown, Johannesburg, South Africa
⁵National Cancer Registry, National Health Laboratory Service, Johannesburg, South Africa
⁶Industrialization, Science, Technology and Innovation Hub, African Union Development Agency (AUDA-NEPAD), Johannesburg, South Africa
⁷Julius Center for Health Sciences and Primary Care, University Medical Center, Utrecht University, Utrecht, Netherlands

The aim of this pilot study was to develop logistic regression (LR) and support vector machine (SVM) models that differentiate low from high risk for prolonged hospital length of stay (LOS) in a South African cohort of 383 colorectal cancer patients who underwent surgical resection with curative intent. Additionally, the impact of 10-fold cross-validation (CV), Monte Carlo CV, and bootstrap internal validation methods on the performance of the two models was evaluated. The median LOS was 9 days, and prolonged LOS was defined as greater than 9 days post-operation. Preoperative factors associated with prolonged LOS were a prior history of hypertension and an Eastern Cooperative Oncology Group score between 2 and 4. Postoperative factors related to prolonged LOS were the need for a stoma as part of the surgical procedure and the development of post-surgical complications. The risk of prolonged LOS was higher in male patients and in any patient with lower preoperative hemoglobin. The highest area under the receiving operating characteristics (AU-ROC) was achieved using LR of 0.823 (CI = 0.798–0.849) and SVM of 0.821 (CI = 0.776–0.825), with each model using the Monte Carlo CV method for internal validation. However, bootstrapping resulted in models with slightly lower variability. We found no significant difference between the models across the three internal validation methods. The LR and SVM algorithms used in this study required incorporating important features for optimal hospital LOS predictions. The factors identified in this study, especially postoperative complications, can be employed as a simple and quick test clinicians may flag a patient at risk of prolonged LOS.

1 Introduction

Surgical resection remains the principal treatment modality for patients with colorectal cancer (CRC), and the primary aim is to cure the disease (1–3). However, the economic burden of CRC treatment from presentation to post-surgery supportive care is high, requiring more cost-effective management plans that will benefit the patient or the healthcare providers (1, 4). South Africa has a two-tiered healthcare system that includes a national health insurance system servicing approximately 20% of the population and a state health system servicing the remaining majority of the population (5). Irrespective of the health sector to which patients present for care, there is a need to develop prediction models that might identify those at increased risk of prolonged hospitalization during their treatment for CRC. Length of stay (LOS) is an easily accessible indicator to measure resource utilization, which speaks to performance and efficiency. A prolonged LOS impacts resource allocation and has been associated with increased risk of several postoperative complications, contracting hospital infections, and hospital readmission (6, 7). An accurate prognosis prediction of LOS is desirable for healthcare management, hospital resource utilization, successful treatment, and discharge planning, especially in low- to middle-income countries such as South Africa. Once a prognosis model is established, efforts can be directed toward identifying risk factors to reduce hospital LOS.

Hospital LOS greater than the mean or median has been used to define prolonged LOS (6, 8–10). Due to variations in patient care and management or response to treatment, the median as a central tendency is consistently and considerably used as a better indicator of LOS than the mean. In any of these measures, several predictors of LOS have been identified, which vary across studies. Factors such as patient age at diagnosis and surgical complications have been consistently recognized in most studies. There is an increase in the use of traditional statistical approaches, such as logistic regression model in predicting LOS (4, 6–9). However, in a multifactorial prediction, detecting interactions and assessing the combination of statistically significant predictors may be challenging with standard statistical procedures. Studies have reported that a more reliable and improved prognosis prediction is achievable using machine learning (ML) and artificial intelligence approaches (11, 12).

Francis et al. (10) investigated the use of a multilayered perceptron neural network (MLPNN) to predict delayed discharge and readmission after CRC surgical resection. The dataset consists of 275 patients who were scheduled for laparoscopic surgery between 2002 and 2009. A median LOS greater than 6 days was used to define prolonged hospital LOS (10). The MLPNN model achieved an area under the receiving operating characteristics (AU-ROC) of 0.817, which was slightly higher when compared with that of logistic regression (AU-ROC = 0.807) using a split-sample method. Independent validation with an insufficient sample size has been shown to be misleading in many studies (13, 14). The study of Francis et al. (10) failed to report the confidence interval of the AU-ROC estimates, which made it difficult to measure the uncertainties in the performance estimates of the model. A study by Stoean et al. (15) estimated LOS using the ensemble of support vector machine (SVM), neural network, logistic regression (LR), and decision tree algorithms. A total of 368 patients were analyzed, and the length of stay was divided into three categories. Using random cross-validation (CV) with 30 repeats, the authors showed the highest accuracy of 73.14 ± 4.37, achieved by the ensemble approach.

Internal validation refers to a validation based on the test data from a similar population (16). Internal validation methods such as CV and bootstrap aim to provide more accurate estimates of the performance of a predictive model as compared to the split-sample method (13). CV is a sophisticated resampling approach and has become the standard procedure in estimating the internal validity of a predictive model. However, studies have shown that, in some settings, the bootstrap method outperforms CV (13, 17). With this in mind, we differentiated our study from previous studies by comparing the efficiency of the repeated 10-fold CV, Monte Carlo CV, and bootstrap (0.632 method with replacement) methods for predictive SVM and LR models. Our overall aim was to develop classifiers to distinguish short from prolonged hospital LOS and identify previously unrecognized features that influence hospital LOS. Prediction models for hospital LOS for CRC patients undergoing surgery in South Africa have not been developed. Such models can contribute valuable information to healthcare providers that would, ideally, enhance the care of affected patients and improve the efficiency of healthcare provision.

2 Material and Methods

2.1 Study Data

The dataset was extracted from the 2015–2019 CRC in South Africa (CRCSA) study, a multi-ethnic urban cohort study conducted in Johannesburg, South Africa. The CRCSA study aimed to improve local statistical reporting and the clinical management of patients with CRC. In total, 716 adult patients were recruited during the study period. The methodology of the CRCSA study has been detailed in a prior publication (18). Of the total sample (n = 716), we extracted 383 patients undergoing surgical resection with curative intent, irrespective of whether the surgical method was laparoscopic or open. These patients underwent different surgical procedures and were grouped into segmental colectomies, major resections, and others, which was included as a variable in the predictive analysis. Palliative surgery and surgery for local and distant metastatic disease were not considered in this study. Ethical approval for this study was obtained from the Human Research Ethics Committee (Medical) of the University of the Witwatersrand, Johannesburg, South Africa (M1911131).

Four hospitals from the University of Witwatersrand Academic Teaching Hospital complex were included in the CRCSA study, namely, Wits Donald Gordon Medical Centre (WDGMC), a private academic teaching hospital, Charlotte Maxeke Johannesburg Academic Hospital (CMJAH), Chris Hani Baragwanath Academic Hospital (CHBAH), and Edenvale Hospital. Of these, the former three hospitals function as tertiary referral centers, while the latter, Edenvale Hospital, functions as a secondary treatment center. All the patients in public hospitals were grouped as a new variable “hospital” with two categories, “public” and “private.” Patients treated at WDGMC were categorized as “private,” and those receiving care at CMJAH, CHBAH, and Edenvale Hospitals were categorized as “public.” Some of the patients, especially in the public hospitals, experienced longer waiting times for surgery after hospital admission. Hence, the primary outcome variable (LOS) was based on the number of days spent in the hospital following surgery. LOS was defined in days as the interval between the day of surgery and the day of hospital discharge. A prolonged hospital LOS was defined as LOS that exceeded 9 days, which is the median LOS in the CRC study. Clinical data captured in the CRCSA study were based on the literature and clinical domain knowledge. The clinical information included socio-economic and demographic characteristics, family history, laboratory and clinical testing, and medical and surgical histories (Table 1). Data pre-processing included feature engineering and imputation of missing values.

TABLE 1

Table 1 Features assessed in the prediction models.

Overall, 83% of CRC patients on the CRCSA database had completed records with no missing information (Figure 1). For those patients with incomplete observations, the missingness within each variable was not related to its value or any other variable in the database. We identified 25 out of 69 variables with one or more missing value(s). The proportion of missing values for each variable with missing records was computed. The variable pre-surgical hemoglobin (Hb) had the highest proportion of missing values, with about 5% missingness. Little’s missing completely at random (MCAR) test (19) demonstrated that missingness was completely at random (p = 0.304) (Figure 1). The MissForest imputation method (20) was used to replace missing values. MissForest is a non-parametric method of imputation based on the random forest algorithm (21). The out-of-bag errors estimated by the MissForest method were 0.04 for the continuous variables and 0.16 for the categorical variables imputed in this study, thus validating the reliability of the MissForest method.

FIGURE 1

Figure 1 Missing data pattern and proportion in the CRCSA study.

2.2 Choice of Classifier

2.2.1 Logistic Regression

We considered a classification problem of separating a set of training samples belonging to two classes: in this study context, short LOS or prolonged LOS.

\begin{array}{l} (x_{i}, y_{i}), x_{i} \in R^{n}, y_{i} \in {- 1, + 1}, i = 1, 2, \dots, n & (1) \end{array}

Where x_i is an n-dimensional real valued features that belong to either one of the two classes (y_i ∈{–1, + 1}). The objective is to define a function [f(x) = y] that can correctly classify patients into one of the two classes based on the feature vector. LR is a statistical technique in which the response variable (y) has a binomial distribution (10, 15). Given a set of features x_i, LR regression determines the membership probability for one of the two classes using

\begin{array}{l} P = \frac{e^{β + β_{i} x_{i}}}{1 + e^{β + β_{i} x_{i}}}, & (2) \end{array}

where β = (β₁, β₂, …, β_m) ∈R^m and are determined by maximum-likelihood estimation. LR has gained popularity in predicting hospital LOS.

2.2.2 Support Vector Machine

SVM is a machine learning algorithm introduced by Vapnik (22). Its application has been promoted in different studies due to its capacity to perform classification and regression based on statistical learning theory and structural risk minimization. Also, it has the ability to handle high-dimensional datasets and linear and nonlinear problems with high performance accuracy (15). Considering the example in Equation 1, where the classes are linearly separable, SVM finds a maximum or optimal hyperplane that gives the greatest separation between the positive and the negative classes (between short LOS and prolonged LOS). A separating hyperplane is defined as w.x + b = 0, where w is the weight vector, which denotes the orientation of the hyperplane, and b denotes bias term. A set of hyperplane margins (||w||^–1) are maximized based on a Lagrangian multiplier (α_i) to identify the optimal margin. The classifier function can be expressed as:

\begin{array}{l} f (x) = sgn (\sum_{i}^{n} y_{i}, α_{i} x . x_{i} + b) & (3) \end{array}

for data that are linearly separable. In this study, we identified that the nonlinear (radial basis function) variant of SVM was more appropriate. In this case, the input features are mapped into a high-dimensional space and the optimal margin is constructed using the radial kernel function, k(x₁, x_j) = exp(–||x_i – x_j||²/2σ²). Hence, the classifier function can be expressed as:

\begin{array}{l} f (x) = sgn (\sum_{i}^{n} y_{i}, α_{i} k x . x_{i} + b) & (4) \end{array}

2.3 Model Development and Validation for Prognosis Prediction

We developed predictive models using LR and SVM (with radial basis kernel) with the “caret package” implemented in the R software. The least absolute shrinkage and selection operator (LASSO) was used to select the most informative features (23). LASSO, which is a penalized regression method, greatly depends on the choice of the tuning parameter (λ) to select the optimal model. The LASSO regression formulation is defined as:

\begin{array}{l} {minimize}_{β} \sum_{i = 1}^{n} {(y_{i} - β_{0} - \sum_{j = 1}^{p} B_{j} X_{i j})}^{2} + λ \sum_{j = 1}^{p} | β_{j} | & (5) \end{array}

where β denotes the regression coefficients and λ the tuning parameter. The objective of tuning a ML hyperparameter is to limit model overfitting because it would lead to poor performance on unseen data. The hyperparameter λ was tuned by using an internal 10-fold CV (repeated 30 times), Monte Carlo CV, and bootstrap (both of which were repeated 10 times). To optimize the choice of the hyperparameter, grid search over λ = {0.01,0.038 by 0.0001} was performed. The hyperparameter value resulting in the highest AU-ROC was selected as the best λ for the final model. Features with non-zero coefficients in the LASSO model were identified as the most relevant features and are used as input features for the LR and SVM models. In addition, the ranking of the feature importance according to each specified model was visualized. For the SVM classifier, the hyperparameters were tuned using the three validation methods specified above for the LASSO model. The values of the radial basis function kernel width (σ) and the loss penalty term (C) were searched over the ranges of σ = {0.0005,0.005,0.045,0.05,0.08,0.01,0.10,1.00} and C = {0.1,0.5,1.0,1.50,1.60,1.65,1.89,1.95,2.00}. Using the default parameters, we fitted the LR models with the three validation methods, as stated previously. The average fit of each classifier per model fitted was used to determine its performance. For the two classifiers used in this study (LR and SVM), a total of six models were developed based on the three internal validation methods.

2.4 Evaluation

The ROC and calibration plots were used to evaluate the performance of the LR and SVM models across the different resampling methods. These methods offer ways to visualize the quality of a prediction model (24). ROC is used to characterize the performance of a predictive model across a set of possible thresholds between the sensitivity and specificity of the model (24). The probability values of a classifier are retrieved, and a point is specified. Values higher than the threshold are classified as positive (prolonged LOS); otherwise, they are classified as negative (short LOS). The higher the performance, the better the algorithm discriminates patients into the outcome classes. A perfect classifier would yield a point at the 0,1 of the ROC space (AU-ROC = 1.0), while a line of no discrimination would yield points at the diagonal. A calibration plot is a line plot that shows the agreement between the actual outcome and the predicted outcome given by the model. For instance, if the LR model predicts a 40% risk of a prolonged hospital LOS for a patient in this study, it is expected that approximately 40 out of 100 patients with such prediction should have the observed frequency of prolonged LOS. A perfect calibrated plot should have a line along the 45 line; hence, the closer the points to the diagonal, the more reliable are the model predictions. The evaluation plots were estimated using the pROC, caret, and ggplots packages in R software. The Wilcoxon signed-rank test was used to conduct a pairwise comparison of the LR and SVM models to examine whether the differences in the estimates of AU-ROC are statistically significant. Two-sided tests were employed, and a p-value <0.05 was considered statistically significant. This non-parametric test has been used in several studies for the comparison of predictive models (25). We also compared the sensitivity and specificity of the models to evaluate the model performance further. The R source codes for this study have been deposited in the Github platform to reproduce the study models (https://github.com/KechJay/HLOS_LS).

3 Results

A total of 383 patients who underwent CRC resection between 2015 and 2020 were studied. Of these, 53.5% had an open surgical procedure, 38.1% had a complete laparoscopic procedure, and 8.4% had a laparoscopic-assisted procedure or a laparoscopic procedure that was converted to an open procedure. The mean age of the 383 patients was 58 (±12.9) years, with equal proportions of men and women. Most patients were of self-reported white (46.9%) and black (36.8%) ethnicity. In total, 202 (52.7%) patients were treated in a private facility and 181 (47.3%) patients were treated in public facilities. Figure 2 shows no consistent pattern in the distribution of the median LOS for private and public hospitals across the study period, and there was no significant difference in LOS when comparing private and public facilities (OR = 0.77, p = 0.214). Overall, when the admissions were pooled over the study period, the median LOS was 9 days, with 53% of patients spending more than 9 days in the hospital. The median LOS values for open and non-open surgical procedures were the same (9 days), 9 days for elective surgery, and 8 days for non-elective surgery.

FIGURE 2

Figure 2 Bar plot illustrating the median hospital length of stay across the study period and study sites.

The LASSO feature selection method identified a set of eight features, which were consistent across the three validation methods. The selected features were used as input to the LR and SVM models. Table 2 provides the distribution of the input features across the outcome variable (hospital LOS). According to the LR model, most of the predictive features are significantly associated with the risk of prolonged LOS at a 5% significant level (Table 2). The risk of prolonged LOS is reduced with female patients and patients with increased hemoglobin levels (measured preoperatively). Other factors, such as having a postoperative complication, stoma, and patients presenting with a history of hypertension, increased the risk of prolonged LOS. For instance, the results showed that a postoperative complication was a major significant risk of a protracted hospital LOS. Of the patients with prolonged LOS, 69% had postoperative complications compared with 31.1% of patients without postoperative complications. Results from the LR model showed that patients with postoperative complications had approximately 14 times the odds of experiencing prolonged LOS compared with patients without postoperative complications. The relevance of these features is depicted in Figure 3. According to LR and SVM, a postoperative complication was the most relevant feature in this study. Most of the features were ranked higher in LR compared to that in SVM. Tumor grade differentiation and anesthetic grading assessment are the least ranked features based on the two models.

TABLE 2

Table 2 Factors predicting prolonged length of stay (LOS) in colorectal cancer (CRC) patients after surgical resection.

FIGURE 3

Figure 3 Variable importance ranked by LR and SVM.

The mean, standard deviation, and confidence intervals of the predictive models with the three validation methods are shown in Table 3. We also generated the ROC curve to visualize the predictive performance of the LR and SVM models (Figure 4). In these figures, the Monte Carlo method appears to demonstrate a slightly higher performance than that of the other validation methods in each predictive model, with the AU-ROCs reaching 82.3% and 82.1% for LR and SVM, respectively. However, it is known that a narrow confidence interval gives a narrower uncertainty for the ROC estimate (a more precise estimate). The bootstrap method resulted in a narrower confidence interval compared to that of the other methods in the two models. Nonetheless, we cannot claim that any of the methods is preferred because, for each internal validation technique, the LR and SVM models showed similarities in performance, and these hold for all the summary statistics. As expected, the Wilcoxon signed-ranked test showed no significant difference between the models across the different internal validation methods. The SVM model with the bootstrap validation method had the highest sensitivity (81.8%), which indicates that 81.8% of the patients were correctly classified into the prolonged hospital LOS class. The SVM with the 10-fold CV showed the most heightened sensitivity of 72%, although with the highest standard error.

TABLE 3

Table 3 Average prediction results and standard deviations obtained by the logistic regression (LR) and support vector machine (SVM) models.

FIGURE 4

Figure 4 Comparison of area under the receiver operating characteristic curves (AU-ROC) across the resampling methods for (A) LR and (B) SVM.

Furthermore, the calibration plots (Figures 5A, B) demonstrated that the two models showed considerable agreement between the observed and predicted probabilities across the three internal validation methods. If the points are above the diagonal line, the predicted probabilities are minor; otherwise, they are too large compared to the observed probabilities. The LR model fitted with the bootstrap method appeared to have good calibration, except at the bottom left and right, where the model under- and over-predicted the probabilities. However, the SVM model with the Monte Carlo method seemed to be better calibrated than the SVM with other methods. Overall, the models showed little deviation, thus confirming good calibration.

FIGURE 5

Figure 5 Comparison of the calibration plots across the resampling methods for (A) LR and (B) SVM.

A further analysis was conducted to determine factors that also predisposed patients to postoperative complications. Eight variables were identified by the LASSO method and were used to fit an LR model. The risk factors for postoperative complications are summarized in Table 4. Factors such as surgical type, the use of chemotherapy treatment, and having a pre-therapeutic or intraoperative complication increased the chances of a postoperative complication.

TABLE 4

Table 4 Determinants of postoperative complications based on logistic regression.

4 Discussion

CRC is one of the most common cancers affecting South Africans, and models that improve the care delivered to patients and simultaneously enhance efficiency for service providers are desperately needed (15, 26). In this pilot study, the first of its kind for South Africa, we used machine learning models to determine the median LOS and investigated the factors impacting LOS for those with CRC undergoing surgical resection with curative intent.

Kelly et al. (6) noted that the median LOS values varied across studies and depend on the healthcare systems. We have acknowledged the longer median LOS observed in this study compared to other studies with shorter median LOS in higher-income countries. Lack of screening, the delay in diagnosis and treatment, inaccessibility to ICU bed postoperatively, and the inability to implement the enhanced recovery after surgery (ERAS) protocol may account for the longer median LOS observed in this study. Therefore, we assume that it is logical to extract the LOS from the study data because it is specific to the study population and speaks to the colorectal cancer healthcare in the studied region.

We investigated the ability of LR and SVM to predict hospital LOS and the effects of repeated 10-fold CV, Monte Carlo CV, and bootstrap internal validation methods on the models. Our study showed that these two models could effectively predict hospital LOS with high AU-ROC. Previous studies on LOS using machine learning procedures have shown that the accuracy in predicting prolonged LOS for patients undergoing CRC resection can be improved using ML procedures (10, 13, 14). In addition, ML encourages the reproducibility and generalizability of the developed model. It is noteworthy that the LR and SVM models reached predictive accuracy values of 0.823 and 0.821, respectively. These are slightly higher than that achieved by the MLPNN method used in Francis et al. (10). Our model accuracy measures (79% and 77%) for LR and SVM are higher than that achieved in the study by Stoean et al. (15). In the studies of Francis et al. (10) and Stoean et al. (15), other models were shown to outperform LR. Our study showed good performance with LR, even higher when compared with those of MLPNN and other models (results not shown). This indicates that performance may depend on the study data and the modeling procedure. We found no significant difference between the results of the two models used in this study; however, models with bootstrap followed by Monte Carlo CV methods resulted in minimum variability compared to those that used repeated 10-fold CV.

This study showed that the identified predictive risk of prolonged hospital LOS relates primarily to patient-related factors. Anemia in CRC increases blood transfusion risk during surgery and consequently prolongs hospital LOS (8, 27, 28). Preoperative hemoglobin was tested in the CRCSA study. Although this variable was modeled on a constant level, the relationship between this variable and those requiring blood transfusion at the time of surgery was unknown. However, the findings of this study correlated with those of previous studies (8, 27, 28). Several studies support the relationship between the requirement for a stoma and risk of prolonged hospital LOS (10, 27, 29). Furthermore, stoma type and length are also associated with prolonged hospital LOS (30). We found no significant impact of stoma type on prolonging LOS; however, having stoma formation compared to not having stoma formation increases the odds of extending LOS to about 2.5 times. As seen in other published studies, our study also confirms the significance of hypertension, Eastern Cooperative Oncology Group (ECOG) performance, the American Society of Anesthesiologists (ASA) grading assessment, and grade of differentiation on LOS after colorectal surgery (31–33).

Among the demographic features in this study, sex was the only one that significantly influenced prolonged LOS, and this was well described in other studies. The male patients in this study had a longer median LOS of 10 days than female patients who had a median LOS of 8 days. Hence, female patients had a 55% reduced odds of prolonged LOS compared to male patients. Male patients also showed the likelihood of increased postoperative complications. Previous studies have found that the rate of postoperative complications was significantly higher in male patients than that in female patients (34, 35). The feature selection method showed no evidence of a patient’s age being associated with prolonged LOS for patients undergoing CRC surgery. In a study done by Leung et al. (9), a patient’s age was shown not to have a significant impact on the hospital LOS. However, some studies have suggested that a patient’s age is one of the informative risk factors for prolonged LOS (6, 36). The median age of the patients in this study was 60 years (range = 18–91 years). Using this information, we further categorized the patients into age groups: <60 years (191 patients) and ≥60 years (192 patients). A comparison of the LOS between these age groups showed that both groups have a median LOS of 9 days. This further highlights the similarity in the LOS of these patients, irrespective of age.

A postoperative complication is a relatively fixed risk factor that influences LOS, which has been shown in previous studies (8, 9). This is a testament to the validity and reliability of both the study data and the modeling approaches used in the present study. In the ranking of variable importance in this study, postoperative complication was ranked 100% by LR and SVM, suggesting that it is a strong determinant of LOS. Most patients in this study had only one postoperative complication each. The majority of the postoperative complications experienced by the patients were ileus, surgical wound sepsis, anastomotic leak/breakdown, and access collection, with the first two being the most recorded complications. Also, the most common postoperative complication in this study, according to the Clavien–Dindo classification, is grade II (56%). A postoperative complication has also been identified as a factor that influences a patient’s overall survival (37).

It is essential to investigate further possible pre- or perioperative factors that predispose patients to postoperative complications. Our findings further showed that factors such as the type of procedure, pre-therapeutic and intraoperative complications, gender, preoperative chemotherapy, staging, and hospital category predispose a patient to postoperative complications. Previous studies supported this (34, 35, 38, 39). If these factors influencing postoperative complications in patients undergoing CRC surgery are identified and controlled, the impact of this variable on LOS may decline drastically, and the overall post-surgical quality of life of the patients may be improved.

Several strengths of the current study should be acknowledged. This was a population-based study, which includes patients diagnosed with CRC who underwent surgical resection within public and private hospitals in the Johannesburg region, linked to the Witwatersrand. These four hospitals have both private and public healthcare facilities, which serve the most extensive urban population in South Africa. There is a high level of confidence in the follow-up of the patients in this study, with a detailed collection of an array of local and established patient information. In addition, the model developed in this study could be extrapolated to other cancer centers across South Africa and may also apply to the general population of South Africa, given that the hospitals in this study are mostly referral hospitals. The study is also subject to a few limitations. There may have been surgical advances, treatment, and patient management across the study period that may have affected the hospital LOS patterns experienced by these patients. Also, inter-hospital variations may impact the hospital LOS because different hospitals may have disparities in hospital admission policies. In addition, the absence of the implementation of ERAS as a standard of care could be another limitation of this study.

Although the sample size in this study was small, and we agree it is a limitation, this is the first of such studies from South Africa and Sub-Saharan Africa. Despite its relatively small sample size, this is the first longitudinal cohort study to describe the socio-demographics, risk factors, treatment, and outcomes of those diagnosed with colorectal cancer in Johannesburg, South Africa (18), hence, a valuable analysis that will hopefully serve as a basis for a broader validation in our setting. We have also validated the predictive models internally using three validation methods. Since there is no available external source test data to validate these models externally, we proposed to externally validate the current predictive models and ascertain the generalizability of these models using the upcoming longitudinal CRC study.

In conclusion, this study demonstrated the ability of LR and SVM to produce a clinically helpful model in predicting patient hospital LOS with high performance. The association established in this study may enable clinicians to implement changes in patient care pathways. If data were collected in the pre-and the post-hospital environment, a broader understanding of poor outcomes would be achieved. Seeing the bigger picture may enable clinicians to depersonalize poor outcomes and focus on measures beyond the failings of individuals. Such a strategy would not only improve clinical outcomes but is also likely to improve efficiency and, therefore, favorably impact the cost of care for patients with CRC. Finally, this study demonstrates that there is so much needed to be done in order to enable the management of CRC in South Africa to be comparable to that of the developed world.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

Ethical approval for this study was obtained from the Human Research Ethics Committee (Medical) of the University of the Witwatersrand, Johannesburg, South Africa (M1911131). The patients/participants provided written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

OA conceptualized, analyzed the data, compiled the results, and wrote the manuscript. DB and JF provided the data and reviewed the manuscript. ES and GN reviewed the manuscript. ME and EM supervised the concept development and reviewed the manuscript. All authors contributed to the article and approved the submitted version.

Funding

The CRCSA study was funded by the Medical Research Council of South Africa through the Wits/SAMRC Common Epithelial Cancer Research Centre (CECRC) Grant (Paul Ruff, principal investigator). The DELTAS Africa Initiative supports the present study. The DELTAS Africa Initiative is an independent funding scheme of the African Academy of Sciences’s (AAS) Alliance for Accelerating Excellence in Science in Africa (AESA). The New Partnership supports it for Africa’s Development Planning and Coordinating Agency (NEPAD Agency), with funding from the Welcome Trust [grant 107754/Z/15/Z- DELTAS Africa Sub-Saharan Africa Consortium for Advanced Biostatistics (SSACAB) programmer] and the UK government.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The authors want to acknowledge the patients who participated in the CRCSA study and the staff of WDGMC that curated the database, especially Mrs. Soneni Maphosa for her valuable explanation of the study design and variables.

References

1. Jansman FG, Postma MJ, Brouwers JR. Cost Considerations in the Treatment of Colorectal Cancer. Pharmacoeconomics (2007) 25:537–62. doi: 10.2165/00019053-200725070-00002

PubMed Abstract | CrossRef Full Text | Google Scholar

2. West MA, Wischmeyer PE, Grocott MP. Prehabilitation and Nutritional Support to Improve Perioperative Outcomes. Curr Anesthesiol Rep (2017) 7:340–9. doi: 10.1007/s40140-017-0245-2

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Bongers BC, Dejong CH, den Dulk M. Enhanced Recovery After Surgery Programmes in Older Patients Undergoing Hepatopancreatobiliary Surgery: What Benefits Might Prehabilitation Have? Eur J Surg Oncol (2020) 47(3):551–9. doi: 10.1016/j.ejso.2020.03.211

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Perng DS, Lu IC, Shi HY, Lin CW, Liu KW, Su YF, et al. Incidence Trends and Predictors for Cost and Average Lengths of Stay in Colorectal Cancer Surgery. World J Gastroenterol: WJG (2014) 20:532. doi: 10.3748/wjg.v20.i2.532

CrossRef Full Text | Google Scholar

5. Maphumulo WT, Bhengu BR. Challenges of Quality Improvement in the Healthcare of South Africa Post-Apartheid: A Critical Review. Curationis (2019) 42:1–9. doi: 10.4102/curationis.v42i1.1901

CrossRef Full Text | Google Scholar

6. Kelly M, Sharp L, Dwane F, Kelleher T, Comber H. Factors Predicting Hospital Length-of-Stay and Readmission After Colorectal Resection: A Population-Based Study of Elective and Emergency Admissions. BMC Health Serv Res (2012) 12:77. doi: 10.1186/1472-6963-12-77

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Vicendese D, Marvelde LT, McNair PD, Whitfield K, English DR, Taieb SB, et al. Hospital Characteristics, Rather Than Surgical Volume, Predict Length of Stay Following Colorectal Cancer Surgery. Aust New Z J Public Health (2020) 44:73–82. doi: 10.1111/1753-6405.12932

CrossRef Full Text | Google Scholar

8. Tartter PI. Determinants of Postoperative Stay in Patients With Colorectal Cancer. Dis Colon Rectum (1988) 31:694–8. doi: 10.1007/BF02552587

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Leung AM, Gibbons R, Vu HN. Predictors of Length of Stay Following Colorectal Resection for Neoplasms in 183 Veterans Affairs Patients. World J Surg (2009) 33:2183–8. doi: 10.1007/s00268-009-0148-6

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Francis N, Luther A, Salib E, Allanby L, Messenger D, Allison A, et al. The Use of Artificial Neural Networks to Predict Delayed Discharge and Readmission in Enhanced Recovery Following Laparoscopic Colorectal Cancer Surgery. Tech Coloproctol (2015) 19:419–28. doi: 10.1007/s10151-015-1319-0

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Cruz JA, Wishart DS. Applications of Machine Learning in Cancer Prediction and Prognosis. Cancer Inf (2006) 2:59–77. doi: 10.1177/117693510600200030

CrossRef Full Text | Google Scholar

12. Vollmer S, Mateen BA, Bohner G, Király FJ, Ghani R, Jonsson P, et al. Machine Learning and Artificial Intelligence Research for Patient Benefit: 20 Critical Questions on Transparency, Replicability, Ethics, and Effectiveness. BMJ (2020) 368(l6927):1–12. doi: 10.1136/bmj.l6927

CrossRef Full Text | Google Scholar

13. Steyerberg EW, Harrell FE Jr., Borsboom GJ, Eijkemans M, Vergouwe Y, Habbema JDF. Internal Validation of Predictive Models: Efficiency of Some Procedures for Logistic Regression Analysis. J Clin Epidemiol (2001) 54:774–81. doi: 10.1016/S0895-4356(01)00341-9

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Steyerberg EW, Harrell FE. Prediction Models Need Appropriate Internal, Internal–External, and External Validation. J Clin Epidemiol (2016) 69:245–7. doi: 10.1016/j.jclinepi.2015.04.005

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Stoean R, Stoean C, Sandita A, Ciobanu D, Mesina C. Ensemble of Classifiers for Length of Stay Prediction in Colorectal Cancer, In: Advances in Computational Intelligence-Lecture Notes in Computer Science, volume 9094 Switzerland AG: Springer (2015). pp. 444–57.

Google Scholar

16. Mondol MH. Rahman MS. A Comparison of Internal Validation Methods for Validating Predictive Models for Binary Data With Rare Events. J Stat Res (2017) 51:131–44. doi: 10.47302/jsr.2017510203

CrossRef Full Text | Google Scholar

17. Efron B, Tibshirani R. Improvements on Cross-Validation: The 632+ Bootstrap Method. J Am Stat Assoc (1997) 92:548–60. doi: 10.1080/01621459.1997.10474007

CrossRef Full Text | Google Scholar

18. Bebington B, Singh E, Fabian J, Jan Kruger C, Prodehl L, Surridge D, et al. Design and Methodology of a Study on Colorectal Cancer in Johannesburg, South Africa. JGH Open (2018) 2:139–43. doi: 10.1002/jgh3.12061

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Little RJ. A Test of Missing Completely at Random for Multivariate Data With Missing Values. J Am Stat Assoc (1988) 83:1198–202. doi: 10.1080/01621459.1988.10478722

CrossRef Full Text | Google Scholar

20. Stekhoven DJ, Bühlmann P. Missforest—non-Parametric Missing Value Imputation for Mixed-Type Data. Bioinformatics (2012) 28:112–8. doi: 10.1093/bioinformatics/btr597

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Achilonu OJ, Fabian J, Musenge E. Modeling Long-Term Graft Survival With Time-Varying Covariate Effects: An Application to a Single Kidney Transplant Centre in Johannesburg, South Africa. Front Public Health (2019) 7:201. doi: 10.3389/fpubh.2019.00201

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Vapnik V. The Nature of Statistical Learning Theory. (2013). doi: 10.1007/978-1-4757-3264-1

CrossRef Full Text | Google Scholar

23. Tibshirani R. Regression Shrinkage and Selection via the Lasso: A Retrospective. J R Stat Soc (Stat Methodol) (2011) 73:273–82. doi: 10.1111/j.1467-9868.2011.00771.x

CrossRef Full Text | Google Scholar

24. Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al. Assessing the Performance of Prediction Models: A Framework for Some Traditional and Novel Measures. Epidemiol (Cambridge Mass) (2010) 21:128. doi: 10.1097/EDE.0b013e3181c30fb2

CrossRef Full Text | Google Scholar

25. Nguyen KA, Chen W, Lin BS, Seeboonruang U. Using Machine Learning-Based Algorithms to Analyze Erosion Rates of a Watershed in Northern Taiwan. Sustainability (2020) 12:2022. doi: 10.3390/su12052022

CrossRef Full Text | Google Scholar

26. Adonis L, An R, Luiz J, Mehrotra A, Patel D, Basu D, et al. Provincial Screening Rates for Chronic Diseases of Lifestyle, Cancers and Hiv in a Health-Insured Population. South Afr Med J (2013) 103:309–12. doi: 10.7196/SAMJ.6686

CrossRef Full Text | Google Scholar

27. Smart NJ, Aaojkrfn White P. Deviation and Failure of Enhanced Recovery After Surgery Following Laparoscopic Colorectal Surgery: Early Prediction Mode. Colorectal Dis (2012) 14:77. doi: 10.1111/j.1463-1318.2012.03096.x

CrossRef Full Text | Google Scholar

28. Khanbhai M, Shah M, Cantanhede G, Ilyas S, Richards T. The Problem of Anaemia in Patients With Colorectal Cancer. Int Scholarly Res Notices (2014) 2014(547914):1–4. doi: 10.1155/2014/547914

CrossRef Full Text | Google Scholar

29. Munk-Madsen P, Eriksen J, Kehlet H, Gögenur I. Why Still in Hospital After Laparoscopic Colorectal Surgery Within an Enhanced Recovery Programme? Colorectal Dis (2019) 21:1438–44. doi: 10.1111/codi.14762

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Parmar K, Zammit M, Smith A, Kenyon D, Lees N, Manchester G, et al. A Prospective Audit of Early Stoma Complications in Colorectal Cancer Treatment Throughout the Greater Manchester and Cheshire Colorectal Cancer Network. Colorectal Dis (2011) 13:935–8. doi: 10.1111/j.1463-1318.2010.02325.x

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Flynn DE, Mao D, Yerkovich ST, Franz R, Iswariah H, Hughes A, et al. The Impact of Comorbidities on Post-Operative Complications Following Colorectal Cancer Surgery. PloS One (2020) 15(12):e0243995. doi: 10.1371/journal.pone.0243995

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Lobato L, Ferreira PCA, Wick EC, Kiran RP, Remzi FH, Kalady MF, et al. Risk Factors for Prolonged Length of Stay After Colorectal Surgery. J Coloproctol (Rio Janeiro) (2013) 33:22–7. doi: 10.1590/S2237-93632013000100005

CrossRef Full Text | Google Scholar

33. Ahmed J, Lim M, Khan S, McNaught C, MacFie J. Predictors of Length of Stay in Patients Having Elective Colorectal Surgery Within an Enhanced Recovery Protocol. Int J Surg (2010) 8:628–32. doi: 10.1016/j.ijsu.2010.07.294

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Biondo S, Gálvez A, Ramírez E, Frago R, Kreisler E. Emergency Surgery for Obstructing and Perforated Colon Cancer: Patterns of Recurrence and Prognostic Factors. Tech Coloproctol (2019) 23:1141–61. doi: 10.1007/s10151-019-02110-x

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Vermeer N, Backes Y, Snijders H, Bastiaannet E, Liefers G, Moons L, et al. National Cohort Study on Postoperative Risks After Surgery for Submucosal Invasive Colorectal Cancer. BJS Open (2019) 3:210. doi: 10.1002/bjs5.50125

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Faiz O, Haji A, Burns E, Bottle A, Kennedy R, Aylin P. Hospital Stay Amongst Patients Undergoing Major Elective Colorectal Surgery: Predicting Prolonged Stay and Readmissions in Nhs Hospitals. Colorectal Dis (2011) 13:816–22. doi: 10.1111/j.1463-1318.2010.02277.x

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Khuri SF, Henderson WG, DePalma RG, Mosca C, Healey NA, Kumbhani DJ, et al. Determinants of Long-Term Survival After Major Surgery and the Adverse Effect of Postoperative Complications. Ann Surg (2005) 242:326. doi: 10.1097/01.sla.0000179621.33268.83

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Swellengrebel H, Marijnen C, Verwaal V, Vincent A, Heuff G, Gerhards M, et al. Toxicity and Complications of Preoperative Chemoradiotherapy for Locally Advanced Rectal Cancer. Br J Surg (2011) 98:418–26. doi: 10.1002/bjs.7315

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Kambakamba P, Dindo D, Nocito A, Clavien P, Seifert B, Schäfer M, et al. Intraoperative Adverse Events During Laparoscopic Colorectal Resection—Better Laparoscopic Treatment But Unchanged Incidence. Lessons Learnt From a Swiss Multi-Institutional Analysis of 3,928 Patients. Langenbeck’s Arch Surg (2014) 399:297–305. doi: 10.1007/s00423-013-1156-4

CrossRef Full Text | Google Scholar

Keywords: colorectal cancer, hospital length of stay, risk factors, support vector machine, logistic regression, internal validation techniques, prediction models, South Africa

Citation: Achilonu OJ, Fabian J, Bebington B, Singh E, Nimako G, Eijkemans RMJC and Musenge E (2021) Use of Machine Learning and Statistical Algorithms to Predict Hospital Length of Stay Following Colorectal Cancer Resection: A South African Pilot Study. Front. Oncol. 11:644045. doi: 10.3389/fonc.2021.644045

Received: 21 December 2020; Accepted: 31 August 2021;
Published: 01 October 2021.

Edited by:

Gianluca Rompianesi, University of Naples Federico II, Italy

Reviewed by:

Kazuhiko Yoshimatsu, Kawasaki Medical School, Japan
Giovanni Domenico Tebala, Oxford University Hospitals NHS Trust, United Kingdom
Gianluca Pagano, University of Naples Federico II, Italy

Copyright © 2021 Achilonu, Fabian, Bebington, Singh, Nimako, Eijkemans and Musenge. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Okechinyere J. Achilonu, YWNoaWxvbnUub2tlY2hpbnllcmVAZ21haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.