ORIGINAL RESEARCH article

Front. Oncol., 17 August 2023

Sec. Gastrointestinal Cancers: Colorectal Cancer

Volume 13 - 2023 | https://doi.org/10.3389/fonc.2023.1199868

Development and validation of a web-based predictive model for preoperative diagnosis of localized colorectal cancer and colorectal adenoma

  • Clinical Laboratory, DongYang People’s Hospital, Dongyang, Zhejiang, China

Abstract

Background:

Localized colorectal cancer (LCC) has obscure clinical signs, which are difficult to distinguish from colorectal adenoma (CA). This study aimed to develop and validate a web-based predictive model for preoperative diagnosis of LCC and CA.

Methods:

We conducted a retrospective study that included data from 500 patients with LCC and 980 patients with CA who were admitted to Dongyang People’s Hospital between November 2012 and June 2022. Patients were randomly divided into the training (n=1036) and validation (n=444) cohorts. Univariate logistic regression, least absolute shrinkage and selection operator regression, and multivariate logistic regression were used to select the variables for predictive models. The area under the curve (AUC), calibration curve, decision curve analysis (DCA), and clinical impact curve (CIC) were used to evaluate the performance of the model.

Results:

The web-based predictive model was developed, including nine independent risk factors: age, sex, drinking history, white blood cell count, lymphocyte count, red blood cell distribution width, albumin, carcinoembryonic antigen, and fecal occult blood test. The AUC of the prediction model in the training and validation cohorts was 0.910 (0.892–0.929) and 0.894 (0.862–0.925), respectively. The calibration curve showed good consistency between the outcome predicted by the model and the actual diagnosis. DCA and CIC showed that the predictive model had a good clinical application value.

Conclusion:

This study first developed a web-based preoperative prediction model, which can discriminate LCC from CA and can be used to quantitatively assess the risks and benefits in clinical practice.

1 Introduction

Colorectal cancer (CRC) is the second leading cause of cancer-related death (). However, for patients with localized colorectal cancer (LCC), the 5-year survival rate after timely surgical treatment can reach 90% (). Generally, endoscopy predicts potential malignant tumors based on the size and shape of colorectal tumors and ultimately guides tumor treatment (). LCC and colorectal adenoma (CA) are local lesions that require different treatment approaches. Patients with LCC should undergo laparoscopic or open surgery as soon as it is practicable () and may require chemotherapy and adjuvant radiation before surgery (). On the contrary, patients with CA can be treated using selective endoscopic removal based on their preference (). However, LCC has obscure clinical signs, and is difficult to distinguish from CA, which depends on biopsy and pathological evaluation ().

Pathological diagnosis is the gold standard for differentiating between benign and malignant colorectal tumors. However, an endoscopic biopsy is an invasive examination that can lead to complications, such as bleeding, perforation, and infection; thus, it is limited to the patient’s willingness and compliance (). Due to the advancements in molecular diagnostic technology, DNA () and microRNA () are now being used as CRC biomarkers. However, they are expensive with unstable diagnostic performance limiting their use in clinical settings. Therefore, fecal occult blood test (FOBT) () and serum carcinoembryonic antigen (CEA) () detection are preferred as CRC biomarkers because of their accessibility and affordability. However, for CRC, particularly in the early stage, a single detection biomarker has limited sensitivity and a high probability of misdiagnosis. To select the appropriate treatment method and reduce the rate of misdiagnosis of LCC, it is crucial to create a diagnostic prediction model with excellent diagnostic performance using readily accessible and affordable markers.

In previous studies, there are many potential predictive biomarkers for CRC preoperative diagnosis models, such as platelet-related parameters (), red blood cell distribution width (RDW) (), and hemoglobin (). However, some of those studies had a small sample size () and almost all of them included patients with advanced CRC (, ). To our knowledge, patients with advanced CRC present with more pronounced clinical symptoms; as such, less sensitive indicators have an exaggerated role in the prediction model, resulting in the reduced diagnostic performance of the LCC prediction model.

In this single-center retrospective study, we used clinical and laboratory data of patients from 2012 to 2022 to develop and validate the first web-based predictive model for preoperative diagnosis of LCC and CA.

2 Materials and methods

The design and reporting of this study were guided by transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) ().

2.1 Study population

Demographic and clinical data of 2070 patients were collected from the medical records at the Dongyang People’s Hospital between November 2012 and June 2022. The inclusion criteria were adult patients (≥ 18 years) who were diagnosed with CRC or CA using histopathology. The exclusion criteria were (1): preoperative anti-tumor treatment (2), patients diagnosed with advanced CRC (Tumor-Node-Metastasis [TNM] stage III-IV) according to TNM staging of the eighth edition of the American Joint Committee on Cancer (3), patients with other primary malignant tumors, and (4) patients with clinical data loss rate >10%. According to the inclusion and exclusion criteria, 500 patients with LCC and 980 patients with CA were finally included in this study (Figure 1).

Figure 1

The study was approved by the ethics committee of Dongyang People’s Hospital. The data analysis was anonymous. Informed consent was waived since this is a retrospective study and the diagnosis as well as treatment of the patients were not affected.

2.2 Data collection

Preoperative data of eligible patients were extracted, including age, sex, routine blood parameters, FOBT, serum albumin, serum glucose, and tumor biomarkers. Routine blood parameters included white blood cell (WBC), neutrophil, and lymphocyte counts, as well as hemoglobin, RDW, platelet count, and mean platelet volume. Selected tumor biomarkers were CEA and carbohydrate antigen 199. FOBT-positive patients were defined as those with a positive immunochemical test and guaiac-based test ≥ 1+. When the proportion of missing values in the variables was less than 10%, the missing data were filled using multiple imputations in the two cohorts respectively (). For outliers, which were defined as values other than 1-99% in continuous variables winsorized by 1% on both sides (), artificial discrimination was used in categorical variables. In addition, data on smoking, drinking, diabetes, and hypertension statuses of the eligible patients were collected. A history of smoking or drinking was noted if smoking or the use of alcohol was reported in a patient’s medical record.

2.3 Statistical analysis

Included patients were randomly divided into the training (70%) and validation (30%) cohorts. Data were analyzed using Stata version 14.0 (Stata Corp LP, College Station, TX, USA) and R software version 4.1.0. Continuous variables are expressed as means and standard deviations or medians and interquartile intervals as appropriate. A Wilcox test or Student’s t-test was used to assess between-group differences. Categorical variables were compared using the chi-square test and are presented as quantities (percentages).

Data from the training cohort were used for the development of the prediction model. Univariable logistic regression analysis was used preliminarily to screen candidate variables. The least absolute shrinkage and selection operator (LASSO) regression analysis was then used to remove collinear independent variables to prevent overfitting (). Subsequently, indicators with coefficients that were not zero were included in a multivariable logistic regression to complete the final selection of variables. Variables with statistical significance (P<0.05) were included in the prediction model. The prediction probability of the model was calculated using the following formula:

A web-based prediction model was developed based on the “DynNom” and “rsconnect” packages in R software (). The prediction probability can be automatically calculated after inputting the expression of each variable.

The performance of the model in the training and validation cohort was assessed. After data were normalized, the discriminative ability was expressed using the area under the receiver operating characteristic (ROC) curve (AUC). The Youden index was used to determine the best cut-off value, and the corresponding sensitivity, specificity, accuracy, positive prediction value (PPV), negative prediction value (NPV), positive likelihood ratio (PLR), as well as negative effect ratio (NLR), were calculated. To examine the consistency between the actual risk of LCC and the probability predicted using the new model, calibration curves were plotted and the Hosmer–Lemeshow goodness of fit test was performed. The clinical utility of the decision curve analysis (DCA) and clinical impact curve (CIC) was used to demonstrate the clinical utility of the prediction model ().

3 Results

3.1 Patient characteristics

Demographic and clinical data were extracted from medical records of 2070 patients between November 2012 and June 2022. After data were excluded based on our criteria, records of 1480 patients (LCC: n=500 and CA: n=980) were finally included in this study (Figure 1). Among LCC patients, five (1.0%), 161 (32.2%), and 334 (66.8%) patients had TNM stage 0, I, and II disease, respectively. The patients were then randomly divided into the training (n=1036) and validation (n=444) cohorts at a ratio of 7:3. There was no statistical difference between the training and validation cohorts in each variable (Table 1).

Table 1

VariablesTraining CohortValidation CohortP
CA Group
(n = 684)
LCC Group
(n = 352)
CA Group
(n = 296)
LCC Group
(n = 148)
Age, years58.4 ± 11.365.6 ± 12.157.1 ± 11.365.1 ± 10.80.132
Male, n (%)
Female, n (%)
485 (70.9)
199 (29.1)
193 (54.8)
159 (45.2)
214 (72.3)
82 (27.7)
81 (54.7)
67 (45.3)
0.711
Smoking history, n (%)246 (36.0)105 (29.8)120 (40.5)43 (29.1)0.294
Drinking history, n (%)225 (32.9)139 (39.5)107 (36.1)45 (30.4)0.739
Comorbidities, n (%)
 Hypertension190 (27.8)134 (38.1)70 (23.6)54 (36.5)0.199
 Diabetes35 (5.1)27 (7.7)19 (6.4)17 (11.5)0.132
Laboratory test
White blood cell count, 10 9/L5.6 (4.8–6.7)6.1 (4.9–7.4)5.7 (4.8–6.7)6.2 (5.0–7.6)0.616
Neutrophil count, 10 9/L3.4 (2.8–4.3)3.9 (2.9–5.0)3.4 (2.8–4.2)3.9 (3.0–5.1)0.905
Lymphocyte count, 10 9/L1.7 (1.4–2.0)1.5 (1.2–1.8)1.7 (1.4–2.1)1.6 (1.2–1.9)0.083
Hemoglobin, g/L146 (136–156)129 (114–141)147 (136–157)126 (114–140)0.866
Red blood cell distribution width, %12.5 (12.2–13.0)13 (12.5–14)12.6 (12.2–13.1)13.2 (12.5–14)0.414
Platelet count, 10 9/L215 (180.5–253)238 (195–288)215 (183–255)237.5 (186–289.5)0.909
Mean platelet volume, fL10.5 (10–11.2)10.2 (9.6–10.8)10.6 (10–11.3)10 (9.4–10.8)0.944
Glucose, mg/dL5.15 (4.8–5.6)5.1 (4.7–5.6)5.1 (4.8–5.6)5.2 (4.8–6.1)0.749
Albumin, g/L44.3 (42.4–46.1)40.5 (36.6–43.4)44.1 (42.3–46.2)40.6 (36.1–43.6)0.902
Carcinoembryonic antigen, ng/mL2.2 (1.5–3.3)3.7 (2.2–6.9)2.2 (1.5–3.3)3.3 (2.0–5.3)0.212
Carbohydrate antigen 199, U/mL9.6 (6.5–14.6)10.7 (6.4–17.6)10.1 (6.4–14.5)10.2 (6.4–17.6)0.747
Fecal occult blood test, n (%)153 (22.4)279 (79.3)77 (26.0)120 (81.1)0.341
TNM stage, n (%)0.867
 Stage 04 (1.1)1 (0.7)
 Stage I112 (31.8)49 (33.1)
 Stage II236 (67.1)98 (66.2)

Preoperative baseline characteristics of patients in the training and validation cohorts.

LCC, localized colorectal cancer; CA, colorectal adenoma; TNM, tumor-node-metastasis.

3.2 Variable selection and development of a web-based predictive model

As shown in Table 2, in the training cohort, there were significant differences in 16 clinical parameters between LCC and CA groups using univariable logistic regression. Subsequently, 15 potential predictors without multicollinearity were identified using LASSO regression analysis (Table 2, Supplementary Figure 1). Finally, age, sex, drinking history, WBC, lymphocyte count, RDW, albumin, CEA, and FOBT were considered independent predictors of LCC in patients according to the results of multivariate logistic regression (Table 3). The probability of LCC can be calculated according to the following formula: Probability (LCC) = 1/(1 + exp (− (-1.703 + 0.024 × age -1.035 × sex (male=1, female=0) + 0.482 × drinking history (yes=1, no=0) + 0.215 × WBC - 0.694 × lymphocyte count + 0.334 × RDW - 0.131 × albumin + 0.203 × CEA + 2.357 × FOBT (positive=1, negative=0)))).

Table 2

VariablesUnivariable logistic analysisLASSO regression analysis
OR (95% CI)PLambda.min = 0.0023
Age1.06 (1.04–1.07)<0.0010.0248
Sex (male=1, female=0)0.50 (0.38–0.65)<0.001-0.6563
Smoking history (yes=1, no=0)0.76 (0.57–1.00)0.048-0.0539
Drinking history (yes=1, no=0)1.33 (1.02–1.74)0.0350.4375
Hypertension (yes=1, no=0)1.60 (1.22–2.10)0.0010.0425
Diabetes (yes=1, no=0)1.54 (0.92–2.59)0.103/
White blood cell count1.15 (1.08–1.23)<0.0010.1896
Neutrophil count1.27 (1.17–1.38)<0.0010
Lymphocyte count0.49 (0.38–0.63)<0.001-0.6331
Hemoglobin0.95 (0.94–0.96)<0.001-0.0117
Red blood cell distribution width1.85 (1.61–2.14)<0.0010.1904
Platelet count1.01 (1.00–1.01)<0.0010.0029
Mean platelet volume0.63 (0.55–0.72)<0.001-0.0030
Glucose1.02 (0.95–1.09)0.552/
Albumin0.78 (0.75–0.81)<0.001-0.1077
Carcinoembryonic antigen1.32 (1.24–1.40)<0.0010.1713
Carbohydrate antigen 1991.02 (1.01–1.03)<0.0010.0017
Fecal occult blood test
(positive=1, negative=0)
13.26 (9.69–18.16)<0.0012.2308

Univariate logistic analysis and LASSO regression analysis in the training cohort.

CI, confidence interval; OR, odds ratio; LASSO, least absolute shrinkage and selection operator.

Table 3

VariablesMultivariate logistic analysis
βOR (95% CI)PβOR (95% CI)P
Age0.0261.03
(1.01–1.05)
0.0060.0241.02
(1.01–1.04)
0.006
Sex (male=1, female=0)-0.7340.48
(0.29–0.79)
0.004-1.0350.35
(0.23–0.54)
<0.001
Smoking history (yes=1, no=0)-0.1370.87
(0.52–1.46)
0.602
Drinking history (yes=1, no=0)0.5321.70
(1.07–2.70)
0.0240.4821.62
(1.07–2.45)
0.023
Hypertension (yes=1, no=0)0.0731.08
(0.71–1.62)
0.726
White blood cell count0.2071.23
(1.09–1.38)
0.0010.2151.24
(1.11–1.39)
<0.001
Lymphocyte count-0.7010.50
(0.33–0.74)
0.001-0.6940.50
(0.34–0.74)
0.001
Hemoglobin-0.0110.99
(0.98–1.00)
0.083
Red blood cell distribution width0.2151.24
(1.02–1.50)
0.0270.3341.40
(1.18–1.65)
<0.001
Platelet count0.0031.00
(1.00–1.01)
0.069
Mean platelet volume-0.1290.88
(0.71–1.09)
0.231
Albumin-0.1100.90
(0.85–0.95)
<0.001-0.1310.88
(0.83–0.92)
<0.001
Carcinoembryonic antigen0.2021.22
(1.13–1.32)
<0.0010.2031.22
(1.14–1.32)
<0.001
Carbohydrate antigen 19-90.0021.00
(0.99–1.01)
0.719
Fecal occult blood test
(positive=1, negative=0)
2.2879.84
(6.71–14.43)
<0.0012.35710.56
(7.24–15.42)
<0.001
Intercept-0.054//-1.703

Multivariate logistic analysis in the training cohort.

CI, confidence interval; OR, odds ratio.

The web-based dynamic prediction model developed using the selected variables can be used through the following link: https://ly11219.shinyapps.io/dynnomapp/.The interface of this webpage is shown in Figure 2. Figure 2A displays the input nomograph interface, in which users can adjust the expression of each item. Figure 2B shows a graphical summary of the LCC probability and 95% confidence interval (CI) predicted for the three patients according to the nomogram. The page also provides a numerical summary, shown in Figure 2C.

Figure 2

3.3 Evaluation of the performance of the prediction model

The AUC in the training and validation cohorts was 0.910 (0.892–0.929) and 0.894 (0.862–0.925), respectively (Figure 3A). There was no significant difference in the diagnostic performance of the prediction model between the two cohorts (P=0.379). The optimal cut-off value of the probatility of the prediction model was 26.41%. The result of sensitivity, specificity, PPV, NPV, PLR, and NLR used to distinguish between LCC and CA was 86.4%, 80.1%, 82.2%, 69.1%, 91.9%, 4.342, and 0.170 in the training cohort, as well as 85.1%, 79.1%, 81.1%, 67.0%, 91.4%, 4.07 and 0.188 in the validation cohort, respectively (Table 4). The results of the Hosmer–Lemeshow goodness of fit test showed good consistency between the outcome predicted by the model and the actual diagnosis, which was reflected in both the training and validation cohorts (Figures 3B, C).

Figure 3

Table 4

Training CohortValidation Cohort
Area under the curve0.9100.894
95% CI lower0.8920.862
95% CI upper0.9290.925
Sensitivity86.4%85.1%
Specificity80.1%79.1%
Accuracy82.2%81.1%
Positive predictive value69.1%67.0%
Negative predictive value91.9%91.4%
Positive log-likelihood ratio4.3424.07
Negative log-likelihood ratio0.1700.188

Predictive performance of the models used to estimate the risk of LCC.

CI, confidence interval; LCC, localized colorectal cancer.

When drawing the DCA to reflect the advantages of the new model, we added a comparison between the new model and the two common CRC screening indicators, CEA and FOBT. In the training and validation cohorts, the threshold probability was between 0.05 and 1.00. The performance of the prediction model was better than that of the CEA, FOBT, and two extreme cases (treat-none and treat-all) as shown in Figures 4A, B. For example, when the risk threshold was set to 0.3 (i.e. if the LCC probability of the patient was >30%), the patient would receive further treatment. In the training cohort, the net benefit of the new prediction model was 0.25, which was higher than that of of the FOBT (0.20), CEA (0.10), treat-all (0.05), and treat-none (0.00).

Figure 4

CIC results also showed a similar number of high-risk LCC patients predicted by the new model and the actual number of LCC patients in both the training and validation cohorts (Figures 4C, D). For instance, when the risk threshold was set to 0.4, almost 400 out of every 1000 persons in the training cohort were deemed at high risk, and approximately 350 of them were diagnosed with LCC.

4 Discussion

CA is a benign tumor in the colorectal region, of which only 5% will eventually advance to CRC, and the overall tumor progression is slow (). Preoperative differentiation between LCC and CA is helpful to reduce unnecessary treatment and promote the early detection of CRC. In most cases, the differentiation between LCC and CA patients depends on invasive colonoscopy (). In this study, we found that age, sex, drinking history, WBC, lymphocyte count, RDW, albumin, CEA, and FOBT were independent predictors of LCC in patients, and successfully developed a web-based prediction model. Through evaluating calibration and validation, we believe that our prediction model has a good discrimination performance and clinical application value.

To predict LCC and CA, a wide range of variables were considered in this model when selecting preoperative markers. Age and sex play a significant role in the diagnosis of many tumors, including CRC () and lung cancer (). In this study, being female and older were considered risk factors for LCC. Previous studies reported smoking and drinking as risk factors for CRC, and these were associated with a poor prognosis (). The current study also showed that individuals with a drinking history were at a higher risk of being diagnosed with LCC, but there was no association between LCC and smoking. Routine blood cell parameters are often used as inflammatory markers to reflect the patient’s inflammatory immune status. Various studies have shown that higher neutrophil-lymphocyte and platelet-lymphocyte ratios or a high systemic immune inflammatory index (platelet count × neutrophil count/lymphocyte count) were associated with a higher tumor stage, worse differentiation level, and worse prognosis of CRC (). Our prediction model used single blood cell parameters, such as WBC, lymphocyte count, or RDW, instead of the ratio between parameters, which reduces the steps of numerical conversion and simplifies the calculation process. Serum albumin not only reflects the nutritional status of patients but also has a negative correlation with the inflammatory reaction in vivo. It is an independent predictor of the prognosis of CRC (). In this study, patients with lower albumin levels had a higher probability of LCC diagnosis. CEA and FOBT are currently widely used non-invasive markers for screening CRC (, ), which played a significant role in this prediction model and had better diagnostic performance than being used individually.

Previous studies attempted to use molecular detection for early diagnosis of CRC; however, this is expensive and incomparable to colonoscopy or fecal immunochemical tests (). Some studies have used markers of the systemic inflammatory response as diagnostic tools for CRC (, ). However, the results of these trials have limited diagnostic performance in individuals with early CRC because a large number of patients with advanced CRC were included. Additionally, the static nomograph model requires the manual calculation of the prediction probability corresponding to the total score, which is less intuitive. This study included nine easily available and inexpensive preoperative variables and developed a web-based prediction model. As a preoperative prediction tool, the model is not only easy to popularize and use, but also has a high prediction accuracy and good discrimination characteristics.

Our study has some limitations. First, since this is a single-center retrospective study, it is necessary to validate our model with data from external prospective studies. Second, neither the diagnostic information of patients with advanced CRC nor the prognosis was included in this study. In most cases, the model was mainly used as a tool for early screening of LCC and CA. Third, other indicators with potential predictive value, such as gene expression and coagulation markers, were not included due to restrictions posed by retrospective data as well as the feasibility of sample collection and costs. To validate the accuracy of our findings, a multicenter prospective investigation is required.

5 Conclusion

A web-based preoperative prediction model incorporating nine preoperative variables was developed. The model can directly and quantitatively assess the risks and benefits in clinical practice and has strong performance in recognizing LCC and CA.

Statements

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee of Dongyang People’s Hospital. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

YL contributed to the research concept and design. HG and JJ conducted data collection and analysis. All authors participated in the first draft of this manuscript. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1199868/full#supplementary-material

Abbreviations

CRC, Colorectal cancer; LCC, localized colorectal cancer; CA, colorectal adenoma; FOBT, fecal occult blood test; CEA, carcinoembryonic antigen; TNM, Tumor-Node-Metastasis; WBC, white blood cell; RDW, red blood cell distribution width; LASSO, least absolute shrinkage and selection operator; ROC, receiver operating characteristic; AUC, arear under the receiver operating characteristic curve; PPV, positive prediction value; NPV, negative prediction value; PLR, positive likelihood ratio; NLR, negative effect ratio; DCA, decision curve analysis; CIC, clinical impact curve; CI, confidence interval.

References

  • 1

    SungHFerlayJSiegelRLLaversanneMSoerjomataramIJemalAet al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin (2021) 71(3):209–49. doi: 10.3322/caac.21660

  • 2

    MillerKDNogueiraLDevasiaTMariottoABYabroffKRJemalAet al. Cancer treatment and survivorship statistics, 2022. CA: Cancer J Clin (2022) 72(5):409–36. doi: 10.3322/caac.21731

  • 3

    BronzwaerMESDeplaAvan LelyveldNSpanierBWMOosterhoutYHvan LeerdamMEet al. Quality assurance of colonoscopy within the Dutch national colorectal cancer screening program. Gastrointestinal Endoscopy (2019) 89(1):113. doi: 10.1016/j.gie.2018.09.011

  • 4

    ShengSZhaoTWangX. Comparison of robot-assisted surgery, laparoscopic-assisted surgery, and open surgery for the treatment of colorectal cancer: A network meta-analysis. Medicine (2018) 97(34):e11817. doi: 10.1097/md.0000000000011817

  • 5

    BensonABVenookAPAl-HawaryMMArainMAChenYJCiomborKKet al. NCCN guidelines insights: rectal cancer, version 6.2020. J Natl Compr Cancer Network JNCCN (2020) 18(7):806–15. doi: 10.6004/jnccn.2020.0032

  • 6

    AaronsCBShanmuganSBleierJI. Management of Malignant colon polyps: current status and controversies. World J Gastroenterol (2014) 20(43):16178–83. doi: 10.3748/wjg.v20.i43.16178

  • 7

    DasVKalitaJPalM. Predictive and prognostic biomarkers in colorectal cancer: A systematic review of recent advances and challenges. Biomedicine pharmacotherapy = Biomedecine pharmacotherapie (2017) 87:819. doi: 10.1016/j.biopha.2016.12.064

  • 8

    KimSYKimHSParkHJ. Adverse events related to colonoscopy: Global trends and future challenges. World J Gastroenterol (2019) 25(2):190204. doi: 10.3748/wjg.v25.i2.190

  • 9

    MüllerDGyőrffyB. DNA methylation-based diagnostic, prognostic, and predictive biomarkers in colorectal cancer. Biochim Biophys Acta Rev Cancer (2022) 1877(3):188722. doi: 10.1016/j.bbcan.2022.188722

  • 10

    ToiyamaYOkugawaYFleshmanJRichard BolandCGoelA. MicroRNAs as potential liquid biopsy biomarkers in colorectal cancer: A systematic review. Biochim Biophys Acta Rev Cancer (2018) 1870(2):274–82. doi: 10.1016/j.bbcan.2018.05.006

  • 11

    LiJNYuanSY. Fecal occult blood test in colorectal cancer screening. J Digestive Dis (2019) 20(2):62–4. doi: 10.1111/1751-2980.12712

  • 12

    RaoHWuHHuangQYuZZhongZ. Clinical value of serum CEA, CA24-2 and CA19-9 in patients with colorectal cancer. Clin Lab (2021) 67(4). doi: 10.7754/Clin.Lab.2020.200828

  • 13

    WuYYZhangXQinYYQinJQLinFQ. Mean platelet volume/platelet count ratio in colorectal cancer: a retrospective clinical study. BMC Cancer (2019) 19(1):314. doi: 10.1186/s12885-019-5504-9

  • 14

    SongYHuangZKangYLinZLuPLinQet al. Clinical usefulness and prognostic value of red cell distribution width in colorectal cancer. BioMed Res Int (2018) 2018:9858943. doi: 10.1155/2018/9858943

  • 15

    LiHLinJXiaoYZhengWZhaoLYangXet al. Colorectal cancer detected by machine learning models using conventional laboratory test data. Technol Cancer Res Treat (2021) 20:15330338211058352. doi: 10.1177/15330338211058352

  • 16

    ZáhorecRMarekVWaczulikovaIVeselovskýTPalajJKečk隊et al. Predictive model using hemoglobin, albumin, fibrinogen, and neutrophil-to-lymphocyte ratio to distinguish patients with colorectal cancer from those with benign adenoma. Neoplasma (2021) 68(6):1292–300. doi: 10.4149/neo_2021_210331N435

  • 17

    HuangJXuPChenY. A retrospective study from a single center to identify hematological factors that distinguish between patients with colorectal carcinoma and colorectal adenoma. Med Sci monitor Int Med J Exp Clin Res (2022) 28:e936745. doi: 10.12659/msm.936745

  • 18

    Hernandez-AinsaMVelamazanRLanasACarrera-LasfuentesPPiazueloE. Blood-cell-based inflammatory markers as a useful tool for early diagnosis in colorectal cancer. Front Med (2022) 9:843074. doi: 10.3389/fmed.2022.843074

  • 19

    MoonsKGAltmanDGReitsmaJBIoannidisJPMacaskillPSteyerbergEWet al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Internal Med (2015) 162(1):W173. doi: 10.7326/m14-0698

  • 20

    AustinPCWhiteIRLeeDSvan BuurenS. Missing data in clinical research: A tutorial on multiple imputation. Can J Cardiol (2021) 37(9):1322–31. doi: 10.1016/j.cjca.2020.11.010

  • 21

    SharmaSChatterjeeS. Winsorization for robust bayesian neural networks. Entropy (Basel Switzerland) (2021) 23(11). doi: 10.3390/e23111546

  • 22

    FriedmanJHastieTTibshIraniR. Regularization paths for generalized linear models via coordinate descent. J Stat Software (2010) 33(1):122.

  • 23

    JalaliAAlvarez-IglesiasARoshanDNewellJ. Visualising statistical models using dynamic nomograms. PloS One (2019) 14(11):e0225253. doi: 10.1371/journal.pone.0225253

  • 24

    KerrKFBrownMDZhuKJanesH. Assessing the clinical impact of risk prediction models with decision curves: guidance for correct interpretation and appropriate use. J Clin Oncol Off J Am Soc Clin Oncol (2016) 34(21):2534–40. doi: 10.1200/jco.2015.65.5654

  • 25

    KeumNGiovannucciE. Global burden of colorectal cancer: emerging trends, risk factors and prevention strategies. Nat Rev Gastroenterol Hepatol (2019) 16(12):713–32. doi: 10.1038/s41575-019-0189-8

  • 26

    ChenHLiNRenJFengXLyuZWeiLet al. Participation and yield of a population-based colorectal cancer screening programme in China. Gut (2019) 68(8):1450–7. doi: 10.1136/gutjnl-2018-317124

  • 27

    ErichsenRBaronJAStoffelEMLaurbergSSandlerRSSørensenHT. Characteristics and survival of interval and sporadic colorectal cancer patients: a nationwide population-based cohort study. Am J Gastroenterol (2013) 108(8):1332–40. doi: 10.1038/ajg.2013.175

  • 28

    LiuDSunXLiuALiLLiSLiJet al. Predictive value of a novel Asian lung cancer screening nomogram based on artificial intelligence and epidemiological characteristics. Thorac Cancer (2021) 12(23):3130–40. doi: 10.1111/1759-7714.14140

  • 29

    KlarichDSBrasserSMHongMY. Moderate alcohol consumption and colorectal cancer risk. Alcoholism Clin Exp Res (2015) 39(8):1280–91. doi: 10.1111/acer.12778

  • 30

    WalterVJansenLUlrichARothWBläkerHChang-ClaudeJet al. Alcohol consumption and survival of colorectal cancer patients: a population-based study from Germany. Am J Clin Nutr (2016) 103(6):1497–506. doi: 10.3945/ajcn.115.127092

  • 31

    GongJHutterCBaronJABerndtSCaanBCampbellPTet al. A pooled analysis of smoking and colorectal cancer: timing of exposure and interactions with environmental factors. Cancer epidemiology Biomarkers Prev Publ Am Assoc Cancer Research cosponsored by Am Soc Prev Oncol (2012) 21(11):1974–85. doi: 10.1158/1055-9965.epi-12-0692

  • 32

    WalterVJansenLHoffmeisterMUlrichAChang-ClaudeJBrennerH. Smoking and survival of colorectal cancer patients: population-based study from Germany. Int J Cancer (2015) 137(6):1433–45. doi: 10.1002/ijc.29511

  • 33

    YangJXuHGuoXZhangJYeXYangYet al. Pretreatment inflammatory indexes as prognostic predictors for survival in colorectal cancer patients receiving neoadjuvant chemoradiotherapy. Sci Rep (2018) 8(1):3044. doi: 10.1038/s41598-018-21093-7

  • 34

    LinSQXieHLGeYZRuanGTZhangQSongMMet al. Association between systemic inflammation and water composition and survival in colorectal cancer. Front Oncol (2022) 12:896160. doi: 10.3389/fonc.2022.896160

  • 35

    SongYYangYGaoPChenXYuDXuYet al. The preoperative neutrophil to lymphocyte ratio is a superior indicator of prognosis compared with other inflammatory biomarkers in resectable colorectal cancer. BMC Cancer (2017) 17(1):744. doi: 10.1186/s12885-017-3752-0

  • 36

    ChanJCChanDLDiakosCIEngelAPavlakisNGillAet al. The lymphocyte-to-monocyte ratio is a superior predictor of overall survival in comparison to established biomarkers of resectable colorectal cancer. Ann Surg (2017) 265(3):539–46. doi: 10.1097/sla.0000000000001743

  • 37

    ZhangNNingFGuoRPeiJQiaoYFanJet al. Prognostic values of preoperative inflammatory and nutritional markers for colorectal cancer. Front Oncol (2020) 10:585083. doi: 10.3389/fonc.2020.585083

  • 38

    LisiDHassanCCrespiM. Participation in colorectal cancer screening with FOBT and colonoscopy: an Italian, multicentre, randomized population study. Digestive liver Dis Off J Ital Soc Gastroenterol Ital Assoc Study Liver (2010) 42(5):371–6. doi: 10.1016/j.dld.2009.07.019

  • 39

    HeissJABrennerH. Epigenome-wide discovery and evaluation of leukocyte DNA methylation markers for the detection of colorectal cancer in a screening setting. Clin Epigenet (2017) 9:24. doi: 10.1186/s13148-017-0322-x

  • 40

    PeterseEFPMeesterRGSde JongeLOmidvariAHAlarid-EscuderoFKnudsenABet al. Comparing the cost-effectiveness of innovative colorectal cancer screening tests. J Natl Cancer Institute (2021) 113(2):154–61. doi: 10.1093/jnci/djaa103

  • 41

    JinPYouPFangJKangQGuFCaiYet al. Comparison of performance of two stool DNA tests and a fecal immunochemical test in detecting colorectal neoplasm: A multicenter diagnostic study. Cancer epidemiology Biomarkers Prev Publ Am Assoc Cancer Research cosponsored by Am Soc Prev Oncol (2022) 31(3):654–61. doi: 10.1158/1055-9965.epi-21-0991

  • 42

    KilincalpSÇobanŞAkinciHHamamcıMKaraahmetFCoşkunYet al. Neutrophil/lymphocyte ratio, platelet/lymphocyte ratio, and mean platelet volume as potential biomarkers for early detection and monitoring of colorectal adenocarcinoma. Eur J Cancer Prev Off J Eur Cancer Prev Organisation (ECP) (2015) 24(4):328–33. doi: 10.1097/cej.0000000000000092

Summary

Keywords

colorectal cancer, colorectal adenoma, predictive model, preoperative diagnosis, dynamic nomogram

Citation

Lu Y, Guo H and Jiang J (2023) Development and validation of a web-based predictive model for preoperative diagnosis of localized colorectal cancer and colorectal adenoma. Front. Oncol. 13:1199868. doi: 10.3389/fonc.2023.1199868

Received

28 April 2023

Accepted

03 August 2023

Published

17 August 2023

Volume

13 - 2023

Edited by

Sharon R. Pine, University of Colorado Anschutz Medical Campus, United States

Reviewed by

Sergey Achkasov, Ryzhikh National Medical Research Centre of Coloproctology, Russia; Dipendra Khadka, Wonkwang University School of Medicine, Republic of Korea; Qisong Zhang, Guangxi University, China

Updates

Copyright

*Correspondence: Yan Lu,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics