- 1Department of Gastroenterology, The Fourth Affiliated Hospital of Anhui Medical University, Hefei, China
- 2Department of Burns, The First Affiliated Hospital of Anhui Medical University, Hefei, China
Objectives: To develop and validate a predictive model estimating the likelihood of pathological upgrading in patients with colorectal intraepithelial neoplasia (IN).
Methods: Using data from 158 patients with colorectal IN confirmed by endoscopic biopsy, we employed Least Absolute Shrinkage and Selection Operator (LASSO) regression followed by multivariate logistic regression to identify key predictive factors. A nomogram was constructed based on the selected variables. The performance of these models was assessed using calibration curves, the area under the receiver operating characteristic curve (AUC), and the Hosmer-Lemeshow goodness-of-fit test. Furthermore, decision curve analysis (DCA) was utilized to evaluate the practical utility of the models, thereby exploring their potential clinical applications.
Results: Four variables—rectal location, surface erosion, lesion size ≥ 30 mm, and villous histology—were incorporated into the nomogram. The model demonstrated strong discrimination (AUC = 0.822; 95% CI: 0.744–0.899) and good calibration (Hosmer–Lemeshow χ2 = 1.731, p = 0.973). Internal validation yielded a consistent AUC of 0.813. DCA confirmed the model’s broad clinical utility.
Conclusion: This nomogram accurately predicts pathological upgrading in colorectal IN, allowing clinicians to identify high-risk patients early and tailor management accordingly.
1 Introduction
Colorectal cancer (CRC) is a malignant disease, ranking in the top three of the most common cancers in the world, ranking third in incidence and cancer-related mortality (1–3). The disease continues to pose a serious threat to global health. In China, the rapid economic development and the aging of the population have led to a continuous rise in the number of new cases of colorectal cancer and related deaths (4). The disease has become a long-standing urgent problem in the field of public health in China, highlighting the urgent need to make continuous progress in early screening, prevention strategies and treatment plans.
Early detection of colorectal cancer is crucial to improve the survival rate of patients. Research shows that when colorectal cancer is in the early stage and undergoing surgical treatment, the 5-year survival rate is close to 90% (5, 6). On the contrary, once the disease progresses to a late stage or metastasis occurs, the survival rate will drop below 10%. This sharp contrast highlights the life-saving significance of timely diagnosis and treatment. A large number of clinical studies further confirmed that the early identification and removal of precancerous lesions through colonoscopy can significantly reduce the incidence and mortality of colorectal cancer (7–10). For example, a long-term study by Zauber et al. showed that the colorectal cancer-related mortality rate of patients undergoing colonoscopic adenoma resection was reduced by 53% (7).
Colorectal cancer develops through a recognized adenoma-cancer sequence, in which intraepithelial tumor (IN) is a key precancerous stage (11–13). This concept highlights the clinical importance of accurately assessing the risk level of IN to guide timely intervention. Unlike other gastrointestinal malignant tumors such as gastric cancer, colorectal tumors limited to the mucosal layer rarely metastasize to regional lymph nodes. When complete endoscopic resection is achieved, such cases can usually be cured (14). However, once the lesion invades the submucosal layer, the treatment plan must include regional lymph node sweep. Therefore, it is very important to distinguish between high-grade intraepithelial tumor and early invasive cancer before surgery. Accurate preoperative evaluation lays the foundation for the selection of individualized treatment strategies.
At present, endoscopic forceps (EFB) are still the most reliable way to identify the characteristics of colorectal tumors. Colonoscopy plays an important role in detecting early lesions of colorectal cancer. However, its accuracy may be affected by a variety of factors, including insufficient tissue sampling depth, misjudgment caused by internal differences in the tumor, and differences in the interpretation of the results by pathologists (15–17). For this reason, the results of preoperative biopsy often do not match the final postoperative pathology report, which in turn affects the treatment decision-making. In addition, colonoscopy is an invasive and expensive detection method, which also requires intestinal preparation, which brings discomfort to patients. Therefore, it is urgent to develop simple and non-invasive follow-up tools.
Against this background, it is of great clinical significance to develop an accessible and low-cost tool to predict the potential progress of IN. Among the existing prediction methods, nomogram are particularly prominent as a practical and reliable method. By integrating multiple clinical variables and patient characteristics, it allows clinicians to estimate the probability of specific outcomes and supports more individualized decision-making (18). Although numerous predictive models have been developed for assessing colorectal cancer risk, only a limited number specifically address the pathological progression of IN (19–21).
This study was designed to explore a key question: whether a nomogram based on routine clinical indicators can accurately estimate the likelihood of pathological advancement in intestinal IN. We propose that integrating clinical, endoscopic, and pathological characteristics may yield a predictive tool with strong accuracy and meaningful clinical value.
2 Materials and methods
2.1 Patient selection
Data were collected on 158 patients diagnosed with IN through EFB at Chaohu Hospital, affiliated with Anhui Medical University, between January 2021 and January 2024. These patients subsequently underwent either EMR or ESD at our institution. The inclusion criteria for the study were defined as follows: (1) a preoperative diagnosis of LGIN or HGIN confirmed through biopsy via EFB; (2) completion of radical surgical resection with a biopsy-to-surgery interval of 6 months or less; (3) availability of comprehensive clinical and pathological data; (4) absence of any prior radiotherapy, chemotherapy, or other antitumor treatments before surgery; and (5) no history of malignancies outside the gastrointestinal tract. The exclusion criteria were: (1) abnormally elevated preoperative tumor markers, including CA19–9, CA125, or CEA; (2) inconsistent conclusions from independent pathological reviews conducted by two pathologists; and (3) a final postoperative pathological diagnosis that did not align with intraepithelial neoplasia, such as a diagnosis of inflammatory polyp, hyperplastic polyp, or invasive carcinoma.
2.2 Data collection
Clinical data collection was conducted using the hospital’s electronic medical record system. The data collected encompassed the following categories: (1) Patient demographics, which included gender, age, body mass index (BMI), smoking history, family history of colorectal cancer, and the presence of concomitant metabolic syndrome (MS); (2) Laboratory assessments, specifically carcinoembryonic antigen (CEA) levels measured within 1 week prior to tumor resection; (3) The quantity of biopsy specimens obtained; (4) Endoscopic characteristics, which involved the maximum tumor diameter, morphological classification as pedunculated or sessile, tumor count, anatomical location, and the presence of hyperemia/erythema, villous architecture, and ulceration. Tumor count was further categorized as either solitary or multiple, while tumor location was classified into rectum, sigmoid colon, descending colon, transverse colon, or ascending colon; (5) Categorized data regarding pathological assessment, where “pathological non-upgrade” was defined as concordant preoperative and postoperative diagnoses. Conversely, the “pathological upgrade” group was characterized by discrepancies between preoperative and postoperative diagnoses, with the latter indicating a higher-grade lesion. This specifically included two scenarios: (a) the upgrading of low-grade intraepithelial neoplasia to high-grade intraepithelial neoplasia, and (b) the transition from low-grade or high-grade intraepithelial neoplasia to a more severe pathological state.
2.3 Evaluation of pathological findings
All endoscopies were performed by experienced gastroenterologists (attending physician or above with at least 5 years of endoscopy experience). Performing a colonoscopy using a white-light endoscope. If abnormal areas (such as polyps or mucosal lesions) are detected, targeted biopsy sampling and further examination are performed using chromoendoscopy and magnifying endoscopy. FFPE biopsies are cut into sections, stained with H&E, and used for an initial diagnosis. To ensure the accuracy of pathology reports, senior pathologists review the diagnostic findings and make corrections when necessary. However, in cases of disagreement, the severity of the disease shall prevail.
2.4 Sample size estimation
The sample size was estimated using the empirical method known as events per variable (EPV) (22). Research using statistical simulations has indicated that for logistic regression, an empirical criterion of an EPV of at least 10 is recommended, meaning the number of events should be ten times the number of independent variables included to ensure robust results. The predictive model ultimately confirmed in this study comprises four predictor variables, with a total of 43 positive outcome events observed in the study cohort. Therefore, events per predictor variable is 10.75 (43/4), meeting and exceeding the empirical standard of EPV ≥ 10.
2.5 Statistical analysis
SPSS 24.0 and R 4.4.1 were used to carry out the statistical analysis. Statistical analysis involved comparing group percentages using the chi-square test or Fisher’s exact test. Continuous variables conforming to a normal distribution were expressed as the mean ± standard deviation, and comparisons between groups were conducted by a t-test; otherwise, they were expressed as the median (P25–P75), and a Mann–Whitney U test was used for intergroup comparisons. A univariate logistic regression analysis was initially employed to explore potential risk factors. Subsequently, the Lasso regression method was utilized to identify a significant combination of risk factors. Finally, a multivariate logistic regression was performed to predict pathological upgrading in colorectal IN patients, thereby constructing a predictive model and developing a nomogram. First, the Hosmer-Lemeshow test was applied to assess the model’s goodness-of-fit. Second, the ability of the model to distinguish between different outcomes was tested using the AUC. Third, a calibration curve was drawn to check how well the predicted results matched the actual outcomes. Finally, DCA was used to assess the model’s value in clinical practice. A p-value below 0.05 was considered to show statistical significance.
3 Results
3.1 Pathological findings following endoscopic therapy
Among the 158 patients who received endoscopic treatment, 43 showed pathological progression after the procedure, giving a progression rate of 27.2%. Of these patients, 111 were first diagnosed with LGIN based on biopsy results. Among the LGIN group, 24 cases showed progression—22 developed into HGIN, and 2 progressed to CRC, with a total progression rate of 21.6%. In addition, 47 patients were diagnosed with HGIN by biopsy, and 19 of them later developed CRC, resulting in a progression rate of 40.4%. The detailed data are shown in Table 1.
Table 1. Pathological upgrading in 158 patients with colorectal intraepithelial neoplasia following endoscopic treatment.
3.2 Baseline characteristic
After applying the inclusion and exclusion criteria, a total of 158 patients diagnosed with colorectal intraepithelial neoplasia by EFB were included in the study. The group consisted of 119 men (75.3%) and 39 women (24.7%). The median age of the patients was 57 years, with ages ranging from 24 to 91 years. Statistically significant differences were identified between the pathology-graded and ungraded groups concerning baseline clinical and pathological characteristics, including family history of colorectal cancer, number of biopsy specimens, maximum tumor diameter, tumor location, tumor hyperemia/erythema, tumor villous structure, and tumor ulceration (P < 0.05). Conversely, no significant differences were found between the two groups regarding gender, age, smoking history, BMI, preoperative CEA levels, presence of metabolic syndrome, pedunculated versus non-pedunculated tumors, or number of tumors (P > 0.05). Detailed findings are presented in Table 2.
3.3 Selection of variable
3.3.1 Univariate analysis of factors influencing pathological progression
Univariate analysis demonstrated that several factors, including a family history of colorectal cancer, the number of biopsy specimens, maximum tumor diameter, tumor location, presence of hyperemia, villous architecture, and ulceration, were correlated with pathological upgrade following endoscopic treatment in patients with colonic adenomatous intraepithelial neoplasia (P < 0.05). Conversely, variables such as sex, age, BMI, smoking history, metabolic syndrome (MS), preoperative CEA level, tumor morphology (pedunculated or sessile), and the number of tumors did not exhibit a significant association with pathological upgrade (P > 0.05). Comprehensive results are provided in Table 3.
3.3.2 Selection of predictor variables using LASSO regression
In this study, 158 patients with colorectal adenomas were included for analysis. The postoperative pathological grade was used as the dependent variable. LASSO regression was applied to identify the predictive factors, as shown in Figure 1A. The results of a 10-fold cross-validation, depicted in Figure 1B, indicated a λ_min value of 0.03815227 (log λ = −3.26617) and a λ_1se value of 0.08030687 (log λ = −2.5219). To keep the model both simple and accurate, λ_1se was chosen as the best penalty coefficient. Based on this criterion, four variables were identified as significant predictors: tumor location, villous architecture, surface erosion, and maximum tumor diameter.
Figure 1. (A) Coefficient profiles of the variables in the LASSO regression model. Each curve represents the trajectory of a variable coefficient as the regularization parameter (λ) changes. This figure illustrates how the coefficients shrink with increasing penalty, identifying variables with the strongest predictive contributions. (B) Selection of the optimal penalty parameter in LASSO regression via cross-validation. The plot shows the relationship between mean cross-validated error and log(λ). The dotted lines indicate λ_min (minimum error) and λ_1se (simpler model within one standard error). λ_1se was selected as the optimal penalty for balancing model simplicity and predictive performance.
3.3.3 Multivariate logistic regression analysis and identification of independent risk factors
The four variables identified through LASSO regression were subsequently incorporated into a multivariate logistic regression model, and additional variable selection was carried out using a forward stepwise approach. The results indicated that rectal tumor location, villous architecture, surface ulceration, and a maximum tumor diameter of 30 mm or greater were independent risk factors for pathological progression following endoscopic resection of colorectal adenomas (all P < 0.05) (Table 4). To better visualize the magnitude of association between each factor and disease progression, a forest plot was generated based on the multivariate logistic regression model (Figure 2).
Table 4. Multivariate logistic regression analysis of factors associated with pathological upgrading.
Figure 2. Forest plot of multivariate logistic regression analysis for risk factors of pathological upgrading in patients with colorectal intraepithelial neoplasia: - 3.521 + 1.530 × Maximum tumor diameter + 1.464 × Erosion + 0.391 × Location + 1.078 × Villus.
3.4 Nomogram for predicting pathologic escalation
To estimate the probability of disease progression in patients with colorectal adenocarcinoma, a nomogram was developed utilizing the results from a multivariate logistic regression analysis (see Figure 3). The model incorporated four predictors: tumor location, villous architecture, surface ulceration, and a maximum tumor diameter (MTD) of 30 mm or greater. Each predictor was assigned a specific score on the upper axis of the nomogram. The individual scores were aggregated to obtain a total score, which was then projected onto the total score axis. By drawing a vertical line downward from this point to the probability axis, the estimated likelihood of pathological upgrade in patients with colorectal IN could be determined. For example, a total score of 190 means the chance of a pathological upgrade is more than 50%, while a score of 300 suggests the probability is above 90%.
3.5 Model validation and evaluation
The ROC curve analysis showed that the predictive model reached an AUC of 0.822, with a 95% CI of 0.744–0.899, for predicting pathological upgrading in patients with colorectal intraepithelial neoplasia (see Figure 4A). The model had a sensitivity of 76.74% and a specificity of 74.78%. Internal validation produced an AUC of 0.813 (95% CI: 0.787–0.823) (see Figure 4B). In the calibration curve, the x-axis shows the predicted probability of pathological upgrading, and the y-axis shows the actual observed probability (see Figure 5). The diagonal line represents an ideal situation where the predicted and observed values are the same. The model is considered accurate when the calibration curve is close to this diagonal line. The Hosmer–Lemeshow test gave a chi-square value of 1.731 and a P-value of 0.973 (P > 0.05), which means the model had good fit and reliable predictions. To assess clinical usefulness, DCA was used to estimate the net benefit of using the nomogram to predict pathological progression in patients with colorectal intraepithelial neoplasia (see Figure 6).
Figure 4. (A) Receiver operating characteristic (ROC) curve of the predictive model for pathological upgrading in colorectal IN patients. The model achieved an AUC of 0.822 (95% CI: 0.744–0.899), indicating good discriminative ability. (B) ROC curve of the internal validation of the predictive model. The internal validation yielded an AUC of 0.813 (95% CI: 0.787–0.823), confirming the model’s robustness and stability.
Figure 5. Calibration curve of the predictive model for pathological upgrading. The calibration curve shows strong agreement between predicted and observed probabilities, demonstrating good model calibration (Hosmer–Lemeshow test: χ2 = 1.731, P = 0.973).
Figure 6. Decision curve analysis (DCA) of the predictive model for pathological upgrading. In this analysis, the horizontal axis denotes the high-risk threshold probability, whereas the vertical axis illustrates the net benefit contingent upon this threshold. Solid lines depict the net benefit associated with the predictive model, while dashed lines represent the “full treatment” and “full non-treatment” strategies, respectively. The decision curve analysis reveals that when the threshold probability ranges from 0 to 85%, the implementation of this predictive model for decision-making confers a superior net clinical benefit in comparison to both the “full intervention” and “full non-intervention” strategies. This finding underscores the substantial clinical utility of the model.
4 Discussion
Colorectal cancer (CRC) is a common malignant tumor of the digestive system, marked by high rates of illness and death. The outcome of CRC is closely related to how early it is diagnosed and treated. Therefore, finding CRC at an early stage is very important, as it allows timely treatment and greatly improves both survival and prognosis. IN is an important precancerous stage in the development of CRC. Accurate risk assessment of IN is a key part of clinical decision-making, as it helps determine whether a patient should undergo endoscopic removal or surgery. However, current preoperative diagnoses based on EFB often differ from final postoperative pathology results. These differences usually occur because of sampling mistakes and variations in how pathologists interpret tissue samples. Thus, there is an urgent clinical need to create a dependable tool that can predict the risk of pathological progression of IN before surgery. Such a tool would help doctors design more personalized and effective treatment plans for patients.
In order to solve this clinical dilemma, we have developed a nomogram to provide doctors with a preoperative risk assessment tool. The clinical application of the nomogram can be implemented through the following steps: first, according to the clinical characteristics of the patient, such as lesion size, morphological characteristics, pathological grade, etc., the score is determined on the corresponding variable axis; second, the scores of each variable are summed to obtain the total score; finally, the corresponding risk probability of pathological upgrading is read on the total score axis. For example, a patient with a large lesion size, specific morphological characteristics, and high-risk pathological classification may have a total score corresponding to a 65% risk of progression, suggesting that surgical treatment should be considered. This intuitive tool enables doctors to conduct individualized risk assessment before surgery, providing a quantitative basis for formulating accurate treatment plans and thus optimizing the clinical decision-making process.
The LASSO method was more effective than the stepwise selection approach in handling multicollinearity and reducing overfitting in the regression model. This study first uses univariate analysis to screen variables, and then selects variables through LASSO regression and multivariate logical regression. In the end, four variables with the strongest predictive value were retained: tumor size (≥≥ 30 mm), villi structure, ulcer and rectal site. The model based on these four prediction factors shows strong identification ability and reliable calibration. Overall, these findings have important clinical value, indicating that the model can be used as a practical tool for identifying patients at high risk of disease progression.
This study analyzed 158 patients with colorectal intraepithelial tumor, of which 43 cases were pathologically upgraded, with an overall progression rate of 27.2%. Among the patients who were first diagnosed with LGIN, 24 cases (21.6%) showed progress; while among the patients with HGIN, 19 cases (40.4%) had further deterioration. These observations highlight the malignant potential of HGIN. As a key stage of the adenoma-cancer sequence, the molecular and genetic changes of HGIN often foreshadow the early initiation of malignant lesions. Once such lesions are confirmed, regardless of the endoscopic performance, they should be regarded as “cancerous cancer that is about to worsen” and should be removed in time through surgery. In contrast, although the overall upgrade rate of LGIN is low, its progress is not random. If the lesion is accompanied by high-risk characteristics such as surface erosion, large diameter or located in the rectum, the risk of upgrading will increase significantly. Therefore, clinicians should attach the same importance to LGIN lesions that conform to the above characteristics under endoscopy, rather than underestimating the risk based on baseline pathological grading alone. This discovery helps to accurately identify high-risk individuals in heterogeneous LGIN populations, thus optimizing clinical treatment decisions.
Many studies have shown that EFB alone has limited diagnostic accuracy and often fails to reflect the true histological type of the lesion. For lesions measuring 10–20 mm and 20–40 mm, the mismatch rates between biopsy results and final pathology were 35 and 37%, respectively. A small study by Pugliese et al. (23) also found that differences between EFB and postoperative pathology increased as lesion size grew. Therefore, for larger lesions, endoscopists should be more cautious about the possibility of pathological upgrading after surgery and recognize the limited reliability of EFB results. In line with previous research, this study found that a MTD of 30 mm or more was an independent risk factor for higher grades of intraepithelial neoplasia. Adenocarcinomas usually have mixed histological structures (24). As lesions become larger, the chance of obtaining enough cancerous tissue through forceps biopsy decreases, which reduces the accuracy of EFB samples. Thus, larger tumors are more likely to produce unrepresentative biopsy specimens. In addition, the risk of malignant transformation also increases with tumor size, which may further explain why larger lesions are linked to higher pathological grades.
Early studies have shown that adenomas with villous components, especially those containing more than 25% villous tissue, carry a much higher risk of progressing to invasive carcinoma or high-grade dysplasia after surgery compared with other types of adenomas (25). Villous structures are often underestimated in biopsy samples because of sampling errors. They also show stronger biological activity and higher cancer potential. Therefore, complete removal and close follow-up of these lesions are strongly recommended. At the molecular level, villous adenomas share more features with carcinomas. They often contain driver mutations such as APC and show genomic instability, such as a high homologous recombination deficiency (HRD) score, which supports their tendency for fast malignant change (26). In addition, about 60% of villous adenomas express vascular endothelial growth factor (VEGF), which may promote blood vessel growth and tumor spread (27). Studies have also found that villous adenomas have the lowest 5-year cancer-specific survival rate (74.1%), compared with 84.2% for tubular adenocarcinoma and 81.5% for tubulovillous adenocarcinoma, suggesting a stronger tendency toward malignant transformation (28).
In endoscopic operations, it is very important to accurately identify and completely remove colorectal lesions accompanied by erosion or ulcers. Theoretically, the erosion area is more prone to malignancy due to hypoxic microenvironment (HIF-1α↑, p53 mutation), and the surface necrotic tissue is prone to biopsy errors (29). Studies have shown that there is a link between colorectal cancer and the occurrence of tumor erosion ulcers, or even an independent link (30, 31). The research of foreign scholar Hong Junbo found that the escalation rate of tumor erosion is as high as 42.6%, and the probability of pathological escalation of tumor erosion is 7.12 times that of non-erosion (OR = 7.12; 95% CI, 3.91–12.94; P < 0.001) (32). For high-risk erosion lesions, accurate identification and thorough resection under endoscopy is the core link to prevent pathological escalation, avoid follow-up risks and guide correct clinical decision-making.
From an anatomical point of view, the rectum and sigmoid colon are more prone to chronic inflammation, and long-term exposure to fecal carcinogens (33). These areas repeatedly undergo the cycle of mucosal damage, regeneration and repair, which increases the risk of DNA replication errors, which leads to genetic mutations that promote tumor development (34, 35). In addition, the abundant blood vessels and lymphatic networks in this area enable the accumulation of inflammatory mediators and growth factors, forming a microenvironment conducive to tumor transformation. The size of the tumor (especially the maximum diameter) is also recognized as an important determinant of malignant progression. The research of Hong Junbo and others shows that the rectum is an independent risk factor for the atypical progression of appendicitis, and the risk of abnormal progression of distal adenoma is 3.29 times higher than that of proximal lesions (P < 0.01) (32). Consistent with the above findings, our research results show that the risk of malignant transformation of rectal adenomas is 1.48 times higher than that of adenomas in other parts.
This research has made several notable contributions to this field. It proposes a new predictive model aimed at assessing the possibility of pathological progression in patients with IN. The model serves as a practical tool that may assist clinicians in identifying high-risk individuals early, thereby improving intervention timing and potentially reducing mortality associated with colorectal cancer. Unlike most existing models that are mainly dedicated to predicting the risk of colorectal cancer, this study focusses on the transition stage represented by IN. By targeting this intermediate stage, the model fills an important gap in the current risk stratification system. Another advantage of the model is that it only relies on conventional clinical variables, so that its explanation is concise and suitable for daily clinical practice. The effectiveness of the model was evaluated by calibration curve, ROC analysis and decision curve analysis. The results showed that the model showed strong clinical practicality, satisfactory accuracy and reliable identification performance in different risk categories.
There are some significant limitations in this study. First of all, it adopts a single-center retrospective analysis design, which may introduce the risk of selection bias and measurement bias. Although the single-center design has advantages, such as maintaining the consistency of the patient’s demographic characteristics, endoscopic operation and diagnostic standards, so as to ensure that the data is more uniform. However, this feature also limits the universality of the model, and it is difficult to promote it to other hospitals, populations or clinical environments. Another limitation lies in the retrospective characteristics of the data itself. Because the research relies on existing medical records, some variables are missing or observer bias, which may affect the overall reliability of the model. Although techniques such as LASSO regression and internal self-lifting verification are used to minimize the fit, the model has not been evaluated by an independent external data set. Therefore, additional verification through large-scale multi-center prospective research is crucial to confirm its wide applicability and predictive robustness.
Looking forward to the future, several directions can further strengthen this research. The next stage will focus on multi-center prospective research to realize the external verification of the model. By collecting different levels of hospital and cross-regional data, we aim to rigorously evaluate the real generalization ability of the model. At the same time, we will focus on developing online prediction tools or integrating the model into the hospital’s electronic health record system as an integral part of the clinical decision-making support platform. Such integration will realize the real-world performance verification of the model, assist clinicians to more accurately assess preoperative risks and formulate personalized treatment plans. In addition, the use of prospective research design can realize structured postoperative follow-up, so as to evaluate the impact of the model on the long-term prognosis and the overall course of the disease. This method is expected to eventually bridge the gap between predictive modeling and clinical practice application.
5 Conclusion
This study analyzed a number of clinical pathological data of patients with colorectal endoepithelial tumor, including the number of biopsy specimens, laboratory indicators, endoscopic characteristics and basic demographic information. Based on these variables, a prediction model is constructed to evaluate the probability of pathological progression. The study identified four independent risk factors: tumor site, mucosal erosion, focal diameter ≥ 30 mm, and villi histological composition. Based on these predictive indicators, we have developed predictive charts to assist clinicians in assessing the risk of disease progression more accurately. This visualization tool can realize the early identification of high-risk patients, provide a basis for the selection of appropriate treatment plans, and help improve the effectiveness of surgery. By guiding timely intervention, it may also help decrease the overall incidence of colorectal cancer and related mortality.
Data availability statement
The original contributions presented in the study are included in this article/supplementary material, further inquiries can be directed to this corresponding author.
Ethics statement
The studies involving humans were approved by Chaohu Hospital Ethics Committee at Anhui Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because this study did not involve any patient’s private information.
Author contributions
CZ: Writing – original draft, Investigation, Methodology, Software. LL: Supervision, Writing – review & editing. SW: Formal analysis, Software, Visualization, Writing – original draft. SJ: Resources, Validation, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Acknowledgments
The authors thank all the doctors and patients who provided data to support this study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Baidoun F, Elshiwy K, Elkeraie Y, Merjaneh Z, Khoudari G, Sarmini M, et al. Colorectal cancer epidemiology: recent trends and impact on outcomes. Curr Drug Targets. (2021) 22:998–1009. doi: 10.2174/1389450121999201117115717
2. Patel S, Karlitz J, Yen T, Lieu C, Boland C. The rising tide of early-onset colorectal cancer: a comprehensive review of epidemiology, clinical features, biology, risk factors, prevention, and early detection. Lancet Gastroenterol Hepatol. (2022) 7:262–74. doi: 10.1016/S2468-1253(21)00426-X
3. Sinicrope F. Increasing incidence of early-onset colorectal cancer. N Engl J Med. (2022) 386:1547–58. doi: 10.1056/NEJMra2200869
4. Yang Y, Han Z, Li X, Huang A, Shi J, Gu J. Epidemiology and risk factors of colorectal cancer in China. Chin J Cancer Res. (2020) 32:729–41. doi: 10.21147/j.issn.1000-9604.2020.06.06
5. Miller K, Siegel R, Lin C, Mariotto A, Kramer J, Rowland J, et al. Cancer treatment and survivorship statistics, 2016. CA Cancer J Clin. (2016) 66:271–89. doi: 10.3322/caac.21349
6. Duan B, Bai J, Qiu J, Wang J, Tong C, Wang X, et al. Histone-lysine N-methyltransferase Setd7 is a potential serum biomarker for colorectal cancer patients. EBioMedicine. (2018) 37:134–43. doi: 10.1016/j.ebiom.2018.10.036
7. Zauber A, Winawer S, O’Brien M, Lansdorp-Vogelaar I, van Ballegooijen M, Hankey B, et al. Colonoscopic polypectomy and long-term prevention of colorectal-cancer deaths. N Engl J Med. (2012) 366:687–96. doi: 10.1056/NEJMoa1100370
8. Pan J, Xin L, Ma Y, Hu L, Li Z. Colonoscopy reduces colorectal cancer incidence and mortality in patients with non-malignant findings: a meta-analysis. Am J Gastroenterol. (2016) 111:355–65. doi: 10.1038/ajg.2015.418
9. Xie Y, Gao Q, Cai G, Sun X, Zou T, Chen S, et al. Fecal Clostridium symbiosum for noninvasive detection of early and advanced colorectal cancer: test and validation studies. EBioMedicine. (2017) 25:32–40. doi: 10.1016/j.ebiom.2017.10.005
10. Zhu J, Chen G, Li Z, Peng Z, Xu L, Dong G, et al. Colonoscopic screening is associated with reduced colorectal cancer incidence and mortality: a systematic review and meta-analysis. J Cancer. (2020) 11:5953–63. doi: 10.7150/jca.46661
11. Rafeeq A, Sheikh J, Wani A, Owais A, Malik A, Sofi S, et al. Emerging trends in colorectal cancer: dysregulated signaling pathways. Int J Mol Med. (2021) 47:82. doi: 10.3892/ijmm.2021.4847
12. Zheng X, Song J, Yu C, Zhou Z, Liu X, Yu J, et al. Single-cell transcriptomic profiling unravels the adenoma-initiation role of protein tyrosine kinases during colorectal tumorigenesis. Signal Transduct Target Ther. (2022) 7:60. doi: 10.1038/s41392-022-00881-8
13. Sedlak J, Yilmaz ÖH, Roper J. Metabolism and colorectal cancer. Annu Rev Pathol. (2023) 18:467–92. doi: 10.1146/annurev-pathmechdis-031521-041113
14. Choi J, Jung S, Shim K, Cho W, Keum B, Byeon J, et al. Meta-analysis of predictive clinicopathologic factors for lymph node metastasis in patients with early colorectal carcinoma. J Korean Med Sci. (2015) 30:398–406. doi: 10.3346/jkms.2015.30.4.398
15. Lim H, Jung H, Park Y, Na H, Ahn J, Choi J, et al. Discrepancy between endoscopic forceps biopsy and endoscopic resection in gastric epithelial neoplasia. Surg Endosc. (2014) 28:1256–62. doi: 10.1007/s00464-013-3316-6
16. Hwang M, Kim K, Kim A, Lee S, Jang B, Kim T. Histologic discrepancy between endoscopic forceps biopsy and endoscopic mucosal resection specimens of colorectal polyp in actual clinical practice. Intest Res. (2018) 16:475–83. doi: 10.5217/ir.2018.16.3.475
17. Park Y, Kim G, Park D, Lee S, Lee M, Lee B, et al. Histopathologic discrepancies between endoscopic forceps biopsy and endoscopic resection specimens in superficial esophageal squamous neoplasms. J Gastroenterol Hepatol. (2019) 34:1058–65. doi: 10.1111/jgh.14571
18. Wu H, Huan C, Hu Y, Xiao S, Xu T, Guo M, et al. Development and validation of a nomogram for predicting all-cause mortality in patients with hemodialysis having pulmonary hypertension. Cardiorenal Med. (2023) 13:282–91. doi: 10.1159/000533674
19. Cueto-López N, García-Ordás M, Dávila-Batista V, Moreno V, Aragonés N, Alaiz-Rodríguez R. A comparative study on feature selection for a risk prediction model for colorectal cancer. Comput Methods Programs Biomed. (2019) 177:219–29. doi: 10.1016/j.cmpb.2019.06.001
20. Zheng Y, Hua X, Win A, MacInnis R, Gallinger S, Le Marchand L, et al. A new comprehensive colorectal cancer risk prediction model incorporating family history, personal characteristics, and environmental factors. Cancer Epidemiol Biomarkers Prev. (2020) 29:549–57. doi: 10.1158/1055-9965.EPI-19-0929
21. Xu J, Wang Y, Li X, Shao Y, Han Z, Zhang J, et al. Prediction model using readily available clinical data for colorectal cancer in a Chinese population. Am J Med Sci. (2022) 364:59–65. doi: 10.1016/j.amjms.2022.01.011
22. Riley R, Ensor J, Snell K, Harrell F, Martin G, Reitsma J, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. (2020) 368:m441. doi: 10.1136/bmj.m441
23. Pugliese V, Gatteschi B, Aste H, Nicolò G, Munizzi F, Giacchero A, et al. Value of multiple forceps biopsies in assessing the malignant potential of colonic polyps. Tumori. (1981) 67:57–62. doi: 10.1177/030089168106700111
24. Ching V, Hay K, Hui I, Vandeleur A, Har P, Rahman T, et al. Diagnostic yield of upper gastrointestinal tract endoscopy and colonoscopy in patients with iron deficiency anaemia while on direct oral anticoagulants. Intern Med J. (2025) 55:467–73. doi: 10.1111/imj.16616
25. Jiang Y, Wang J, Chen Y, Sun H, Dong Z, Xu S. Discrepancy between forceps biopsy and resection in colorectal polyps: a 1686 paired screening-therapeutic colonoscopic finding. Ther Clin Risk Manag. (2022) 18:561–9. doi: 10.2147/TCRM.S358708
26. Yang C, Xiang E, Chen P, Fang X. Evolutionary history of adenomas to colorectal cancer in FAP families. Front Genet. (2024) 15:1391851. doi: 10.3389/fgene.2024.1391851
27. Balasubramanian S, Priyathersini N, Johnson T. Expression of vascular endothelial growth factor (VEGF) in colorectal adenoma and carcinoma in a tertiary care center. Cureus. (2022) 14:e31393. doi: 10.7759/cureus.31393
28. Emile S, Garoufalia Z, Wignakumar A, Wexner S. Cancer-specific survival of colorectal adenocarcinomas according to the type of pre-existing adenoma: a surveillance, epidemiology, and end results registry analysis. Surgery. (2025) 184:109468. doi: 10.1016/j.surg.2025.109468
29. Wu W, Hu Z, Zhao Q, Zhang X, Zhang H, Wang H, et al. Down-regulation of hypoxia-inducible factor-1α and downstream glucose transporter protein-1 gene by β-elemene enhancing the radiosensitivity of lung adenocarcinoma transplanted tumor. Onco Targets Ther. (2020) 13:11627–35. doi: 10.2147/OTT.S275956
30. Yu L, Li N, Zhang X, Wang T, Chen W. Analysis of 234 cases of colorectal polyps treated by endoscopic mucosal resection. World J Clin Cases. (2020) 8:5180–7. doi: 10.12998/wjcc.v8.i21.5180
31. Tanaka S, Saitoh Y, Matsuda T, Igarashi M, Matsumoto T, Iwao Y, et al. Evidence-based clinical practice guidelines for management of colorectal polyps. J Gastroenterol. (2021) 56:323–35. doi: 10.1007/s00535-021-01776-1
32. Hong J, Wang Y, Deng J, Qi M, Zuo W, Hao Y, et al. Potential factors predicting histopathological upgrade discrepancies between endoscopic forceps biopsy of colorectal low-grade intraepithelial neoplasia and endoscopic resection specimens. Biomed Res Int. (2022) 2022:1915458. doi: 10.1155/2022/1915458
33. Lam A, Duloy A, Keswani R. Quality indicators for the detection and removal of colorectal polyps and interventions to improve them. Gastrointest Endosc Clin N Am. (2022) 32:329–49. doi: 10.1016/j.giec.2021.12.010
34. Dong J, Ma T, Xu Y, Li P, Chen W, Tu J, et al. Characteristics and potential malignancy of colorectal juvenile polyps in adults: a single-center retrospective study in China. BMC Gastroenterol. (2022) 22:75. doi: 10.1186/s12876-022-02151-x
Keywords: predictive model, nomogram, colorectal intraepithelial neoplasia, pathological upgrade, risk factor
Citation: Zhang C, Lu L, Wu S and Jin S (2025) Establishing a nomogram on the risk of pathological escalation of intestinal intraepithelial neoplasia in patients with colorectal intraepithelial neoplasia: a retrospective study. Front. Med. 12:1670165. doi: 10.3389/fmed.2025.1670165
Received: 05 August 2025; Revised: 11 November 2025; Accepted: 17 November 2025;
Published: 28 November 2025.
Edited by:
Zhuang Aobo, Xiamen University, ChinaReviewed by:
Ivan Šoša, University of Rijeka, CroatiaSheng Zheng, Third People’s Hospital of Yunnan Province, China
Copyright © 2025 Zhang, Lu, Wu and Jin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Liang Lu, ZW1haWxAdW5pLmVkdToxOTg2NzY0NTY4QHFxLmNvbQ==
Liang Lu1*