Using machine learning to predict lymph node metastasis in patients with renal cell carcinoma: A population-based study

Zhang, Yuhan; Yi, Xinglin; Tang, Zhe; Xie, Pan; Yin, Na; Deng, Qiumiao; Zhu, Lin; Luo, Hu; Peng, Kanfu

doi:10.3389/fpubh.2023.1104931

ORIGINAL RESEARCH article

Front. Public Health, 24 March 2023

Sec. Digital Public Health

Volume 11 - 2023 | https://doi.org/10.3389/fpubh.2023.1104931

This article is part of the Research TopicDigital Health Quality, Acceptability, and Cost: Steps to Effective Continuity of Cancer CareView all 7 articles

Using machine learning to predict lymph node metastasis in patients with renal cell carcinoma: A population-based study

Yuhan Zhang¹

Xinglin Yi²

Zhe Tang¹

Pan Xie¹

Na Yin¹

Qiumiao Deng¹

Lin Zhu¹

Hu Luo²^*

Kanfu Peng¹^*

¹Department of Nephrology, Third Military Medical University Southwest Hospital, Chongqing, China
²Department of Respiratory Medicine Center, Third Military Medical University Southwest Hospital, Chongqing, China

Background: Lymph node (LN) metastasis is strongly associated with distant metastasis of renal cell carcinoma (RCC) and indicates an adverse prognosis. Accurate LN-status prediction is essential for individualized treatment of patients with RCC and to help physicians make appropriate surgical decisions. Thus, a prediction model to assess the hazard index of LN metastasis in patients with RCC is needed.

Methods: Partial data were extracted from the Surveillance, Epidemiology, and End Results (SEER) database. Data of 492 individuals with RCC, collected from the Southwest Hospital in Chongqing, China, were used for external validation. Eight indicators of risk of LN metastasis were screened out. Six machine learning (ML) classifiers were established and tuned, focused on predicting LN metastasis in patients with RCC. The models were integrated with big data analytics and ML algorithms. Based on the optimal model, we developed an online risk calculator and plotted overall survival using Kaplan–Meier analysis.

Results: The extreme gradient-boosting (XGB) model was superior to the other models in both internal and external trials. The area under the curve, accuracy, sensitivity, and specificity were 0.930, 0.857, 0.856, and 0.873, respectively, in the internal test and 0.958, 0.935, 0.769, and 0.944, respectively, in the external test. These parameters show that XGB has an excellent ability for clinical application. The survival analysis showed that patients with predicted N1 tumors had significantly shorter survival (p < 0.0001).

Conclusion: Our study shows that integrating ML algorithms and clinical data can effectively predict LN metastasis in patients with confirmed RCC. Subsequently, a freely available online calculator (https://xinglinyi.shinyapps.io/20221004-app/) was built, based on the XGB model.

1. Introduction

Renal cell carcinoma (RCC) roughly comprises 90% of kidney cancer and the incidence of RCC is currently increasing (1). According to a recent report, there were nearly 400,000 new cases and 170,000 kidney cancer-related deaths worldwide in 2018 (2). Clear-cell RCC is the main type of RCC; other subtypes include papillary RCC and chromophobe RCC (3).

Although, with the gradual improvement in imaging detection, the clinical diagnosis and treatment rate of RCC are increasing annually, metastatic RCC is associated with poor prognosis (4, 5). Numerous previous studies have shown that in RCC, lymph node (LN) metastasis is strongly associated with distant metastasis. Clinically, accurate LN status assessment can improve the diagnosis and treatment of RCC (6, 7). Babaian et al. (8) noted that LN dissection (LND) can improve the 5-year survival rate of patients with LN involvement, suggesting that patients with renal cancer undergoing LND may have better survival. LN infiltration is one of the most important predictors of tumor progression and mortality (9). Therefore, it is imperative to investigate the strong predictors of LN metastasis and accurately identify at-risk patients who require LND.

Unclear imaging findings and low intraoperative positive biopsy rates have delayed the diagnosis and identification of early LN metastasis in RCC, thus limiting the therapeutic effect of LND (10). Although computed tomography (CT) and magnetic resonance imaging (MRI) can detect LNs ~1 cm in diameter, enlarged LNs are not necessarily a sign of metastasis. The specificity of using only a single imaging method to diagnose LN metastasis is poor. Published literature suggests that use of a combination of MRI and F-fluoro-2-deoxyglucose positron emission tomography (FDG-PET) may provide the highest accuracy, however, the capacity to perform such imaging is limited by the high cost and the limited availability of the required equipment (6). Therefore, a reliable and accurate predictive tool for the screening of high-risk populations and risk evaluation of LN metastasis in RCC is urgently required. To date, few studies have assessed the predictors of LN metastasis in patients with RCC.

The Surveillance, Epidemiology, and End Results (SEER) database was developed by the US National Cancer Institute and is available for open access analysis. We aimed to build a reliable and accurate tool based on machine learning (ML) algorithms using an extensive number of patients with RCC from the SEER database and the Southwest Hospital in Chongqing, China, for use in screening patients at high risk of developing LN metastasis.

2. Materials and methods

2.1. Patient information

Patient information was obtained from the SEER research database, which is widely used for the analysis of clinical cancer databases worldwide. Information on LN status was obtained from records of SEER database variables based on the seventh edition of the American Joint Committee on Cancer (AJCC) tumor-node-metastasis (TNM) staging; The records had sufficient data on imaging and pathological findings to enable LN status to be assessed. The inclusion criterion was patients in the SEER database with histologically diagnosed kidney cancer diagnosed from 2010 to 2017. Patients with any of the following exclusion criteria were excluded: (1) age younger than 20 years, (2) unknown tumor laterality, (3) unknown tumor size, (4) unknown TNM stage, (5) more than one primary tumor, or (6) unknown tumor grade.

As a result, 52,199 eligible patients were enrolled in this retrospective cohort study. Subsequently, we randomly divided the data of these patients into a training set (N = 36,539) and an internal test set (N = 15,660). Based on the training cohort, we constructed a predictive model by combining clinicopathological variables and the 7th TNM classification of the AJCC. In addition, the data of 492 patients from the Southwest Hospital in Chongqing, China were utilized as the external validation cohort to further validate the applicability of ML models. The process of selecting data for the study is shown in Figure 1. Three investigators collated the data, of whom two were responsible for data extraction, and the other investigator performed accuracy checks.

FIGURE 1

Figure 1. Patient selection flowchart.

2.2. Data preprocessing and feature selection

All common features, including age, sex, tumor laterality, TNM stage, tumor size, tumor histology, race, and tumor grade, included in the analysis were reviewed by clinicians and filtered using the SEERStat software (8.4.0.1 version). Age was classified in four categories: <50, 50–59, 60–69, and ≥70 years. TNM stage was classified according to the 7th AJCC TNM classification. Based on the World Health Organization classification scheme, histological categories included the following: 8120, transitional cell carcinoma; 8,255, adenocarcinoma with mixed subtypes; 8,260, papillary adenocarcinoma; 8,310, clear-cell adenocarcinoma; 8,312, RCC; and 8,317, chromophobe renal carcinoma and other rare subtypes. The chi-squared test was applied to evaluate the differences between categorical variables, and the t-test was applied to analyze the differences between continuous variables. Univariable logistic regression (LR) analysis was performed to evaluate the risk factors for predicting LN metastasis in the training cohort of patients with RCC. Statistical significance was set at p < 0.05. The odds ratio (OR) was calculated using backward stepwise selection with the 95% confidence interval (CI). All statistical analyses were performed using R software version 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria).

2.3. Model development and performance assessments

ML algorithms, a form of artificial intelligence, generally transcend traditional regression methods in predicting outcomes. The following six ML algorithms were used to build the models: LR, extreme gradient-boosting (XGB), random forest, support vector machine, artificial neural network (ANN), and decision tree models. The area under the receiver operating characteristic (ROC) curve (AUC), accuracy, sensitivity, and specificity were included in the study to assess the predictive power of each ML algorithm. The decision clinical analysis (DCA) curve and clinical utility curve (CUC) were utilized to examine clinical applicability. Furthermore, based on the best-performing model, we built a web-based online calculator and used the Kaplan–Meier method to predict the overall survival (OS) and to compare the survival outcome between patients with predicted N1 and N0 tumors.

2.4. Correlation analysis and variable importance

After features were screened out using univariable LR, we performed a Spearman correlation analysis to evaluate the relevant degree among these variables. The relevant index included three levels: 0–0.4, low; 0.4–0.7, intermediate; and ≥0.7, high. The relative variable importance was ranked using the permutation method in the best-performing model.

3. Results

3.1. Clinical characteristics of patients

After screening, 52,691 patients were enrolled, and 10 variables were included in this study. There were no significant differences between the training and internal validation cohorts in any of these characteristics (Table 1). Owing to geographic and ethnic differences and the limited sample size, there were statistically significant differences between the external validation cohort and training set, in most variables except for sex and N stage. In addition, there were no statistically significant differences in variable distribution between patients with and without LN metastases (Table 2).

TABLE 1

Table 1. Characteristics in the training, internal test, and external test cohorts.

TABLE 2

Table 2. Characteristics of the patients presenting with and without lymph node metastases.

3.2. Univariable and multivariable logistic regression analysis

Eight risk factors associated with LN metastases, including age, sex, tumor laterality, T stage, tumor size, tumor histology, tumor grade, and M stage, were identified using univariable LR analysis (Table 3). In this study, ML algorithms utilized these risk factors to develop six available models. In the multivariable LR analysis, age (50–59 years) and tumor laterality (right) were dependent protective factors for LN metastasis, and T stage (T2, T3, T4), M stage (M1), tumor grade (III, IV), and tumor histology of transitional cell carcinoma were independent risk factors for LN metastasis.

TABLE 3

Table 3. Univariable and multivariable logistic regression analyses of risk factors for LN metastasis in patients with RCC.

3.3. Correlation analysis

The correlations between the variables selected as predictors were analyzed and visualized in a heatmap (Figure 2) using Spearman’s rank correlation coefficient. Among these, T stage, tumor size, tumor grade, and M stage were associated with N stage. However, none of the variables showed a strong linear relationship.

FIGURE 2

Figure 2. Heatmap of the correlation of patients’ clinical and pathological features.

3.4. Performance of the machine learning algorithms

To improve the stability and determine the optimal hyperparameters of the six ML algorithm models, a 10-fold cross-validation method was applied in this study. As shown in Figure 3, the XGB model had the best performance on ROC curve analysis (AUC = 0.931, Std = 0.002) of the six algorithms tested, and the ANN model also showed good performance (AUC = 0.930, Std = 0.002). The results of the comprehensive review process are shown in Table 4. XGB was superior to the other models in both internal and external trials. The AUC, accuracy, sensitivity, and specificity were 0.930, 0.857, 0.856, and 0.873, respectively, in the internal test and 0.958, 0.935, 0.769, and 0.944, respectively, in the external test. The ROC curves of the internal and external tests are shown in Figure 4. In the performance comparison of the ML algorithms, an AUC closer to 1 indicates that the model is better than models with lower AUCs. Therefore, we selected the XGB model as the final prediction model. We then ranked the importance of predictor variables for the XGB model using a permutation test (Figure 5). The M stage, tumor size, T stage, and tumor grade all had a marked impact on the forecast results. The DCA curve showed that XGB had the highest clinical applicability (Figure 6), which indicates that using the model could help clinicians to identify which patients with RCC may have LN metastases. The probability density plot (Figure 7A) depicting predictive distribution showed that the AUC was highest when the predictive score was 0.045. The CUC plot (Figure 7B) also showed good clinical applicability.

FIGURE 3

Figure 3. Ten-fold cross-validation of the receiver operating characteristic curves of the six machine learning models in the training cohort.

TABLE 4

Table 4. Predictive performance of the algorithms’ internal and external tests.

FIGURE 4

Figure 4. Receiver operating characteristic curves of the six algorithms in the internal (A) and external tests (B).

FIGURE 5

Figure 5. Feature importance ranks in extreme gradient boosting.

FIGURE 6

Figure 6. Decision clinical analysis curves of algorithms in the training (A), internal test (B), and external test (C) sets.

FIGURE 7

Figure 7. Probability density plot (A) and the clinical impact curve (B) of extreme gradient-boosting model.

3.5. Web-based calculator

A web-based online predictor was built based on the most predictive XGB algorithm to enable clinicians to predict the risk of LN metastasis in patients with RCC.¹ The calculator was easy to use, and physicians were able to enter variables in the option box to calculate the probability of developing LN metastasis for each patient with RCC (Figure 8). The result was automatically presented by clicking the “predict” button. The calculator was published online and can be found at: see text footnote 1.

FIGURE 8

Figure 8. Web-based calculator created using the extreme gradient-boosting algorithm.

3.6. Survival analysis

We analyzed the survival outcome according to the XGB predictive results. Survival analysis using the Kaplan–Meier method showed that the XGB model had fine discriminative ability for predicting OS (p < 0.0001; Figure 9). The survival analysis showed that patients with predicted N1 had a significantly shorter survival time (p < 0.0001).

FIGURE 9

Figure 9. The Kaplan–Meier curve of overall survival comparing patients with N1 and N0 lymph node status based on the extreme gradient-boosting (XGB) model.

4. Discussion

LN metastasis is the main metastatic pathway of RCC. Moreover, several studies have confirmed that patients with RCC with LN metastasis are more likely to develop distant metastasis (11). Once the lesion spreads to a distance, the 5-year survival rate decreases to ~10% (12). Surgical resection is still the primary treatment for RCC owing to its insensitivity to chemotherapy and radiotherapy. Recently, the value of LND in patients with RCC has been a topic of academic debate. Although some studies have not shown a benefit of LND, most studies suggest that LND promotes accurate tumor staging, improves the prognosis, and prolongs OS in patients with RCC (13–15). The 2018 European Association of Urology guidelines for RCC, state that patients with insidious involvement of lymph nodes (cN0/pN1 cM0) who undergo LND have improved prognosis and survival (16, 17). A clinical study revealed that some patients with RCC who underwent isolated LN resection, had good long-term survival (18). However, these recommendations are not supported by strong evidence, and previous studies have not identified which patients derive the greatest benefit from LND owing to the uncertainty of LN metastasis in renal cancer, which often make clinicians face difficulties in selecting the surgical method and scope.

Clinically, the diagnosis of LN metastases using CT or MRI is often difficult. Imaging examination has high specificity but relatively low sensitivity in the identification of LN metastasis in patients with RCC (19). The diameter of a normal LN is usually 1 cm as the upper limit of clinical imaging, but there are often undetected micrometastases (20). Therefore, a convenient tool to accurately predict LN metastasis is urgently required to guide LND. The resection of positive LNs is thought to be a direct therapy method that effectively blocks future LN metastasis.

Currently, precision medicine is summarized into four concepts: predictive, personalized, preventive, and participatory. Integrating big data with ML algorithms is becoming a clinical necessity. Recent large-scale studies have selected this novel and efficient means to investigate the clinical issues related to the early diagnosis and treatment of cancer metastasis; however, few studies have attempted to evaluate the risk of LN metastasis in patients with confirmed RCC using ML methods (21, 22). For example, Pin et al. (23) reported that patient age at surgery, largest tumor diameter, presence of preoperative symptoms, cT stage, cN stage, and serum biomarkers were associated with LN metastases in patients with RCC. Similarly, Wenle et al. (24) showed that a nomogram-based prediction method based on quantifying the risk of LN invasion in patients with RCC may have practical clinical application. Compared with traditional statistical methods, ML can construct an optimal mathematical model and constantly adjust the model parameters to effectively prevent overfitting. Therefore, in this study, six ML algorithms were developed and validated to predict LN metastasis in patients with RCC. Among them, the XGB model showed outstanding performance in both the internal and external validations, with an AUC of 0.930 and 0.958 in the internal and external tests, respectively. The DCA curve and CUC also showed good applicability.

Our analysis confirmed that the M stage, tumor size, T stage, and tumor grade were the four most important determinants in forecasting the results, which is consistent with the results of previous studies (10, 25, 26). Umberto et al. (10). found that the M stage at diagnosis and tumor size were the most important independent predictors of LN metastases. In addition, patients with RCC with a tumor diameter > 7 cm (cT2a or higher) have been shown to have a significantly increased risk of LN progression (25). Several researchers have found that LN metastases in patients with RCC are dependent on many factors, but especially on tumor size and tumor grade. Pantuck et al. (26) found that only 6% of tumors with Fuhrman grades 1–2 had LN involvement, and 26% of tumors with high Fuhrman grade had LN involvement. In patients with RCC, LN metastasis is more common in men than in women. Smoking and alcohol consumption are well-established risk factors in RCC (27) and are associated with a relatively high incidence of LN metastasis in men with RCC. A novel finding of this study is that patients with renal cancer whose tumor site was on the left side had a higher risk of LN metastases. A retrospective study using two different national databases showed that patients with left-sided RCC were more likely to have a higher tumor grade, LN positivity, and distant metastasis than those with right-sided disease (28). The spread patterns of the lymphatic system may have contributed to these differences. The effect of the side of the tumor on prognosis warrants further in-depth study.

This study has several strengths. Based on a large sample size and internal and external validations, our study ensured credibility and authenticity. In addition, the survival outcomes of the N1 group were worse than those of the N0 group, which is consistent with the results of previous studies. Several studies have shown that 5-year survival rates are often <40% for patients with LN involvement (18, 29). This further indirectly verified the reliability of the prediction results of this study. Nevertheless, this study also has some limitations. First, the use of retrospective data may have led to data bias. Second, this study did not include specific biochemical indicators. Although this avoids the effects of differences in levels of testing at different institutions, some specific biochemical parameters that were omitted warrant further investigation. In addition, owing to the inevitable differences in the level of diagnosis and treatment in different countries or regions, the external validation cohort only included Chinese patients, and so the generalizability to other countries is unclear. However, our study represents an important step forward in developing a model to predict the risk of LN metastasis in patients with confirmed RCC.

In conclusion, our study showed that integrating ML algorithms and clinical data can effectively predict LN metastasis in patients with RCC. Subsequently, a freely available online calculator (see text footnote 1) based on the XGB model was built to quantify the risk of LN metastasis in patients with RCC conveniently and accurately.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by Ethics Committee of the First Affiliated Hospital of PLA Army Medical University. The patients/participants provided their written informed consent to participate in this study.

Author contributions

YZ and XY: conceptualization. KP: methodology and project administration. XY: software. YZ: validation, writing—original draft preparation, and writing—review and editing. PX: formal analysis. ZT: investigation. NY: resources. LZ: data curation. QD: visualization. HL: supervision. All authors contributed to the article and approved the submitted version.

Funding

The work was funded by the National Natural Science Foundation of China (authorization number: 81873606).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

1. ^https://xinglinyi.shinyapps.io/20221004-app/

References

1. Gray, RE, and Harris, GT. Renal cell carcinoma: diagnosis and management. Am Fam Physician. (2019) 99:179–84.

Google Scholar

2. Bray, F, Ferlay, J, Soerjomataram, I, Siegel, RL, Torre, LA, and Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2018) 68:394–424. doi: 10.3322/caac.21492

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Ricketts, CJ, De Cubas, AA, Fan, H, Smith, CC, Lang, M, Reznik, E, et al. The cancer genome atlas comprehensive molecular characterization of renal cell carcinoma. Cell Rep. (2018) 23:313–326.e5. doi: 10.1016/j.celrep.2018.03.075

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Barata, PC, and Rini, BI. Treatment of renal cell carcinoma: current status and future directions. CA Cancer J Clin. (2017) 67:507–24. doi: 10.3322/caac.21411

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Turajlic, S, Swanton, C, and Boshoff, C. Kidney cancer: the next decade. J Exp Med. (2018) 215:2477–9. doi: 10.1084/jem.20181617

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Tadayoni, A, Paschall, AK, and Malayeri, AA. Assessing lymph node status in patients with kidney cancer. Transl Androl Urol. (2018) 7:766–73. doi: 10.21037/tau.2018.07.19

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Srivastava, A, Rivera-Núñez, Z, Kim, S, Sterling, J, Farber, NJ, Radadia, KD, et al. Impact of pathologic lymph node-positive renal cell carcinoma on survival in patients without metastasis: evidence in support of expanding the definition of stage IV kidney cancer. Cancer. (2020) 126:2991–3001. doi: 10.1002/cncr.32912

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Babaian, KN, Kim, DY, Kenney, PA, Wood, CG Jr, Wong, J, Sanchez, C, et al. Preoperative predictors of pathological lymph node metastasis in patients with renal cell carcinoma undergoing retroperitoneal lymph node dissection. J Urol. (2015) 193:1101–7. doi: 10.1016/j.juro.2014.10.096

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Sun, JX, Liu, CQ, Zhang, ZB, Xia, QD, Xu, JZ, An, Y, et al. A novel predictive model of pathological lymph node metastasis constructed with preoperative independent predictors in patients with renal cell carcinoma. J Clin Med. (2023) 12:441. doi: 10.3390/jcm12020441

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Capitanio, U, Deho, F, Dell’Oglio, P, Larcher, A, Capogrosso, P, Nini, A, et al. Lymphadenopathies in patients with renal cell carcinoma: clinical and pathological predictors of pathologically confirmed lymph node invasion. World J Urol. (2016) 34:1139–45. doi: 10.1007/s00345-015-1747-5

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Dudani, S, de Velasco, G, Wells, JC, Gan, CL, Donskov, F, Porta, C, et al. Evaluation of clear cell, papillary, and Chromophobe renal cell carcinoma metastasis sites and association with survival. JAMA Netw Open. (2021) 4:e2021869. doi: 10.1001/jamanetworkopen.2020.21869

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Mitchell, TJ, Turajlic, S, Rowan, A, Nicol, D, Farmery, JHR, O'Brien, T, et al. Timing the landmark events in the evolution of clear cell renal cell cancer: TRACERx renal. Cells. (2018) 173:611–623.e17. doi: 10.1016/j.cell.2018.02.020

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Feuerstein, MA, Kent, M, Bazzi, WM, Bernstein, M, and Russo, P. Analysis of lymph node dissection in patients with ≥7-cm renal tumors. World J Urol. (2014) 32:1531–6. doi: 10.1007/s00345-013-1233-x

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Whitson, JM, Harris, CR, Reese, AC, and Meng, MV. Lymphadenectomy improves survival of patients with renal cell carcinoma and nodal metastases. J Urol. (2011) 185:1615–20. doi: 10.1016/j.juro.2010.12.053

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Crispen, PL, Breau, RH, Allmer, C, Lohse, CM, Cheville, JC, Leibovich, BC, et al. Lymph node dissection at the time of radical nephrectomy for high-risk clear cell renal cell carcinoma: indications and recommendations for surgical templates. Eur Urol. (2011) 59:18–23. doi: 10.1016/j.eururo.2010.08.042

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Ljungberg, B, Albiges, L, Abu-Ghanem, Y, Bedke, J, Capitanio, U, Dabestani, S, et al. European Association of Urology guidelines on renal cell carcinoma: the 2022 update. Eur Urol. (2022) 82:399–410. doi: 10.1016/j.eururo.2022.03.006

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Kuusk, T, Klatte, T, Zondervan, P, Lagerveld, B, Graafland, N, Hendricksen, K, et al. Outcome after resection of occult and non-occult lymph node metastases at the time of nephrectomy. World J Urol. (2021) 39:3377–83. doi: 10.1007/s00345-021-03633-5

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Gershman, B, Moreira, DM, Thompson, RH, Boorjian, SA, Lohse, CM, Costello, BA, et al. Renal cell carcinoma with isolated lymph node involvement: long-term natural history and predictors of oncologic outcomes following surgical resection. Eur Urol. (2017) 72:300–6. doi: 10.1016/j.eururo.2016.12.027

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Geller, JI, Ehrlich, PF, Cost, NG, Khanna, G, Mullen, EA, Gratias, EJ, et al. Characterization of adolescent and pediatric renal cell carcinoma: a report from the Children's oncology group study AREN03B2. Cancer. (2015) 121:2457–64. doi: 10.1002/cncr.29368

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Coll, DM, and Smith, RC. Update on radiological imaging of renal cell carcinoma. BJU Int. (2007) 99 (5 Pt B): 1217–22) 99:1217–22. doi: 10.1111/j.1464-410X.2007.06824.x

CrossRef Full Text | Google Scholar

21. Singh, NP, Bapi, RS, and Vinod, PK. Machine learning models to predict the progression from early to late stages of papillary renal cell carcinoma. Comput Biol Med. (2018) 100:92–9. doi: 10.1016/j.compbiomed.2018.06.030

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Lee, Y, Ryu, J, Kang, MW, Seo, KH, Kim, J, Suh, J, et al. Machine learning-based prediction of acute kidney injury after nephrectomy in patients with renal cell carcinoma. Sci Rep. (2021) 11:15704. doi: 10.1038/s41598-021-95019-1

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Li, P, Peng, C, Xie, Y, Wang, L, Gu, L, Wu, S, et al. A novel preoperative Nomogram for predicting lymph node invasion in renal cell carcinoma patients without metastasis. Cancer Manag Res. (2019) 11:9961–7. doi: 10.2147/CMAR.S218254

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Li, W, Wang, B, Dong, S, Xu, C, Song, Y, Qiao, X, et al. A novel Nomogram for prediction and evaluation of lymphatic metastasis in patients with renal cell carcinoma. Front Oncol. (2022) 12:851552. doi: 10.3389/fonc.2022.851552

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Dell’Oglio, P, Larcher, A, Muttin, F, Di Trapani, E, Trevisani, F, Ripa, F, et al. Lymph node dissection should not be dismissed in case of localized renal cell carcinoma in the presence of larger diseases. Urol Oncol. (2017) 35:662.e9–662.e15. doi: 10.1016/j.urolonc.2017.07.010

CrossRef Full Text | Google Scholar

26. Pantuck, AJ, Zisman, A, Dorey, F, Chao, DH, Han, KR, Said, J, et al. Renal cell carcinoma with retroperitoneal lymph nodes: role of lymph node dissection. J Urol. (2003) 169:2076–83. doi: 10.1097/01.ju.0000066130.27119.1c

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Scelo, G, and Larose, TL. Epidemiology and risk factors for kidney cancer. J Clin Oncol. (2018) 36:3574–81. doi: 10.1200/JCO.2018.79.1905

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Strauss, A, Uhlig, J, Lotz, J, Trojan, L, and Uhlig, A. Tumor laterality in renal cancer as a predictor of survival in large patient cohorts: a STROBE compliant study. Medicine (Baltimore). (2019) 98:e15346. doi: 10.1097/MD.0000000000015346

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Yu, KJ, Keskin, SK, Meissner, MA, Petros, FG, Wang, X, Borregales, LD, et al. Renal cell carcinoma and pathologic nodal disease: implications for American joint committee on cancer staging. Cancer. (2018) 124:4023–31. doi: 10.1002/cncr.31661

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: renal cell carcinoma, lymph node, metastasis, machine learning, online calculator

Citation: Zhang Y, Yi X, Tang Z, Xie P, Yin N, Deng Q, Zhu L, Luo H and Peng K (2023) Using machine learning to predict lymph node metastasis in patients with renal cell carcinoma: A population-based study. Front. Public Health 11:1104931. doi: 10.3389/fpubh.2023.1104931

Received: 22 November 2022; Accepted: 13 March 2023;
Published: 24 March 2023.

Edited by:

Tania Estapé, FEFOC, Spain

Reviewed by:

Shailesh Kumar Tripathi, Rajendra Institute of Medical Sciences, India
Fabrizio Consorti, Sapienza University of Rome, Italy

Copyright © 2023 Zhang, Yi, Tang, Xie, Yin, Deng, Zhu, Luo and Peng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hu Luo, bHVvaHVjeUAxNjMuY29t; Kanfu Peng, MzkyOTA2Nzg2QHFxLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.