Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 24 May 2022
Sec. Genitourinary Oncology
This article is part of the Research Topic The Application of Artificial Intelligence In Diagnosis, Treatment and Prognosis In Urologic Oncology View all 11 articles

A Deep Learning System to Predict the Histopathological Results From Urine Cytopathological Images

  • 1Department of Urology, Peking University First Hospital, Peking University, Beijing, China
  • 2Institute of Urology, Peking University, Beijing, China
  • 3National Urological Cancer Center, Beijing, China
  • 4School of Cyber Science and Technology, Beihang University, Beijing, China
  • 5R&D Department, Yizhun Medical AI Co. Ltd, Beijing, China

Background: Although deep learning systems (DLSs) have been developed to diagnose urine cytology, more evidence is required to prove if such systems can predict histopathology results as well.

Methods: We retrospectively retrieved urine cytology slides and matched histological results. High-power field panel images were annotated by a certified urological pathologist. A deep learning system was designed with a ResNet101 Faster R-CNN (faster region-based convolutional neural network). It was firstly built to spot cancer cells. Then, it was directly used to predict the likelihood of the presence of tissue malignancy.

Results: We retrieved 441 positive cases and 395 negative cases. The development involved 387 positive cases, accounting for 2,668 labeled cells, to train the DLS to spot cancer cells. The DLS was then used to predict corresponding histopathology results. In an internal test set of 85 cases, the area under the curve (AUC) was 0.90 (95%CI 0.84–0.96), and the kappa score was 0.68 (95%CI 0.52–0.84), indicating substantial agreement. The F1 score was 0.56, sensitivity was 71% (95%CI 52%–85%), and specificity was 94% (95%CI 84%–98%). In an extra test set of 333 cases, the DLS achieved 0.25 false-positive cells per image. The AUC was 0.93 (95%CI 0.90–0.95), and the kappa score was 0.58 (95%CI 0.46–0.70) indicating moderate agreement. The F1 score was 0.66, sensitivity was 67% (95%CI 54%–78%), and specificity was 92% (95%CI 88%–95%).

Conclusions: The deep learning system could predict if there was malignancy using cytocentrifuged urine cytology images. The process was explainable since the prediction of malignancy was directly based on the abnormal cells selected by the model and can be verified by examining those candidate abnormal cells in each image. Thus, this DLS was not just a tool for pathologists in cytology diagnosis. It simultaneously provided novel histopathologic insights for urologists.

Introduction

Urothelial carcinoma (UC) is one of the most common cancers worldwide (1). UCs are often multifocal and tend to recur. Thus, thorough screening and frequent surveillance are mandatory. The diagnosis of UC typically relies on the histopathological assessment of tissue resected by cystoscopy, yet it is an invasive approach and not easily accessible.

Urine cytology has played an important role in the screening and surveillance of UCs for many years for its effective, inexpensive, noninvasive nature (24). However, in the context of urine cytology, there is currently no gold standard for cyto-histo correlation in urine (5).

One possible solution could be to use the deep learning systems (DLSs) to build such links. DLSs have demonstrated a capacity superior to manual workflows in shifting through massive images to retrieve similar patterns and establish associations with novel traits in many medical data analysis tasks (68). Three studies have successfully automated the urine cytology diagnosis through the use of DLSs (911). One of these previous studies has further proven that DLSs might be able to determine the malignant potential of tumors more accurately than classical cytology (11). Researchers used a 16-layer convolutional neural network (CNN). Weights trained for initial UC cell detection were reused for the first 7 layers. New training with histopathological data started at the eighth layer. The DLS determined the presence of stromal invasion and performed a nuclear grading of tumor cells in the corresponding histological specimens. Therefore, DLSs have become a method that could potentially link cytopathology findings with histopathology results.

In this study, we hypothesized that routine urine cytology images contain information about the presence of malignant tissue in urinary tracts. The rationale for this cyto-histo correlation is that malignant tissues in urinary tracts undergo constant exfoliation, which sheds tumor cells and influences tumor cell morphology in urine. Building on those previous studies, we systematically investigated the presence of such correlation and aimed to capture it through our DLS. We trained and tested the DLS to spot UC cells in cytology images before using it to predict if a case would get malignant surgical pathology within the next 1 year. The results demonstrated that the DLS could predict the presence of malignancy and display such associations between cytopathology and histopathology through likelihood even without further training with histopathology data. Thus, DLS cytology can be used as not only a pathological tool to assist cytopathological diagnosis but also a novel risk-stratification tool to predict histopathology. This could help urologists make therapeutic decisions.

Materials and Methods

Data Acquisition

All images were obtained from the archival glass of hematoxylin and eosin-stained urine cytocentrifugation cytology from consecutive patients who underwent examination, surgery, or both at Peking University First Hospital from 2014 to 2020 (Figure 1, Table 1). Urine cytology was routinely diagnosed using Papanicolaou’s classification at our institute (12). Classes III, IV, and V were defined as positive; class I and II (atypical) were defined as negative.

FIGURE 1
www.frontiersin.org

Figure 1 The data acquisition process is illustrated here. We collected the cytology images retrospectively and consecutively from Sep 2014 to Jan 2020 and built a series of data sets for training, validation, and tests. For those who underwent surgeries within the next 1 year, surgical results were also followed. The preliminary test set was only used for cancer cell detection, while the internal and extra test sets were used for both cancer cell detection and malignancy prediction.

TABLE 1
www.frontiersin.org

Table 1 Baseline characteristics.

Among the 441 positive cases (patients diagnosed with UC based on cytologic examination), 211 received surgery (surgery within the next 1 year, if not otherwise clarified), all of which were diagnosed with UC based on histological examination (Table 2).

TABLE 2
www.frontiersin.org

Table 2 Surgical follow-up.

Among the 395 negative cases (patients diagnosed with benign diseases based on cytologic examination), all received surgery, of which 333 were diagnosed with UC based on histological examination and the rest 62 were diagnosed with benign disease based on histological examination (Table 2). For the above 333 cases with contradicted cytopathological and histopathological results, a blinded pathologist’s review was carried out to check for overlooked cancer cells. As a result, 63 cases actually had cancer cells in their cytology images confirmed by a pathologist and should be deemed as positive cases.

From the original slides, 1,280 × 960-pixel, Joint Photographic Experts Group (jpeg) format images were exported: 466 images from positive cases and 417 images from negative cases. Subsequently, the training–validation set and preliminary test set, internal test set, and extra test set were defined by the different cytopathological and histopathological diagnoses. The training–validation set and preliminary test set were allocated by 8:1 stochastically. This study was approved by the institutional review board of Peking University First Hospital.

Deep Learning System

We built our DLS on a ResNet101 Faster-RCNN (Figure 2). ResNet101 is a 101-layer Reidual Network proposed on the 2016 IEEE Conference on Computer Vision and Pattern Recognition by He et al. (13). Deeper and restructured, ResNet101 has shown a high performance in many contexts of use including skin lesion detection and brain disease detection in magnetic resonance images (14, 15).

FIGURE 2
www.frontiersin.org

Figure 2 The overall design of the deep learning system. The ResNet 101 Faster-RCNN detected the UC cells while assigning a possibility to each of the cell, and an additional classifier picked the maximal possibility and predicted the histopathological malignant state according to the set threshold.

Faster-RCNN, short for faster region-based convolutional neural network, is a CNN that combines object detection and classification into one network (16). It extracts features, makes detections through these features, and quantifies the degree of fit at each detection using a value of possibility ranging from the worst of 0 to the optimum of 100. Evidence has shown that Faster-RCNN is especially good at detecting objects at multiple scales and aspect ratios, such as abnormal cervical cells in cytology images and cancer regions in colorectal biopsies (17, 18).

The model was implemented in Python 3.8 using TensorFlow (1.12.0) and Keras (2.0.3). Malignant cells with remarkable atypia in the jpeg images were annotated by a certified urological pathologist using the open-source software LabelMe (19). The images were then divided into 175 × 200-pixel panel subimages automatically, which were used for the training of the DLS.

ResNet101 was pretrained on the ImageNet database consisting of 1.2 million training images, with 1,000 classes of objects (20). The weight pretrained with ImageNet was used to initiate the weights of all convolutional layers, and all weights were trained with cytology images afterwards. The images passed through 33 convolution blocks and then through 1 dense layer. The SoftMax function was used as the activation function.

The training set and validation set were further allocated by 5:1 stochastically, for early stopping during network training to suppress overfitting. Spatial augmentation, including 90° rotation and vertical and horizontal flip, was applied in network training. We set 80 as the maximum epoch and stopped training if validation loss did not improve after 15 epochs.

For the prediction of malignancy, an additional classifier was added at the end of the initial DLS. The function of the classifier was to select the highest value of the possibilities in an image and to make a binary classification (benign or malignant) by comparing this value to the threshold (Figure 2). More details were provided in the Supporting Materials (Supporting File 1).

Evaluation Metrics

Performance was evaluated based on the testing results.

For the detection of UC cells, the annotations served as the reference standard. We used sensitivity, accuracy, and average false-positive cells per image. Sensitivities and accuracies were calculated using following formulas:

Sensitivity=True PositivesTotal Annotations; Accuracy=True PositivesTrue Positives + False Positives

For the prediction of the malignancy, the surgical results served as the reference standard. We used sensitivity, specificity, the F1 score, and kappa score for evaluation. Cohen kappa scores reflect the agreement of the DLS with the pathologist reference standard (21). F1 scores were calculated using the following formula:

F1 Score=2×Precision×RecallPrecision+Recall

Results

Development of the Deep Learning System

The first step of our model is to detect just cancer cells. Therefore, we only annotated the cancer cells in the jpeg images for training and validation purposes, and treated all the other cells in the same image as background (Supporting File1). In total, 1,364 cells from 411 1,280 × 960-pixel images were labelled. A total of 1953 subimages were obtained. Each subimage contained at least one label in it. Sub-images were subsequently randomly allocated into the training and validation sets by the ratio of 5:1, and subjected to the pretrained DLS, as mentioned in Materials and Methods. Both the total loss and the system accuracy stabilized after 45–50 epochs for the validation set. A final model was chosen at 48 epochs when the total loss for the validation set hit the lowest point of 1.6. It was also where the classification accuracy for the validation set hit the highest point of 0.77.

Deep Learning System Performance to Detect Cancer Cells

We evaluated the ability of the DLS to detect cancer cells in the three test sets (Figure 3). We took advantage of the value of possibility generated by the DLS and adopted it as the threshold for cell detection.

FIGURE 3
www.frontiersin.org

Figure 3 The examples of snapshot images from positive cases and the results by the deep learning system were provided. The malignant cells detected were labeled by the Faster-RCNN, and the possibilities of each detection were also shown.

For all sets, sensitivities increased at the cost of more cells mistakenly spotted as malignant by the DLS (Figure 4A). The accuracy initially increased with the thresholds for cell detection, and the rate of increase slowed down by approximately 50–55 (Figure 4B). Such a trend was observed in both preliminary and internal tests. Therefore, we chose 55 as the optimal threshold. Under this threshold, the sensitivity is 41% for the preliminary test at the cost of an average of 3.09 false-positive cells per image, 36% for the internal test at the cost of an average of 0.72 false-positive cells per image, and 41% for the extra test at the cost of an average of 0.31 false-positives cell per image (Figure 4A). The accuracy of cancer cell detection was 50.0, 50.3, and 14.5 for the preliminary test, internal test, and extra test, respectively (Figure 4B).

FIGURE 4
www.frontiersin.org

Figure 4 The DLS performance to detect malignant cells is illustrated. (A) As the system tried to find more malignant cells (to achieve higher sensitivity), it made more mistakes classifying benign cells as malignant. The sensitivities at the optimal threshold for three sets were marked. (B) The accuracy at different thresholds. Thresholds were represented by their middle value (for example, x = 52.5 represented the interval of 50–55 points). Higher thresholds tend to have better detection accuracy. The performance at the optimal threshold of 50–55 points is highlighted by the dashed line. At this point, the increasing rates began to slow down.

For the subgroups of the extra test set, results are also calculated under the optimal threshold of 55: for the positive 64 images, sensitivity is 41% with an average of 0.95 false-positive cells per image; for the 281 negative images, the average false-positive cell per image is 0.16 while the sensitivity is not available due to no true positive (cancer cell).

Deep Learning System Performance to Predict Malignancy

We evaluated whether the exact same DLS can predict malignancy with no further transfer learning. Because the gold standard of malignancy is histopathology, we paired cytology with its corresponding surgical pathology. A total of 97 1,280 × 960-pixel images in the internal test set and 345 1,280 × 960-pixel images in the extra test set that could be paired with corresponding histopathological specimens were used. Here, we proposed a hypothesis that a case with positive cytology was more likely to have malignant surgical pathology. Therefore, the maximal value of possibility (the threshold for cell detection) in each image was adopted as the threshold for malignancy prediction.

Notably, the DLS was able to predict malignancy through cytology images. For the internal test, the AUC was 0.90 (95%CI 0.84–0.96) (Figure 5A). The highest kappa score is 0.71 at the threshold of 57 for malignancy prediction, and the highest F1 score is 0.78 at the threshold of 58 (Table 3). For the extra test, the AUC was 0.93 (95%CI 0.90–0.95) (Figure 5A). The highest kappa score is 0.60 at the threshold of 52 for malignancy prediction, and the highest F1 score is 0.69 at the threshold of 52 (Table 4).

FIGURE 5
www.frontiersin.org

Figure 5 The DLS performance to predict malignancy is illustrated. (A) The receiver operating characteristics curve for the performance to predict malignancy for the internal and extra tests. (B) The distribution of maximal possibilities in the internal test set. Most images from benign histology were scored below 55. (C) The distribution of maximal possibilities in the extra test set. Most images from negative cases were also scored below 55 (highlighted by the dashed line).

TABLE 3
www.frontiersin.org

Table 3 Performance on the internal test set.

TABLE 4
www.frontiersin.org

Table 4 Performance on the extra test set.

Under the optimal threshold (threshold=55), the DLS also achieved good performance. For the internal test, 4 images that scored higher than 55 have benign histologic results (Figure 5B). Thus, sensitivity is 71% (95%CI 52%–85%); specificity is 94% (95%CI 84%–98%); the F1 score is 0.76; and there was a substantial agreement with the reference standard (kappa = 0.68 [95%CI 0.52–0.84]). For the extra test, 22 images that scored higher than 55 have benign histologic results (Figure 5C). Thus, sensitivity is 67% (95%CI 54%–78%); specificity is 92% (95%CI 88%–95%); the F1 score is 0.66; and there was a moderate agreement with the reference standard (kappa = 0.58 [95%CI 0.46–0.70]).

Discussion

In this study, we developed a DLS to predict the likelihood of the presence of tissue malignancy through urine cytopathology. Notably, the system achieved an AUC of 0.90 for the internal test and of 0.93 for the extra test. Under the optimal threshold, sensitivity is 71%, and specificity is 94% for the internal test; sensitivity is 67%, and specificity is 92% for the extra test. These results proved that the DLS was able to predict the presence of malignant tissue merely from urine cytology images.

It has been fully demonstrated that deep learning models can be used to establish a cytology diagnosis system. Vaickus et al. were the first to show that the analysis of urine cytology specimens could be reliably automated. They achieved an accuracy of more than 90% using a hybrid deep-learning and morphometric algorithm (9). Pantanowitz et al. further proved this idea using a much larger data set. They used a pure neural network to exploit and integrate both slide-level and cell-level features and achieved a sensitivity of 79.5% and a specificity of 84.5% for cytopathological diagnosis (10). For both studies, features were carefully engineered to ensure biological interpretability and reproducibility. Features such as the nuclear-cytoplasm ratio, chromatin quality, and the quantity of cells were included (Figure 6). Such a design made the system explainable and enables it to fulfill the aim to assist pathologists in cytology reading.

FIGURE 6
www.frontiersin.org

Figure 6 Different designs in between different DLSs. The current DLS could make both cytopathological diagnosis and histopathological prediction, yet it did not need histopathological data in its training.

Although the principal indications for the use of cytology include the diagnosis, follow-up, and monitoring of patients with urothelial tumors, the gold standard for the diagnosis of tumor is still histopathology. Therefore, the cyto-histo correlation has become the bottleneck for the use of cytology in clinical settings. Previous studies have shown that deep learning techniques could discern subtle differences in image features that are not readily noticeable to pathologists between tissues from patients with different genetic subtypes, cancer grades, and survival (68). Thus, it was reasonable to wonder if deep learning could also distinguish differences in cytology images with different histopathological results.

Fujita et al. were the first to observe that the DLS could not only accurately detect UC cells but also distinguish characteristics traditionally determined using histopathology. They probed two specific characters: whether the lesions were invasive and whether the lesions were high grade. For both, the DLS achieved an AUC higher than 0.86 and F1 score higher than 0.82 (11). The study used a mixing-training model with the first half trained with cytopathological data and the other half trained with histopathological data (Figure 6). The results proved that at least part of the features could be shared for cytopathology diagnosis and histopathology prediction. This laid the foundation for a complete cyto-histo correlation.

For the purpose of proving this correlation more stringently, we improved the design of this study on the basis of previous ones. First, we trained the system with only positive cytology images instead of adding negative images as in previous studies. Therefore, this design could insure all the features used by the DLS derived from malignant cases. Second, during malignancy prediction, we did not train the initial DLS again. Instead, we added an additional classifier at the end of the initial DLS. This design rendered two models that shared a same set of convolutional network and detector and, thus, a same set of features. The additional classifier was designed based on the biological meaning of the degree of fit calculated by the DLS. During cancer cell detection, the degree of fit was the likelihood of a cell to be a cancer cell. Therefore, the maximal degree of fit in a certain image represented the likelihood of a case to get malignancy. There were no manually designed features in the DLS. Therefore, the process was explainable since the prediction of malignancy was directly based on the abnormal cells selected by the model and can be verified by examining those candidate abnormal cells in each image. Thus, the design made the DLS not only possible to testify the hypothesis better but also explainable in a unique way.

The test sets in this study included preliminary, internal, and extra test sets. The extra test set came from patients whose cytology was initially diagnosed as negative but later proved to have malignant histopathological results. Overlooked cancer cells in these images were carefully revised in the blinded review at the beginning of this study. Results showed that the extra test set had higher AUC than the internal test set during malignancy prediction. This indicated that the DLS had a good performance for the extra test despite the fact that these cases were among the most difficult to diagnose by traditional cytology. However, the sensitivity and specificity for the extra test under the optimal threshold were lower than that for the internal test. Moreover, during cancer cell detection, the accuracy for the extra test was also lower than that for the preliminary and the internal tests under the optimal threshold, and the accuracy for the extra test was still increasing after the threshold reached the optimal threshold. This indicated that the current optimal threshold using in this study, which was chosen based on the preliminary and internal tests, might not serve as the optimal threshold for the extra test. Future studies are needed for a better strategy to find the optimal threshold for DLS application. At the same time, it is also important to understand that pathologists are not able to spot every true cancer cell since no cytopathological scoring system used for cytology diagnosis at present is perfect. A DLS learnt from pathologists’ annotations inevitably inherited these bias and errors. Therefore, as pathologists had failed to perform excellent in the extra test set themselves, the DLS would only detect cancer cells with additional difficulties.

The results of the DLS performance to predict malignancy showed a relatively high specificity. The sensitivity, however, was not as good as the specificity. This indicated that most cases predicted to have tissue malignancy were indeed patients with UC, while some patients with UC were not successfully identified. This may be attributed to the fact that the DLS still needs further improvement or that some UCs do not present morphologically abnormal cells in urine. Meanwhile, it was not necessary to spot every cancer cell to make a prediction. Instead, it was adequate to find most of the cancer cells while mistaking as few normal cells as possible. These successfully spotted cancer cells were likely to be among the most atypical and, thus, gave the highest scores in an image. Biologically, it was also the patients whose cells in cytology had a higher degree of atypia who were more likely to get UC. Future studies are needed to identify UC in those patients without abnormal cells in urine.

There is currently no gold standard for cyto-histo correlation in urine. Many argue that a negative cytology with a concurrent positive surgical result is not a false negative. Similarly, a positive urine followed by a negative surgical result is not a false positive. However, results in this current study imply that when the DLS predicts a malignant state, it might focus on characteristics that are partially same with those used for UC cell detection. This indicates that there are features on cytology images correlated to histopathology results. Nevertheless, due to the lack of technical maneuver to untangle representative features in Faster-RCNN, we are not able to define each feature and apply them in classical cytology. This is one of the limitations of this study. Another limitation is that this is a retrospective study in a single center. Multicentered prospective studies are warranted to further prove these findings.

Collectively, the current results demonstrated that DLS cytology could be used to predict the likelihood of a case to have histological confirmed malignancy through a cyto-histo correlation. If a DLS can serve as a risk-stratification tool to distinguish clinically relevant malignancy at the time of cytology, urologists can plan in time therapeutic strategies at lower cost that benefit more patients.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Ethics Statement

The studies involving human participants were reviewed and approved by the ethics committee of biological and medical research of Peking University First Hospital. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

YL, SJ, and WY set up the experimental design. YL, QS, and YF collected and selected the data. SJ was mainly responsible for the high-performance computing and statistical analysis. QS was responsible for annotations and the morphology interpretation in the cytology and histology. LC provided the basic algorithm model. LC and SF provided guidance on deep learning and computer vision techniques. YL wrote the manuscript. WY and HP were responsible for conception, and they supervised the work. All authors reviewed the manuscript and agree with its contents.

Funding

This study was funded by National Natural Science Foundation of China (81870518 to WY and 62073012 to HP).

Conflict of Interest

Authors YL, SJ, QS, LC, SF, YF, HP, and WY are on a pending patent (China 202110493539.5). Author LC was employed by the company Yizhun Medical AI Co. Ltd.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We are grateful to the patients and urological pathologists at the Peking University First Hospital. YG, HH, and YM are acknowledged for their advice on the setup of the model. XY and LZ are acknowledged for their advice and guidance on the illustration of figures. We express our sincere appreciation for the grants from the National Natural Science Foundation of China (81870518 to WY and 62073012 to HP).

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.901586/full#supplementary-material

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin (2021) 71(3):209–49. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Witjes JA, Bruins HM, Cathomas R, Comperat EM, Cowan NC, Gakis G, et al. European Association of Urology Guidelines on Muscle-Invasive and Metastatic Bladder Cancer: Summary of the 2020 Guidelines. Eur Urol (2020) 79(1):82–104. doi: 10.1016/j.eururo.2020.03.055

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Flaig TW, Spiess PE, Agarwal N, Bangs R, Boorjian SA, Buyyounouski MK, et al. Bladder Cancer, Version 3.2020, NCCN Clinical Practice Guidelines in Oncology. J Natl Compr Canc Netw (2020) 18(3):329–54. doi: 10.6004/jnccn.2020.0011

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Roupret M, Babjuk M, Burger M, Capoun O, Cohen D, Comperat EM, et al. European Association of Urology Guidelines on Upper Urinary Tract Urothelial Carcinoma: 2020 Update. Eur Urol (2021) 79(1):62–79. doi: 10.1016/j.eururo.2020.05.042

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Crothers BA. Cytologic-Histologic Correlation: Where Are We Now, and Where Are We Going? Cancer Cytopathol (2018) 126(5):301–8. doi: 10.1002/cncy.21991

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Kather JN, Heij LR, Grabsch HI, Loeffler C, Echle A, Muti HS, et al. Pan-Cancer Image-Based Detection of Clinically Actionable Genetic Alterations. Nat Canc (2020) 1(8):789–99. doi: 10.1038/s43018-020-0087-6

CrossRef Full Text | Google Scholar

7. Fu Y, Jung AW, Torne RV, Gonzalez S, Vöhringer H, Shmatko A, et al. Pan-Cancer Computational Histopathology Reveals Mutations, Tumor Composition and Prognosis. Nat Canc (2020) 1(8):800–10. doi: 10.1038/s43018-020-0085-8

CrossRef Full Text | Google Scholar

8. Mi H, Bivalacqua TJ, Kates M, Seiler R, Black PC, Popel AS, et al. Predictive Models of Response to Neoadjuvant Chemotherapy in Muscle-Invasive Bladder Cancer Using Nuclear Morphology and Tissue Architecture. Cell Rep Med (2021) 2(9):100382. doi: 10.1016/j.xcrm.2021.100382

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Vaickus LJ, Suriawinata AA, Wei JW, Liu X. Automating the Paris System for Urine Cytopathology-A Hybrid Deep-Learning and Morphometric Approach. Cancer Cytopathol (2019) 127(2):98–115. doi: 10.1002/cncy.22099

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Sanghvi AB, Allen EZ, Callenberg KM, Pantanowitz L. Performance of an Artificial Intelligence Algorithm for Reporting Urine Cytopathology. Cancer Cytopathol (2019) 127(10):658–66. doi: 10.1002/cncy.22176

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Nojima S, Terayama K, Shimoura S, Hijiki S, Nonomura N, Morii E, et al. A Deep Learning System to Diagnose the Malignant Potential of Urothelial Carcinoma Cells in Cytology Specimens. Cancer Cytopathol (2021) 129(12):984–95. doi: 10.1002/cncy.22443

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Owens CL, Vandenbussche CJ, Burroughs FH, Rosenthal DL. A Review of Reporting Systems and Terminology for Urine Cytology. Cancer Cytopathol (2013) 121(1):9–14. doi: 10.1002/cncy.21253

PubMed Abstract | CrossRef Full Text | Google Scholar

13. He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition(2016) (Accessed 2016 IEEE Conference on Computer Vision and Pattern Recognition).

Google Scholar

14. Dai D, Dong C, Xu S, Yan Q, Li Z, Zhang C, et al. Ms RED: A Novel Multi-Scale Residual Encoding and Decoding Network for Skin Lesion Segmentation. Med Imag Anal (2022) 75:102293. doi: 10.1016/j.media.2021.102293

CrossRef Full Text | Google Scholar

15. Chen H, Dou Q, Yu L, Qin J, Heng PA. VoxResNet: Deep Voxelwise Residual Networks for Brain Segmentation From 3D MR Images. Neuroimage. (2018) 170:446–55. doi: 10.1016/j.neuroimage.2017.04.041

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Ren S, He K, Girshick R, Sun J. Faster R-CNN: Towards Real-Time Object Detection With Region Proposal Networks. IEEE Trans Pattern Anal Mach Intell (2017) 39(6):1137–49. doi: 10.1109/TPAMI.2016.2577031

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Li X, Xu Z, Shen X, Zhou Y, Xiao B, Li TQ. Detection of Cervical Cancer Cells in Whole Slide Images Using Deformable and Global Context Aware Faster RCNN-FPN. Curr Oncol (2021) 28(5):3585–601. doi: 10.3390/curroncol28050307

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Ho C, Zhao Z, Chen XF, Sauer J, Saraf SA, Jialdasani R, et al. A Promising Deep Learning-Assistive Algorithm for Histopathological Screening of Colorectal Cancer. Sci Rep (2022) 12(1):2222. doi: 10.1038/s41598-022-06264-x

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Russell BC, Torralba A, Murphy KP, Freeman WT. LabelMe: A Database And Web-Based Tool for Image Annotation. Int J Of Comput Vision (2008) 77(1-3):157–73. doi: 10.1007/s11263-007-0090-8

CrossRef Full Text | Google Scholar

20. Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, et al. ImageNet Large Scale Visual Recognition Challenge. Int J Comput Vis (2015) 115(3):211–52. doi: 10.1007/s11263-015-0816-y

CrossRef Full Text | Google Scholar

21. Fleiss JL, Cohen J, Everitt BS. Large Sample Standard Errors of Kappa And Weighted Kappa. Psycholog Bullet (1969) 72:323–7. doi: 10.1037/h0028106

CrossRef Full Text | Google Scholar

Keywords: cyto-histo correlation, deep learning, urothelial carcinoma, urine cytology, convolutional neural network

Citation: Liu Y, Jin S, Shen Q, Chang L, Fang S, Fan Y, Peng H and Yu W (2022) A Deep Learning System to Predict the Histopathological Results From Urine Cytopathological Images. Front. Oncol. 12:901586. doi: 10.3389/fonc.2022.901586

Received: 22 March 2022; Accepted: 19 April 2022;
Published: 24 May 2022.

Edited by:

Jian Lu, Peking University Third Hospital, China

Reviewed by:

Kazutoshi Fujita, Osaka University, Japan
Satoshi Nojima, Osaka University, Japan

Copyright © 2022 Liu, Jin, Shen, Chang, Fang, Fan, Peng and Yu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wei Yu, yuweif@126.com; Hao Peng, penghao@act.buaa.edu.cn

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.