AUTHOR=Sathianathen Niranjan J. , Heller Nicholas , Tejpaul Resha , Stai Bethany , Kalapara Arveen , Rickman Jack , Dean Joshua , Oestreich Makinna , Blake Paul , Kaluzniak Heather , Raza Shaneabbas , Rosenberg Joel , Moore Keenan , Walczak Edward , Rengel Zachary , Edgerton Zach , Vasdev Ranveer , Peterson Matthew , McSweeney Sean , Peterson Sarah , Papanikolopoulos Nikolaos , Weight Christopher 

TITLE=Automatic Segmentation of Kidneys and Kidney Tumors: The KiTS19 International Challenge

JOURNAL=Frontiers in Digital Health

VOLUME=Volume 3 - 2021

YEAR=2022

URL=https://www.frontiersin.org/journals/digital-health/articles/10.3389/fdgth.2021.797607

DOI=10.3389/fdgth.2021.797607

ISSN=2673-253X

ABSTRACT=Purpose: Clinicians rely on imaging features to calculate the complexity of renal masses based on validated scoring systems. These scoring methods are labor-intensive and are subject to interobserver variability. Artificial intelligence has been increasingly utilized by the medical community to solve such issues. However, developing reliable algorithms are usually time-consuming and costly. We created an international community-driven competition (KiTS19) to develop and identify the best system for automatic segmentation of kidneys and kidney tumors in contrast to CT and report the results. Methods: A training and test set of CT scans that were manually annotated by trained individuals were generated from consecutive patients undergoing renal surgery for whom demographic, clinical and outcome data were available. The KiTS19 Challenge was a machine learning competition hosted on grand-challenge.org in conjunction with an international conference. Teams were given 3 months to develop their algorithm using a full-annotated training set of images and an unannotated test set was released for 2 weeks from which average Sørensen-Dice coefficient between kidney and tumor regions across all 90 test cases. Results: There were 100 valid submissions that were based on deep neural networks but there were differences in pre-processing strategies, architectural details, and training procedures. The winning team scored a 0.974 kidney Dice and a 0.851 tumor Dice resulting in 0.912 composite scores. Automatic segmentation of the kidney by the participating teams performed comparably to expert manual segmentation but was less reliable when segmenting the tumor. Conclusion: Rapid advancement in automated semantic segmentation of kidney lesions is possible with relatively high accuracy when the data is released publicly, and participation is incentivized. Our findings demonstrate the potential of adopting AI in the medical field.