SYSTEMATIC REVIEW article

Front. Oncol., 11 October 2024

Sec. Gastrointestinal Cancers: Colorectal Cancer

Volume 14 - 2024 | https://doi.org/10.3389/fonc.2024.1424044

Application of Artificial Intelligence in the diagnosis and treatment of colorectal cancer: a bibliometric analysis, 2004–2023

  • 1. Department of Oncology, Wuxi Hospital Affiliated to Nanjing University of Chinese Medicine, Wuxi, China

  • 2. Department of Traditional Chinese Medicine, Jiangyin Nanzha Community Health Service Center, Wuxi, China

  • 3. Department of General Surgery, Jiangyin Hospital Affiliated to Nanjing University of Chinese Medicine, Wuxi, China

Article metrics

View details

10

Citations

5,4k

Views

1,6k

Downloads

Abstract

Background:

An increasing number of studies have turned their lens to the application of Artificial Intelligence (AI) in the diagnosis and treatment of colorectal cancer (CRC).

Objective:

To clarify and visualize the basic situation, research hotspots, and development trends of AI in the diagnosis and treatment of CRC, and provide clues for research in the future.

Methods:

On January 31, 2024, the Web of Science Core Collection (WoSCC) database was searched to screen and export the relevant research published during 2004-2023, and Cite Space, VoSviewer, Bibliometrix were used to visualize the number of publications, countries (regions), institutions, journals, authors, citations, keywords, etc.

Results:

A total of 2715 pieces of literature were included. The number of publications grew slowly until the end of 2016, but rapidly after 2017, till to the peak of 798 in 2023. A total of 92 countries, 3997 organizations, and 15,667 authors were involved in this research. Chinese scholars released the highest number of publications, and the U.S. contributed the highest number of total citations. As to authors, MORI, YUICHI had the highest number of publications, and WANG, PU had the highest number of total citations. According to the analysis of citations and keywords, the current research hotspots are mainly related to “Colonoscopy”, “Polyp Segmentation”, “Digital Pathology”, “Radiomics”, “prognosis”.

Conclusion:

Research on the application of AI in the diagnosis and treatment of CRC has made significant progress and is flourishing across the world. Current research hotspots include AI-assisted early screening and diagnosis, pathology, and staging, and prognosis assessment, and future research is predicted to put weight on multimodal data fusion, personalized treatment, and drug development.

1 Introduction

Colorectal cancer (CRC) is a common malignant tumor of the gastrointestinal tract, and the global morbidity and mortality of CRC rank third and second among all cancers, respectively (1). According to Cancer incidence and mortality in China, 2022, China is inflicted with a high incidence of CRC, and its morbidity and mortality rank second and fourth among all malignant tumors, respectively (2). Contributors to CRC include gender, genetic factors, and family factors in addition to smoking, obesity, and poor lifestyle (3). Currently, CRC is usually diagnosed by laboratory tests, endoscopy, imaging, and histopathology, and treated with endoscopy, surgery, radiotherapy, chemotherapy, targeted therapy, and immunotherapy (4). Despite these diagnostic and treatment technologies, CRC is till prevalent in China, as shown by its continuous increase in morbidity and mortality in men during 2014-2018 (2). Artificial intelligence (AI) is revolutionizing the diagnosis and treatment of CRC, as shown by the advances in high-precision medical image analysis, endoscopic data processing, and digital pathology assessment, as well as personalized treatment and robot-assisted surgery. AI also help to better allocate and utilize medical resources and healthcare services (59).

Bibliometrics is a cross-cutting science that uses mathematical and statistical tools to quantitatively analyze scientific literature and provides standardized evaluation criteria to reduce subjective bias compared to traditional review studies (10). It can help researchers quickly understand the classic literature and core authors in a certain field as well as the research hotspots and development trends, in order to enhance the research efficiency and assist scientific research decisions (11, 12).

Recent years have witnessed an increasing number of studies related to the application of AI in CRC diagnosis. However, very few researchers have used bibliometric methods to systematically analyze studies in this area. In the only few bibliometric studies on related topics, they all focus on the analysis of the basic status of the research, without a detailed and extended analysis of the current research hotspots and future research trends (13, 14). Based on the Web of Science (WoS) core collection database, the present bibliometric study uses Cite Space (15), VoSviewer (16), Bibliometrix (17) to analyze the status quo, hotspots, and trends in the research of AI-related CRC diagnosis or treatment during 2004-2023, aiming to provide new ideas and clues for related research work.

2 Materials and methods

2.1 Data sources and search strategy

The WoS is the most comprehensive, systematic, and authoritative database available for bibliometric analysis, due to its rich bibliometric indicators and inclusion of high-quality journals from around the world (18). The data included in this study were obtained from the Web of Science Core Collection (WoSCC) database (https://www.webofscience.com/wos/woscc/basic-search). The search was conducted on January 31, 2024. Our search topic was divided into two parts, technical terms and disease terms, and for a broader search, we looked for relevant synonyms in the Pubmed MeSH Database and referred to recent bibliometric studies related to AI (19) or colorectal cancer (20) to finalize the following search formula: TS=(Rectal Neoplasm* OR Rectal Tumor* OR Rectal Cancer* OR Rectum Neoplasm* OR Rectum Cancer* OR Cancer of the Rectum OR Cancer of Rectum OR Colorectal Neoplasm* OR Colorectal Tumor* OR Colorectal Cancer* OR Colorectal Carcinoma* OR Colonic Neoplasm* OR Colon Neoplasm* OR Cancer of Colon OR Colon Cancer* OR Cancer of the Colon OR Colonic Cancer*) AND TS=(artificial intelligence OR Computational Intelligence OR machine learn* OR deep learn* OR artificial neural network OR machine intelligence).In the filtering function of the WoSCC database, we have set it so that only studies published as “article” and “review” and in “English” from January 1, 2004 to December 31, 2023 were screened. The process of literature screening is shown in Figure 1. A total of 2,715 articles were finally included, and the complete records of these articles (including title, authors, sources, abstracts, references, etc.) were exported in “download_txt” format for subsequent analysis.

Figure 1

2.2 Methods of statistical analysis

Microsoft Office Excel 2019 was used to draw the infographics of the annual publication volume and the descriptive tables of various types of data. VOSviewer (v.1.6.20) was used to carry out co-authorship and co-occurrence network analyses among countries (regions), institutions, authors, and journals. High-frequency co-cited literature, highly cited literature, and keywords were presented through network visualization, overlay visualization, and density visualization. Journal biplot overlays, co-cited literature clusters, keyword clusters, and timeline plots of keyword clusters were plotted using CiteSpace (v.6.3.R1 64-bit advanced) and combined with citation databases for literature analysis. Thematic trend analysis of keywords was run on the Bibliometrix (v4.1.4) based on R software (v4.3.2). The changes of research hotspots over time in this research area were visualized.

3 Results

3.1 General information about the data

A total of 2715 records were searched out of the WoS and imported into the Bibliometrix. The overall situation is shown in Table 1.

Table 1

DescriptionResults
Documents2715
Articles2336
Reviews379
Timespan2004:2023
Sources (Journals, Books, etc)722
Annual Growth Rate %22.85
Document Average Age3.41
Average citations per doc18.74
References89817

Basic data information.

3.2 Analysis of the annual volume and its changing trend

The annual numbers of articles were imported into Microsoft Office Excel 2019 to plot the trend of this index (Figure 2). It showed that the annual number of articles grew slowly, and never exceeded 50 before 2016, but rapidly after 2017, especially between 2020 and 2023, till to a peak of 798 in 2023.

Figure 2

3.3 Analysis of countries (regions)

The analysis of VOSviewer software showed that a total of 92 countries contributed to the research on AI in CRC diagnosis and treatment. The top 10 countries with the most publications are listed in Table 2, in which China ranked the first with 927 articles, the USA second with 658 articles, and the UK third with 240 articles. Although the USA took the second place in the total number of publications, its total number of citations reached 21,532 (the largest in the world) and average number of citations reached 32.7234, which indicates that the USA has a significant influence in this research field.

Table 2

RankCountryDocumentsPercentage (%)CitationsAvg. citationsTSL
1China92734.14%1355014.617320
2USA65824.24%2153232.7234647
3United Kingdom2408.84%740630.8583393
4Germany2047.51%533626.1569347
5Italy1917.03%338417.7173292
6Japan1886.92%350618.6489188
7South Korea1636.00%231814.220979
8Netherlands1204.42%379031.5833272
9India1194.38%10228.588273
10Canada1043.83%274726.4135156

Top 10 countries (regions) by number of publications.

TLS, Total link strength.

In addition, the average numbers of citations in the Netherlands (31.5833), the United Kingdom (30.8583), Canada (26.4153), and Germany (26.1569) were also relatively high, which may imply that the studies in these countries are generally of high quality. It is particularly noteworthy that although Sweden only ranked 17th in the number of publications (47 in total), its average number of citations was as high as 41.2128, which ranked the first among all the countries, implying that the quality of the research results in Sweden is high.

Using the VOSviewer software, we performed a collaborative network mapping for the 32 countries with at least 20 publications (Figure 3), showing that international collaborations had appeared among a wide range of countries or regions in North America, Europe, and Asia, with the United States, the United Kingdom, Germany, China, and Italy as top centers.

Figure 3

3.4 Analysis of organizations

According to the results from the VOSviewer software, a total of 3,997 organizations have participated in the research on AI in CRC diagnosis and treatment. Table 3 lists the top 10 organizations having published the most publications, in which Sun Yat-sen University ranked the first (70 articles), followed by Fudan University (53 articles), Chinese Academy of Sciences (51 articles), Harvard Medical School (50 articles), and Zhejiang University (50 articles). Among these top 10 institutions, Harvard Medical School had the most total citations (2,362), followed by the Chinese Academy of Sciences (1,707) and Sun Yat-sen University (1,313).

Table 3

RankOrganizationCountryDocumentsCitationsTSL
1Sun Yat-sen UnivChina70131385
2Fudan UnivChina5367646
3Chinese Acad SciChina51170779
4Harvard Med SchUSA50236293
5Zhejiang UnivChina50118611
6Shanghai Jiao Tong UnivChina4846045
7Sichuan UnivChina4767433
8Southern Med UnivChina4659552
9Stanford UnivUSA36157429
10Natl Canc CtrItaly3490345

Top 10 organizations in terms of number of articles issued.

It is worth noting that although Mayo Clinic did not enter the top 10 in the number of publications (25), its total number of citations was as high as 3,662, topping all other institutions, indicating that its research quality was the highest.

In addition, a network involving 71 organizations having published at least 15 articles was constructed by the VOSviewer software (Figure 4). There were strong links between several institutions. The top five in TSL (Total Score Length) included the University of Oslo, Harvard Medical School, Sun Yat-sen University, Chinese Academy of Sciences, and Showa University, reflecting their leadership and influence in the research on AI in CRC diagnosis and treatment.

Figure 4

3.5 Analysis of authors

The results from the VOSviewer software showed that a total of 15,667 authors were involved in the research on AI in CRC diagnosis and treatment. As shown in Table 4, the top three authors with the most publications in this field were MORI, YUICHI (23), followed by REPICI, ALESSANDRO (20), and HASSAN, CESARE (18). As for the total number of citations, the top three authors were WANG, PU (10 articles with 1101 citations), BERZIN, TYLER M. (9 articles with 1088 citations) and KATHER, JAKOB NIKOLAS (17 articles with 1077 citations).

Table 4

RankAuthorsDocumentsCitationsTSL
1MORI, YUICHI23758164
2REPICI, ALESSANDRO20701108
3HASSAN, CESARE1866694
4KATHER, JAKOB NIKOLAS17107745
5KUDO, SHIN-EI17603131
6MISAWA, MASASHI16660124
7PICKHARDT, PERRY J.1543625
8MORI, KENSAKU1449997
9LIU, ZAIYI1314754
10SAITO, YUTAKA1342555

Top 10 authors in terms of number of articles published.

In the VOSviewer software, a network of 234 authors with at least 5 publications was constructed (Figure 5), showing intense mutual collaborations between these authors. In this collaborative network, the top five authors in terms of TSL (Total Score Length) included MORI, YUICHI, KUDO, SHIN-EI, MISAWA, MASASHI, REPICI, ALESSANDRO, and MORI, KENSAKU, which indicated that they play more important roles in the collaborative network in this research area.

Figure 5

3.6 Analysis of journals and co-citations

Through the results from the VOSviewer software, we found that a total of 722 journals had published research on AI in CRC diagnosis and treatment. Among them, Cancers published the largest number of articles (110 articles). In terms of the number of citations, IEEE Transactions On Medical Imaging led with 3232 citations. Table 5 provides details of the top 10 journals in the number of articles published, where Computers In Biology And Medicine carried the highest impact factor of 7.7 in 2022.

Table 5

RankJournalDocumentsCitationsIFJCRJournalCo-citationsIFJCR
1Cancers11010415.2Q2Gastrointest Endosc24457.7Q1
2Frontiers In Oncology995784.7Q2Sci Rep-Uk22884.6Q2
3Scientific Reports8123954.6Q2Gastroenterology224629.4Q1
4Diagnostics613953.6Q2Nature161864.8Q1
5Ieee Access454373.9Q2Plos One16133.7Q2
6Medical Physics4414203.8Q2Gut158224.5Q1
7Plos One354773.7Q2New Engl J Med1542158.5Q1
8WorldJournal Of Gastroenterology353604.3Q2J Clin Oncol149345.4Q1
9Applied Sciences-Basel304902.7Q3Lect Notes Comput Sc1426————
10Computers In Biology And Medicine302687.7Q1Endoscopy14239.3Q1

Basic information of the top 10 journals in terms of number of publications and co-citations.

IF, Impact Factor (2022); JCR, Journal Citation Reports Subdivision (2022).

In terms of citation frequency, the top three journals were Gastrointestinal Endoscopy (2,445), Scientific Reports (2,288), and Gastroenterology (2,246), as shown in Table 5. Among these cited journals, The New England Journal of Medicine carried the highest impact factor of 158.5 in 2022.

Figure 6 shows a two-plot overlay mapping based on citing and cited journals in the CiteSpace software, revealing these journals were mainly cited through four paths: Literature published in journals in the fields of “Molecular, Biology, Genetics”, “Health, Nursing, Medicine” was cited by literature published in journals in the fields of “Medicine, Medical, Clinical”, “Molecular, Biology, Immunology”. This shows a complex network of intersections and influences between research areas.

Figure 6

3.7 Analysis of references

The analysis in the VOSviewer software showed that a total of 89,755 references were cited in 2715 articles. As shown in Table 6, the top 10 most cited literature included. In addition to three epidemiological studies on tumor incidence and mortality [Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries (21), Cancer Statistics, 2017 (22), Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries (1)], and a study examining the relationship between adenoma detection rate and CRC mortality risk [Adenoma Detection Rate and Risk of Colorectal Cancer and Death (23)], the remaining six pieces of literatures (2429) focus on the application of convolutional neural networks in colonoscopy image recognition.

Table 6

RankTitleCo-citationsYear of publication
1Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries2692018
2U-Net: Convolutional Networks for Biomedical Image Segmentation2032015
3Deep Residual Learning for Image Recognition1862016
4Cancer Statistics, 20171682017
5Very Deep Convolutional Networks for Large-Scale Image Recognition1512014
6Real-time automatic detection system increases colonoscopic polyp and adenoma detection rates: A prospective randomized controlled study1342019
7Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries1312021
8Adenoma Detection Rate and Risk of Colorectal Cancer and Death1262014
9ImageNet classification with deep convolutional neural networks1252017
10Deep learning localizes and identifies polyps in real-time with 96% accuracy in screening1152018

Top 10 co-cited references.

The co-cited literature was clustered and analyzed in Citespace software (Figure 7), where the major clusters were mainly related to “Colonoscopy”, “Polyp Segmentation”, “Digital Pathology”, “Lymph Node Metastasis” and “Radiomics “.

Figure 7

3.8 Analysis of highly cited studies

Highly cited studies are often regarded as authoritative in the field of research, as they provide key evidence and information on research progresses and trends. In 2715 pieces of results from the VOSviewer software, a total of 83 studies with more than 100 citations were screened out, of which 37 were strong interconnections, and their density is visualized in Figure 8. Table 7 shows the top 10 cited literature, which were published mainly between 2016 and 2019.

Figure 8

Table 7

RankTitleCitationsYear of publication
1Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning?16872016
2Locality Sensitive Deep Learning for Detection and Classification of Nuclei in Routine Colon Cancer Histology Images7132016
3Genomic and Molecular Landscape of DNA Damage Repair Deficiency across The Cancer Genome Atlas6202018
4Genome-wide cell-free DNA fragmentation in patients with cancer5612019
5The Applications of Radiomics in Precision Diagnosis and Treatment of Oncology: Opportunities and Challenges4372019
6Real-time automatic detection system increases colonoscopic polyp and adenoma detection rates: a prospective randomized controlled study4122019
7Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study3882019
8Deep Learning Localizes and Identifies Polyps in Real Time With 96% Accuracy in Screening Colonoscopy3832018
9Real-time differentiation of adenomatous and hyperplastic diminutive colorectal polyps during analysis of unaltered videos of standard colonoscopy using a deep learning model3582017
10Deep learning-based tissue analysis predicts outcomes in colorectal cancer3342018

Top 10 cited publications.

Among these highly cited studies, an article by TAJBAKHSH N et al. (30) published in IEEE Transactions on Medical Imaging, entitled “Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning?”, received the largest number of 1687 citations. This article discussed how to more effectively apply convolutional neural networks to medical image analysis. The second most cited was an article by SIRINUKUNWATTANA K et al. (31) in the IEEE Transactions on Medical Imaging entitled “Locality Sensitive Deep Learning for Detection and Classification of Nuclei in Routine Colon Cancer Histology Images”, with 713 citations. It described the use of deep learning techniques to assist in the diagnosis and staging of CRC. Another representative study was “Genomic and Molecular Landscape of DNA Damage Repair Deficiency across The Cancer Genome Atlas” published by KNIJNENBURG TA et al. (32) in Cell Reports, cited for 620 times. This study involved the application of AI in identifying tumor gene expression patterns.

Among the top 10 cited studies, three focused on diagnostic colonoscopy, two on medical image analysis, two on genomic studies, two on prognostic analysis, and one on exploration for digital pathology. This broad coverage highlights the diverse and active research in this field.

3.9 Keywords analysis

3.9.1 Keyword co-occurrence analysis

Co-occurrence of these keywords can reveal the most active topics within a research field. After importing the data into the VOSviewer software, the results showed a total of 4882 keywords. The top 20 keywords with the highest frequency were displayed (Table 8). The keywords that appeared more than 500 times included machine learning, deep learning, and CRC. Then, the popular keywords were categorized according to the type and functionality of AI models used in these studies (Table 9). The top five types of AI models were machine learning, deep learning, convolutional neural networks, artificial neural networks, and transfer learning; while in terms of functionality, the top five were colonoscopy, radiomics, prognosis, magnetic resonance imaging, and endoscopy. In addition, the VOSviewer software was used to plot the 54 keywords with more than 20 occurrences into a time-stacked graph (Figure 9), which demonstrated the evolution of keywords over time. As shown by gradation of colors in the graph, a color closer to yellow indicates that this keyword is more recent. As can be seen from the timeline in the bottom right corner of the figure, the top keywords were more concentrated between 2020 and 2022. Relatively new keywords included computed tomography, attention mechanism, immunotherapy, lymph node metastasis, and polyp segmentation. The concentrations of these keywords reflected the latest trends and developments in the research.

Table 8

RankkeywordsOccurrencesRankkeywordsOccurrences
1machine learning54711prostate cancer76
2deep learning54512prognosis70
3colorectal cancer52813Magnetic resonance imaging61
4artificial intelligence39914endoscopy56
5colonoscopy15615classification48
6rectal cancer13616artificial neural network46
7convolutional neural network13117computer-aided diagnosis46
8radiomics12118biomarkers45
9cancer11619digital pathology45
10colon cancer10920segmentation44

Top 20 keywords in terms of frequency of occurrence.

Table 9

classificationkeywordsOccurrencesclassificationkeywordsOccurrences
modelmachine learning547functioncolonoscopy156
deep learning545radiomics121
convolutional neural network131prognosis70
artificial neural network46Magnetic resonance imaging61
transfer learning43endoscopy56

Classification of hot keywords.

Figure 9

3.9.2 Keyword clustering and theme trend analysis

Keywords were clustered by CiteSpace software (Figure 10A), revealing the knowledge structure of a particular topic. A Q-value (Modularity) of 0.8317 and an S-value (Silhouette) of 0.9526 indicated that the clusters formed were highly structured and differentiated. Among the top 10 clusters identified (numbered 0-9), cluster #2 focused on polyp detection and covered keywords such as system, optical biopsy, and management; cluster #3 focused on inflammatory bowel disease and contained terms such as endoscopy, confocal laser microendoscopy, capsule endoscopy, and colonoscopy; cluster #5 dealt with digital pathology, with keywords such as microsatellite instability, image analysis, colonoscopy, and panoramic slice images; and cluster #6 focused on gene expression and highlighted the importance of deep learning, artificial intelligence, weighted gene co-expression network analysis, cells, and other concepts; cluster #7 focused on computer-assisted detection, and contained terms such as computed tomography, tumor heterogeneity, texture analysis, etc.; cluster #8 focused on cancer classification, with feature selection, gene expression data, deep learning, and quality metrics as its keywords.

Figure 10

In the CiteSpace, we tracked the evolution of keywords over time by plotting a timeline of keyword clustering (Figure 10B). Using the Bibliometrix, we created thematic trend charts based on keywords over the period from 2016 to 2023 (Figure 10C). We found that early research focused on CRC diagnosis, classification, and biomarkers, whereas recent research tended to focus on areas such as AI-assisted colorectoscopy, digital pathology, image analysis, gene expression, and prognosis prediction. By analyzing the topic trend map, we found that the hot keywords in current research included lymph node metastasis, computed tomography (CT), and tumor microenvironment, which may guide the direction of future research.

4 Discussion

4.1 Analysis of the basic situation of the study

This bibliometric analysis reviews the research on AI in CRC diagnosis and treatment in the past 20 years. We found that the number of publications per year grew slowly and never rose above 50 during the period from 2004 to 2016. However, since 2017, this number began to grow significantly, especially between 2020 and 2023, indicating that this research has ushered in a new era since 2017. More encouraging research is expected in the future.

In the world, China ranked first and the USA second in the number of publications, but China sit far below the USA in the total number of citations. In addition, China’s average number of citations was also lower than that of countries such as Sweden, the United States, the Netherlands, the United Kingdom, Canada, and Germany. According to the TSL index, the United States collaborated most frequently with other countries (regions). These data suggest that although China has released the highest number of publications, the overall quality and authority of its research remain to be improved. The United States undoubtedly occupies a central position in this field. High-quality research is mainly contributed by developed countries in Europe and the United States.

At the institutional level, Sun Yat-sen University devoted the highest number of publications, while the Mayo Clinic gained the highest number of citations. Harvard Medical School and Chinese Academy of Sciences ranked high in the number of publications, number of citations, and TSL, demonstrating their heavy contributions in this research area. As for individual contributors, MORI, YUICHI had the highest number of publications for his research on AI-assisted colonoscopy, while WANG, PU was the most cited author for his research on the effect of AI-assisted colonoscopy on adenoma detection. Their contributions were focused on AI-assisted colonoscopy. Finally, based on the total number of citations and impact factor values, the journals “Gastrointest Endosc” and “Gastroenterology” enjoyed the highest reputation in this field.

4.2 Current research hotspots

According to the analysis of highly cited studies, co-cited references, and keywords, the current research hotspots are mainly categorized into the following three areas:

4.2.1 Early screening and diagnosis

AI has greatly advanced the development of medical imaging. Features on CT, MRI, and other medical images can be extracted, quantified, and based on to identify and distinguish tumors. AI helps radiologists to detect not only CRC, but also tiny polyps or other early lesions, thus improving diagnostic accuracy and efficiency (33, 34). In colonoscopy, AI can locate and facilitate precise border segmentation of the lesions (24). Deep learning models, such as convolutional neural networks (CNN), can analyze colonoscopy video images in real-time to automatically detect polyps or other abnormal tissues. Some polyps that are small, isochromatic, flat, unclear at borders, hidden by folds, and located at the edges of the field of view, have been easily overlooked by endoscopists, but can now by accurately identified by AI (27, 29, 35). Computer-aided diagnostic (CADx) systems allow optical biopsy and characterization of polyps or other diseased tissues with an accuracy comparable to or higher than that of experienced endoscopists (36, 37).

4.2.2 Pathology and staging

Histopathologic analysis is regarded as the gold standard for assessing cancer diagnosis and prognosis. Whole Slide Imaging (WSI) is being widely around the world used for scanning and digitizing whole tissue sections. AI can extract pathological evidence from these digitized images to support decision-making on lesion type, grading, and other key characteristics (38, 39). To more accurately detect the nuclei, a team of researchers has combined Neighborhood Integration Predictor (NEP) with CNN to quantitate tissue components in the whole slide image (31). Staging of CRC is closely related to lymph node metastasis, lymph nodes with micro-metastases on whole slide images can currently be detected by deep learning models without manual annotation (40). In the future, the application of AI models in this field will be even broader.

4.2.3 Prognostic assessment

An AI model can combine one patient’s clinical information, pathological results, genetic test results, and other multi-dimensional data to assess the prognosis and provide a strategy for personalized treatment (41). Based on images of CRC tissue samples, researchers have combined a CNN with a recurrent architecture to train a deep learning model for predicting CRC prognosis. This model can assess CRC morphology, histopathology, microenvironment more accurately than an experienced physician (42, 43). AI can also automatically extract multi-dimensional prognostic features (such as tumor size, shape, edge) from CT, MRI, and other medical images for prognostic assessment (44). A “meta-model” has been established to assist in clinical decision-making and predict the survival of cancer patients, even their clinical data are incomplete (45). These AI models have substantially sharpened physicians’ ability to predict the outcomes of CRC.

4.3 Future research trends

As indicated by the results from the citation analysis and the topic trend analysis, future research of AI in CRC diagnosis and treatment is expected to focus on the following three aspects:

4.3.1 Multimodal data fusion

Most of current AI models are operated in a single paradigm, which limits their application in a broader clinical context. The fusion of paradigms can increase the robustness and accuracy of diagnostic and prognostic AI models. Multimodal fusion of histological images and genomic features has demonstrated higher accuracy in tumor grading and molecular typing than single-peak deep networks trained only on histological and genomic data (46). Faisal Mahmood’s team at Harvard Medical School are exploring the application of AI for multimodal data integration in oncology. Their AI model can discover new associations within and across modalities which can explain variations in outcomes or resistance to treatment, as well as new tumor biomarkers and therapeutic targets (47, 48). AI has demonstrated the ability to analyze complementary multimodal data streams about CRC. Multiorganomics techniques and AI algorithms will become a research hotspot in the future and drive the development of precision medicine for cancer (49).

4.3.2 Personalized treatment and precision medicine

AI can tail out an individualized treatment plan by analyzing a patient’s genetic information, biomarkers, clinical data, and lifestyle. It has shown that AI can predict microsatellite instability (MSI) and defective mismatch repair (dMMR) in CRC patients, thus facilitating the establishment of personalized drug regimens (50). Various algorithms, which can be run on APPs, have been trained to predict a wide range of genetic mutations, molecular tumor subtypes, gene expression signatures, and standard pathological biomarkers in routine pathological tissues (51). With the continuous advancement of AI technology, the treatment of CRC in the future will be more personalized, precise, and efficient.

4.3.3 Drug development

AI can be used to optimize peptide synthesis and molecular design, virtualize molecular docking, quantitate conformational relationships, drug repositioning, protein misfolding, protein-protein interactions, and identify molecular pathways in polypharmacy (52). Using AI, an IncRNA-related ceRNA regulatory network containing 144 core genes has been constructed for CRC, thereby easing the exploration of targeted treatments (53). In summary, AI has aroused a sea change in drug development.

5 Limitation

Given that other databases may contain valuable literature, use of only the WoSCC database may have limited the significance of the study. There may have been a language bias from the inclusion of only publications in English, rather than other languages. The quality of selected literature was not assessed, which may have affected the interpretation of the results. Recent high-quality articles were not included into the analysis, which may have discounted the reliability of our findings.

6 Conclusion

In the research on AI in CRC diagnosis and treatment, the United States makes a significant contribution by publishing a large number of articles with high qualities and impacts. China also occupies an important position in the volume of published articles, but there is still room for improvement in the overall quality of research. Currently, the main research hotspots are put on early screening and diagnosis (including imaging, colonoscopy, etc.), pathological diagnosis and staging, and prognostic assessment. Future research may turn to multimodal data, personalized treatment, precision medicine, as well as drug development.

Statements

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

LS: Methodology, Writing – original draft, Writing – review & editing. RZ: Data curation, Software, Writing – original draft. YG: Data curation, Formal analysis, Writing – review & editing. LH: Data curation, Writing – review & editing. CJ: Conceptualization, Supervision, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported by the National Natural Science Foundation of China (Grant No. 82274269), Traditional Chinese Medicine Technology Development Plan of Jiangsu Provincial Administration of Traditional Chinese Medicine (ZD202324), and Top Talent Support Program for young and middle-aged people of Wuxi Health Committee (BJ2023064).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  • 1

    SungHFerlayJSiegelRLLaversanneMSoejomataramIJemalAet al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660

  • 2

    HanBZhengRZengHWangSSunKChenRet al. Cancer incidence and mortality in China, 2022. J Natl Cancer Center. (2024) 4:47–53. doi: 10.1016/j.jncc.2024.01.006

  • 3

    DekkerETanisPJVleugelsJLAKasiPMWallaceMB. Colorectal cancer. Lancet. (2019) 394:1467–80. doi: 10.1016/S0140-6736(19)32319-0

  • 4

    MiMWengSXuZHuHWangYYuanY. CSCO guidelines for colorectal cancer version 2023: Updates and insights. Chin J Cancer Res = Chung-kuo yen cheng yen chiu. (2023) 35:233–8. doi: 10.21147/j.issn.1000-9604.2023.03.02

  • 5

    BhinderBGilvaryCMadhukarNSElementoO. Artificial intelligence in cancer research and precision medicine. Cancer Discovery. (2021) 11:900–15. doi: 10.1158/2159-8290.CD-21-0090

  • 6

    MitsalaATsalikidisCPitiakoudisMSimopoulosCTsarouchaAk. Artificial intelligence in colorectal cancer screening, diagnosis and treatment. A New Era Curr Oncol (Toronto Ont). (2021) 28:1581–607. doi: 10.3390/curroncol28030149

  • 7

    KumarSKMiskovicVBlasiakASundarRPedrocchiALGPearsonATet al. Artificial intelligence in clinical oncology: from data to digital pathology and treatment. Am Soc Clin Oncol Educ book Am Soc Clin Oncol Annu Meeting. (2023) 43:e390084. doi: 10.1200/EDBK_390084

  • 8

    SharmaAKumarRYadavGGargP. Artificial intelligence in intestinal polyp and colorectal cancer prediction. Cancer Lett. (2023) 565:216238. doi: 10.1016/j.canlet.2023.216238

  • 9

    SpadacciniMMassimiDMoriYAlfaroneLFugazzaAMaselliRet al. Artificial intelligence-aided endoscopy and colorectal cancer screening. Diagnostics (Basel Switzerland). (2023) 13:1102. doi: 10.3390/diagnostics13061102

  • 10

    MukherjeeDLimWMKumarSDonthuN. Guidelines for advancing theory and practice through bibliometric research. J Business Res. (2022) 148:101–15. doi: 10.1016/j.jbusres.2022.04.042

  • 11

    ÖztürkOKocamanRKanbachDK. How to design bibliometric research: an overview and a framework proposal. Rev Manag Sci. (2024), 1–29. doi: 10.1007/s11846-024-00738-0

  • 12

    EllegaardOWallinJA. The bibliometric analysis of scholarly production: How great is the impact? Scientometrics. (2015) 105:1809–31. doi: 10.1007/s11192-015-1645-z

  • 13

    HuangPFengZShuXWuAWangZHuTet al. A bibliometric and visual analysis of publications on artificial intelligence in colorectal cancer (2002-2022). Front Oncol. (2023) 13:1077539. doi: 10.3389/fonc.2023.1077539

  • 14

    LiuGZhaoJTianGLiSLuY. Visualizing knowledge evolution trends and research hotspots of artificial intelligence in colorectal cancer: A bibliometric analysis. Front Oncol. (2022) 12:925924. doi: 10.3389/fonc.2022.925924

  • 15

    ChenCSongM. Visualizing a field of research: A methodology of systematic scientometric reviews. PloS One. (2019) 14:e0223994. doi: 10.1371/journal.pone.0223994

  • 16

    Nj van ELW. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics. (2010) 84:523–38. doi: 10.1007/s11192-009-0146-3

  • 17

    AriaMCuccurulloC. bibliometrix: An R-tool for comprehensive science mapping analysis. J Informetrics. (2017) 11:959–75. doi: 10.1016/j.joi.2017.08.007

  • 18

    ChenC. Searching for intellectual turning points: progressive knowledge domain visualization. Proc Natl Acad Sci USA. (2004) 101:5303–10. doi: 10.1073/pnas.0307513100

  • 19

    TangRZhangSDingCZhuMGaoY. Artificial intelligence in intensive care medicine: bibliometric analysis. J Med Internet Res. (2022) 24:e42185. doi: 10.2196/42185

  • 20

    LongDMaoCZhangZZouJZhuY. Visual analysis of colorectal cancer and gut microbiota: A bibliometric analysis from 2002 to 2022. Medicine. (2023) 102:e35727. doi: 10.1097/MD.0000000000035727

  • 21

    BrayFFerlayJSoerjomataramISiegelRLTorreLAJemalA. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. (2018) 68:394–424. doi: 10.3322/caac.21492

  • 22

    SiegelRLMillerKDJemalA. Cancer statistics, 2017. CA: Cancer J Clin. (2017) 67:7–30. doi: 10.3322/caac.21387

  • 23

    DaCCdJArMWkZJkLCaDet al. Adenoma detection rate and risk of colorectal cancer and death. New Engl J Med. (2014) 370:1298–306. doi: 10.1056/NEJMoa1309086

  • 24

    RonnebergerOFischerPBroxT. U-net: convolutional networks for biomedical image segmentation. In: NavabNHorneggerJWellsWMFrangiAF, editors. Medical image computing and computer-assisted intervention – MICCAI 2015. Lecture notes in computer science. Springer International Publishing, Cham (2015). p. 234–41. doi: 10.1007/978-3-319-24574-4_28

  • 25

    Deep residual learning for image recognition | IEEE conference publication | IEEE xplore. Available online at: https://ieeexplore.ieee.org/document/7780459 (Accessed March 28, 2024).

  • 26

    SimonyanKZissermanA. Very deep convolutional networks for large-scale image recognition. Comput Sci. (2014). doi: 10.48550/arXiv.1409.1556

  • 27

    Real-time automatic detection system increases colonoscopic polyp and adenoma detection rates: a prospective randomised controlled study . Available online at: https://pubmed.ncbi.nlm.nih.gov/30814121/ (Accessed March 28, 2024).

  • 28

    KrizhevskyASutskeverIHintonG. ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst. (2012) 25:84–90. doi: 10.1145/3065386

  • 29

    GregorUPriyamTTalalAMohitMFaridJWilliamKet al. Deep learning localizes and identifies polyps in real time with 96% Accuracy in screening colonoscopy. Gastroenterology. (2018) 155:1069–1078.e8. doi: 10.1053/j.gastro.2018.06.037

  • 30

    TajbakhshNShinJYGuruduSRHurstRTKendallCBGotwayMBet al. Convolutional neural networks for medical image analysis: full training or fine tuning? IEEE Trans Med Imaging. (2016) 35:1299–312. doi: 10.1109/TMI.2016.2535302

  • 31

    SirinukunwattanaKRazaSEATsangY-WSneadDRJCreeIARajpootNM. Locality sensitive deep learning for detection and classification of nuclei in routine colon cancer histology images. IEEE Trans Med Imaging. (2016) 35:1196–206. doi: 10.1109/TMI.2016.2525803

  • 32

    KnijnenburgTAWangLZimmermannMTChambweNGaoGFCherniackADet al. Genomic and molecular landscape of DNA damage repair deficiency across the cancer genome atlas. Cell Rep. (2018) 23:239254.e6. doi: 10.1016/j.celrep.2018.03.076

  • 33

    LiuZWangSDongDWeiJFangCZhouXet al. The applications of radiomics in precision diagnosis and treatment of oncology: opportunities and challenges. Theranostics. (2019) 9:1303–22. doi: 10.7150/thno.30309

  • 34

    Pk SPGYcLSfCJfYDdOYjC. Localization of colorectal cancer lesions in contrast-computed tomography images via a deep learning approach. Bioengineering (Basel Switzerland). (2023) 10:972. doi: 10.3390/bioengineering10080972

  • 35

    WangPLiuXBerzinTMBrownJRGLiuPZhouCet al. Effect of a deep-learning computer-aided detection system on adenoma detection during colonoscopy (CADe-DB trial): a double-blind randomised study. Lancet Gastroenterol Hepatol. (2020) 5:343–51. doi: 10.1016/S2468-1253(19)30411-X

  • 36

    HouwenBBSLHazewinkelYGiotisIVleugelsJLAMostafaviNSvan PuttenPet al. Computer-aided diagnosis for optical diagnosis of diminutive colorectal polyps including sessile serrated lesions: a real-time comparison with screening endoscopists. Endoscopy. (2023) 55:756–65. doi: 10.1055/a-2009-3990

  • 37

    CollinsTBencteuxVBenedicentiSMorettiVMitaMTBarbieriVet al. Automatic optical biopsy for colorectal cancer using hyperspectral imaging and artificial neural networks. Surg Endoscopy. (2022) 36:8549–59. doi: 10.1007/s00464-022-09524-z

  • 38

    SuALeeHTanXSuarezCJAndorNNguyenQet al. A deep learning model for molecular label transfer that enables cancer cell identification from histopathology images. NPJ Precis Oncol. (2022) 6:14. doi: 10.1038/s41698-022-00252-0

  • 39

    TiwariSFalahkheirkhahKChengGBhargavaR. Colon cancer grading using infrared spectroscopic imaging-based deep learning. Appl Spectrosc. (2022) 76:475. doi: 10.1177/00037028221076170

  • 40

    ChuangWYChenCCYuWHYehCJChangSHUengSHet al. Identification of nodal micrometastasis in colorectal cancer using deep learning on annotation-free whole-slide images. Modern pathology : an Off J United States Can Acad Pathology Inc. (2021) 34:1901–11. doi: 10.1038/s41379-021-00838-2

  • 41

    QiuHDingSLiuJWangLWangX. Applications of artificial intelligence in screening, diagnosis, treatment, and prognosis of colorectal cancer. Curr Oncol (Toronto Ont). (2022) 29:1773–95. doi: 10.3390/curroncol29030146

  • 42

    BychkovDLinderNTurkkiRNordlingSKovanenPEVerrillCet al. Deep learning based tissue analysis predicts outcome in colorectal cancer. Sci Rep. (2018) 8:3395. doi: 10.1038/s41598-018-21758-3

  • 43

    KatherJNKrisamJCharoentongPLueddeTHerpelEWeisCAet al. Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study. PloS Med. (2019) 16:e1002730. doi: 10.1371/journal.pmed.1002730

  • 44

    BeraKBramanNGuptaAVelchetiVMadabhushiA. Predicting cancer outcomes with radiomics and artificial intelligence in radiology. Nat Rev Clin Oncol. (2022) 19:132–46. doi: 10.1038/s41571-021-00560-7

  • 45

    BaronJMParanjapeKLoveTSharmaVHeaneyDPrimeM. Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support. J Am Med Inf Association : JAMIA. (2021) 28:605–15. doi: 10.1093/jamia/ocaa254

  • 46

    ChenRJLuMYWangJWilliamsonDFKRodigSJLindemanNIet al. Pathomic fusion: an integrated framework for fusing histopathology and genomic features for cancer diagnosis and prognosis. IEEE Trans Med Imaging. (2022) 41:757–70. doi: 10.1109/TMI.2020.3021387

  • 47

    LipkovaJChenRJChenBLuMYBarbieriMShaoDet al. Artificial intelligence for multimodal data integration in oncology. Cancer Cell. (2022) 40:1095–110. doi: 10.1016/j.ccell.2022.09.012

  • 48

    ChenRJLuMYWilliamsonDFKChenTYLipkovaJNoorZet al. Pan-cancer integrative histology-genomic analysis via multimodal deep learning. Cancer Cell. (2022) 40:865–78.e6. doi: 10.1016/j.ccell.2022.07.004

  • 49

    HeXLiuXZuoFShiHJingJ. Artificial intelligence-based multi-omics analysis fuels cancer precision medicine. Semin Cancer Biol. (2023) 88:187–200. doi: 10.1016/j.semcancer.2022.12.009

  • 50

    EchleAGrabschHIQuirkePvan den BrandtPAWestNPHutchinsGGAet al. Clinical-grade detection of microsatellite instability in colorectal tumors by deep learning. Gastroenterology. (2020) 159:14061416.e11. doi: 10.1053/j.gastro.2020.06.021

  • 51

    KatherJNHeijLRGrabschHILoefflerCEchleAMutiHSet al. Pan-cancer image-based detection of clinically actionable genetic alterations. Nat Cancer. (2020) 1:789–99. doi: 10.1038/s43018-020-0087-6

  • 52

    GuptaRSrivastavaDSahuMTiwariSAmbastaRKKumarP. Artificial intelligence to deep learning: machine intelligence approach for drug discovery. Mol Divers. (2021) 25:1315–60. doi: 10.1007/s11030-021-10217-3

  • 53

    HuDZhangBYuMShiWZhangL. Identification of prognostic biomarkers and drug target prediction for colon cancer according to a competitive endogenous RNA network. Mol Med Rep. (2020) 22:620–32. doi: 10.3892/mmr.2020.11171

Summary

Keywords

colorectal cancer, Artificial Intelligence, CiteSpace, VOSviewer, Bibliometrix, bibliometrics

Citation

Sun L, Zhang R, Gu Y, Huang L and Jin C (2024) Application of Artificial Intelligence in the diagnosis and treatment of colorectal cancer: a bibliometric analysis, 2004–2023. Front. Oncol. 14:1424044. doi: 10.3389/fonc.2024.1424044

Received

13 May 2024

Accepted

23 September 2024

Published

11 October 2024

Volume

14 - 2024

Edited by

Cheng-Xiong Xu, Chongqing University, China

Reviewed by

Calin Cainap, University of Medicine and Pharmacy Iuliu Hatieganu, Romania

Jung Lee, Milwaukee School of Engineering, United States

Updates

Copyright

*Correspondence: Chunhui Jin,

†These authors have contributed equally to this work and share first authorship

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics