- 1Department of Clinical Medicine, Medical College, Yangzhou University, Yangzhou, China
- 2Department of Medical Imaging, Northern Jiangsu People’s Hospital, Clinical Medical College of Yangzhou University, Yangzhou, China
- 3Digestive Department of Northern Jiangsu People’s Hospital, Clinical Medical College of Yangzhou University, Yangzhou, China
Background: Artificial intelligence (AI) has gained widespread attention because of its potential applications in the clinical management of inflammatory bowel disease (IBD). However, bibliometric analyses of the literature published in this field are currently lacking. This study aimed to characterize the development trends and identify research hotspots in the application of AI to IBD through a bibliometric approach.
Methods: Literature related to the application of AI in IBD, published between 2000 and 2024, was retrieved from the Web of Science Core Collection. Microsoft Office Excel 2021 was used to analyze and visualize the annual number of publications. Charticulator was utilized to create country cross chord charts. CiteSpace was employed to visualize collaboration networks among authors, institutions, and countries, generate timeline visualizations and perform a burst analysis of references and keywords.
Results: The bibliometric analysis included 1,136 publications published between the years 2000 and 2024. The number of annual publications showed a rapid growth trend. The USA (n = 337) and Harvard University (n = 47) had the most published papers. Stidham, Ryan (n = 21) published the highest number of articles. The core journals shaping the field included Scientific Reports, Inflammatory Bowel Diseases, and PLoS One. The timeline view and burst analysis of references and keywords revealed that the research hotspots focused on radiomics, endoscopy, natural language processing (NLP), and personalized medicine after 2020.
Conclusion: A growing number of researchers have engaged in exploring the application of AI in IBD, with the USA contributing the most to this field. Currently, the research hotspots mainly involve radiomics, endoscopy, NLP, and personalized medicine. This study provides valuable information for researchers evaluating the application of AI in IBD.
1 Introduction
Inflammatory bowel diseases (IBD), including Crohn’s disease (CD) and ulcerative colitis (UC), are characterized by chronic intestinal inflammation (1). The clinical symptoms of IBD include abdominal pain, diarrhea, weight loss, and bloody stools. The disease can lead to serious complications such as stricture, fistula, infection, and even cancer (2). Moreover, the incidence of IBD is increasing globally each year, particularly in developing countries. Consequently, IBD has become a major global public health concern (3, 4). In recent years, immunomodulators and biologic agents have led to significant advancements in the treatment of IBD. However, due to the high clinical heterogeneity of the disease and individual variations in patient response to treatment, challenges persist (5). This highlights the urgent need to develop more precise diagnostic, assessment, and treatment strategies.
Artificial intelligence (AI) has shown great potential in the diagnosis and treatment of IBD. AI can effectively identify the early signs of disease, predict disease progression, and assist in the development of personalized treatment plans (6–8). In addition, AI has demonstrated significant potential for drug discovery and facilitated the development of new therapies (9). Moreover, the continuous expansion of datasets and optimization of algorithms will further advance the application of AI in developing personalized therapies for IBD.
Several review articles have explored the application of AI in IBD; however, these reviews often lack objective quantitative analyses and rely overly on the subjective understanding of researchers. Consequently, these reviews are often variable and subjective when presenting research findings. In contrast, bibliometrics, as a quantitative analysis tool, can objectively and systematically analyze authors, institutions, countries, references, and keywords within a field (10). Hence, bibliometrics have become an essential tool for tracking research trends and predicting future directions and is widely applied in medical research. However, a comprehensive analysis of the literature in this field is lacking.
Therefore, this study aimed to utilize bibliometric methods to systematically analyze research on AI in IBD since the 21st century, identify major research contributors, examine the distribution of current research outcomes, identify research hotspots, and explore the frontiers of this field, thereby establishing a systematic and comprehensive knowledge framework.
2 Materials and methods
2.1 Data sources and retrieval strategies
The Web of Science Core Collection (WoSCC) is globally recognized as one of the most authoritative databases and is widely considered the preferred platform for bibliometric analysis (11). The data analyzed in this study included all literature on the application of AI in IBD since 2000, which was retrieved on January 11, 2025. The search strategy is described in Supplementary material 1.
The initial search results retrieved 1,402 documents. After filtering the data to include only “article” and “review article” types, 1,148 documents were obtained. After excluding four non-English documents and eight documents from the fields of zoology and veterinary science, 1,136 documents were included in the final analysis. The search and screening process is illustrated in Figure 1. The data were exported in “plain text” format from the “Full Record and Cited References” section of the WoSCC platform. In addition, citation reports for the top 10 most-cited articles; the top 10 most prolific authors, institutions, and countries; and the top 15 journals by publication volume were obtained from the WoSCC.
2.2 Data processing and analysis
Data deduplication was performed using CiteSpace software (6.3. R3, Advanced), and no duplicate records were obtained. Keywords with similar meanings, singular and plural forms, and abbreviations were merged. For example, “inflammatory bowel disease (IBD),” “IBD,” “inflammatory bowel diseases,” and “inflammatory bowel disease” were merged into “inflammatory bowel disease.” CiteSpace was used to visualize collaboration networks among authors, institutions, and countries, generate keyword and reference timeline visualization maps, and identify keywords and references with the strongest citation bursts. Microsoft Office Excel 2021 and Charticulator were used to visualize the annual number of publications and construct country cross chord charts, respectively.
2.3 Interpretation of charts and indicators
Each node in the collaboration network, using by CiteSpace, represents an entity. The color of a node indicates the year of its first appearance. The lines between nodes indicate the degree of collaboration between them (12). A notable advantage of network visualization is its ability to quickly reveal the structure and relationships among the nodes and accurately highlight key and influential nodes (13). In the burst analysis, the timeline from 2002 to 2024 is represented by lines where light-green segments indicate periods when the entity has not yet appeared in the dataset, dark-green segments denote active periods of research, and red segments indicate burst periods of research (14).
The average number of citations per publication (AC/P) can be used to measure the citation impact of a scholar, institution, or country within a specific period, thereby assessing its research influence (15). The H-index is used to evaluate the academic achievements of individual researchers and has been increasingly applied to measure the research impact of institutions, countries, and journals (16). The modularity (Q value) and silhouette (S value) values were used to evaluate the community structure and clustering quality of the network (17). A Q value >0.3 indicates a significant clustering structure. The S value is typically used to assess the quality and cohesion of the network clustering. An S value >0.7 indicates good clustering quality.
3 Results
3.1 Scientific output
A total of 1,136 publications were included in the analysis, comprising 937 articles and 199 reviews. Figure 2 illustrates the trend in the annual number of publications and cumulative number of publications. The results indicate that research on the application of AI in IBD is growing rapidly. In 2013, the annual number of publications exceeded 10 for the first time, which surpassed 100 by 2021. In 2024, the annual number of publications reached 427. The exponential regression analysis (R2 = 0.9921) indicated that the cumulative number of publications showed a continuous exponential growth trend.

Figure 2. Annual and cumulative output about the application of AI in IBD related publications from 2002 to 2024.
3.2 Analysis of authors
A total of 7,401 authors were included in the analysis of AI applications in IBD. Figure 3 shows the collaboration network among the authors. Based on the distribution of nodes, authors can be broadly divided into three main groups, with representative scholars including Stidham, Ryan, Iacucci, Marietta, and Kopylov, Uri. Table 1 lists the top 10 most productive authors in terms of the number of publications. Stidham, Ryan W. (21) ranks first, followed by Iacucci, Marietta (16), Waljee, Akbar (15), and Danese, Silvio (15). Regarding the H-index, Stidham, Ryan W. (12) ranks first, followed by Waljee, Akbar (10), Ghosh, Subrata (10), and Higgins, Peter D. R. (10). Regarding the AC/P index, Higgins, Peter D. R. (68.08) ranks first, followed by Stidham, Ryan W. (44.1) and Waljee, Akbar (43.54).

Table 1. The top 10 most productive authors related to the research about application of AI in inflammatory bowel disease.
3.3 Analysis of institutions
More than 4,000 institutions have contributed to research on the AI in IBD. A total of 445 nodes were identified using CiteSpace analysis (Figure 4). Table 2 lists the 10 most productive institutions based on the number of publications. The top three institutions in terms of the number of publications and H-index were Harvard University (n = 47, H-index = 24), University of California System (n = 42, H-index = 18), and University of Michigan (n = 32, H-index = 17). The top three institutions in terms of AC/P were Massachusetts General Hospital (81.91), Harvard University (64.34), and the University of Michigan (48.16).

Table 2. The top 10 most productive institutions related to the research about application of AI in inflammatory bowel disease.
3.4 Analysis of countries/regions
A total of 65 countries/regions have published papers in related fields. Figure 5A shows the collaboration network among the countries/regions. Network analysis revealed that countries such as the USA, United Kingdom, and Italy were among the first to publish research on AI in the context of IBD. However, in recent years, the number of publications from China has grown rapidly. The top 10 most productive countries, ranked by the number of published papers, are listed in Table 3. The USA leads in terms of publication numbers, AC/P, and the H-index, with publications comprising 29.67% of the global total. China (327) and the United Kingdom (115) ranked second and third, respectively, although the number of publications from each country was less than 100. Figure 5B shows the collaboration among the top 20 countries, with the closest collaboration between Canada and the USA, followed by the United Kingdom and the USA, and China and the USA. Thus, the USA maintains a high level of collaboration with multiple countries globally. In contrast, collaborative relationships among other countries are relatively weak, especially across continents.

Table 3. The top 10 most productive countries related to the research about application of AI in inflammatory bowel disease.
3.5 Analysis of journals
A total of 458 journals published studies on the application of AI in IBD. The top 15 most productive journals are listed in Table 4. Scientific Reports published the most articles (n = 44), followed by Inflammatory Bowel Diseases (n = 41) and PLoS One (n = 27). Inflammatory Bowel Diseases had the highest H-index of 19. Data on the Impact Factors (IF) and Journal Citation Reports (JCR) quartiles were obtained from the JCR. Among the top 15 journals, Gastroenterology had the highest impact factor (IF = 25.7), and all journals were in Q1, except for Diagnostics and Frontiers in Microbiology, which were in Q2.

Table 4. The top 15 most productive journals related to the research about application of AI in inflammatory bowel disease.
3.6 Analysis of references
Table 5 presents the 10 most-cited articles. The article by Kieft et al. (18), published in Microbiome in 2020, had the highest number of citations (n = 472). This study introduced VIBRANT, which significantly improved the recovery quality of viral genomes using neural networks and a new v-score metric. This revealed specific viral populations associated with CD, providing a new perspective on microbiome-host interactions. Additionally, the Sequence Kernel Association Test method, proposed by Ionita-Laza et al. (19), combined genome-wide association studies (GWAS) and whole-exome sequencing data to enhance the effectiveness of genetic research on diseases such as CD. This method paved the way for new applications of AI in genetic data analysis. The article by Lee et al. (20) was ranked third and introduced a sequence-based gkm-SVM algorithm. This algorithm accurately predicted the impact of regulatory variants using deltaSVM scores, aiding in the identification of functional regulatory variants associated with autoimmune diseases.
A reference timeline map was constructed using CiteSpace (Figure 6A), with references clustered based on title words. The Q value was 0.8151 and the S value was 0.9269, indicating significant and highly reliable clustering results. Nine clusters were obtained: “#1 artificial intelligence,” “#2 ulcerative colitis,” “#3 gut microbiota,” “#4 comparative efficacy,” “#7 Crohn’s disease,” “#8 genetic prediction,” “#13 random walk,” “#14 infiltration-associated biomarker,” and “#18 predicting potential microbe-disease association.” References with higher citation burst intensity typically have broader scientific impact and greater influence on subsequent research. Figure 6B shows the 25 references with the strongest citation bursts. The three references with the highest burst strengths were: “The treatment-naive microbiome in new-onset Crohn’s disease” (strength = 11.3) (21), “Worldwide incidence and prevalence of inflammatory bowel disease in the 21st century: a systematic review of population-based studies” (strength = 10.43) (4), and “Classification of Pediatric Inflammatory Bowel Disease using Machine Learning” (strength = 8.81) (22).

Figure 6. (A) Reference timeline visualization map. (B) Top 25 references with the strongest citation bursts.
3.7 Analysis of keywords
The 626 keywords were distributed across 12 clusters with reliable clustering results (Figure 7A) (Q = 0.4725, S = 0.7551). The clusters, ranked from largest to smallest, were “#0 artificial intelligence,” “#1 capsule endoscopy,” “#2 remission,” “#3 inflammatory bowel disease,” “#4 cancer,” “#5 radiomics,” “#6 metabolomics,” “#7 cytokine,” “#8 machine learning,” “#9 prediction model,” “#10 adenoma detection,” and “#11 natural products.” The timeline visualization of keywords illustrates the evolution of research development. Over time, the research hotspots have gradually shifted from IBD, cancer, metabolomics, and adenoma detection to remission, capsule endoscopy, radiomics, and natural products. Keyword burst detection was used to identify keywords with a rapid increase in frequency over a short period, thereby revealing research hotspots and emerging trends. Figure 7B shows the 25 keywords with the strongest citation bursts. The keywords that have emerged since 2020 include colonoscopy, natural language processing (NLP), random forest, score, enteroscopy, personalized medicine, short-chain fatty acids, and clinical trials.

Figure 7. (A) Keyword timeline visualization map. (B) Top 25 keywords with the strongest citation bursts.
Figure 8 illustrates the eight principal investigation modalities applied in AI research on IBD, summarizing the full spectrum of data types and analytical approaches identified in our systematic analysis. These modalities include endoscopy, imaging, histopathology, omics, gut microbiome, laboratory data, structured clinical information, and NLP.
4 Discussion
In this study, we analyzed 1,136 publications on the application of AI in IBD retrieved from the WOSCC. The number of publications in this field showed a rapid growth trend, with 81.86% of all the papers published in the last 5 years. This rapid expansion reflects the presence of several concurrent drivers. First, the open sourcing of deep learning frameworks, such as TensorFlow in 2015 and PyTorch in 2017, has removed licensing barriers and enabled gastroenterology teams to train convolutional neural networks and transformer models on colonoscopy images, radiology scans, and electronic health record data. Second, the EU Horizon 2020 program on “AI for Health Imaging” has supported multiple multinational collaborative studies. The Notice of Special Interest NOT-CA-24-031 by NIH encourages the analysis and clinical validation of AI tools across various disease domains. Public–private initiatives such as “AI for Health” by Microsoft further lowered costs for small and mid-sized research centers by providing cloud computing resources, datasets, and engineering support. Third, strategic policy signals and data-sharing infrastructures such as the 2017 New Generation Artificial Intelligence Development Plan by China prioritized medical AI. Finally, the SECURE-IBD registry established during the COVID-19 pandemic generated large-scale clinical datasets of patients with IBD, significantly shortening the model development cycles. Given the persistence of these enabling factors, publications on AI in IBD are expected to increase rapidly.
The top 10 authors, institutions, and countries were mainly concentrated in Europe, North America, and Asia. The top 10 countries in terms of publication volume are all developed countries or major economies, suggesting a close relationship between the level of economic development and the distribution of academic output; higher economic development promotes academic output in AI applications for IBD. The outstanding performance of USA in AI applications for IBD is attributed to its early leading position in AI research, ample research funding, diverse population providing rich data resources, comprehensive policy and regulatory support, advanced medical infrastructure, and extensive international collaborations. Collectively, these factors contribute to the USA’s leading position and academic output in this field. Notably, China has also demonstrated rapid growth in academic output (Figure 5A). As of 2024, the total annual publication volume for China has reached 327 articles, approaching that of the USA. These changes reflect China’s rapid progress in this field. However, China has a relatively low H-index and AC/P, due to the recent rapid development of AI, which has not allowed sufficient time for publications to accumulate high citation counts. Therefore, Chinese scholars must focus on enhancing their research impact and increasing international collaboration and exchange.
Additionally, the incidence of IBD appears to have influenced the distribution of current research outputs. North America has the highest annual standardized incidence rate (ASR) of IBD, whereas Oceania has the lowest ASR. The East Asian region had the highest average annual growth rate in IBD incidence, whereas the Central European region experienced the most rapid decline in ASR (23). This trend mirrors the distribution of research output on AI in IBD, indicating a close relationship between changes in incidence rates and research output.
AI in IBD is closely associated with the collaborative development of gut microbiome studies and cancer research (Figures 6A, 7A). AI can significantly enhance the ability to identify biomarkers and therapeutic targets by integrating the gut microbiome data. Our results are consistent with the bibliometric analyses conducted by Zhang et al. (24) on IBD and gut microbiota, who indicates that countries such as the USA, China, and Italy, along with institutions such as Harvard Medical School, University of California, and University College Cork, are major contributors to research on IBD and gut microbiota. Furthermore, our results suggest that AI has played a crucial role in related studies on IBD and cancer. Bibliometric analyses conducted by Zhang et al. (25) similarly showed that countries such as China, Germany, Japan, the United Kingdom, and the USA are major contributors to research on colorectal cancer and IBD. Moreover, the increasing number of annual publications from China observed in this study are consistent with increasing research in the fields of IBD and colorectal cancer in China since 2018.
Among the top 15 journals in terms of publications, all are Q1 journals except for “Diagnostics” and “Frontiers in Microbiology,” which are ranked Q2. This suggests that AI in IBD has received widespread attention and achieved high recognition of its quality. The predominance of Q1 journals reflects the profound impact and academic value of AI in IBD. When selecting a journal for submission, researchers should prioritize highly productive and influential journals, as they provide an important platform for AI in IBD and ensure that research findings are recognized and disseminated to a broader academic community.
Based on the timeline view of references and keywords combined with burst detection analysis, this research can be broadly categorized into three distinct periods:
In the first phase (before 2012), the application of AI in IBD research was in its early stages, with fewer than 10 annual publications. Early studies relied primarily on single-modality data, such as genetic, histopathological, omics, or endoscopic data, to construct diagnostic models for IBD or IBD-related cancers or adenomas, focusing mainly on predicting the efficacy of steroid therapies. Artificial neural networks successfully distinguished IBD-related colorectal tumors from sporadic adenomas/carcinomas in the field of genetic analysis (26). Whereas, the combination of Raman spectroscopy and support vector machines achieved 98.9% accuracy in distinguishing among CD, UC, and healthy tissues in the field of histopathology (27). Decision-tree ensemble algorithms in omics research have successfully identified biomarkers (28). Additionally, endoscopic image analysis has shown great potential in clinical applications. Based on the morphological features of ulcers, a classification and regression tree (CART) model achieved 92% accuracy in distinguishing Behçet’s disease from CD (29). In terms of treatment, the CART algorithm analysis revealed that BclI and NALP1 gene polymorphisms are closely related to steroid response, emphasizing the role of genetic and demographic factors in predicting steroid efficacy in young patients with IBD (30). Moreover, with the expansion of data volume and optimization of technology, these research directions remain popular topics in the field. However, these studies did not fully utilize the synergistic effects of multimodal data. Therefore, in the future, research on AI should focus on a deeper integration of multimodal data to further enhance the accuracy and generalization ability of models, aiming to overcome the translational bottleneck from laboratory research to clinical application.
In the second phase (2012–2019), the application of AI in IBD research grew rapidly. This phase is marked by the widespread application of machine learning algorithms in areas such as drug therapy, gut microbiota, and GWAS. Predictive models for drug therapy provided strong support for personalized therapy and early intervention in clinical settings. This enabled the prediction of treatment outcomes using immunosuppressants and biologics. For example, a dynamic risk assessment system based on a random forest algorithm predicted the risk of hospitalization and steroid use in patients with IBD by integrating predictive factors such as demographic characteristics, laboratory indicators, and medication information (31). The model effectively predicted remission outcomes of thiopurine treatment in patients with IBD by analyzing laboratory indicators and age, leading to a reduction in steroid use, hospitalization, and surgery rates (32). Another algorithm based on week 6 laboratory data from vedolizumab treatment identified patients with CD who could achieve steroid-free remission by week 52 (33). The AI model provided new perspectives and methods for the early diagnosis and prognostic assessment of IBD by analyzing changes in the gut microbiota. Particularly, the machine learning model distinguished patients with CD from healthy controls based on differences in microbial communities, indicating that patients with IBD exhibited distinct dysbiosis characteristics that significantly differed from those of other immune diseases (34). The neural network algorithm vedoNet, which integrated the gut microbiome and clinical data, significantly enhanced the ability to predict clinical remission following vedolizumab treatment. These results suggested that early dynamic changes in the microbiome could serve as key markers of the IBD treatment response (35). Furthermore, GWAS revealed genetic factors associated with diseases by analyzing large-scale associations between individual genetic variations and diseases or traits. A model trained using machine learning on data from a known GWAS study of IBD-related genes successfully identified 347 new potential IBD risk genes and provided new insights into the pathogenesis of IBD (36). The gkm-SVM method, proposed by Lee et al. (20), accurately identified GWAS SNPs and their functional regulatory roles in the genomic context using deltaSVM technology and provided a powerful tool for discovering new IBD risk SNPs. However, this stage of research faced various challenges, including data heterogeneity, limited model generalizability, and difficulties in clinical translation. Interdisciplinary collaboration and big data sharing were the key drivers of the advancement of AI in IBD research.
In the third phase (after 2020), research on AI in IBD is expected to undergo rapid growth. Research hotspots during this period will primarily focus on radiomics, endoscopy, NLP, and personalized medicine. Radiomics involves the high-throughput extraction of quantitative features from medical imaging. When combined with machine learning algorithms, these features can uncover potential biological information and provide new perspectives for disease diagnosis, prognosis evaluation, and treatment decision-making. For instance, the radiomic features of magnetic resonance enterography, when integrated with clinical variables and simplified magnetic resonance activity index scores, can effectively predict the timing of surgery for CD (37). Additionally, radiomics models based on computed-tomography enterography have demonstrated clinical superiority in distinguishing intestinal fibrosis in CD, with diagnostic accuracy significantly surpassing that of traditional visual assessments by radiologists (38). Moreover, in the field of endoscopy, AI can systematically and objectively evaluate mucosal inflammation through endoscopic imaging and histological analysis, predict prognosis, and guide disease management (39). The analysis of images and videos during colonoscopy to assess UC demonstrates high accuracy, effectively evaluating the disease severity, risk of recurrence, and inflammatory activity (40–42). Furthermore, a meta-analysis indicated that AI systems exhibit high diagnostic performance in detecting mucosal healing in UC, although moderate-to-high heterogeneity exists. Thus, standardized and shared AI training helps reduce variability between systems (43). Additionally, capsule endoscopy (CE) is primarily used in CD as a crucial non-invasive diagnostic technique for examining the small intestine. Since assessing CD activity, distinguishing mucosal inflammation, and identifying complications such as strictures and fistulas is essential (44). AI can be used to identify mucosal lesions in CD, including ulcers, erosions, and strictures, by analyzing CE images (45, 46). In contrast, NLP effectively supplements the information gaps in traditional biomarkers by parsing unstructured text from electronic medical records, thereby significantly enhancing the ability to identify treatment complications (47). Breakthroughs in large language models have opened new research opportunities for NLP. For instance, ChatGPT provides accurate and comprehensive responses to real-world inquiries from patients with IBD (48). Large language models can automatically identify adverse events from clinical records and provide timely and precise data to support clinical decision-making (49). Consequently, in the future, more research based on large language models will focus on assisting with differential diagnoses, medication adjustment, efficacy monitoring, and personalized education and support. Nonetheless, issues such as linguistic diversity, data privacy and security, lack of transparency, and over-reliance in clinical applications need to be addressed. Although personalized medicine, in which treatment strategies are tailored based on individual biological characteristics, is becoming increasingly relevant for IBD management, current IBD medications, such as non-specific anti-inflammatory drugs or biologics, can only target a single aspect of the complex multifactorial biological processes. Thus, research using AI is driving the treatment of IBD from broad-spectrum anti-inflammatory drugs for phenotypically similar but biologically heterogeneous patients towards omics-based, highly specific, customized personalized medicine targeting patients with molecularly homogeneous IBD (50).
Despite the rapid expansion of AI research and applications in IBD, several unresolved challenges impede its translation from laboratory to clinical practice. These include the absence of standardized data formats, limited model interpretability and robustness, high risk of overfitting during training, and difficulty in reconciling patient privacy with data security. For instance, radiomics investigations are often confined to single-center, small-sample cohorts. Heterogeneity in imaging devices and scanning protocols obstructs cross-platform validation, and observer bias introduced by manual segmentation further undermines reproducibility. Additionally, AI-endoscopic systems suffer from biased data distributions due to variability in operator skill and image quality, which reduces algorithmic generalizability across populations. NLP models navigate through colloquial expressions and inconsistent terminology in multilingual electronic health records. Despite their promise for patient consultation and literature synthesis, large language models remain prone to hallucinations, lack rigorous fact citations, incur high computational costs, and operate within ambiguous regulatory frameworks. Multi-omics modeling is challenged by inconsistent sampling time points and divergent sequencing platforms, resulting in unstable feature integration. Moreover, most studies rely solely on single-time point retrospective data and lack longitudinal designs with multiple follow-up visits, limiting the ability of AI tools to accurately reflect disease progression and deliver reliable prognostic predictions. Consequently, clinical investigators should establish multicenter prospective databases governed by unified data collection and annotation standards to overcome these obstacles. Algorithm engineers must develop automated image-segmentation methods, cross-device adaptation techniques, and comprehensive model reliability assessment tools while leveraging federated learning or differential-privacy approaches to mitigate overfitting and safeguard patient confidentiality. Furthermore, industry stakeholders and funding bodies can facilitate this progress by supporting open cross-device benchmark datasets and financing prospective cross-platform validation studies. Regulatory agencies should implement sandbox approval processes that align with data protection regulations and require the transparent disclosure of model cards and decision logic. Integrating retrieval-augmented generation and real-time monitoring systems for large language models is essential for reducing hallucination risks. Additionally, patient and public engagement through education on data sharing and the interpretation of results is equally important. Thus, AI can be fully integrated into precision diagnostics and therapeutics for IBD through the coordinated efforts of clinicians, engineers, industry partners, policymakers, and patients.
Nevertheless, this study has some limitations. First, all the data were sourced from the WoSCC and included only English-language literature. In addition, an inherent time lag exists in database indexing, meaning that some articles published in 2024 may not have been indexed, which could result in incomplete coverage of the relevant literature. Second, although every effort was made to design a comprehensive search strategy, some relevant publications were inadvertently excluded. Finally, citation frequency is heavily dependent on publication date, therefore, newly published articles may not have accrued sufficient citations, potentially introducing bias into our analysis.
5 Conclusion
This study demonstrated that since the beginning of the 21st century, research on AI has undergone rapid advancement in the field of IBD and has made significant breakthroughs. The USA leads the research output in this field and exerts the most significant academic influence. Harvard University’s scholarly contributions are particularly notable among academic institutions, ranking first in terms of publication volume. During the initial phases of research, studies primarily focused on developing diagnostic models for IBD and IBD-associated cancers or adenomas using single-modality data, and on predicting the therapeutic efficacy of steroids. As research progressed, hotspots shifted towards machine learning applications in drug therapy, the gut microbiome, and GWAS. Currently, the field primarily focuses on radiomics, endoscopic technologies, NLP, and personalized medicine.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
LX: Conceptualization, Data curation, Formal analysis, Software, Writing – original draft. JZ: Formal analysis, Software, Visualization, Writing – review & editing. CS: Writing – review & editing. GC: Writing – original draft. SG: Conceptualization, Supervision, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Acknowledgments
The authors would like to thank Editage (www.editage.cn) for English language editing.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1600291/full#supplementary-material
References
1. Liu, D, Saikam, V, Skrada, KA, Merlin, D, and Iyer, SS. Inflammatory bowel disease biomarkers. Med Res Rev. (2022) 42:1856–87. doi: 10.1002/med.21893
2. Agrawal, M, Spencer, EA, Colombel, J-F, and Ungaro, RC. Approach to the management of recently diagnosed inflammatory bowel disease Patients: a user’s guide for adult and pediatric gastroenterologists. Gastroenterology. (2021) 161:47–65. doi: 10.1053/j.gastro.2021.04.063
3. Alatab, S, Sepanlou, SG, Ikuta, K, Vahedi, H, Bisignano, C, Safiri, S, et al. The global, regional, and national burden of inflammatory bowel disease in 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet Gastroenterol Hepatol. (2020) 5:17–30. doi: 10.1016/S2468-1253(19)30333-4
4. Ng, SC, Shi, HY, Hamidi, N, Underwood, FE, Tang, W, Benchimol, EI, et al. Worldwide incidence and prevalence of inflammatory bowel disease in the 21st century: a systematic review of population-based studies. Lancet. (2017) 390:2769–78. doi: 10.1016/S0140-6736(17)32448-0
5. D’Haens, GR, and van Deventer, S. 25 years of anti-TNF treatment for inflammatory bowel disease: lessons from the past and a look to the future. Gut. (2021) 70:1396–405. doi: 10.1136/gutjnl-2019-320022
6. Stidham, RW, Yu, D, Zhao, X, Bishu, S, Rice, M, Bourque, C, et al. Identifying the presence, activity, and status of extraintestinal manifestations of inflammatory bowel disease using natural language processing of clinical notes. Inflamm Bowel Dis. (2023) 29:503–10. doi: 10.1093/ibd/izac109
7. Gomollón, F, Gisbert, JP, Guerra, I, Plaza, R, Pajares Villarroya, R, Moreno Almazán, L, et al. Clinical characteristics and prognostic factors for Crohn’s disease relapses using natural language processing and machine learning: a pilot study. Eur J Gastroenterol Hepatol. (2022) 34:389–97. doi: 10.1097/MEG.0000000000002317
8. Chirra, P, Sleiman, J, Gandhi, NS, Gordon, IO, Hariri, M, Baker, M, et al. Radiomics to detect inflammation and fibrosis on magnetic resonance enterography in stricturing Crohn’s disease. J Crohns Colitis. (2024) 18:1660–71. doi: 10.1093/ecco-jcc/jjae073
9. Fu, Y, Ding, X, Zhang, M, Feng, C, Yan, Z, Wang, F, et al. Intestinal mucosal barrier repair and immune regulation with an AI-developed gut-restricted PHD inhibitor. Nat Biotechnol. (2024). doi: 10.1038/s41587-024-02503-w
10. Peng, Y, Wang, T-T, Wang, J-Z, Wang, H, Fan, R-Y, Gong, L-G, et al. The application of artificial intelligence in thyroid nodules: a systematic review based on bibliometric analysis. Endocr Metab Immune Disord Drug Targets. (2024) 24:1280–90. doi: 10.2174/0118715303264254231117113456
11. Wang, Y, Peng, Y, Wang, T, Li, H, Zhao, Z, Gong, L, et al. The evolution and current situation in the application of dual-energy computed tomography: a bibliometric study. Quant Imaging Med Surg. (2023) 13:6801–13. doi: 10.21037/qims-23-467
12. Wu, Z, Chen, S, Wang, Y, Li, F, Xu, H, Li, M, et al. Current perspectives and trend of computer-aided drug design: a review and bibliometric analysis. Int J Surg. (2024) 110:3848–78. doi: 10.1097/JS9.0000000000001289
13. Xu, L, Zou, J, Sun, C, Chen, G, and Gao, S. Worldwide research trends in Crohn’s disease treatment over the past 2 decades: a bibliometric analysis. Front Pharmacol. (2024) 15:1441785. doi: 10.3389/fphar.2024.1441785
14. Fang, Q, Huang, K, Yao, X, Peng, Y, Kan, A, Song, Y, et al. The application of radiology for dilated cardiomyopathy diagnosis, treatment, and prognosis prediction: a bibliometric analysis. Quant Imaging Med Surg. (2023) 13:7012–28. doi: 10.21037/qims-23-34
15. Wang, J, Zhao, W, Zhang, Z, Liu, X, Xie, T, Wang, L, et al. A journey of challenges and victories: a bibliometric worldview of nanomedicine since the 21st century. Adv Mater. (2024) 36:e2308915. doi: 10.1002/adma.202308915
16. Hirsch, JE. An index to quantify an individual’s scientific research output. Proc Natl Acad Sci USA. (2005) 102:16569–72. doi: 10.1073/pnas.0507655102
17. Sabé, M, Sulstarova, A, Chen, C, Hyde, J, Poulet, E, Aleman, A, et al. A century of research on neuromodulation interventions: a scientometric analysis of trends and knowledge maps. Neurosci Biobehav Rev. (2023) 152:105300. doi: 10.1016/j.neubiorev.2023.105300
18. Kieft, K, Zhou, Z, and Anantharaman, K. VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. Microbiome. (2020) 8:90. doi: 10.1186/s40168-020-00867-0
19. Ionita-Laza, I, Lee, S, Makarov, V, Buxbaum, JD, and Lin, X. Sequence kernel association tests for the combined effect of rare and common variants. Am J Hum Genet. (2013) 92:841–53. doi: 10.1016/j.ajhg.2013.04.015
20. Lee, D, Gorkin, DU, Baker, M, Strober, BJ, Asoni, AL, McCallion, AS, et al. A method to predict the impact of regulatory variants from DNA sequence. Nat Genet. (2015) 47:955–61. doi: 10.1038/ng.3331
21. Gevers, D, Kugathasan, S, Denson, LA, Vázquez-Baeza, Y, Van Treuren, W, Ren, B, et al. The treatment-naive microbiome in new-onset Crohn’s disease. Cell Host Microbe. (2014) 15:382–92. doi: 10.1016/j.chom.2014.02.005
22. Mossotto, E, Ashton, JJ, Coelho, T, Beattie, RM, Mac Arthur, BD, and Ennis, S. Classification of paediatric inflammatory bowel disease using machine learning. Sci Rep. (2017) 7:2427. doi: 10.1038/s41598-017-02606-2
23. Li, C-J, Wang, Y-K, Zhang, S-M, Ren, M-D, and He, S-X. Global burden of inflammatory bowel disease 1990–2019: a systematic examination of the disease burden and twenty-year forecast. World J Gastroenterol. (2023) 29:5751–67. doi: 10.3748/wjg.v29.i42.5751
24. Zhang, A, Wang, F, Li, D, Wang, C-Z, Yao, H, Wan, J-Y, et al. Emerging insights into inflammatory bowel disease from the intestinal microbiota perspective: a bibliometric analysis. Front Immunol. (2023) 14:1264705. doi: 10.3389/fimmu.2023.1264705
25. Zhang, H, Xin, H, Zhao, M, Bi, C, Xiao, Y, Li, Y, et al. Global research trends on the relationship between IBD and CRC: a bibliometric analysis from 2000 to 2023. J Health Popul Nutr. (2024) 43:83. doi: 10.1186/s41043-024-00577-5
26. Selaru, FM, Xu, Y, Yin, J, Zou, T, Liu, TC, Mori, Y, et al. Artificial neural networks distinguish among subtypes of neoplastic colorectal lesions. Gastroenterology. (2002) 122:606–13. doi: 10.1053/gast.2002.31904
27. Bielecki, C, Bocklitz, TW, Schmitt, M, Krafft, C, Marquardt, C, Gharbi, A, et al. Classification of inflammatory bowel diseases by means of Raman spectroscopic imaging of epithelium cells. J Biomed Opt. (2012) 17:0760301. doi: 10.1117/1.JBO.17.7.076030
28. Geurts, P, Fillet, M, de Seny, D, Meuwis, M-A, Malaise, M, Merville, M-P, et al. Proteomic mass spectra classification using decision tree based ensemble methods. Bioinformatics. (2005) 21:3138–45. doi: 10.1093/bioinformatics/bti494
29. Lee, S, Kim, B, Kim, T, and Kim, W. Differential diagnosis of intestinal Behçet’s disease and Crohn’s disease by colonoscopic findings. Endoscopy. (2009) 41:9–16. doi: 10.1055/s-0028-1103481
30. De Iudicibus, S, Stocco, G, Martelossi, S, Londero, M, Ebner, E, Pontillo, A, et al. Genetic predictors of glucocorticoid response in pediatric patients with inflammatory bowel diseases. J Clin Gastroenterol. (2011) 45:e1–7. doi: 10.1097/MCG.0b013e3181e8ae93
31. Waljee, AK, Lipson, R, Wiitala, WL, Zhang, Y, Liu, B, Zhu, J, et al. Predicting hospitalization and outpatient corticosteroid use in inflammatory bowel disease patients using machine learning. Inflamm Bowel Dis. (2018) 24:45–53. doi: 10.1093/ibd/izx0007
32. Waljee, AK, Sauder, K, Patel, A, Segar, S, Liu, B, Zhang, Y, et al. Machine learning algorithms for objective remission and clinical outcomes with thiopurines. J Crohns Colitis. (2017) 11:801–10. doi: 10.1093/ecco-jcc/jjx014
33. Waljee, AK, Liu, B, Sauder, K, Zhu, J, Govani, SM, Stidham, RW, et al. Predicting corticosteroid-free biologic remission with vedolizumab in Crohn’s disease. Inflamm Bowel Dis. (2018) 24:1185–92. doi: 10.1093/ibd/izy031
34. Forbes, JD, Chen, C, Knox, NC, Marrie, R-A, El-Gabalawy, H, De Kievit, T, et al. A comparative study of the gut microbiota in immune-mediated inflammatory diseases-does a common dysbiosis exist? Microbiome. (2018) 6:221. doi: 10.1186/s40168-018-0603-4
35. Ananthakrishnan, AN, Luo, C, Yajnik, V, Khalili, H, Garber, JJ, Stevens, BW, et al. Gut microbiome function predicts response to anti-integrin biologic therapy in inflammatory bowel diseases. Cell Host Microbe. (2017) 21:603–610.e3. doi: 10.1016/j.chom.2017.04.010
36. Isakov, O, Dotan, I, and Ben-Shachar, S. Machine learning-based gene prioritization identifies novel candidate risk genes for inflammatory bowel disease. Inflamm Bowel Dis. (2017) 23:1516–23. doi: 10.1097/MIB.0000000000001222
37. Chirra, P, Sharma, A, Bera, K, Cohn, HM, Kurowski, JA, Amann, K, et al. Integrating radiomics with clinicoradiological scoring can predict high-risk patients who need surgery in Crohn’s disease: a pilot study. Inflamm Bowel Dis. (2023) 29:349–58. doi: 10.1093/ibd/izac211
38. Li, X, Liang, D, Meng, J, Zhou, J, Chen, Z, Huang, S, et al. Development and validation of a novel computed-tomography enterography radiomic approach for characterization of intestinal fibrosis in Crohn’s disease. Gastroenterology. (2021) 160:2303–2316.e11. doi: 10.1053/j.gastro.2021.02.027
39. Maeda, Y, Ditonno, I, Puga-Tejada, M, Zammarchi, I, Santacroce, G, Ghosh, S, et al. Artificial intelligence-enabled advanced endoscopic imaging to assess deep healing in inflammatory bowel disease. eGastroenterology. (2024) 2:e100090. doi: 10.1136/egastro-2024-100090
40. Gottlieb, K, Requa, J, Karnes, W, Chandra Gudivada, R, Shen, J, Rael, E, et al. Central reading of ulcerative colitis clinical trial videos using neural networks. Gastroenterology. (2021) 160:710–719.e2. doi: 10.1053/j.gastro.2020.10.024
41. Ogata, N, Maeda, Y, Misawa, M, Takenaka, K, Takabayashi, K, Iacucci, M, et al. Artificial intelligence-assisted video colonoscopy for disease monitoring of ulcerative colitis: a prospective study. J Crohns Colitis. (2025) 19:jjae080. doi: 10.1093/ecco-jcc/jjae080
42. Fan, Y, Mu, R, Xu, H, Xie, C, Zhang, Y, Liu, L, et al. Novel deep learning-based computer-aided diagnosis system for predicting inflammatory activity in ulcerative colitis. Gastrointest Endosc. (2023) 97:335–46. doi: 10.1016/j.gie.2022.08.015
43. Rimondi, A, Gottlieb, K, Despott, EJ, Iacucci, M, Murino, A, and Tontini, GE. Can artificial intelligence replace endoscopists when assessing mucosal healing in ulcerative colitis? A systematic review and diagnostic test accuracy meta-analysis. Dig Liver Dis. (2024) 56:1164–72. doi: 10.1016/j.dld.2023.11.005
44. Ogata, N, Ohtsuka, K, Ogawa, M, Maeda, Y, Ishida, F, and Kudo, S. Image-enhanced capsule endoscopy improves the identification of small intestinal lesions. Diagnostics. (2021) 11:2122. doi: 10.3390/diagnostics11112122
45. Klang, E, Grinman, A, Soffer, S, Margalit Yehuda, R, Barzilay, O, Amitai, MM, et al. Automated detection of Crohn’s disease intestinal strictures on capsule endoscopy images using deep neural networks. J Crohns Colitis. (2021) 15:749–56. doi: 10.1093/ecco-jcc/jjaa234
46. Klang, E, Barash, Y, Margalit, RY, Soffer, S, Shimon, O, Albshesh, A, et al. Deep learning algorithms for automated detection of Crohn’s disease ulcers by video capsule endoscopy. Gastrointest Endosc. (2020) 91:606–613.e2. doi: 10.1016/j.gie.2019.11.012
47. Stidham, RW, and Takenaka, K. Artificial intelligence for disease assessment in inflammatory bowel disease: how will it change our practice? Gastroenterology. (2022) 162:1493–506. doi: 10.1053/j.gastro.2021.12.238
48. Sciberras, M, Farrugia, Y, Gordon, H, Furfaro, F, Allocca, M, Torres, J, et al. Accuracy of information given by chat GPT for patients with inflammatory bowel disease in relation to ECCO guidelines. J Crohns Colitis. (2024) 18:1215–21. doi: 10.1093/ecco-jcc/jjae040
49. Silverman, AL, Sushil, M, Bhasuran, B, Ludwig, D, Buchanan, J, Racz, R, et al. Algorithmic identification of treatment-emergent adverse events from clinical notes using large language models: a pilot study in inflammatory bowel disease. Clin Pharmacol Ther. (2024) 115:1391–9. doi: 10.1002/cpt.3226
Keywords: artificial intelligence, inflammatory bowel disease, Crohn’s disease, ulcerative colitis, bibliometric, Web of Science
Citation: Xu L, Zou J, Sun C, Chen G and Gao S (2025) The global academic distribution and changes in research hotspots of artificial intelligence in inflammatory bowel disease since 2000. Front. Med. 12:1600291. doi: 10.3389/fmed.2025.1600291
Edited by:
Rahul Kashyap, WellSpan Health, United StatesReviewed by:
Stefan G. Stanciu, Polytechnic University of Bucharest, RomaniaRui Wang, The First Affiliated Hospital of Xi’an Jiaotong University, China
Copyright © 2025 Xu, Zou, Sun, Chen and Gao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sujun Gao, d29vZC5zamdhb0AxNjMuY29t