Skip to main content

SYSTEMATIC REVIEW article

Front. Med., 22 June 2022
Sec. Gastroenterology
Volume 9 - 2022 | https://doi.org/10.3389/fmed.2022.890720

Diagnostic Accuracy of Artificial Intelligence (AI) to Detect Early Neoplasia in Barrett's Esophagus: A Non-comparative Systematic Review and Meta-Analysis

Jin Lin Tan1,2 Mohamed Asif Chinnaratha1,2 Richard Woodman3 Rory Martin4 Hsiang-Ting Chen4 Gustavo Carneiro4 Rajvinder Singh1,2*
  • 1Department of Gastroenterology and Hepatology, Lyell McEwin Hospital, SA Health, Elizabeth Vale, SA, Australia
  • 2Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, SA, Australia
  • 3Flinders Centre for Epidemiology and Biostatistics, College of Medicine and Public Health, Flinders University, Bedford Park, SA, Australia
  • 4Australian Institute for Machine Learning, The University of Adelaide, Adelaide, SA, Australia

Background and Aims: Artificial Intelligence (AI) is rapidly evolving in gastrointestinal (GI) endoscopy. We undertook a systematic review and meta-analysis to assess the performance of AI at detecting early Barrett's neoplasia.

Methods: We searched Medline, EMBASE and Cochrane Central Register of controlled trials database from inception to the 28th Jan 2022 to identify studies on the detection of early Barrett's neoplasia using AI. Study quality was assessed using Quality Assessment of Diagnostic Accuracy Studies – 2 (QUADAS-2). A random-effects model was used to calculate pooled sensitivity, specificity, and diagnostics odds ratio (DOR). Forest plots and a summary of the receiving operating characteristics (SROC) curves displayed the outcomes. Heterogeneity was determined by I2, Tau2 statistics and p-value. The funnel plots and Deek's test were used to assess publication bias.

Results: Twelve studies comprising of 1,361 patients (utilizing 532,328 images on which the various AI models were trained) were used. The SROC was 0.94 (95% CI: 0.92–0.96). Pooled sensitivity, specificity and diagnostic odds ratio were 90.3% (95% CI: 87.1–92.7%), 84.4% (95% CI: 80.2–87.9%) and 48.1 (95% CI: 28.4–81.5), respectively. Subgroup analysis of AI models trained only on white light endoscopy was similar with pooled sensitivity and specificity of 91.2% (95% CI: 85.7–94.7%) and 85.1% (95% CI: 81.6%−88.1%), respectively.

Conclusions: AI is highly accurate at detecting early Barrett's neoplasia and validated for patients with at least high-grade dysplasia and above. Further well-designed prospective randomized controlled studies of all histopathological subtypes of early Barrett's neoplasia are needed to confirm these findings further.

Introduction

Artificial Intelligence (AI) is rapidly evolving in gastrointestinal (GI) endoscopy. This development has been promising in the lower gastrointestinal tract, where various AI-assisted algorithms to detect and diagnose colorectal lesions during colonoscopy have been utilized (1). There are fewer studies on detecting early neoplasia associated with Barrett's esophagus (BE) (1). BE is a metaplastic alteration of the normal esophageal epithelium detected on endoscopic examination and pathologically confirmed as exhibiting intestinal metaplasia. It is considered a precursor to the development of esophageal adenocarcinoma, which carries high mortality (2). According to the Surveillance, Epidemiology, and End Results Program (SEER) database, the incidence of esophageal carcinoma is rising more rapidly than any other form of cancer, with a 6-fold increase demonstrated from 1975 to 2001(3). The overall 5-year survival rate for patients with esophageal adenocarcinoma in the United States is a dismal 21% (4).

Studies have shown that patients in whom adenocarcinoma is detected during endoscopic surveillance for BE are more likely to have early Barret's esophageal cancer, receive curative therapy, and survive longer than symptomatic patients in whom adenocarcinoma is detected (5). Surveillance strategies utilizing white light endoscopy (WLE) and random biopsies have been advocated and critical in recognizing dysplasia. However, dysplastic lesions are still difficult to distinguish from non-dysplastic mucosa as only a tiny fraction of Barrett's esophagus mucosa is randomly biopsied. It is difficult to obtain an accurate assessment of the presence of cancer or dysplasia due to sampling error (6). It has been shown that surveillance endoscopy done at non-expert centers has a lower early BE cancer detection rate (7). Expertise and advanced techniques such as Narrow Band Imaging (NBI) and magnification can improve accuracy in diagnosing early BE cancer but are only available in expert centers (8). Better and more widespread techniques to enhance the accuracy of endoscopic surveillance of early BE cancer is required.

Robust evidence was lacking in previous meta-analyses to support the use of AI in the surveillance of early Barrett's neoplasia, as most included studies were retrospective in nature (1, 9, 10). Moreover, these meta-analyses reviewed the performance of AI in the upper gastro-intestinal tracts of various other pathologies with very few studies on early Barrett's neoplasia. In addition, there is a lack of ground truth on whether images had correlated with actual histopathological reports. While the meta-analyses reported the generic class of AI algorithm used in included studies, the specific type of algorithm used, which is an important consideration for its overall accuracy, were not reported. We provide an updated study aiming to collate ongoing evidence on recently published studies on the utility of AI in detecting dysplasia and early cancer in BE.

Methods

This study was pre-registered with the PROSPERO register (11) and followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses of Diagnostic Test Accuracy (PRISMA-DTA) (12).

Eligibility Criteria

The inclusion criteria utilized the PICO methodology and included:

(i) Participants: Patients with Barrett's esophagus with or without dysplasia;

(ii) Intervention: Use of artificial intelligence or computer-assisted diagnosis for detection of early Barrett's neoplasia;

(iii) Control: Standard surveillance of Barrett's esophagus with White Light Endoscopy (WLE) with or without Narrow Band Imaging (NBI);

(iv) Outcome measures the accuracy, sensitivity, specificity, area under the receiver operating characteristic (AUROC) curves of AI models to detect early Barrett's neoplasia.

The exclusion criteria were as follows: (i) Endoscopic surveillance technique such as volumetric laser endomicroscopy and hyperspectral imaging; (ii) Histological subtypes such as squamous cell carcinoma (iii) Reviews, meta-analyses, editorials, letters, comments (iv) and animal studies.

Search Strategy

An electronic search was performed by Medline, EMBASE and Cochrane Central Register of controlled trials database from inception to the 28th Jan 2022 using the following MeSH terms or free text: “artificial intelligence,” “AI,” “convolutional neural network,” “deep learning,” “computer-assisted diagnosis,” “computer-aided detection,” “Barrett's esophagus,” “dysplasia,” “adenocarcinoma,” “esophageal adenocarcinoma” and “esophageal tumor” (Supplementary Tables 13). The search was limited to human studies, but there were no language restrictions. Two independent reviewers (J.T and R.M) performed an initial literature search and selected relevant studies based on the eligibility criteria. Titles and abstracts were screened to exclude studies that did not address the research questions. Subsequently, the remaining studies were assessed in full for eligibility. Finally, any discrepancies were resolved by consensus between the two reviewers or discussion with a third senior author (M.A.C).

Data Collection and Study Quality Assessment

The following data were extracted from each study: author, year, journal, country or region, types of study, endoscopic imaging modality used, details of artificial intelligence algorithm used, definitions of Barrett's dysplasia, types of controls used, whether images or videos had confirmed corresponding histology, number of patients, endoscopic images or videos, rates of true positivity, false negativity, true negativity, false positivity, sensitivity, specificity, accuracy or area under receiver operating characteristics curve of respective artificial intelligence algorithm used in each study, and whether the study had been validated internally or externally, or achieved a real-time diagnosis. Two investigators (J.T and R.M) extracted the data independently. Study quality was assessed using Quality Assessment of Diagnostic Accuracy Studies – 2 (QUADAS-2) by two independent reviewers (J.T and R.M) (13). Conflicts were resolved by discussion and involvement of a third senior author (M.A.C). QUADAS-2 contains four key domains: patient selection, index test, reference standard, and flow of patients through the study. Each domain is assessed in terms of risk of bias, and the first three are also evaluated in terms of concerns regarding applicability. Risk of bias and applicability were both judged as “low,” “high,” or “unclear”.

Statistical Analysis

A random-effects model, as described by DerSimonian and Laird, was used to calculate the following: pooled sensitivity, specificity and diagnostics odds ratio (DOR). Forest plots and a summary of the receiver operating characteristics (SROC) curves were used to display the outcomes. The SROC plot summarizes the sensitivity and specificity of individual studies on a scatter plot, together with the summary operating point to depict the overall accuracy.

I2, Tau2 statistics and p-value were used to assess heterogeneity. An I2 of >50%, Tau2 of >0.1 or p < 0.05 implies significant heterogeneity. Publication bias was visually assessed using funnel plots and Deek's funnel plot asymmetry test (p < 0.05 implying publication bias). Forest plots were performed using the meta package in R Project for Statistical Computing, developed by the R Foundation (14), and all other statistical analyses were performed using the MIDAS package in STATA Statistical Software (15).

Results

Search Results

The initial search strategy yielded 1,198 studies. Following title and abstract screening, 1,172 articles were excluded. The remaining twenty-six studies were reviewed in full text. Fourteen studies were further excluded as they did not meet the eligibility criteria. Finally, twelve studies that met all the inclusion criteria were included for this meta-analysis (1627). The search strategy and study selection are shown in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1. Modified PRISMA flow diagram of the search strategy and study selection.

Study Characteristics

Table 1 illustrates the study characteristics of included studies. A total of 1,361 patients were included in these studies, with 532,328 images acquired to train the various AI models that were developed. Seven studies were retrospective (16, 17, 19, 22, 24, 26, 27), four studies were prospective (18, 21, 23, 25), and one study had both retrospective and prospective components in their methods (20). In terms of endoscopic imaging modality evaluated, eleven studies included WLE (1626), three studies included NBI (17, 22, 27), and three studies included virtual chromoendoscopy images (i-Scan) (2426). More recent studies from 2019 used Convolutional Neural Networks (CNN) instead of Support Vector Machine (SVM) as the basis of their AI model (17, 1927). Three studies had used a proprietary pre-training dataset of images from various parts of the gastrointestinal tracts known as “GastroNet” (20, 21, 27). Only two studies demonstrated the real-time capabilities of their AI models (21, 22). Eleven studies documented that endoscopic images included were confirmed on histopathological reports (1622, 2427). Among all included studies, there were slight differences in the definitions of dysplastic lesions, including endoscopic images of BE with high-grade dysplasia, T1/intra-mucosal adenocarcinoma, esophageal adenocarcinoma, or a combination of the above. Based on the QUADAS-2 scale, seven studies had low risks of bias, four studies had unclear risks of bias, and one study contained risks of bias (Table 2). There were no concerns regarding the applicability of all included studies.

TABLE 1
www.frontiersin.org

Table 1. Study characteristics of included studies.

TABLE 2
www.frontiersin.org

Table 2. Quality assessment of diagnostic accuracy studies – 2 of included studies.

AI and Detection of Early BE Neoplasia

All twelve studies reported the sensitivity and specificity of their AI model at detecting early BE neoplasia. Eleven studies (16, 1822, 2527), including two studies which are sub-studies of de Groof et al. (20), of 191,278 endoscopic images or frames (for video-based studies) demonstrated that AI is very accurate at detecting early BE neoplasia with the respective pooled sensitivity, specificity and diagnostic odds ratio (DOR): 90.3% [95% CI: 87.1%−92.7%] (Figure 2), 84.4% [95% CI: 80.2–87.9%] (Figure 3) and 48.1 [95% CI: 28.4–81.5] (Figure 4). However, all the primary outcome analyses had significant heterogeneity (I2 of >50%, Tau2 of >0.1 and p < 0.05). Whilst there is significant heterogeneity amongst the included studies, the area under the summary of receiver operating characteristics curve was 0.94 [95% CI: 0.92–0.96] (Figure 5). Funnel plot assessment showed asymmetrical distribution, possibly due to underlying heterogeneity (Figure 6). However, Deek's test did not show statistically significant publication bias with a p-value of 0.18 (Figure 7).

FIGURE 2
www.frontiersin.org

Figure 2. Forest plots of pooled sensitivities for all included studies of AI and the detection of early BE neoplasia.

FIGURE 3
www.frontiersin.org

Figure 3. Forest plots of pooled specificities for all included studies of AI and the detection of early BE neoplasia.

FIGURE 4
www.frontiersin.org

Figure 4. Forest plots of pooled diagnostic odds ratio for all included studies of AI and the detection of early BE neoplasia.

FIGURE 5
www.frontiersin.org

Figure 5. Summary of the receiver operating characteristics (SROC) curve of all included studies.

FIGURE 6
www.frontiersin.org

Figure 6. Funnel plots of all included studies.

FIGURE 7
www.frontiersin.org

Figure 7. Deek's Funnel Plot asymmetry test of all included studies.

Subgroup Analysis–White Light Endoscopy

A subgroup analysis was performed of six studies (16, 1822), with White Light Endoscopy as the mode of imaging modality demonstrated similar pooled sensitivity and specificity: 91.2% [95% CI: 85.7–94.7%] (Figure 8) and 85.1% [95% CI: 81.6–88.1%] (Figure 9). There was significant heterogeneity for pooled sensitivity analysis (I2 = 64%, Tau2 = 0.40 and p < 0.01). Heterogeneity was not significant for pooled specificity analysis (I2 = 11%, Tau2 = 0.29 and p = 0.34).

FIGURE 8
www.frontiersin.org

Figure 8. Forest plots of pooled sensitivities for included studies of AI and the detection of early BE neoplasia (White Light Endoscopy only) – Subgroup analysis.

FIGURE 9
www.frontiersin.org

Figure 9. Forest plots of pooled specificities for included studies of AI and the detection of early BE neoplasia (White Light Endoscopy only) – Subgroup analysis.

Discussion

This systematic review and meta-analysis of more than 500,000 images from 1,361 patients have consistently shown that AI can detect early Barrett's neoplasia accurately. An area under the SROC of 0.94 [95% CI: 0.92–0.96] was achieved with a pooled sensitivity of 90.3% (95 % CI: 87.1–92.7%) pooled specificity of 84.4% (95% CI: 80.2–87.9). Of note, three of the included studies (17, 23, 24) could not be included into this final analysis due to lack of required data but their respective sensitivity and specificity are represented in Table 1.

There is moderate inter-study heterogeneity noted in the included studies. Qualitative assessment of these studies showed multiple factors such as inclusion definitions of dysplastic Barret's lesions, different types of AI algorithm and imaging modality used. Whilst most included studies (1727) used a CNN based AI algorithm, different pre-training datasets and CNN models were used. Similarly, different modalities such as white light endoscopy (WLE), narrow-band imaging (NBI) and “i-Scan” were used in various studies or combinations. All the above are very likely to contribute to the heterogeneity of our primary study outcome.

Subgroup analysis with WLE was consistent showing similar results with pooled sensitivity and specificity: 91.2% [95% CI: 85.7–94.7%] and 85.1% [95% CI: 81.6–88.1%]. Interestingly, in contrast to its pooled sensitivity analysis, pooled specificity analysis was homogenous. This finding could imply that whilst there is an array of AI models with different sensitivities, their respective specificities are limited by the macroscopic details of what optical white light endoscopy can achieve.

Compared to current published data, AI outperformed expert endoscopists in detecting early Barrett's neoplasia. In a benchmark assessment of 17 senior endoscopists conducted by de Groof et al., the following accuracy, sensitivity and specificity were achieved: 74.8% (95% CI: 72–77%), 76.5% (95% CI:72–81%), and 73.1% (95% CI:66–80%) (20). Accurate classification of BE lesions has significant clinical implications as it determines the duration of follow up, types of endoscopic resection required (for example, endoscopic mucosal resection vs. endoscopic submucosal dissection) or whether radiofrequency ablation is required (28). Ultimately, prompt management of early Barrett's neoplasia will likely improve patient mortality and quality of life.

However, without including BE lesions with low-grade dysplasia (LGD) in any of the included studies, it remains a challenge to determine whether AI could indeed detect all dysplastic lesions in patients with BE. Firstly, not all lesions of BE with LGD can be seen macroscopically. One study had demonstrated that of the 75 patients with a confirmed diagnosis of LGD, 52% were visible at the time of endoscopy in expert centers, with a significant contrast of only 12% detected at community centers (29). Moreover, the diagnosis of LGD is further limited by pathologists' poor inter-observer agreement of histopathological criteria (30). Whilst AI algorithm could theoretically improve the rates of detections of LGD, which may benefit community centers, it will be challenging to train an AI model without consistent “ground truth” of what is classified as LGD histologically by pathologists.

From an AI design perspective, instead of using pre-trained CNN models, de Groof et al. developed a model pre-trained using more than 500,000 endoscopic images of the gastrointestinal tract and subsequently refined with images of BE with or without dysplasia, verified by experts and histological reports (20). However, their results are similar to CNN models pre-trained conventionally based on ImageNet, an image database organized according to the WordNet hierarchy such as Hashimoto et al. (22).

There are various strengths to this meta-analysis. Firstly, there are more than 500,000 images from 1,361 patients in this review, which is currently the largest sample size in this patient cohort. Secondly, this meta-analysis is restricted to patients with BE and allowed us to ascertain the performance of AI at detecting early BE neoplasia. Thirdly, this systematic review presented the subtypes of CNN models used in each study which were not included in previously published reviews. Understanding differences in AI models is an essential aspect of interpreting the performance of each study, as AI models can be trained very differently, using different pre-training image databases and various refinement techniques.

Our meta analysis has some limitations. Firstly, most included studies were retrospective in nature. This can result in selection or information bias. Secondly, no studies included LGD due to issues as described in the above section. Thirdly, there was insufficient data to perform a subgroup analysis of the performance of AI vs. endoscopists at detecting early neoplastic BE. Fourthly, several studies (18, 24, 25) only included <100 unique patients in their training model. A concept of “over-fitting” may occur where the AI model becomes too attuned to the limited dataset on which it was trained and therefore loses its applicability to any other datasets. Lastly, there was significant inter-study heterogeneity in this meta-analysis which is likely multifactorial. Given the advancements of different AI techniques, it will be expected that future studies will likely be heterogeneous.

Conclusion

In conclusion, this systematic review and meta-analysis provide updated evidence showing that AI is highly accurate at detecting early Barrett's neoplasia but validated for patients with BE lesions of least high-grade dysplasia and above. Our results support the need for more studies, including AI models to detect macroscopically visible low-grade dysplasia. In addition, well-designed prospective randomized controlled studies are needed to further explore if AI can indeed be effective both for experts and non-expert endoscopists.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.

Author Contributions

JT performed an initial search and review of articles, data extraction, quality assessment of included papers, and prepared the manuscript. RM performed an initial search and review of articles, data extraction, and quality assessment of included papers. RW provided statistical advice and performed statistical analyses on extracted data. RS, MC, and GC provided supervision of the systematic review and meta-analysis. All authors critically reviewed, edited, and approved the final manuscript for submission.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2022.890720/full#supplementary-material

References

1. Zhang SM, Wang YJ, Zhang ST. Accuracy of artificial intelligence-assisted detection of esophageal cancer and neoplasms on endoscopic images: a systematic review and meta-analysis. J Dig Dis. (2021) 22:318–28. doi: 10.1111/1751-2980.12992

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Modiano N, Gerson LB. Barrett's esophagus: incidence, etiology, pathophysiology, prevention and treatment. Ther Clin Risk Manag. (2007) 3:1035–145.

PubMed Abstract | Google Scholar

3. Pera M, Manterola C, Vidal O, Grande L. Epidemiology of esophageal adenocarcinoma. J Surg Oncol. (2005) 92:151–9. doi: 10.1002/jso.20357

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Then EO, Lopez M, Saleem S, Gayam V, Sunkara T, Culliford A, et al. Esophageal cancer: an updated surveillance epidemiology and end results database analysis. World J Oncol. (2020) 11:55–64. doi: 10.14740/wjon1254

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Corley DA, Levin TR, Habel LA, Weiss NS, Buffler PA. Surveillance and survival in Barrett's adenocarcinomas: a population-based study. Gastroenterology. (2002) 122:633–40. doi: 10.1053/gast.2002.31879

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Egger K, Werner M, Meining A, Ott R, Allescher HD, Höfler H, et al. Biopsy surveillance is still necessary in patients with Barrett's oesophagus despite new endoscopic imaging techniques. Gut. (2003) 52:18–23. doi: 10.1136/gut.52.1.18

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Schölvinck DW, van der Meulen K, Bergman J, Weusten B. Detection of lesions in dysplastic Barrett's esophagus by community and expert endoscopists. Endoscopy. (2017) 49:113–20. doi: 10.1055/s-0042-118312

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Wolfsen HC, Crook JE, Krishna M, Achem SR, Devault KR, Bouras EP, et al. Prospective, controlled tandem endoscopy study of narrow band imaging for dysplasia detection in Barrett's Esophagus. Gastroenterology. (2008) 135:24–31. doi: 10.1053/j.gastro.2008.03.019

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Lui TKL, Tsui VWM, Leung WK. Accuracy of artificial intelligence-assisted detection of upper GI lesions: a systematic review and meta-analysis. Gastroint Endo. (2020) 92:821–30. doi: 10.1016/j.gie.2020.06.034

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Bang CS, Lee JJ, Baik GH. Computer-aided diagnosis of esophageal cancer and neoplasms in endoscopic images: a systematic review and meta-analysis of diagnostic test accuracy. Gastrointest Endosc. (2021) 93:1006–15. doi: 10.1016/j.gie.2020.11.025

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Booth A, Clarke M, Dooley G, Ghersi D, Moher D, Petticrew M, et al. The nuts and bolts of PROSPERO: an international prospective register of systematic reviews. Syst Rev. (2012) 1:2. doi: 10.1186/2046-4053-1-2

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Salameh J-P, Bossuyt PM, McGrath TA, Thombs BD, Hyde CJ, Macaskill P, et al. Preferred reporting items for systematic review and meta-analysis of diagnostic test accuracy studies (PRISMA-DTA): explanation, elaboration, and checklist. BMJ. (2020):m2632. doi: 10.1136/bmj.m2632

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. (2011) 155:529–36. doi: 10.7326/0003-4819-155-8-201110180-00009

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Computing RR. A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing (2021).

Google Scholar

15. StataCorp. Stata Statistical Software. Release 17 ed. College Station, TX: StataCorp LLC (2021).

Google Scholar

16. van der Sommen F, Zinger S, Curvers WL, Bisschops R, Pech O, Weusten BL, et al. Computer-aided detection of early neoplastic lesions in Barrett's esophagus. Endoscopy. (2016) 48:617–24. doi: 10.1055/s-0042-105284

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Ebigbo A, Mendel R, Probst A, Manzeneder J, De Souza LA, Papa JP, et al. Computer-aided diagnosis using deep learning in the evaluation of early oesophageal adenocarcinoma. Gut. (2019) 68:1143–5. doi: 10.1136/gutjnl-2018-317573

PubMed Abstract | CrossRef Full Text | Google Scholar

18. de Groof J, van der Sommen F, van der Putten J, Struyvenberg MR, Zinger S, Curvers WL, et al. The Argos project: The development of a computer-aided detection system to improve detection of Barrett's neoplasia on white light endoscopy. United European Gastroenterol J. (2019) 7:538–47. doi: 10.1177/2050640619837443

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Abdelrahim M, Saikou M, Masaike Y, Ohtsuka S, Maeda N, Hossain E, et al. Artificial intelligence using convolutional neural networks for detection of early Barrett's neoplasia. United European Gastroenterol J. (2020) 8(8 SUPPL):114. doi: 10.1136/gutjnl-2020-bsgcampus.6

PubMed Abstract | CrossRef Full Text | Google Scholar

20. de Groof AJ, Struyvenberg MR, van der Putten J, van der Sommen F, Fockens KN, Curvers WL, et al. Deep-learning system detects neoplasia in patients with barrett's esophagus with higher accuracy than endoscopists in a multistep training and validation study with benchmarking. Gastroenterology. (2020) 158:915–29. doi: 10.1053/j.gastro.2019.11.030

PubMed Abstract | CrossRef Full Text | Google Scholar

21. de Groof AJ, Struyvenberg MR, Fockens KN, van der Putten J, van der Sommen F, Boers TG, et al. Deep learning algorithm detection of Barrett's neoplasia with high accuracy during live endoscopic procedures: a pilot study (with video). Gastrointest Endosc. (2020) 91:1242–50. doi: 10.1016/j.gie.2019.12.048

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Hashimoto R, Requa J, Dao T, Ninh A, Tran E, Mai D, et al. Artificial intelligence using convolutional neural networks for real-time detection of early esophageal neoplasia in Barrett's esophagus (with video). Gastroint Endos. (2020) 91:1264–71. doi: 10.1016/j.gie.2019.12.049

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Samarasena JB, Konda VJ, Trindade AJ, Cavaliere KR, Chang K, Hashimoto R, et al. Id: 3522405 detection of early esophageal neoplasia in barrett's esophagus using real time artificial intelligence: a multicenter external video validation study. Gastroint Endos. (2021) 93(6 Supplement):AB195. doi: 10.1016/j.gie.2021.03.438

CrossRef Full Text | Google Scholar

24. Hussein M, Lines D, Puyal JGB, Bowman N, Sehgal V, Toth D, et al. Computer aided diagnosis for the characterisation of dysplasia in Barrett's oesophaguswith magnification endoscopy. Endoscopy. (2021) 53(SUPPL 1):S10. doi: 10.1055/s-0041-1724273

CrossRef Full Text | Google Scholar

25. Hussein M, Puyal JGB, Brandao P, Toth D, Sehgal V, Everson M, et al. Deep neural network for the detection of early neoplasia in Barrett's oesophagus. Gut. (2021) 70(SUPPL 1):A17. doi: 10.1136/gutjnl-2020-bsgcampus.30

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Hussein M, Puyal JGB, Lines D, Sehgal V, Toth D, Everson M, et al. Deep neural network for the localisation of early neoplasia in Barrett's oesophagus withtargeted biopsies. Endoscopy. (2021) 53(SUPPL 1):S8–9. doi: 10.1055/s-0041-1724270

CrossRef Full Text | Google Scholar

27. Struyvenberg MR, de Groof AJ, van der Putten J, van der Sommen F, Baldaque-Silva F, Omae M, et al. A computer-assisted algorithm for narrow-band imaging-based tissue characterization in Barrett's esophagus. Gastrointest Endosc. (2021) 93:89–98. doi: 10.1016/j.gie.2020.05.050

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Weusten B, Bisschops R, Coron E, Dinis-Ribeiro M, Dumonceau JM, Esteban JM, et al. Endoscopic management of Barrett's esophagus: European society of gastrointestinal endoscopy (ESGE) position statement. Endoscopy. (2017) 49:191–8. doi: 10.1055/s-0042-122140

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Tsoi EH, Mahindra P, Cameron G, Williams R, Norris R, Desmond PV, et al. Barrett's esophagus with low-grade dysplasia: high rate of upstaging at Barrett's esophagus referral units suggests progression rates may be overestimated. Gastrointest Endosc. (2021) 94:902–8. doi: 10.1016/j.gie.2021.05.021

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Vennalaganti P, Kanakadandi V, Goldblum JR, Mathur SC, Patil DT, Offerhaus GJ, et al. Discordance among pathologists in the united states and europe in diagnosis of low-grade dysplasia for patients with barrett's esophagus. Gastroenterology. (2017) 152:564–70.e4. doi: 10.1053/j.gastro.2016.10.041

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Barrett's esophagus, dysplasia, esophageal adenocarcinoma, artificial intelligence, deep learning

Citation: Tan JL, Chinnaratha MA, Woodman R, Martin R, Chen H-T, Carneiro G and Singh R (2022) Diagnostic Accuracy of Artificial Intelligence (AI) to Detect Early Neoplasia in Barrett's Esophagus: A Non-comparative Systematic Review and Meta-Analysis. Front. Med. 9:890720. doi: 10.3389/fmed.2022.890720

Received: 06 March 2022; Accepted: 27 May 2022;
Published: 22 June 2022.

Edited by:

Timo Rath, Brigham and Women's Hospital, Harvard Medical School, United States

Reviewed by:

Shubhra Mishra, Post Graduate Institute of Medical Education and Research (PGIMER), India
Peter Klare, Agatharied Hospital GmbH, Germany

Copyright © 2022 Tan, Chinnaratha, Woodman, Martin, Chen, Carneiro and Singh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rajvinder Singh, rajvinder.singh@sa.gov.au

Download