Evolution of research trends in artificial intelligence for breast cancer diagnosis and prognosis over the past two decades: A bibliometric analysis

Syed, Asif Hassan; Khan, Tabrej

doi:10.3389/fonc.2022.854927

ORIGINAL RESEARCH article

Front. Oncol., 23 September 2022

Sec. Breast Cancer

Volume 12 - 2022 | https://doi.org/10.3389/fonc.2022.854927

Evolution of research trends in artificial intelligence for breast cancer diagnosis and prognosis over the past two decades: A bibliometric analysis

1. Department of Computer Science, Faculty of Computing and Information Technology Rabigh (FCITR), King Abdulaziz University, Jeddah, Saudi Arabia
2. Department of Information Systems, Faculty of Computing and Information Technology Rabigh (FCITR), King Abdulaziz University, Jeddah, Saudi Arabia

Article metrics

View details

Citations

6,9k

Views

2,7k

Downloads

A correction has been applied to this article in:

Corrigendum: Evolution of research trends in artificial intelligence for breast cancer diagnosis and prognosis over the past two decades: A bibliometric analysis
1. Read correction

Abstract

Objective:

In recent years, among the available tools, the concurrent application of Artificial Intelligence (AI) has improved the diagnostic performance of breast cancer screening. In this context, the present study intends to provide a comprehensive overview of the evolution of AI for breast cancer diagnosis and prognosis research using bibliometric analysis.

Methodology:

Therefore, in the present study, relevant peer-reviewed research articles published from 2000 to 2021 were downloaded from the Scopus and Web of Science (WOS) databases and later quantitatively analyzed and visualized using Bibliometrix (R package). Finally, open challenges areas were identified for future research work.

Results:

The present study revealed that the number of literature studies published in AI for breast cancer detection and survival prediction has increased from 12 to 546 between the years 2000 to 2021. The United States of America (USA), the Republic of China, and India are the most productive publication-wise in this field. Furthermore, the USA leads in terms of the total citations; however, hungry and Holland take the lead positions in average citations per year. Wang J is the most productive author, and Zhan J is the most relevant author in this field. Stanford University in the USA is the most relevant affiliation by the number of published articles. The top 10 most relevant sources are Q1 journals with PLOS ONE and computer in Biology and Medicine are the leading journals in this field. The most trending topics related to our study, transfer learning and deep learning, were identified.

Conclusion:

The present findings provide insight and research directions for policymakers and academic researchers for future collaboration and research in AI for breast cancer patients.

Introduction

Breast cancer is the most commonly diagnosed cancer among women in most countries (159 of 185 countries), with an estimated 2.3 million women diagnosed with breast cancer in 2020. Moreover, breast cancer is the leading cause of cancer death in women in 110 countries, with 685000 deaths globally (1). However, early detection and prognosis prediction, which involves explicitly estimating the relapse of breast tumors and predicting the 5-year survival rate of the breast cancer patient, can significantly improve patient outcomes (2, 3). In this context, several developed countries have employed extensive mammography, Magnetic Resonance Imaging, breast ultrasound, and thermography-based screening programs for earlier breast cancer (4, 5). However, one of the significant challenges lies in interpreting these images generated by such techniques. In addition, the precision and accuracy achieved by even the best clinicians in detecting breast cancer using mammography vary widely, thus leaving room for further improvements (6, 7). In this context, in the 1990s, Computer-aided software detection was introduced for mammography, and several software assistive applications have been approved for medical use. However, despite initial promising implementations, the software tools of the 1990’s era could not significantly improve the performance of mammography readers in real-world scenarios (7–11).

Over the past few years, AI’s potential in precision oncology has uniquely poised to handle the errors associated with medical image analysis (12–19). AI is centered on developing high-level algorithms to execute complex tasks in clinical settings in radiology to quickly and effectively aid in interpreting image data. The main objective of applying AI to image analysis is to reveal a visual pattern from image data and assist clinicians and mammogram experts in formulating effective clinical decisions about breast cancer detection and survival prediction. In recent years, the field of AI in breast cancer research has seen a resurgence owed to the commendable performances of Deep Learning (DL) in detecting breast cancer and further predicting the 5-years survival of breast cancer using mammography. Studies have shown the capacity of DL to be at par, or in some cases, exceed the performance of human experts in medical–image analysis for the diagnosis and prognosis of breast cancer (20, 21). As the scarcity of mammography experts threatens the availability and sufficiency of breast-screening services worldwide, AI agents’ unique precision and accuracy in an image- analysis could enhance the access to high-quality diagnosis and prognosis of breast cancer. Therefore, the prospects of AI in facilitating clinicians in clinical decision-making and managing breast cancer are manifold and ever-expanding. As the applications of AI in breast cancer diagnosis and prognosis grow, it becomes necessary to comprehend the ongoing research setting and future research trajectory. However, the AI-based research in breast cancer detection and survival prediction does not explore inherent development rules and current research trends and discuss the challenges that the AI will face in diagnosing and prognosis of Breast Cancer. Therefore, to achieve the goal, the present study aims to review the existing research articles through bibliometric analysis to learn about the global progress and trends in the application of AI for breast cancer detection and survival prediction. Bibliometric analysis is a quantitative analysis of research publications to describe the trends in academic literature, the contributions of journals and authors, nations’ productivity in a particular research area, and info regarding research collaborations and cooperation (22–24). In addition, the bibliometric analysis enables monitoring of the patterns and trends of effectual publications in several areas, including healthcare research (25).

Thus, the current bibliometric analysis findings will help researchers, governments, and entrepreneurs understand the Development of AI research in breast cancer diagnosis and prognosis in the last two decades. For research scholars and scientists, the present study results will be helpful to know about the important journals and understand the thematic trends of AI in breast cancer diagnosis and prognosis research. Our study will help governments devise more proficient present and future action strategies centered on AI research and development evolution trends in breast cancer diagnosis and prognosis. In the context of entrepreneurs, the results will help scree the most contributing research organizations toward AI for breast cancer research and also develop a competitive AI market for developing AI applications for breast cancer detection and survival prediction after understanding the collaboration networks of the AI in breast cancer diagnostic and prognostic research area. Moreover, the current study is the first to quantitatively analyze the hot research domains of breast cancer research and the application of AI in cancer detection and survival prediction. Our study portrays the impact of scientificc information by indicating gaps and presenting a meaningful path for future research in AI for breast cancer detection and survival prediction. An overview of the systematic review of AI’s application in breast cancer detection and survival prediction includes eight distinct phases, as shown in Figure 1. As shown in Figure 1, Phase-1 presents the data source and methodology; Phase-2 offers the fundamental bibliometric analysis; Phase-3 shows the conceptual knowledge structure analysis; Phase-4 describes the intellectual knowledge structure analysis; Phase-5 describes the social knowledge structure analysis; Phase-6 lists the current bibliometric limitations; Phase-7 describes the open challenges of AI in breast cancer diagnosis and prognosis research; and finally, Phase-8 describes the concluding remarks.

Figure 1

Materials and methods

Methodology and data sources

Pre-planning

In the pre-planning stage, search queries were selected as tabulated in Supplementary Table S1. The search queries were categorized as 1) key search terms and 2) a combination of key search terms with breast cancer and search items related to the prediction and classification of breast cancer. The key search terms included AI, Machine Learning (ML), and names of different supervised and unsupervised algorithms as tabulated in Supplementary Table S1. The second search terms, as tabulated in Supplementary Table S1, included a combination of search queries in association with “breast cancer and detection,” “breast cancer and classification,” “breast cancer and prognosis detection,” “breast cancer and mortality risk,” “breast cancer and survival,” “breast cancer and prediction,” and finally “breast cancer and microarray gene expression.” A subset of crucial search queries and different combinations of key search terms were selected based on the relevance of the search criteria to AI and its application in breast cancer diagnosis and prognosis research. Our search scope expanded but remained focused on breast cancer by searching literature using key search terms combined with breast cancer and search items that include the word prediction, classification, diagnosis, and prognosis of breast cancer. The idea of adding breast cancer and microarray gene expression criterion with the key search items, namely AI and ML, is to explore and analyze the application of AI and ML in breast cancer research using microarray gene expression data. Since microarray gene expression data plays a significant role in understanding the role of different gene biomarkers in the pathophysiology of breast cancer disease initiation and progression. Thereby employing AI and ML techniques, the most relevant/informative breast cancer gene biomarkers can be screened, and subsequently, classification and deep learning models can be constructed to predict and classify the disease’s different stages. Therefore, the involving gene microarray data with AI helps us understand the evolving role of AI in breast cancer severity, mortality, and survival predictions across the past two decades.

In addition, appropriate research questions were formulated as tabulated in Supplementary Table S2 to provide a comprehensive overview of the knowledge structure and bibliometric and statistical techniques to evaluate the role of AI research in breast cancer detection and survival prediction from the year 2000 to 2021.

Data collection

In the data collection stage, we systematically searched academic articles in WOS core collection and Scopus databases from 1^st January 2000 to 31^st September 2021 that involved AI’s application in breast cancer detection and survival prediction research. The keywords used for the data retrieval are tabulated in Supplementary Table S1. In addition, research articles and review papers written in English were included in the present study. From Scopus 10161 academic publications and ISI WOS, 7277 research publications were retrieved for analysis.

Data refinement

Further, in the data refinement stage, the publications retrieved from WOS and Scopus were refined based on the exclusion criteria tabulated in Supplementary Table S2. In addition, we excluded studies published as books, editorials, letters, conference papers, and academic publications not published in the English language were excluded from our systematic bibliometric review. Lastly, the refined list of publications obtained from Scopus (1737) and WOS (1841) was combined by removing the redundant publications. Therefore, after the refinement process, the total number of articles was reduced to 2641. A systematic workflow of the selection criteria for data collection and refinement is shown in Supplementary Figure S1 and Supplementary Table S3.

Data extraction

We retrieved the metadata from Scopus and WOS as a bibliographic information file (.bib file). The data exported included: (a) authors/editors, (b) authors full name, (c) title, (d) source, (e) authors’ keywords, (f) keywords plus, (g) abstracts, (h) authors affiliations, (i) corresponding authors affiliation, (j) cited references, (j) total citations, (k) highly cited (l) usage counts (m) publication year, (n) DOI, (o) subject category, (p) author identifiers, (q) languages, and (r) funding agencies.

Bibliometric data analysis

The bibliometric analysis enables a researcher to record, access objectively, and process hundreds or thousands of publications to profoundly summarize recent trends in scientific publications in a discipline or specifically in a research area. In the present study, a bibliometric analysis of publications related to the evolution of AI research in breast cancer diagnosis and prognosis from 2000 to date is performed to address the six major queries as tabulated in Supplementary Table S2. The bibliometric data analysis was conducted using biblioshiny (26) to represent the publication patterns and the research trends in implementing AI on breast cancer diagnosis and prognosis. In addition, we intend to statistically explore and evaluate the scientific knowledge structure through the current bibliometric analysis. The basic knowledge structure of a research field can be categorized into three parts such as:

1. Conceptual structure (what literature talks about central themes and trends related to a specific research field)

2. Intellectual structure (How the work of an author influences a given scientific community)

3. Social structure (how authors, institutions, and countries interact with each other)

Firstly, the conceptual structure is explored statistically using thematic mapping (27), thematic evolution, co-occurrence network, and factorial analysis. Secondly, the intellectual knowledge structure was assessed by performing co-citation network analysis (28) and historiography (29). Finally, the social knowledge structure was reviewed based on the collaboration network and collaboration world map. Therefore, upon analyzing the conceptual, intellectual, and social structure, we can understand the knowledge structure of the application of AI in breast cancer diagnosis and prognosis during the last two decades. Thus upon analyzing the knowledge structure of AI in breast cancer in the previous two decades, we will understand the current accomplishments and future open challenges in implementing AI for breast cancer diagnosis and prognosis.

Results

Annual scientific production

The number of publications from 2000 to 2021 shows the evolution of the research and trends in AI for breast cancer diagnosis and prognosis. The current study uses WOS and Scopus databases to mine 2641 academic publications from 2000 to 2021 using the query listed in Supplementary Table S1. As shown in Figure 2, the yearly scientific publication presents variations in scientific contribution in the research field mentioned above within a specified time duration. The analysis shows that the global scientific publication trends in AI for breast cancer diagnosis and prognosis peaked in 2019-2021, with 2020 being the most productive year (456 scientific publications). Thus, the increasing frequency of international academic literature in the last six years (2016 to 2021) depicts a growing intensity of research in AI for breast cancer diagnosis and prognosis. Therefore, we can presume that the research in AI for breast cancer diagnosis and prognosis has attracted the most attention of researchers during the last decade (2011-2021).

Figure 2

Most relevant authors

The current paragraph highlights the most prolific researchers in the field of AI for breast cancer detection and survival predictions in terms of the number of publications in this area and the impact of their publications. Table 1 shows the 15 most prolific authors with their number of publications, total citations, and corresponding h-index. As is evident from Table 1, Zang, Y from Henan Polytechnic University, Jiaozuo, China, has the most number of publications, i.e., 31, closely followed by Wang Y from Hangzhou Dianzi University, Hangzhou, China, Li Y from Chongquing University/Third Military Medical University, Chongqing and Zhang J from Zhejiang Cancer Hospital, Zhejiang Hangzhou, China with 28 publication each author. However, regarding the impact of these publications in terms of total citations, Chen H has the highest citations with 1302 citations, followed by Madabhushi, A, Rangayan, R with 1233 and 1225 citations, respectively. Furthermore, Chen H and Zhang Y is the most contributing author with an h-index of 13, followed by Rangayyan R with 12, Zhang X, and Wang Y with 12 each. Thus, the table suggests that Zang Y, with the highest number of publications, is the most contributing researcher in AI for breast cancer detection and prognosis predictions.

Table 1

Rank	Element	H_index	TC	NP
1.	CHEN H	13	1302	17
2.	ZHANG Y	13	445	31
3.	RANGAYYAN R	12	1225	13
4.	ZHANG X	12	791	21
5.	WANG Y	12	666	28
6.	ZHANG J	12	444	28
7.	WANG J	11	663	24
8.	YANG Y	10	1107	15
9.	CHEN X	10	1080	13
10.	LIU J	10	648	18
11.	LI Y	10	565	28
12.	CHEN Y	10	418	20
13.	SILVA A	10	377	18
14.	MADABHUSHI A	9	1233	10
15.	POLAT K	9	654	10

Tabulation of the 15 most prolific authors with their number of publications (NP), Total Citation (TC), and corresponding h-index (Note the authors are ranked based on h-index and h-index obtained from biblioshiny).

Most relevant organizations

The top 10 most contributing/relevant organizations in AI for breast cancer detection and survival prediction research are represented in Supplementary Figure S2. As per Supplementary Figure S2, there are five most productive organizations, among which Stanford University, USA, is the topmost productive organization with 38 publications, followed by National Taiwan University, Taiwan, with 37 publications, Sun Yat-sen University, China, with 32 publications, University of Malaya, Malaysia with 32 publication and Sichuan University, China, with 30 publications. Moreover, it is remarkable that out of the top 10 organizations globally, four organizations are from China.

Country scientific production

The top 20 contributing countries in AI for breast cancer detection and survival prediction are shown in Table 2. The data tabulated in Table 2 includes the total article published in the given field, total citations, and the average article citations. It appears from Table 2 that there are only two countries (China and USA) producing more than one thousand publications in the AI for breast cancer detection and survival prediction from the year 2000 to 2021. As per Table 2, the Republic of China is the top scientific productive country with 1217 publications, followed by the USA with 1100 publications, and India with 690 publications in AI for breast cancer detection and survival prediction research. The USA is the most influential country with 13015 citations, followed by China and United Kingdom (UK) with 9375 and 3166 citations. Surprisingly, the Netherlands is in twenty positions in terms of publication numbers. However, the average article citation in the Netherland is 82.26, which is the highest among the top twenty countries. Thereby, we can conclude that Netherland significantly impacts research in AI in breast cancer diagnosis and prognosis.

Table 2

Region	Number of Publications	Total Citations	Average Article Citations
CHINA	1217	9375	19.7
USA	1100	13015	34.43
INDIA	690	3153	8.64
UK	273	3166	39.09
CANADA	217	1318	20.28
SPAIN	201	2581	51.62
GERMANY	191	2562	45.75
SOUTH KOREA	189	1445	19.01
IRAN	158	1438	19.43
TURKEY	145	2506	30.19
ITALY	139	822	16.12
AUSTRALIA	125	1819	34.32
MALAYSIA	121	617	10.82
EGYPT	115	1302	21
PAKISTAN	112	532	12.98
SAUDI ARABIA	106	385	9.17
FRANCE	98	493	22.41
BRAZIL	97	908	19.32
SINGAPORE	73	877	38.13
NETHERLANDS	71	2221	82.26

Tabulation of the top 20 contributing countries in AI for breast cancer detection and survival prediction (Note that the countries are ranked based on the number of publications).

Most preferred periodicals

The number of publications in terms of Bradford law called the core sources the nucleus of journals, mainly devoted to the given research area. It appears from Supplementary Figure S3 that the top ten journals, as tabulated in Table 3, form the core of journals publishing about a third of the documents of the entire collection. The leading ten relevant periodicals that published one or more articles included in our bibliographic collection are tabulated in Supplementary Table S4. It is noteworthy that PLOS ONE, with 96 articles, is the most preferred publishing venue, followed by Computers in Biology and Medicine and Expert Systems With Application with 86 and 81 articles. In terms of the H-index, which is a journals number of published articles (h), each of which has been cited by other papers at least h time, Expert System with Applications with an h-index of 36 and with amazingly 4230 total citations is the most leading journal, followed by IEEE Transactions On Medical Imaging (h-index = 32, TC = 4223). Artificial Intelligence in Medicine (H-index = 28, TC = 2837), Computers in Biology and Medicine (H-index = 28, TC = 2147) and BMC Bioinformatics (H -index = 24, TC = 3114) being other most prominent journals publishing in the area of AI in breast cancer detection and survival predictions.

Table 3

Sources	Articles	H-index	Total Citations
PLOS ONE	96	26	2242
Computers In Biology And Medicine	86	28	2147
Expert Systems With Applications	81	36	4230
IEEE Access	80	13	627
Scientific Reports	77	17	1736
BMC Bioinformatics	72	24	3114
Computer Methods And Programs In Biomedicine	66	23	1615
Artificial Intelligence In Medicine	64	28	2837
Neurocomputing	62	25	2529
IEEE Transactions On Medical Imaging	56	32	4223

Top 10 preferred periodicals for AI in breast cancer detection and survival prediction research from the year 2000 to 2021 (The journals are ranked based on the H-index).

Highly cited research publications in AI for breast cancer detection and survival predictions

The topmost ten highly local cited (Local citation measures the impact of documents in the analyzed collection) research publications within AI for the given research area published between 2000 to 2021 are tabulated in Table 4. For example, Delen D 2005 (30) published an article titled “Predicting breast cancer survivability: a comparison of three data mining methods” published in “AI in Medicine” is the most locally cited article with 65 local citations and 539 global citations, respectively. Akay MF 2009 (31), with the article entitled “Support vector machines (SVM) combined with feature selection for breast cancer diagnosis” published in Expert System and applications, was the second most influential paper with 64 local citations and 367 global citations. Also, Zheng B 2014 (32) published an article entitled “Breast cancer diagnosis based on feature extraction using a hybrid of K-means and SVM algorithms” that got 58 local citations and 214 global citations. Finally, Kooi T 2017 (33) published an article entitled “Large scale DL for computer-aided detection of mammographic lesions” with 55 local and 387 global citations. Therefore, as shown in Table 4, these authors are the most influential authors contributing to AI for breast cancer detection and survival prediction research from 2000 to 2021.

Table 4

Document	Journal	DOI	Year	Local Citations	Global Citations
Delen, Walker, and Kadam, 2005	Artif Intell Med	10.1016/j.artmed.2004.07.002	2005	65	539
Akay, 2009	Expert Syst Appl	10.1016/j.eswa.2008.01.009	2009	64	367
Zheng, Yoon, and Lam, 2014	Expert Syst Appl	10.1016/j.eswa.2013.08.044	2014	58	214
Kooi et al., 2017	Med Image Anal	10.1016/j.media.2016.07.007	2017	55	387
Arevalo et al., 2016	Comput Meth Prog Bio	10.1016/j.cmpb.2015.12.014	2016	48	172
Setiono, 2000	Artif Intell Med	10.1016/S0933-3657(99)00041-X	2000	46	140
Karabatak and Ince, 2009	Expert Syst Appl	10.1016/j.eswa.2008.02.064	2009	44	236
Araújo et al., 2017	Plos One	10.1371/journal.pone.0177544	2017	44	243
Cheng et al., 2006	Pattern Recogn	10.1016/j.patcog.2005.07.006	2006	40	303
Dheeba et al., 2014	J Biomed Inform	10.1016/j.jbi.2014.01.010	2014	39	170

List of top 10 highly locally cited articles within AI for breast cancer detection and survival prediction research from 2000 to 2021.

Conceptual knowledge structure analysis

Keyword analysis

In the current section, we apply the keyword analysis and keyword co-occurrences to analyze the research trends and developments in AI for breast cancer detection and survival predictions to display the research gaps in the literature and detect potential future research trends in AI for breast cancer detection and survival prediction field. The top fifteen keywords are highlighted in Supplementary Figure S4; with 805 occurrences, the keyword “breast cancer” is the most frequently occurring keyword, followed by ML (282), classification (281), DL (276), and feature selection (163). Furthermore, the correlation between AI and Breast cancer diagnosis and prognosis research can be mapped using the word growth graph shown in Supplementary Figure S5. As observed from the word growth graph, the occurrence per year of the main keywords, which are all the tools of AI for the earlier diagnosis of breast cancer, have grown progressively over time, namely breast cancer, DL, ML, feature selection, and classification. However, some of them, like “breast cancer, classification, ML, and DL,” grew more dynamically than other keywords. For example, in terms of cumulate occurrence in 2000, keywords breast cancer, machine learning, classification, feature selection, and deep learning were zero, one, three, one, and zero, respectively. Whereas in the year 2021, the keywords with the highest increase in occurrences from the year 2000 to 2021 were: Breast cancer (777), ML (275), classification (274), DL (258), and feature selection (162).

In addition to the author’s keyword analysis, the authors’ keywords co-occurrences were analyzed using biblioshiny. The Co-occurrence network can enable us to understand the topics covered by a research field and define the most critical and recent fronts (issues). It could also help us understand the evolution of the issues over time. The outcome of the Co-occurrence network study is presented in Figure 3. In Figure 3, the node size (keyword) represented by a dot in the network displays the number of occurrences (keywords). For instance, Breast cancer is the maximum size node, confirming that breast cancer is the most frequent keyword. In this regard, we can observe from Figure 3 that the author’s keywords are DL, ML, and classification, the highest frequency of occurrence after breast cancer. Likewise, the width of edges linking other nodes shows the occurrence of keywords employed concurrently in the research publications present in our metadata. In this context, we observe that the author keywords “breast cancer and DL” followed by “breast cancer and ML,” “breast cancer and classification,” “breast cancer and convolutional neural network (CNN),” and “breast cancer and computer-aided diagnosis (CAD)” have the most co-occurrences in current bibliometric literature.

Figure 3

Keywords evolution trends

Applying a clustering algorithm to the keywords network makes it possible to highlight different themes of a given domain. Each cluster/theme can be represented on a particular plot, known as a strategic or thematic map (27). In a thematic map, each bubble represents a network cluster. The bubble name is the word belonging to the cluster with the higher occurrence value. The bubble size is proportional to the cluster word occurrences, and the bubble position is set according to the cluster callon centrality and density. The callon centrality can be read as the importance of the theme in the entire research field, and callon density can be read as a measure of the theme’s development. Therefore, thematic maps were constructed to reveal the evolution of the keyword trends, as shown in Figure 4. The thematic map consists of four quadrants: The first quadrant from the right top corner signifies the thematic keywords belonging to motor themes, representing well-developed themes related to the Application of AI in breast cancer diagnosis and prognosis research.

Figure 4

The second quadrant represents the niche themes, which represent themes that have good internal development. The third quadrant represents the thematic keyword belonging to weakly developed, emerging, or declining themes. Finally, the fourth quadrants represent thematic keywords belonging to basic and transversal themes with weak internal development. For example, in Figure 4, the thematic analysis of the data obtained from 2000-2021, we observed that breast cancer classification and machine learning are both well developed and essential for the conceptual structure of the research field (AI for Breast Cancer Diagnosis and Prognosis). On the other hand, mammography, CAD, and mammogram are the themes that are important but less developed as compared to themes of the first quadrant (Motor themes). The themes such as feature extraction, cancer, and SVM have good internal development but unimportant external ties with the other themes, so they have a marginal role in the given scientific field. It is worth mentioning that the primary/transversal themes and the motor themes are considered those that support the development and strengthening of an area of knowledge (AI for breast cancer diagnosis and prognosis) due to their centrality and density.

On the other hand, DL, CNN, and Transfer Learning (TL) represent the emerging or declining themes with a weak internal development degree and are marginally crucial for developing the given scientific field. Next, the thematic evolution of the keywords from 2000 to 2021 is analyzed based on the keyword thematic map and Sankey diagram shown in Supplementary Figure S6 and Figures 5A–D, respectively. According to the Sankey diagram and keywords thematic map as shown in Supplementary Figure S6 and Figures 5A–D, we observe that from 2000 to 2015, studies were more focused on applying ML tools to detect metastatic breast cancer masses from ultrasound breast images. However, during the last five to six years, the implementation of DL techniques to improve the accuracy of detecting suspicious cancerous breast masses using ultrasound or MRI images of breast masses has paved the way for earlier detection of breast cancer. Moreover, studies have shown that the role of Natural Language Processing (NLP) has great potential in predicting metastatic breast cancer recurrence.

Figure 5

Multicorrespondence analysis and clustering map of words

Similarly to the network analysis, we applied the factorial analysis (data reduction technique) to study the sub-topics related to the implementation of AI in breast cancer detection and survival prediction research, as represented in Figure 6. The factorial analysis was performed using the multiple correspondence analyses as the dimensionality reduction technique and hierarchical clustering as the clustering algorithm to group related terms close to each other. Through the factorial analysis, the nodes with the same color constitute a cluster that depicts their central research theme (main topic) inferred from their respective sub-topics (nodes) within a given cluster. Further, the association between two nodes is dependent on proximity between the nodes. The closer the two nodes’ proximity, the more significant the articles treat them together. Nodes with lower proximity are pulled together while nodes with high proximity are distant, thereby attaining discrete clustering among keywords. The map’s origin for each cluster in the conceptual structure map represents the average position of all column profiles and, therefore, represents the center of the research field.

Figure 6

The conceptual structure analysis using factorial analysis reveals that the two subfields were identified in the scientific field of AI for breast cancer detection and survival predictions. The two main subfields are as follows:

1. Red cluster grouping together author keywords: breast cancer, CAD, neural network (NN), data mining (DM), CNN, TL, DL, mammography, mammogram, SVM, classification, ML, and feature selection. The factorial analysis shows that the keyword “breast cancer” occupies a more central position in the red cluster. Thus, we can conclude that breast cancer is the red cluster’s most common and significant topic.

2. Blue cluster grouping the author keywords: SVM, breast cancer diagnosis, SVM, cancer, and feature extraction. The factorial analysis shows that the keyword “cancer” occupies a more central position in the blue cluster. Thus, we can conclude that cancer is the most common and significant keyword in the blue cluster.

Multicorrespondence analysis and clustering most contributing documents

The graphical map shown in Supplementary Figure S7 allows us to identify the link between the topics and the related documents. The map plots the documents associated with the highest total contribution. The total contributions measure each document’s weight in the information summarized by the two axes. The colors represent the clusters to which each record belongs. The most contributing documents related to the blue and the red cluster are shown in Supplementary Figure S7 and tabulated in Table 5. We can observe from the data available from the red cluster that the article published by Chougrad H, 2018 (34), entitled “Deep Convolutional Neural Networks (DCNN) for breast cancer screening” published in Compt Meth Prog Bio, is the most contributing paper followed closely in the second position by Masud M, 2020 (35) entitled “CNN-based models for diagnosis of breast cancer” published in Neural Computing Application.” In the same context, the article authored by Murtaza G, 2020 (36), entitled “Breast Cancer Multi-classification through Deep Neural Network (DNN) and Hierarchical Classification Approach,” published in Multimedia Tools and Applications, is the third most contributing paper. Finally, the article “MitosisNet: End-to-End Mitotic Cell Detection by Multi-Task Learning,” published in IEEE Access and authored by Alom MZ, 2020 (37), is the fourth most contributing document on the associated topics with the red cluster. The article entitled “Development of an intelligent CAD system for mass detection in mammographic images,” published in IET Image Processing, authored by Andreadis T in 2020 (38), is the most contributing paper on the topics related to the blue cluster. In addition, the articles written by Salama WM, 2020 (39) and Eltrass AS, 2020 (40) were the second and third most contributing paper in the area of research related to the blue cluster.

Table 5

Cluster	Documents	Article tile	Journal	Contribution
Red (I)	Chougrad, Zouaki and Alheyane, 2018	Deep Convolutional Neural Networks for breast cancer screening	Computer Methods and Programs in Biomedicine	1.38
	Masud, Eldin Rashed, and Hossain, 2020	Convolutional neural network-based models for diagnosis of breast cancer	Neural Computing Application	1.02
	Murtaza, Shuib, Mujtaba, et al., 2020	Breast Cancer Multi-classification through Deep Neural Network and Hierarchical Classification Approach	Multimedia Tools and Applications	1.02
	Alom et al., 2020	MitosisNet: End-to-End Mitotic Cell Detection by Multi-Task Learning	IEEE Access	1.01
Blue (II)	Andreadis et al., 2020	Development of an intelligent CAD system for mass detection in mammographic images	IET Image Processing	4.23
	Salama, Elbagoury, and Aly, 2020	Novel breast cancer classification framework based on deep learning	IET Image Processing	4.16
	Eltrass and Salama, 2020	Fully automated scheme for computer-aided detection and breast cancer diagnosis using digitized mammograms	IET Image Processing	3.82

Highly contributing Articles by clusters obtained using Multicorrespondence Analysis.

Multicorrespondence analysis and clustering most cited documents

The graphical map in Supplementary Figure S8 allows us to identify the link between the topics and the cited documents. The graphical map plots the documents associated with the highest global citations. The colors represent the clusters to which each document belongs. The most cited papers related to the blue and the red cluster are shown in Supplementary Figure S8 and tabulated in Supplementary Table S5. We can observe from the data available from the red cluster that the article published by Sirinukunwattana K, 2016 (41) entitled “Locality sensitive deep learning for detection and classification of nuclei in routine colon cancer histology images” and published in IEEE Transactions on Medical Imaging is the most cited paper (557 Citations) in deep learning a subtopic associated with the red cluster. The documents authored by Delen D, 2005 (30) and Tang J, 2009 (42), are the second with 539 and the third with 443 citations, the most globally cited papers associated with subtopics of the red cluster. In the blue cluster, the article “SVM combined with feature selection for breast cancer diagnosis,” published in Expert systems with applications, authored by Akay MF, 2009 (31), is the most cited paper with 367 citations related to topics associated with the blue cluster. In addition, the articles authored by Chen HL, 2011 (43) and Stoean R, 2013 (44) were the second and third most cited documents in research related to the blue cluster.

Intellectual knowledge structure analysis

Co-citation analysis

Co-citation analysis (28) is a critical citation analysis technique in bibliometrics to show a relationship between nodes representing the author or documents (Representation of an Intellectual structure of a given research field). Here we talk about co-citation of two papers or authors when a third document or author cites both. The co-cited documents are represented as nodes, and the edges connecting the co-cited documents represent the instances of co-citation. Here the node size means the document occurrence, i.e., a paper with higher occurrence will have a correspondingly larger node size and vice versa. Moreover, the edge size is proportional to the document’s co-occurrence, i.e., records with higher co-occurrence will have a thicker edge size and vice versa. As per Figure 7, we can observe that the research papers by Simonyan K, 2014 (45), Kaiming HE, 2016 (46), Krizhevsky A, 2012 (47), Lecun Y, 2015 (48), Ronneberger O, 2015 (49), Spanhol FA, 2016 (50), Litjens G, 2017 (51), Bray F, 2018 (52), and Cireşan DC, 2013 (53) have been cited by other documents as well as co-cited by many source documents (documents in the dataset). Moreover, Breiman I, 2001 (54), Guyon I, 2002 (55), Haralick RM, 1973 (56), Akay MF, 2009 (31), Cortes C, 1995 (57), Delen D, 2005 (30) and Pena-Reyes CA 1999 (58) have been co-cited by other source documents. The color of the nodes in the co-citation network represents the research field to which the records belong. For example, the Red color nodes depict research in the DCNN for image classification to diagnose cancer. The blue nodes represent documents related to different ML algorithms for breast cancer diagnosis.

Figure 7

Historiography analysis

When examined over time, co-citation analysis helps detect a paradigm shift (a fundamental change in approach or underlying assumptions) and school of thought related to a particular research field (29). In Supplementary Figure S9, each historical path represents a research topic and its core authors and documents. Each node in Supplementary Figure S9 represents a document (included in the analyzed collection) cited by other documents. Each edge represents a direct citation, and nodes and edges are plotted on an oriented graph where the horizontal axis represents the publication years. Here, the blue color research path represents a fundamental change in approach and school of thought related to breast cancer diagnosis and the prediction of breast cancer survivability research using AI.

From 2000 to 2015, the focus was on detecting cancer and predicting survivability using a basic ML algorithm (30–32, 43, 59–63). After that, however, the emphasis has been on using DL networks in breast cancer diagnosis and prognosis research (64). The light yellow color research path represents the automated detection and classification of masses in the mammogram. From 2000 to 2010, the focus was on using CAD for breast cancer (65, 66)). After that, however, the focus shifted to heuristic and CNN for the CAD of breast cancer. The purple-colored research path represents breast cancer diagnosis using microscopic biopsy images. From 2000 to 2015, the purple-colored research path focused on diagnosing breast cancer using the computer-aided analysis of biopsy images. After that, however, the focus shifted to CNN to diagnose breast cancer using histological images (67).

Similarly, the light red color research path represents classifying and detecting lesions in a mammogram using DL techniques. The red-colored research path originated in 2015 and continues till 2021 (68–71). Lastly, the light blue research path represents the field of breast cancer classification’s DL and TL. Although the light blue research path originated in 2016 (72), the primary contributing authors are continuously publishing in DL and TL for the diagnosis and prognosis of breast cancer (73–78).

Social knowledge structure analysis

Authors’ collaboration network analysis

The author’s collaboration network analysis reveals how authors interact with each other. We applied a threshold of five papers per author and represented the global collaboration of authors worldwide. Figure 8 shows the partnership of the eight most contributing authors among the total authors in the dataset. Out of the selected fifty authors, eight authors collaborated strongly with the other authors in the dataset and had a minimum of five publications together. The thickness of the edges represents the association between the authors, and the node’s size represents the number of articles they co-authored together. For example, Wang S, Zhang Y, and Zhang X in the blue-colored research path published more papers together than other authors in the dataset. Similarly, Wang J, Li Y, and Li L in the red-colored research path published more articles than other authors in the dataset in the red-colored research field. Lastly, Ma Y and Yang Z in the green-colored research field published more articles together in the red-colored research field than the other authors in the dataset publishing article in the green-colored research field.

Figure 8

Institution collaboration network analysis

The Institution collaboration network analysis reveals how institutions interact with each other. We applied a threshold of two or more edges and represented the global collaboration of institutions worldwide. The thickness of the edges represents the association between the institutions, and the node’s size represents the number of articles they collaborated on. Among the total institutions listed in the dataset, Figure 9 shows the collaboration of the most collaborating institution. For example, the King Abdulaziz University of Saudi Arabia and the University of Leicester University had the maximum number of collaborated research in AI for breast cancer diagnosis and prognosis. Stanford University collaborated extensively with Radboud University and Tsinghua University in the same context.

Figure 9

Collaboration world map analysis

As shown in Figure 10, the country collaboration network analysis reveals how different countries interact. We applied a threshold of five or more edges and represented countries’ collaboration worldwide. For example, from Supplementary Table S4, we observe that China collaborated strongly with the USA with 77 partnerships, 26 with the UK, and 10 with India in the research field of AI for breast cancer diagnosis and prognosis. In addition, the USA strongly collaborated with the UK with 20 partnerships, 13 with Germany, 13 with India, 12 with Saudi Arabia, and 11 with Korea. Concurrently, Pakistan collaborated with Saudi Arabia, the UK, and Germany in AI for breast cancer diagnosis and prognosis.

Figure 10

Discussion

AI is perpetually changing the human race’s way of doing things and has been employed in many fields, including agriculture, the Internet of things (IoT), manufacturing, and intelligent healthcare. For example, since AI was introduced to detect and classify breast cancer and breast cancer patients’ survivability prediction, many academicians, scientists, and researchers have performed landmark experiments to employ different DL-based technologies for breast cancer detection and survival prediction. However, there was still a lack of a systematic evaluation of the application of DL in breast cancer diagnosis and prognosis from a bibliometric perspective. In particular, the existing literature did not conclusively answer the six questions well, including 1) What are the publishing and citation trends of the research publication in AI for breast cancer detection and survival prediction, 2) Who are the most contributing authors, journals, organizations, and countries in AI for breast cancer diagnosis and prognosis, 3) What are the publication patterns and most frequently used keywords of the articles published in AI for Breast Cancer diagnosis and prognosis, 4) What are the collaboration networks of AI research in breast cancer diagnosis and prognosis, 5) What are the thematic trends of the Application of AI in breast cancer diagnosis and prognosis research and development, and 6) What are the main open areas of challenges and the corresponding solutions for future research work in AI for breast cancer research. To address the gap in the knowledge structure in AI for Breast cancer diagnosis and prognosis, the current data and the related systematic bibliometric review methods to address the field of research are discussed. The present study depicts the research hotspots trends, publication patterns in different countries and journals, the author’s contribution and collaboration, and collaborations between countries and their institutions on AI for breast cancer diagnosis and prognosis research.

China is most productive in publishing research articles on AI for breast cancer diagnosis and prognosis research, followed closely by USA and India, respectively. While the USA has the most significant global influence based on the total citation indicators, and Netherland, in terms of average article citation, is the most influential country in research regarding the implementation of AI in breast cancer diagnosis and prognosis research. Furthermore, China is strongly collaborative with the USA, followed by the UK. Stanford University and National Taiwan University are the most relevant institutions in AI for breast cancer diagnosis and prognosis in the past two decades, from 2001 to 2021. The PLOS One is the most preferred periodical for researchers publishing articles on AI for breast cancer diagnosis and prognosis between the years 2000 to 2021. However, the journal “Expert Systems with Applications,” followed by the IEEE Transaction on Medical Imaging Journal, is the most influential AI in breast cancer detection and survival predictions research.

As per our bibliometric analysis, Zhang Y is the most contributing author and a prolific author publishing regularly in AI for breast cancer research. On the other hand, Chen H is one of the most influential authors, with 1302 citations and an H-index of 13. In 2017, Chen H and his team proposed a novel approach (Deep Contour-Aware Networks) for object instance segmentation from histopathological images (79). The proposed method won two histological object segmentation challenges: the 2015 MICCAI Nuclei Segmentation Challenge and the 2015 MICCAI Gland Segmentation Challenge, significantly surpassing all available techniques. Furthermore, Ramón Díaz-Uriarte and Sara Alvarez de Andrés, 2006 applied machine learning algorithms for gene selection and class prediction with microarray data (80) is the most globally cited article on AI for breast cancer diagnosis and prognosis research from 2000 to 2021. Delen et al., 2006 [30] compared three DM techniques for predicting breast cancer survivability, and as per the articles in our dataset collected from 2001 to 2022, their work is one of the most influential research (highly locally cited articles) in breast cancer research using AI techniques.

The keywords of a publication signify the main focus research areas, and the rate of recurrence of the keywords and their co-occurrences suggest the topics focused on that particular area of research. Accordingly, we found that “breast cancer,” “ML,” “classification,” “DL,” and “feature selection” are the most frequently occurring keywords based on keyword analysis. Analyzing the most relevant word data with that of top locally and globally cited literature offers a strong association between breast cancer and AI technologies, namely ML and DL, as these keywords are the most regularly used keywords in literature along with the most repeatedly mapped subject areas in articles present in our dataset. The current observation reveals that the prime focus of the researchers belonging to the medical imaging community is on solving medical imaging challenges in implementing AI techniques, namely DL and ML, for breast cancer research, especially concerning improving the accuracy of breast cancer screening and prognosis prediction of cancer patients. [34-40].

Morphological attributes of breast masses are crucial for classifying malignant masses based on texture and morphological characteristics of the breast images from benign tissues. Studies have shed light on using AI systems to extract features from breast ultrasound images. In a study by Hsu Sm et al., where texture attributes (namely, variance), morphological features, namely, a standard deviation of the shortest distance) and the nakagami parameters were combined to create a set of physical characteristics from the ultrasound images to build a classification model using fuzzy c-means (FCM) clustering algorithm that achieved a classification accuracy of 89.4% to discriminate between benign and malignant breast tissues (81). Zhang et al., in their study, developed a two-layer DL architecture by combining feature learning and selection techniques to extract Shear-Wave Elastography (SWE) features that performed better than the model build using the statistical features with an accuracy of 93.4% and an AUC value of 0.947, respectively (82). Furthermore, studies have shown that CAD systems, when employed to analyze the ultrasound features, enhance the diagnostic performance of inexperienced and experienced physicians (83, 84).

Moreover, the most crucial part of various diagnostic systems and human breast cancer diagnosis is the ability to classify benign breast masses from malignant breast tissues. In this context, to allow radiologists and physicians to reach a reliable conclusion in a short time regarding suspicious breast masses, AI systems have been developed gradually during the last two decades to classify benign and malignant breast masses. Several studies have used different deep learning architectures to classify malignant and benign breast lesions based on breast ultrasound images. To discuss a few DL-based studies, namely Becker AS et al., in 2018 (85) compared the performance of DL-based software for classifying malignant from benign breast tissues with three subjects with variable expertise (a trained medical student, a resident, and an experienced radiologist) in screening breast cancer using breast ultrasound images. The finding was encouraging as the DL software trained using a few hundred samples (553 benign and 84 malignant) showed comparable accuracy in classifying malignant from benign breast tissues compared to the experienced radiologist.

Moreover, the performance of the CNN-based system was better than the medical student trained using the same training data (n= 445, i.e., 70% of the total data). These findings showed that DL-based models could mimic a human decision-making process. Furthermore, in another study by Cirtisis A et al., in 2019 (86), the dCNN method achieved a classification accuracy of 95.3%, which was better than 94.1% obtained by a radiologist on the external dataset comprising ultrasound images of breast lesions. These studies have shown that AI-based tools can shorten the diagnosis time of experienced doctors (radiologists) and enhance the diagnostic capability of inexperienced doctors. Moreover, our claims of the correlation between breast cancer and AI tools can also be interpreted from the cumulative occurrence word growth graph of keywords from 2000 to 2021. We can conclude from the observation made from the word growth graph that a strong correlation between the keywords, namely “breast cancer,” “ML,” “classification,” “DL,” and “feature selection,” exists. Moreover, due to the increasing implementation of AI, particularly DL in breast masses medical image analysis for the detection of cancer, these keywords form a significant portion of the trending topics in AI for the earlier detection and survival prediction of breast cancer and breast cancer patients, respectively, during the last five years (3, 32, 33, 39, 50, 67–72, 74, 76–78, 87–90).

The conceptual structure map obtained using the factorial analysis reveals that the last two decades have shed light on AI sub-topics: CNN, TL, DL, NN, SVM, classification, ML, and feature selection. While the keywords, namely, CAD, mammography, and mammogram, represent sub-topics related to breast cancer diagnosis and detection. Consequently, we can say that the red cluster contains keywords that highlight AI techniques’ application in breast cancer diagnosis and prognosis. Moreover, with its fast computing capability, and good result reproducibility with minimum efforts, AI has shown great potential in providing fact-based and helpful information to doctors in the diagnosis of breast cancer, thereby reducing the load of medical practitioners and the amount of incorrect breast cancer analysis (91, 92). Intuitively, the high number of quality publications published related to topics in the red cluster as compared to the blue cluster can be dedicated to the increasing role of ML and DL techniques, namely, CNN (34-35, 47, 50, 68, 70, 73, and 116), NN (16, 21, 36, 53, 60–63, 93), SVM (31, 32, 43, 44, 55), feature selection (31, 43, 44), and classification (50, 59, 63, 67–69, 71, 73, 75, 79, 90) in medical image analysis task.

TL is based on applying established ML and DL approaches that implement previously learned knowledge to solve novel problems more accurately and effectively (94, 95). Hyunh et al. first applied the TL technique in 2016 (96) for breast cancer imaging, using the well-defined CNN models: ResNet, GoogLeNet, AlexNet, VGGNet, and Inception, to solve image classification tasks that were trained on natural image database, ImageNet (97). Next, Yap et al., 2018 (98) proposed implementing a deep neural learning approach for breast cancer diagnosis —with a pre-trained CN, AlexNet, using three different methods— a U-Net model, a transfer learning method, and a patch-based LeNet approach. Later, Byra et al. in 2019 (99) developed a neural TL methodology for classifying breast lesions using ultrasound images. Succeeding the previous works, many studies were published in implementing TL techniques for breast detection using an ultrasound imaging approach (100, 101). Though TL approaches have continually been improving in the context of breast ultrasound analyses for breast cancer detection, there is always room for improvement (102, 103).

The CAD system for breast cancer diagnosis and prognosis has been extensively implemented (104). Relevant studies have shown that CAD systems are helpful in refining descriptions of the breast lesion and enhancing the consistency of the attributes of the breast masses among ultrasound examiners, thereby helping in the decision-making (83, 84). Recently, the implementation of DL in the CAD system has shown great potential in optimizing resource allocation, relieving doctors’ workload, and thus significantly improving the detection and prognosis of breast cancer (33, 93, 105, 106). Besides, DL-based CAD systems are contributing significantly to the fields of contrast-enhanced mammography, ultrasound and Magnetic Resonance Imaging (MRI) (107, 108), ultrasound elastography (109), and digital breast tomosynthesis (88, 110). Thus, with the advancement of AI expertise, radiologists are confident of achieving more accurate classification and thereby achieving early detection, timely diagnosis, and apt treatment of breast cancer, thereby benefiting most breast cancer patients.

Further, the conceptual knowledge structure was evaluated using the co-occurrence network. Therefore, through the co-occurrence network of the author’s keyword, we determine that on recent fronts, “breast cancer and DL” (33, 39, 71, 72, 75, 77, 78, 87–90), “breast cancer and ML,” (3, 31, 32, 43, 44, 55), “breast cancer and classification,” (50, 59, 63, 67–69, 71, 73, 75, 79, 90), “breast cancer and CNN,” (34, 35, 47, 50, 67, 69, 73, 90), and “breast cancer and CAD” (7, 8, 10, 33, 42, 65, 90), with the highest total link strength depicts the multi-faceted implementation of AI in breast cancer detection and survival prediction research areas during the years 2020- 2021. Moreover, as per the analysis of the Sankey diagram and the thematic evolution of keywords from 2000 to 2021, we understand the following:

1. From 2000 to 2010, the motor theme focused more on keywords mammography (4, 6, 8, 10, and 42), and ML-related topics (31, 43, 44, 50, 59, 60, 63, 67–69, 71, 73, 75, 79, 90) for breast cancer diagnosis and prognosis. The researchers investigated several ML methods for automating mammogram image classification during this period. The major limitation of the conventional ML studies is the detection of breast masses which vary in size, making it challenging for the researcher to detect and classify suspicious malignant breast masses from benign breast masses (111, 112). Therefore, detecting suspicious breast masses was still an open challenge for future cancer detection and prognosis research studies.

2. During the last five to six years, the basic and the transversal themes show that keywords ML (31, 43, 44, 50, 59, 60, 63, 67–69, 71, 73, 75, 79, 90), DM (30), SVM (31, 32, 43, 44, 55), feature selection (31, 43, 44), and classification (50, 59, 63, 67–69, 71, 73, 75, 79, 90) have merged into a single cluster, namely breast cancer. Moreover, DL (33, 39, 71, 72, 75, 77, 78, 87, 88, 90) and feature extraction (32) have also evolved as the primary themes in the AI field for the diagnosis and prognosis of breast cancer in recent years (2015 to 2021). However, these fields are essential for applying AI in breast cancer diagnosis and prognosis research but are not well developed, and it is far from the goal of being fully integrated into the work of clinicians and large-scale application in the world. Still, we believe that with the progress of research in AI methodology, doctors will be in a position to achieve earlier detection of breast cancer with higher accuracy and precision.

3. NLP, another emerging area of research in recent years, has a potential role in harvesting important clinical attributes unexplored within electronic medical registers. Therefore, by developing the NLP system, researchers in the coming years can use the information present in an electronic record on cancer outcomes and treatment to find individual patient timelines of metastatic breast cancer relapse (113, 114).

As per the co-citation analysis, we can say that documents by Simonyan K, 2014 (45), Krizhevsky A, 2012 (47), and Lecun Y, 2015 (48) have a higher occurrence and co-occurrence, proving that these research articles are landmark articles in applying AI to Breast cancer diagnosis. Furthermore, the historiography analysis helps detect a paradigm shift and school of thought related to AI in breast cancer diagnosis and prognosis research. Here from the historical path analysis, we observe that during the last five to six years, the focus has been on using deep learning (64, 67–72) and transfer learning techniques (75, 77, 87, 115, 118) for an image-based detection of breast cancer and survivability prediction research.

Finally, the social knowledge structure analysis shows that authors Zhang Y& Zhang X, Zhang Y & Wang S, Wang J, Li Y, Li L, and Ma Y & Yang Z collaborated and published more papers than other authors in the dataset. Similarly, the institution collaboration network analysis reveals that the King Abdulaziz University of Saudi Arabia and the University of Leicester University had the maximum number of collaborated research in AI for breast cancer diagnosis and prognosis. In addition, Stanford University collaborated extensively with Radboud University and Tsinghua University in the same context. Finally, as per the world map collaboration analysis, we observe that the developed nations, namely China, the USA, India, the UK, and Saudi Arabia, are pivotal in promoting collaborative research on AI for breast cancer diagnosis and prognosis research through their constant search for collaboration with other countries. However, we observed that institutions in developed countries seldom take the initiative to collaborate with institutions in developing and underdeveloped economies. Instead, the developed nations tend to select equally good or better institutions than themselves as collaborators.

However, these DL and TL techniques have not been declared primary clinical protocols for clinicians to detect breast cancer and cancer patients’ survivability. Thus, the scientific community must collaborate globally to undertake the necessary medical device regulation to use deep learning technology in health care. Therefore, the current systematic bibliometric review could be a valuable resource for beginners who wish to apply DL and TL techniques for breast cancer classification, detection, and survivability through different medical imaging modalities.

Open challenges in AI for breast cancer diagnosis and prognosis

As per the evolution of the field of AI and its application in breast cancer diagnosis and prognosis has evolved, we observe from the thematic map that during the last five to six years, the basic and the transversal themes show that keyword, DL, and TL have evolved as the primary themes in the AI field for the diagnosis and prognosis of breast cancer in recent years (2015 to 2021). However, although DL and TL themes are essential for applying AI in breast cancer diagnosis and prognosis research (70, 71, 74, 76, 88, 89), these fields have not developed enough to be used as clinically proven technology to be used by clinicians for earlier detection of cancer and cancer patient survivability predictions using histopathological images and mammograms (90, 116). Therefore, efforts have to be made by the scientific community globally to collaborate efficiently to implement DL technologies to improve the performance of breast cancer classification and detection performance. Hence, these DL techniques can be used as a primary diagnostic tool for the detection of breast cancer and survivability prediction of breast cancer patients with greater accuracy and precision.

Moreover, we observed that the developed nations’ institutions seldom take the initiative to cooperate with institutions in developing and underdeveloped countries. Instead, the developed nations tend to select equal or better institutions with infrastructure and intellects than themselves as collaborators. Therefore, a country with better infrastructure and economy should collaborate with prolific intellectuals and their affiliated institutions from developing and underdeveloped countries with funded projects to try and utilize the current technology to establish a worldwide AI-based breast cancer healthcare ecosystem. The AI-based breast cancer healthcare ecosystem will allow institutions from underdeveloped countries to significantly implement advanced DL techniques in breast cancer diagnosis and prognosis.

Clinical and image data should be shared. However, data that is demonstrative of typical breast cancer patients, annotated, structured, and ready to be used is inadequate and available in only a few institutions. Therefore new imaging repositories, such as the Health Data Research Innovation Gateway, must be set up to address this data gap. In addition, setting up new image repositories is vital for developing a data ecosystem to meet the demand for developing a novel algorithm for the earlier detection and treatment response prediction of breast cancer.

Further, it is essential to bring scientific fields together, which means a new multidisciplinary team, including clinical scientists, informaticians, and clinicians needs to be trained and developed to incorporate AI analysis into breast cancer care decisions (117).

Limitations

Our bibliometric review has some limitations. First, we included publications available only in the English language. Secondly, we did not include electronic preprints studies published in an online open-access repository, the ArXiv. We might have skipped several publications related to AI and Breast cancer diagnosis and prognosis research; nevertheless, these electronic preprints in the online repositories are not peer-reviewed articles. Third, we only extracted and analyzed data from WOS and Scopus data from January 2000 to October 2021. So we might have missed many articles linked to AI and Breast cancer diagnosis and prognosis research published between the years November 2021 to January 2022.

Conclusion

DL, feature extraction, and TL for breast cancer diagnosis have become basic and transversal themes in the last five to six years. However, these fields are not well developed enough to be used by clinicians for regular cancer detection and prognosis prediction. Therefore, there is urgent to convert these basic themes to motor themes and append these techniques to clinical practices as a breast cancer diagnostic or prognostic tool. Therefore, the current systematic bibliometric review could be a valuable resource for beginners applying AI to researchers on DL-based breast cancer classification through different medical imaging modalities.

Funding

This project was funded by the Deanship of Scientific Research (DSR), King Abdulaziz University, Jeddah, under grant no. (D:830-1021-1443). The authors, therefore, gratefully acknowledge DSR's technical and financial support.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: Web of Science and Scopus.

Author contributions

All authors made substantial intellectual involvement in the present study to meet the requirements as authors. First, AS and TK apprehended the study’s design. Second, AS and TK collected the data, and AS performed the research and analyzed the data. Third, TK drafted the materials and methodology and edited the figures. Fourth, AS drafted the abstract, introduction, result, and discussion. Finally, AS and TK edited the manuscript. All authors agree to be accountable for the content of the work

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.854927/full#supplementary-material

Supplementary Figure S1

Study Selection workflow.

Supplementary Figure S2

Most relevant affiliation in AI research for breast cancer detection and prognosis prediction.

Supplementary Figure S3

Source clustering through Bradford’s Law.

Supplementary Figure S4

Top fifteen keywords in AI for breast cancer detection and survival prediction research from 2000 to 2021.

Supplementary Figure S5

Keywords growth curve from 2000 to 2022.

Supplementary Figure S6

Sankey diagram based on keyword thematic evolution from 2000 to 2020.

Supplementary Figure S7

Factorial map of the documents in the red and blue clusters with the highest contributions.

Supplementary Figure S8

Factorial map of the documents in the red and blue clusters with the highest citations.

Supplementary Figure S9

Historical direct citation network analysis from 2000 to 2021.

References

1
SungHFerlayJSiegelRLLaversanneMSoerjomataramIJemalAet al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin (2021) 71:209–49. doi: 10.3322/caac.21660
- CrossRef
- Google Scholar
2
McKinneySMSieniekMGodboleVGodwinJAntropovaNAshrafianHet al. International evaluation of an AI system for breast cancer screening. Nature (2020) 577:89–94. doi: 10.1038/s41586-019-1799-6
- CrossRef
- Google Scholar
3
LiJZhouZDongJFuYLiYLuanZet al. Predicting breast cancer 5-year survival using machine learning: A systematic review. PloS One (2021) 16:e0250370. doi: 10.1371/journal.pone.0250370
- CrossRef
- Google Scholar
4
LeeCHDershawDDKopansDEvansPMonseesBMonticcioloDet al. Breast cancer screening with imaging: Recommendations from the society of breast imaging and the ACR on the use of mammography, breast MRI, breast ultrasound, and other technologies for the detection of clinically occult breast cancer. J Am Coll Radiol (2010) 7:18–27. doi: 10.1016/j.jacr.2009.09.022
- CrossRef
- Google Scholar
5
OeffingerKCFonthamETHEtzioniRHerzigAMichaelsonJSShihY-CTet al. Breast cancer screening for women at average risk. JAMA (2015) 314:1599. doi: 10.1001/jama.2015.12783
- CrossRef
- Google Scholar
6
ElmoreJGJacksonSLAbrahamLMigliorettiDLCarneyPAGellerBMet al. Variability in interpretive performance at screening mammography and radiologists’ characteristics associated with accuracy. Radiology (2009) 253:641–51. doi: 10.1148/radiol.2533082308
- CrossRef
- Google Scholar
7
LehmanCDWellmanRDBuistDSMKerlikowskeKTostesonANAMigliorettiDL. Diagnostic accuracy of digital screening mammography with and without computer-aided detection. JAMA Intern Med (2015) 175:1828. doi: 10.1001/jamainternmed.2015.5231
- CrossRef
- Google Scholar
8
GilbertFJAstleySMGillanMGCAgbajeOFWallisMGJamesJet al. Single reading with computer-aided detection for screening mammography. N Engl J Med (2008) 359:1675–84. doi: 10.1056/NEJMoa0803545
- CrossRef
- Google Scholar
9
GigerMLChanH-PBooneJ. Anniversary paper: History and status of CAD and quantitative image analysis: The role of medical physics and AAPM. Med Phys (2008) 35:5799–820. doi: 10.1118/1.3013555
- CrossRef
- Google Scholar
10
FentonJJTaplinSHCarneyPAAbrahamLSicklesEAD’OrsiCet al. Influence of computer-aided detection on performance of screening mammography. N Engl J Med (2007) 356:1399–409. doi: 10.1056/NEJMoa066099
- CrossRef
- Google Scholar
11
KohliAJhaS. Why CAD failed in mammography. J Am Coll Radiol (2018) 15:535–7. doi: 10.1016/j.jacr.2017.12.029
- CrossRef
- Google Scholar
12
ShawahnaASaitSMEl-MalehA. FPGA-based accelerators of deep learning networks for learning and classification: A review. IEEE Access (2019) 7:7823–59. doi: 10.1109/ACCESS.2018.2890150
- CrossRef
- Google Scholar
13
ShenDWuGSukH-I. Deep learning in medical image analysis. Annu Rev BioMed Eng (2017) 19:221–48. doi: 10.1146/annurev-bioeng-071516-044442
- CrossRef
- Google Scholar
14
SuzukiK. Overview of deep learning in medical imaging. Radiol Phys Technol (2017) 10:257–73. doi: 10.1007/s12194-017-0406-5
- CrossRef
- Google Scholar
15
GulshanVPengLCoramMStumpeMCWuDNarayanaswamyAet al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA (2016) 316:2402. doi: 10.1001/jama.2016.17216
- CrossRef
- Google Scholar
16
EstevaAKuprelBNovoaRAKoJSwetterSMBlauHMet al. Dermatologist-level classification of skin cancer with deep neural networks. Nature (2017) 542:115–8. doi: 10.1038/nature21056
- CrossRef
- Google Scholar
17
De FauwJLedsamJRRomera-ParedesBNikolovSTomasevNBlackwellSet al. Clinically applicable deep learning for diagnosis and referral in retinal disease. Nat Med (2018) 24:1342–50. doi: 10.1038/s41591-018-0107-6
- CrossRef
- Google Scholar
18
ArdilaDKiralyAPBharadwajSChoiBReicherJJPengLet al. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat Med (2019) 25:954–61. doi: 10.1038/s41591-019-0447-x
- CrossRef
- Google Scholar
19
TopolEJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med (2019) 25:44–56. doi: 10.1038/s41591-018-0300-7
- CrossRef
- Google Scholar
20
Rodriguez-RuizALångKGubern-MeridaABroedersMGennaroGClauserPet al. Stand-alone artificial intelligence for breast cancer detection in mammography: Comparison with 101 radiologists. J Natl Cancer Inst (2019) 111:916–22. doi: 10.1093/jnci/djy222
- CrossRef
- Google Scholar
21
WuNPhangJParkJShenYHuangZZorinMet al. Deep neural networks improve radiologists’ performance in breast cancer screening. IEEE Trans Med Imaging (2020) 39:1184–94. doi: 10.1109/TMI.2019.2945514
- CrossRef
- Google Scholar
22
GulerATWaaijerCJFPalmbladM. Scientific workflows for bibliometrics. Scientometrics (2016) 107:385–98. doi: 10.1007/s11192-016-1885-6
- CrossRef
- Google Scholar
23
AhmadvandAKavanaghDClarkMDrennanJNissenL. Trends and visibility of “Digital health” as a keyword in articles by JMIR publications in the new millennium: Bibliographic-bibliometric analysis. J Med Internet Res (2019) 21:e10477. doi: 10.2196/10477
- CrossRef
- Google Scholar
24
TajFKleinMCAvan HalterenA. Digital health behavior change technology: Bibliometric and scoping review of two decades of research. JMIR mHealth uHealth (2019) 7:e13311. doi: 10.2196/13311
- CrossRef
- Google Scholar
25
PengCHeMCutronaSLKiefeCILiuFWangZ. Theme trends and knowledge structure on mobile health apps: Bibliometric analysis. JMIR mHealth uHealth (2020) 8:e18212. doi: 10.2196/18212
- CrossRef
- Google Scholar
26
AriaMCuccurulloC. bibliometrix : An r-tool for comprehensive science mapping analysis. J Informetr (2017) 11:959–75. doi: 10.1016/j.joi.2017.08.007
- CrossRef
- Google Scholar
27
CoboMJLópez-HerreraAGHerrera-ViedmaEHerreraF. Science mapping software tools: Review, analysis, and cooperative study among tools. J Am Soc Inf Sci Technol (2011) 62:1382–402. doi: 10.1002/asi.21525
- CrossRef
- Google Scholar
28
SmallH. Co-Citation in the scientific literature: A new measure of the relationship between two documents. J Am Soc Inf Sci (1973) 24:265–9. doi: 10.1002/asi.4630240406
- CrossRef
- Google Scholar
29
GarfieldE. Historiographic mapping of knowledge domains literature. J Inf Sci (2004) 30:119–45. doi: 10.1177/0165551504042802
- CrossRef
- Google Scholar
30
DelenDWalkerGKadamA. Predicting breast cancer survivability: a comparison of three data mining methods. Artif Intell Med (2005) 34:113–27. doi: 10.1016/j.artmed.2004.07.002
- CrossRef
- Google Scholar
31
AkayMF. Support vector machines combined with feature selection for breast cancer diagnosis. Expert Syst Appl (2009) 36:3240–7. doi: 10.1016/j.eswa.2008.01.009
- CrossRef
- Google Scholar
32
ZhengBYoonSWLamSS. Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms. Expert Syst Appl (2014) 41:1476–82. doi: 10.1016/j.eswa.2013.08.044
- CrossRef
- Google Scholar
33
KooiTLitjensGvan GinnekenBGubern-MéridaASánchezCIMannRet al. Large Scale deep learning for computer aided detection of mammographic lesions. Med Image Anal (2017) 35:303–12. doi: 10.1016/j.media.2016.07.007
- CrossRef
- Google Scholar
34
ChougradHZouakiHAlheyaneO. Deep convolutional neural networks for breast cancer screening. Comput Methods Programs BioMed (2018) 157:19–30. doi: 10.1016/j.cmpb.2018.01.011
- CrossRef
- Google Scholar
35
MasudMEldin RashedAEHossainMS. Convolutional neural network-based models for diagnosis of breast cancer. Neural Comput Appl (2020) 34(14):11383–94. doi: 10.1007/s00521-020-05394-5
- CrossRef
- Google Scholar
36
MurtazaGShuibLMujtabaGRazaG. Breast cancer multi-classification through deep neural network and hierarchical classification approach. Multimed Tools Appl (2020) 79:15481–511. doi: 10.1007/s11042-019-7525-4
- CrossRef
- Google Scholar
37
AlomMZAspirasTTahaTMBowenTJAsariVK. MitosisNet: End-to-End mitotic cell detection by multi-task learning. IEEE Access (2020) 8:68695–710. doi: 10.1109/ACCESS.2020.2983995
- CrossRef
- Google Scholar
38
AndreadisTEmmanouilidisCGoumasSKoulouriotisD. Development of an intelligent CAD system for mass detection in mammographic images. IET Image Process (2020) 14:1960–6. doi: 10.1049/iet-ipr.2019.1295
- CrossRef
- Google Scholar
39
SalamaWMElbagouryAMAlyMH. Novel breast cancer classification framework based on deep learning. IET Image Process (2020) 14:3254–9. doi: 10.1049/iet-ipr.2020.0122
- CrossRef
- Google Scholar
40
EltrassASSalamaMS. Fully automated scheme for computer-aided detection and breast cancer diagnosis using digitised mammograms. IET Image Process (2020) 14:495–505. doi: 10.1049/iet-ipr.2018.5953
- CrossRef
- Google Scholar
41
SirinukunwattanaKRazaSEATsangY-WSneadDRJCreeIARajpootNM. Locality sensitive deep learning for detection and classification of nuclei in routine colon cancer histology images. IEEE Trans Med Imaging (2016) 35:1196–206. doi: 10.1109/TMI.2016.2525803
- CrossRef
- Google Scholar
42
TangJRangayyanRMXuJEl NaqaIYangY. Computer-aided detection and diagnosis of breast cancer with mammography: recent advances. IEEE Trans Inf Technol BioMed (2009) 13:236–51. doi: 10.1109/TITB.2008.2009441
- CrossRef
- Google Scholar
43
ChenH-LYangBLiuJLiuD-Y. A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst Appl (2011) 38:9014–22. doi: 10.1016/j.eswa.2011.01.120
- CrossRef
- Google Scholar
44
StoeanRStoeanC. Modeling medical decision making by support vector machines, explaining by rules of evolutionary algorithms with feature selection. Expert Syst Appl (2013) 40:2677–86. doi: 10.1016/j.eswa.2012.11.007
- CrossRef
- Google Scholar
45
SimonyanKZissermanA. Very deep convolutional networks for Large-scale image recognition. arXiv (2014). 14091556.
- Google Scholar
46
HeKZhangXRenSSunJ. Deep residual learning for image recognition. Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit (2016), 770–8. doi: 10.1109/CVPR.2016.90. 2016-Decem.
- CrossRef
- Google Scholar
47
KrizhevskyASutskeverIHintonGE. ImageNet classification with deep convolutional neural networks. Commun ACM (2017) 60:84–90. doi: 10.1145/3065386
- CrossRef
- Google Scholar
48
LeCunYBengioYHintonG. Deep learning. Nature (2015) 521:436–44. doi: 10.1038/nature14539
- CrossRef
- Google Scholar
49
RonnebergerOFischerPBroxT. U-net: Convolutional networks for biomedical image segmentation. InInternational Conference on Medical image computing and computer-assisted intervention2015 Oct 5 (pp. 234–241). Springer, Cham.
- Google Scholar
50
SpanholFAOliveiraLSPetitjeanCHeutteL. Breast cancer histopathological image classification using convolutional neural networks. Int Joint Conf Neural Networks (IJCNN) (IEEE) (2016), 2560–7. doi: 10.1109/IJCNN.2016.7727519
- CrossRef
- Google Scholar
51
LitjensGKooiTBejnordiBESetioAAACiompiFGhafoorianMet al. A survey on deep learning in medical image analysis. Med Image Anal (2017) 42:60–88. doi: 10.1016/j.media.2017.07.005
- CrossRef
- Google Scholar
52
BrayFFerlayJSoerjomataramISiegelRLTorreLAJemalA. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin (2018) 68:394–424. doi: 10.3322/caac.21492
- CrossRef
- Google Scholar
53
CireşanDCGiustiAGambardellaLMSchmidhuberJ. Mitosis detection in breast cancer histology images with deep neural networks. InInternational conference on medical image computing and computer-assisted intervention. 2013 Sep 22 (pp. 411–418). Springer, Berlin, Heidelberg.
- Google Scholar
54
BreimanL. Random forests. Mach Learn (2001) 45:5–32. doi: 10.1023/A:1010933404324
- CrossRef
- Google Scholar
55
GuyonIWestonJBarnhillSVapnikV. Gene selection for cancer classification using support vector machines. Mach Learn (2002) 46:389–422. doi: 10.1023/A:1012487302797
- CrossRef
- Google Scholar
56
HaralickRMShanmugamKDinsteinI. Textural features for image classification. IEEE Trans Syst Man Cybern (1973) SMC-3:610–21. doi: 10.1109/TSMC.1973.4309314
- CrossRef
- Google Scholar
57
CortesCVapnikV. Support-vector networks. Mach Learn (1995) 20:273–97. doi: 10.1007/BF00994018
- CrossRef
- Google Scholar
58
Peña-ReyesCASipperM. A fuzzy-genetic approach to breast cancer diagnosis. Artif Intell Med (1999) 17:131–55. doi: 10.1016/S0933-3657(99)00019-6
- CrossRef
- Google Scholar
59
SetionoR. Generating concise and accurate classification rules for breast cancer diagnosis. Artif Intell Med (2000) 18:205–19. doi: 10.1016/S0933-3657(99)00041-X
- CrossRef
- Google Scholar
60
AbbassHA. An evolutionary artificial neural networks approach for breast cancer diagnosis. Artif Intell Med (2002) 25:265–81. doi: 10.1016/S0933-3657(02)00028-3
- CrossRef
- Google Scholar
61
Jerez-AragonésJMGómez-RuizJARamos-JiménezGMuñoz-PérezJAlba-ConejoE. An expert system for detection of breast cancer based on association rules and neural network. Expert Syst Applications. Artif Intell Med (2003) 27:45–63. doi: 10.1016/s0933-3657(02)00086-6
- CrossRef
- Google Scholar
62
KarabatakMInceMC. An expert system for detection of breast cancer based on association rules and neural network. Expert Syst Appl (2009) 36:3465–9. doi: 10.1016/j.eswa.2008.02.064
- CrossRef
- Google Scholar
63
Marcano-CedeñoAQuintanilla-DomínguezJAndinaD. WBCD breast cancer database classification applying artificial metaplasticity neural network. Expert Syst Appl (2011) 38:9573–9. doi: 10.1016/j.eswa.2011.01.167
- CrossRef
- Google Scholar
64
Abdel-ZaherAMEldeibAM. Breast cancer classification using deep belief networks. Expert Syst Appl (2016) 46:139–44. doi: 10.1016/j.eswa.2015.10.015
- CrossRef
- Google Scholar
65
ChengHDCaiXChenXHuLLouX. Computer-aided detection and classification of microcalcifications in mammograms: a survey. Pattern Recognit (2003) 36:2967–91. doi: 10.1016/S0031-3203(03)00192-4
- CrossRef
- Google Scholar
66
ChengHDShiXJMinRHuLMCaiXPDuHN. Approaches for automated detection and classification of masses in mammograms. Pattern Recognit (2006) 39:646–68. doi: 10.1016/j.patcog.2005.07.006
- CrossRef
- Google Scholar
67
AraújoTArestaGCastroERoucoJAguiarPEloyCet al. Classification of breast cancer histology images using convolutional neural networks. PloS One (2017) 12:e0177544. doi: 10.1371/journal.pone.0177544
- CrossRef
- Google Scholar
68
JiaoZGaoXWangYLiJ. A deep feature based framework for breast masses classification. Neurocomputing (2016) 197:221–31. doi: 10.1016/j.neucom.2016.02.060
- CrossRef
- Google Scholar
69
ArevaloJGonzálezFARamos-PollánROliveiraJLGuevara LopezMA. Representation learning for mammography mass lesion classification with convolutional neural networks. Comput Methods Programs BioMed (2016) 127:248–57. doi: 10.1016/j.cmpb.2015.12.014
- CrossRef
- Google Scholar
70
Al-masniMAAl-antariMAParkJ-MGiGKimT-YRiveraPet al. Simultaneous detection and classification of breast masses in digital mammograms via a deep learning YOLO-based CAD system. Comput Methods Programs BioMed (2018) 157:85–94. doi: 10.1016/j.cmpb.2018.01.017
- CrossRef
- Google Scholar
71
KallenbergMPetersenKNielsenMNgAYDiaoPIgelCet al. Unsupervised deep learning applied to breast density segmentation and mammographic risk scoring. IEEE Trans Med Imaging (2016) 35:1322–31. doi: 10.1109/TMI.2016.2532122
- CrossRef
- Google Scholar
72
TingFFTanYJSimKS. Convolutional neural network improvement for breast cancer classification. Expert Syst Appl (2019) 120:103–15. doi: 10.1016/j.eswa.2018.11.008
- CrossRef
- Google Scholar
73
CelikYTaloMYildirimOKarabatakMAcharyaUR. Automated invasive ductal carcinoma detection based using deep transfer learning with whole-slide images. Pattern Recognit Lett (2020) 133:232–9. doi: 10.1016/j.patrec.2020.03.011
- CrossRef
- Google Scholar
74
MurtazaGShuibLAbdul WahabAWMujtabaGMujtabaGNwekeHFet al. Deep learning-based breast cancer classification through medical imaging modalities: state of the art and research challenges. Artif Intell Rev (2020) 53:1655–720. doi: 10.1007/s10462-019-09716-5
- CrossRef
- Google Scholar
75
ChougradHZouakiHAlheyaneO. Multi-label transfer learning for the early diagnosis of breast cancer. Neurocomputing (2020) 392:168–80. doi: 10.1016/j.neucom.2019.01.112
- CrossRef
- Google Scholar
76
AgarwalRDíazOYapMHLladóXMartíR. Deep learning for mass detection in full field digital mammograms. Comput Biol Med (2020) 121:103774. doi: 10.1016/j.compbiomed.2020.103774
- CrossRef
- Google Scholar
77
BenhammouYAchchabBHerreraFTabikS. BreakHis based breast cancer automatic diagnosis using deep learning: Taxonomy, survey and insights. Neurocomputing (2020) 375:9–24. doi: 10.1016/j.neucom.2019.09.044
- CrossRef
- Google Scholar
78
KumarASinghSKSaxenaSLakshmananKSangaiahAKChauhanHet al. Deep feature learning for histopathological image classification of canine mammary tumors and human breast cancer. Inf Sci (Ny) (2020) 508:405–21. doi: 10.1016/j.ins.2019.08.072
- CrossRef
- Google Scholar
79
ChenHQiXYuLDouQQinJHengP-A. DCAN: Deep contour-aware networks for object instance segmentation from histology images. Med Image Anal (2017) 36:135–46. doi: 10.1016/j.media.2016.11.004
- CrossRef
- Google Scholar
80
Díaz-UriarteRAlvarez de AndrésS. Gene selection and classification of microarray data using random forest. BMC Bioinf (2006) 7:3. doi: 10.1186/1471-2105-7-3
- CrossRef
- Google Scholar
81
HsuSMKuoWHKuoFCLiaoYY. Breast tumor classification using different features of quantitative ultrasound parametric images. Int J Comput Assist Radiol Surg (2019) 14(4):623–33. doi: 10.1007/s11548-018-01908-8
- CrossRef
- Google Scholar
82
ZhangQXiaoYDaiWSuoJWangCShiJet al. Deep learning based classification of breast tumors with shear-wave elastography. Ultrasonics (2016) 72:150–7. doi: 10.1016/j.ultras.2016.08.004
- CrossRef
- Google Scholar
83
ParkHJKimSMLa YunBJangMKimBJangJYet al. A computer-aided diagnosis system using artificial intelligence for the diagnosis and characterization of breast masses on ultrasound: Added value for the inexperienced breast radiologist. Med (Baltimore) (2019) 98(3):e14146. doi: 10.1097/MD.0000000000014146
- CrossRef
- Google Scholar
84
ChoiJHKangBJBaekJELeeHSKimSH. Application of computer-aided diagnosis in breast ultrasound interpretation: Improvements in diagnostic performance according to reader experience. Ultrasonography (2018) 37(3):217–25. doi: 10.14366/usg.17046
- CrossRef
- Google Scholar
85
BeckerASMuellerMStoffelEMarconMGhafoorSBossA. Classification of breast cancer in ultrasound imaging using a generic deep learning analysis software: A pilot study. Br J Radiol (2018) 91(1083):20170576. doi: 10.1259/bjr.20170576
- CrossRef
- Google Scholar
86
CiritsisARossiCEberhardMMarconMBeckerASBossA. Automatic classification of ultrasound breast lesions using a DCNN mimicking human decision-making. Eur Radiol (2019) 29(10):5458–68. doi: 10.1007/s00330-019-06118-7
- CrossRef
- Google Scholar
87
AlzubaidiLAl-ShammaOFadhelMAFarhanLZhangJDuanY. Optimizing the performance of breast cancer classification by employing the same domain transfer learning from hybrid deep convolutional neural network model. Electronics (2020) 9:445. doi: 10.3390/electronics9030445
- CrossRef
- Google Scholar
88
LotterWDiabARHaslamBKimJGGrisotGWuEet al. Robust breast cancer detection in mammography and digital breast tomosynthesis using an annotation-efficient deep learning approach. Nat Med (2021) 27(2):244–9. doi: 10.1038/s41591-020-01174-9
- CrossRef
- Google Scholar
89
BhattCKumarIVijayakumarVSinghKUKumarA. The state of the art of deep learning models in medical science and their challenges. Multimed Syst (2021) 27:599–613. doi: 10.1007/s00530-020-00694-1
- CrossRef
- Google Scholar
90
FreemanKGeppertJStintonCTodkillDJohnsonSClarkeAet al. Use of artificial intelligence for image analysis in breast cancer screening programmes: systematic review of test accuracy. BMJ (2021) n1872. doi: 10.1136/bmj.n1872
- CrossRef
- Google Scholar
91
LeiYMYinMYuMHYuJZengSELvWZet al. Artificial intelligence in medical imaging of the breast. Front Oncol (2021) 11:600557. doi: 10.3389/fonc.2021.600557
- CrossRef
- Google Scholar
92
MorganMBMatesJL. Applications of artificial intelligence in breast imaging. Radiol Clin North Am (2021) 59(1):139–48. doi: 10.1016/j.rcl.2020.08.007
- CrossRef
- Google Scholar
93
MohamedAALuoYPengHJankowitzRCWuS. Understanding clinical mammographic breast density assessment: A deep learning perspective. J Digit Imaging (2018) 31(4):387–92. doi: 10.1016/j.media.2018.12.006
- CrossRef
- Google Scholar
94
PanSJYangQ. A survey on transfer learning. IEEE Trans knowledge Data Eng (2009) 22(10):1345–59. doi: 10.1109/TKDE.2009.191
- CrossRef
- Google Scholar
95
WeissKKhoshgoftaarTMWangD. A survey of transfer learning. J Big Data (2016) 3:9. doi: 10.1186/s40537-016-0043-6
- CrossRef
- Google Scholar
96
HuynhBDrukkerKGigerM. MO-DE-207B-06: Computer-aided diagnosis of breast ultrasound images using transfer learning from deep convolutional neural networks. Int J Med Phys Res Prac (2016) 43:3705–5. doi: 10.1118/1.4957255
- CrossRef
- Google Scholar
97
AlomMZTahaTYakopcicCWestbergSHasanMEsesnBet al. The history began from AlexNet: A comprehensive survey on deep learning approaches. arXiv (2018). arXiv:abs/1803.01164.
- Google Scholar
98
YapMHPonsGMartiJGanauSSentisMZwiggelaarRet al. Automated breast ultrasound lesions detection using convolutional neural networks. IEEE J Biomed Health Inform (2018) 22:1218–26. doi: 10.1109/JBHI.2017.2731873
- CrossRef
- Google Scholar
99
ByraMGalperinMOjeda-FournierHOlsonLO’BoyleMComstockCet al. Breast mass classification in sonography with transfer learning using a deep convolutional neural network and color conversion. Med Phys (2019) 46:746–55. doi: 10.1002/mp.13361
- CrossRef
- Google Scholar
100
ByraMSznajderTKorzinekDPiotrzkowska-WroblewskaHDobruch-SobczakKNowickiAet al. Impact of ultrasound image reconstruction method on breast lesion classification with deep learning. arXiv (2018). arXiv:abs/1804.02119.
- Google Scholar
101
YapMHGoyalMOsmanFMMartíRDentonEJuetteAet al. Breast ultrasound lesions recognition: End-to-end deep learning approaches. J Med Imaging (2019) 6(1):011007. doi: 10.1117/1.JMI.6.1.011007
- CrossRef
- Google Scholar
102
AyanaGDeseKChoeS-w. Transfer learning in breast cancer diagnoses via ultrasound imaging. Cancers (2021) 13(4):738. doi: 10.3390/cancers13040738
- CrossRef
- Google Scholar
103
TariqMIqbalSAyeshaHAbbasIAhmadKTNiaziMFK. Medical image based breast cancer diagnosis: State of the art and future directions. Expert Syst Appl (2021) 167:114095.
- Google Scholar
104
JalalianAMashohorSMahmudRKarasfiBSaripanMIBRamliARB. Foundation and methodologies in computer-aided diagnosis systems for breast cancer detection. EXCLI J (2017) 16:113–37. doi: 10.17179/excli2016-70
- CrossRef
- Google Scholar
105
Rodríguez-RuizAKrupinskiEMordangJ-JSchillingKHeywang-KöbrunnerSHSechopoulosIet al. Detection of breast cancer with mammography: Effect of an artificial intelligence support system. Radiol290:305–14. doi: 10.1148/radiol.2018181371
- CrossRef
- Google Scholar
106
KimJKimHJKimCKimWH. Artificial intelligence in breast ultrasonography. Ultrasonography (2021) 40(2):183–90. doi: 10.14366/usg.20117
- CrossRef
- Google Scholar
107
AdachiMFujiokaTMoriMKubotaKKikuchiYXiaotongWet al. Detection and diagnosis of breast cancer using artificial intelligence based assessment of maximum intensity projection dynamic contrast-enhanced magnetic resonance images. Diag (Basel) (2020) 10(5):330. doi: 10.3390/diagnostics10050330
- CrossRef
- Google Scholar
108
DalmisMUGubern-MeridaAVreemannSBultPKarssemeijerNMannRet al. Artificial intelligence-based classification of breast lesions imaged with a multiparametric breast MRI protocol with ultrafast DCE-MRI, T2, and DWI. Invest Radiol (2019) 54(6):325–32. doi: 10.1097/RLI.0000000000000544
- CrossRef
- Google Scholar
109
ZhangQSongSXiaoYChenSShiJZhengH. Dual-mode artificially-intelligent diagnosis of breast tumors in shear-wave elastography and b-mode ultrasound using deep polynomial networks. Med Eng Phys (2019) 64:1–6. doi: 10.1016/j.medengphy.2018.12.005
- CrossRef
- Google Scholar
110
SechopoulosITeuwenJMannR. Artificial intelligence for breast cancer detection in mammography and digital breast tomosynthesis: State of the art. Semin Cancer Biol (2020) 72:214–25. doi: 10.1016/j.semcancer.2020.06.002
- CrossRef
- Google Scholar
111
GardeziSJFayeISanchezBJKamelNHussainM. Mammogram classification using dynamic time warping. Multimed Tools Appl (2017) 77(3):3941–62. doi: 10.1007/s11042-016-4328-8
- CrossRef
- Google Scholar
112
MichaelsonJSatijaSMooreRWeberGHalpernEGarlandAet al. Estimates of the sizes at which breast cancers become detectable on mammographic and clinical grounds. J Womens Health (2003) 5(1):3–10. doi: 10.1097/00130747-200302000-00002
- CrossRef
- Google Scholar
113
YimWYetisgenMHarrisWPKwanSW. Natural language processing in oncology: A review. JAMA Oncol (2016) 2(6):797–804. doi: 10.1001/jamaoncol.2016.0213
- CrossRef
- Google Scholar
114
BanerjeeIBozkurtSCaswell-JinJLKurianAWRubinDL. Natural language processing approaches to detect the timeline of metastatic recurrence of breast cancer. JCO Clin Cancer Inf (2019) 3:1–12. doi: 10.1200/CCI.19.00034
- CrossRef
- Google Scholar
115
ShenLMargoliesLRRothsteinJHFluderEMcBrideRSiehW. Deep learning to improve breast cancer detection on screening mammography. Sci Rep (2019) 9:12495. doi: 10.1038/s41598-019-48995-4
- CrossRef
- Google Scholar
116
Al-antariMAHanS-MKimT-S. Evaluation of deep learning detection and classification towards computer-aided diagnosis of breast lesions in digital X-ray mammograms. Comput Methods Programs BioMed (2020) 196:105584. doi: 10.1016/j.cmpb.2020.105584
- CrossRef
- Google Scholar
117
HickmanSEBaxterGCGilbertFJ. Adoption of artificial intelligence in breast imaging: evaluation, ethical constraints and limitations. Br J Cancer (2021) 125:15–22. doi: 10.1038/s41416-021-01333-w
- CrossRef
- Google Scholar
118
KhanSIslamNJanZUd DinIRodriguesJJPC. A novel deep learning based framework for the detection and classification of breast cancer using transfer learning. Pattern Recognit Lett (2019) 125:1–6. doi: 10.1016/j.patrec.2019.03.022
- CrossRef
- Google Scholar

Summary

Keywords

artificial intelligence, breast cancer, diagnosis and prognosis, Bibliometrix analysis, knowledge structures

Citation

Syed AH and Khan T (2022) Evolution of research trends in artificial intelligence for breast cancer diagnosis and prognosis over the past two decades: A bibliometric analysis. Front. Oncol. 12:854927. doi: 10.3389/fonc.2022.854927

Received

14 January 2022

Accepted

30 August 2022

Published

23 September 2022

Volume

12 - 2022

Edited by

Siuly Siuly, Victoria University, Australia

Reviewed by

Azin Nahvijou, Tehran University of Medical Science, Iran; Carlos Luis Parra-Calderón, Andusian Health Service, Spain

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Asif Hassan Syed, shassan1@kau.edu.sa

This article was submitted to Breast Cancer, a section of the journal Frontiers in Oncology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

ORIGINAL RESEARCH article

Evolution of research trends in artificial intelligence for breast cancer diagnosis and prognosis over the past two decades: A bibliometric analysis

Abstract

Introduction

Materials and methods

Methodology and data sources

Pre-planning

Data collection

Data refinement

Data extraction

Bibliometric data analysis

Results

Annual scientific production

Most relevant authors

Most relevant organizations

Country scientific production

Most preferred periodicals

Highly cited research publications in AI for breast cancer detection and survival predictions

Conceptual knowledge structure analysis

Keyword analysis

Keywords evolution trends

Multicorrespondence analysis and clustering map of words

Multicorrespondence analysis and clustering most contributing documents

Multicorrespondence analysis and clustering most cited documents

Intellectual knowledge structure analysis

Co-citation analysis

Historiography analysis

Social knowledge structure analysis

Authors’ collaboration network analysis

Institution collaboration network analysis

Collaboration world map analysis

Discussion

Open challenges in AI for breast cancer diagnosis and prognosis

Limitations

Conclusion

Funding

Publisher’s note

Statements

Data availability statement

Author contributions

Conflict of interest

Supplementary material

References

Summary

Outline

Figures

Cite article

Share article

Article metrics