Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Public Health, 23 June 2025

Sec. Digital Public Health

Volume 13 - 2025 | https://doi.org/10.3389/fpubh.2025.1609615

This article is part of the Research TopicAdvancing Public Health through Generative Artificial Intelligence: A Focus on Digital Well-Being and the Economy of AttentionView all 6 articles

Artificial intelligence in early warning systems for infectious disease surveillance: a systematic review

  • 1Department of Health Data Science and Biostatistics, University of Texas Southwestern Medical Center, Dallas, TX, United States
  • 2Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, United States

Introduction: Infectious diseases pose a significant global health threat, exacerbated by factors like globalization and climate change. Artificial intelligence (AI) offers promising tools to enhance crucial early warning systems (EWS) for disease surveillance. This systematic review evaluates the current landscape of AI applications in EWS, identifying key techniques, data sources, benefits, and challenges.

Methods: Following PRISMA guidelines, a systematic search of Semantic Scholar (2018-onward) was conducted. After screening 600 records and removing duplicates and non-relevant articles, the search yielded 67 relevant studies for review.

Results: Key findings reveal the prevalent use of machine learning (ML), deep learning (DL), and natural language processing (NLP), which often integrate diverse data sources (e.g., epidemiological, web, climate, wastewater). The major benefits identified include earlier outbreak detection and improved prediction accuracy. However, significant challenges persist regarding data quality and bias, model transparency (the “black box” issue), system integration difficulties, and ethical considerations such as privacy and equity.

Discussion: AI demonstrates considerable potential to strengthen infectious disease EWS. Realizing this potential, however, requires concerted efforts to address data limitations, enhance model explainability, ensure ethical implementation, improve infrastructure, and foster collaboration between AI developers and public health experts.

1 Introduction

Protecting global populations from the continuing threat of infectious diseases is an important concern in an increasingly interconnected world (13). Several factors increase the risk of pandemics, including the rising frequency of zoonotic spillovers (4), the growing challenge of antimicrobial resistance (5), widespread globalization (2), rapid urbanization (1), and the effects of climate change (6). These factors show the urgent need for strong surveillance and preparedness strategies (7, 8). In this context, the ability to rapidly detect and effectively respond to infectious disease outbreaks at their earliest stages is urgent (9, 10).

Artificial intelligence (AI) has emerged as a powerful tool in public health, offering new possibilities to improve infectious disease surveillance and early warning systems (1, 11, 12). Its potential to transform early outbreak detection, refine epidemiological models, and optimize healthcare responses has received growing attention (1316). By using advanced algorithms to process and analyze large datasets from diverse sources, AI can identify patterns and detect anomalies that may signal emerging public health threats (1, 1719).

This review critically examines the current state of AI applications in early warning systems (EWS) for infectious disease surveillance. It addresses the following key questions:

1. What are the primary artificial intelligence techniques and methodologies currently employed in early warning systems for infectious disease surveillance?

2. What types of data sources are predominantly utilized by these AI-driven systems?

3. What are the main reported benefits and advantages of applying AI in this domain?

4. What are the key limitations, challenges, and ethical considerations identified in the literature regarding the use of AI for infectious disease surveillance?

5. What are the emerging trends and future directions for the development and application of AI in this field?

2 Methods

This systematic review was conducted following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (20) to ensure a transparent and repeatable process. The search strategy, detailed below, was specifically designed to retrieve studies relevant to the primary research questions presented in the Introduction.

2.1 Search strategy

The review began by defining four main search topics to capture the breadth of relevant literature: (1) the application of artificial intelligence in early warning systems for the detection and management of infectious diseases; (2) the use of machine learning techniques to enhance early warning systems for infectious disease surveillance; (3) the role of deep learning methods in developing early warning systems for infectious diseases; and (4) the use of AI-driven early warning systems for infectious disease outbreaks. These topics were chosen to cover the key technologies and applications in the field.

Search queries were generated based on these topics and executed on Semantic Scholar (the primary database used, n = 1 in the PRISMA diagram). For each topic, three types of queries were developed: a broad query, a focused query using the “+” operator, and a related query including additional supporting terms. This multi-strategy approach was designed to balance the retrieval of a wide range of studies with the identification of highly relevant results. Table 1 presents the full set of search queries.

Table 1
www.frontiersin.org

Table 1. Search topics and query strategies for the systematic review.

We selected Semantic Scholar as the primary data source for this review due to its extensive reach and advanced search functionalities. It is a free, AI-driven search engine indexing over 200 million academic papers, utilizing machine learning to identify relevant literature beyond simple keyword matching. Its foundation on comprehensive knowledge graphs, including the Microsoft Academic Knowledge Graph and Springer Nature's SciGraph, connected with direct partnerships with over 50 publishers and data providers, guarantees broad coverage of academic content (21).

2.2 Inclusion and exclusion criteria

To guarantee the relevance and focus of this review, specific inclusion and exclusion criteria were applied during the study selection process. Studies were included if they directly addressed one or more of the four research topics concerning AI, machine learning, deep learning, and early warning systems for human infectious disease surveillance.

Only journal articles, conference papers, or studies published in English from the year 2018 onward were included. Studies were excluded if they were identified as editorials, commentaries, or abstracts only; if they did not focus on human infectious diseases or early warning systems; or if they were not published in English. A total of 223 reports were excluded during the full-text assessment stage, primarily because they were not directly relevant to the research questions.

2.3 Study selection process

The initial search retrieved ~600 records. After removing 303 duplicate records during the identification phase, 297 unique records remained for screening.

All 297 records were screened based on their titles, abstracts. No records were excluded at this initial screening stage. The full texts of all 297 records were then retrieved and assessed for eligibility according to the inclusion and exclusion criteria described in Section 2.2. During the full-text assessment, 223 reports were excluded, mainly because they did not meet the relevance criteria. This process resulted in 67 studies being included in the final review, as shown in the PRISMA flow diagram (Figure 1).

Figure 1
www.frontiersin.org

Figure 1. PRISMA flow diagram illustrating the selection process for studies included in the review.

3 Results

Following the systematic search and screening process detailed in the Methods section, a final set of 67 studies met the inclusion criteria and were included in this review (see PRISMA flow diagram, Figure 1). A summary of the key characteristics of the 67 included studies is presented in Table 2. This summary highlights aspects central to our research questions, including the primary AI/ML techniques employed (Research Question 1), the types of data sources used (Research Question 2), the main application areas of the AI systems, and the specific infectious diseases addressed.

Table 2
www.frontiersin.org

Table 2. Characteristics of included studies.

3.1 The growing need for early warning systems

The history of global health clearly shows the damaging impact of pandemics and epidemics, which have caused significant loss of life and lasting social and economic disruption (1, 2). Events such as the 1918 influenza pandemic, the 2003 Severe Acute Respiratory Syndrome (SARS) outbreak, and the recent COVID-19 pandemic serve as important reminders of the serious threat posed by infectious diseases (2, 8). These experiences show the importance of learning from the past and continuously improving our ability to detect and respond to outbreaks more effectively, informed by reviews of early warning system (EWS) effectiveness and global development experiences (7, 22, 23).

In addition to these historical lessons, several current global trends are increasing both the risk and speed of infectious disease outbreaks (1, 24). The rising number of zoonotic spillovers (diseases jumping from animals to humans), driven by factors such as deforestation and habitat loss, contributes to the emergence of new diseases, demanding integrated One Health approaches (2, 4). At the same time, the growing challenge of antimicrobial resistance is making the treatment of common bacterial infections more difficult, a problem that AI is being used to address (5, 25).

Globalization, driven by international travel and trade, enables pathogens to cross borders rapidly, increasing the potential for worldwide spread (1, 2). Urbanization leads to higher population densities, creating environments where diseases can spread more easily (1, 24). Furthermore, climate change is changing disease patterns and migration routes by expanding the habitats of disease vectors such as mosquitoes, making prediction and control more difficult and driving the need for climate-informed early warning systems (6, 26, 27). Together, these factors create a more complex and unstable environment for disease emergence and spread (1, 24).

In this changing situation, early warning systems (EWS) for epidemics are essential tools for preventing the rapid spread of infectious diseases and reducing their impact on public health (4, 12). These systems act as proactive defenses, allowing faster and more targeted responses to protect communities and save lives (16, 23). The ability to detect and understand outbreaks in their early stages is important for implementing timely interventions, such as quarantine measures, vaccination campaigns, and public education efforts, which can significantly change the course of an outbreak and lessen its overall burden (2, 13).

Early warnings offer a valuable window of opportunity to control an outbreak before it overwhelms healthcare systems and spreads further (10, 28). This emphasizes the importance of rapid, informed decision-making based on accurate and timely data–a challenge that modern technologies, particularly artificial intelligence, aim to address (29, 30).

3.2 How AI powers early warning systems

Addressing the challenges of modern disease surveillance requires tools capable of handling large and varied information; artificial intelligence (AI) offers such capabilities (1, 5). AI has become a powerful tool for processing and analyzing large datasets from diverse sources for infectious disease surveillance, operating at scales far beyond human capacity (16, 19, 31). It can analyze information from sources such as medical records, social media posts, news reports, and environmental monitoring devices (9, 10, 32). By analyzing these large volumes of varied data, AI applications in public health offer a more complete and timely understanding of disease dynamics (8, 33, 34).

AI detects early warning signals of infectious disease outbreaks through several mechanisms. It can identify anomalies–deviations from expected patterns–that may signal emerging public health threats (18, 19). AI algorithms are also capable of finding patterns in data that suggest the onset of a disease outbreak, allowing faster recognition of potential threats (1, 17, 35). For example, AI might detect an unusual spike in online searches for specific symptoms combined with increased social media posts about illness in a particular city, potentially indicating an outbreak days before official case counts rise (10, 12). Machine learning models are essential for finding correlations within large datasets that may indicate emerging outbreaks, enabling timely interventions (17, 36, 37).

Furthermore, AI is used in predictive modeling. AI-driven predictive analytics have played an important role in monitoring epidemiological trends, enabling public health officials to better anticipate and respond to potential outbreaks (16, 17, 35, 36). By creating predictive models, AI improves efforts in contact tracing and surveillance, helping to understand and control the spread of infectious diseases (11, 18, 38). Using historical data, environmental factors, and real-time surveillance information, machine learning models can forecast the spread and impact of infectious diseases with increasing accuracy (14, 27, 30, 39), enabling proactive resource allocation and more targeted public health measures (40).

The integration of AI into early warning systems significantly improves the speed and efficiency of outbreak detection and prediction compared to traditional methods (12, 38). By rapidly processing large amounts of data, AI can identify potential outbreaks much faster than conventional systems relying on manual data collection and analysis (1, 10, 41). This increased speed and efficiency support more timely and effective public health responses (9, 42).

However, it is important to recognize that AI serves as a valuable tool that supports and enhances, rather than replaces, traditional epidemiological methods and public health infrastructure (8, 12, 43). AI systems work alongside human-led efforts, providing new insights that help health professionals make better-informed decisions during outbreaks (31, 43, 44). The specific computational techniques that enable these functions are explored in the following section.

3.3 Artificial intelligence techniques and methodologies employed in early warning systems

Addressing the first review question, this section details the primary artificial intelligence techniques and methodologies employed in early warning systems for infectious disease surveillance, based on the reviewed literature. For readers interested in more detailed descriptions of these techniques, including their core principles and typical applications in the context of infectious disease EWS, please refer to Appendix A.

3.3.1 Natural language processing (NLP)

Natural language processing (NLP) is important to analyze unstructured text data to detect early signals of infectious disease outbreaks (17, 32). AI systems use NLP to process large amounts of open-source data, including news reports and social media posts, to identify early warning signs of potential epidemics (12). NLP techniques can analyze user-generated content, detecting mentions of symptoms, self-reported illnesses, and concerns about disease spread in specific geographic areas, thus providing valuable real-time intelligence (19). Moreover, NLP has been applied within tools like medical chatbots (45) and to electronic medical record data to identify and characterize a broad range of symptoms associated with infectious diseases, improving the detail and speed of surveillance compared to structured data alone. By extracting relevant information from large amounts of online textual data, NLP enhances early outbreak detection, often identifying signals before official health notifications are released.

3.3.2 Machine learning (ML)

Machine learning (ML) algorithms are widely used for pattern recognition, classification, and prediction in infectious disease surveillance (25, 37). These algorithms analyze structured and unstructured data from various sources to detect early warning signals of outbreaks (30). A variety of ML techniques are applied in epidemic and pandemic early warning systems. Among the most frequently employed are classification algorithms such as Support Vector Machines (SVM) and tree-based methods including Decision Trees (4648), along with instance-based models like K-Nearest Neighbor (KNN) (37, 48), linear models like Logistic Regression (46, 48), and probabilistic classifiers such as Naive Bayes (46, 49). Several studies have shown that comparing or combining multiple ML techniques often improves the prediction of infectious disease incidence and trends, demonstrating their potential for forecasting disease dynamics (37, 48). Ensemble methods, a powerful ML extension, are discussed further in Section 3.3.5.

These algorithms are categorized into supervised, unsupervised, and reinforcement learning, serving distinct roles in early warning systems (EWS) for infectious diseases.

Supervised learning is the most commonly applied paradigm in the reviewed literature. These algorithms train on labeled data, where each instance is associated with a known outcome, enabling the model to predict outcomes for new, unseen inputs. In infectious disease surveillance, supervised learning is widely used to predict the likelihood or timing of future outbreaks based on epidemiological histories, climate patterns, and population mobility data; to classify cases or regions into predefined risk categories (e.g., high-risk vs. low-risk); and to support disease diagnosis using symptomatic data or medical imagery. Techniques such as SVM, Decision Trees, Logistic Regression, and Naive Bayes are frequently used within this framework.

Unsupervised learning, by contrast, works on unlabeled data and seeks to discover hidden structures, anomalies, or groupings without predefined outcomes. In EWS applications, it is commonly employed for anomaly detection (e.g., identifying unexpected spikes in symptom-related social media activity), clustering cases or outbreaks to uncover transmission dynamics, and topic modeling of text data to detect emerging public health concerns or novel symptom profiles.

Reinforcement learning (RL) involves an agent that interacts with an environment to learn optimal decision strategies through trial and error, aiming to maximize a cumulative reward over time. Although RL is less frequently applied in operational EWS compared to the other methods, it holds considerable potential. Notable applications include optimizing public health interventions, such as determining when and where to deploy vaccines or allocate resources, and developing adaptive control policies that respond quickly to evolving surveillance data. These applications, however, are more complex to implement and remain largely at the exploratory stage.

These learning paradigms provide an adaptable and growing toolkit for enhancing infectious disease EWS across a range of predictive and decision-support tasks.

3.3.3 Deep learning (DL)

Deep learning (DL) techniques, a subset of ML utilizing neural networks, are increasingly recognized for their ability to handle complex, high-dimensional data in surveillance tasks (11, 41). Common architectures include Recurrent Neural Networks (RNNs), particularly Long Short-Term Memory (LSTM) networks, and Transformer models like Bidirectional Encoder Representations from Transformers (BERT) (50). DL models have shown great success in improving diagnostic accuracy and forecasting outbreaks by analyzing large datasets and recognizing complex patterns (3, 51). LSTM networks are especially well-suited for modeling time-based disease trends because they can retain information over long sequences (39, 52, 53). Transformer models like BERT are used for the classification and prioritization of textual information, such as news articles or social media posts, allowing faster and more efficient identification of outbreak-relevant data (12, 50). Other DL architectures found in the reviewed literature include Convolutional Neural Networks (CNNs), often applied to image data but also used in prediction models (49, 54), and custom networks like the Self-Excitation Attention Residual Network (SEAR) designed for influenza surveillance (13).

3.3.4 Time series analysis

Time series analysis methods are frequently used to predict disease incidence and trends based on historical data patterns (37, 55). Statistical approaches, such as the Auto-Regressive Integrated Moving Average (ARIMA) model and its seasonal variants (SARIMA), consider trends, seasonality, and random fluctuations in time series data (39). These models are useful for tasks like predicting seasonal flu peaks or modeling reported case counts over time. Deep learning methods, particularly LSTMs (discussed in Section 3.3.3), are also commonly applied to time series forecasting in this domain (30, 52), alongside other neural network approaches (42).

3.3.5 Ensemble learning

Ensemble learning techniques aim to improve the accuracy, robustness, and reliability of predictions by combining the outputs of multiple individual models (18). Common ensemble methods include Random Forest, which aggregates predictions from multiple decision trees (40, 56, 57), and various Boosting algorithms (such as Gradient Boosting, XGBoost, CatBoost, and LightGBM) that build models sequentially, with each new model correcting errors made by previous ones (13, 52, 56, 57). Another powerful ensemble technique is Stacking, where predictions from several different base models (e.g., ARIMA, SVM, LSTM) are used as inputs for a higher-level meta-learner to produce the final output (39). These ensemble methods often outperform single models by reducing variance and bias, leading to more reliable predictions for complex tasks like outbreak risk forecasting.

3.3.6 Hybrid models

Finally, hybrid models that integrate different AI techniques or combine AI with statistical methods or domain-specific models are increasingly being explored to enhance early warning systems (9). By combining methods, hybrid approaches aim to leverage the different strengths of each component–balancing the interpretability of statistical models with the predictive power of deep learning algorithms or integrating physical models with machine learning (58). Examples include combining CNNs with Structural Equation Models (SEM) (54), using stacking ensembles as described above (39), or integrating Internet of Things (IoT) data streams with AI analysis platforms (59). These approaches create more powerful and flexible systems for infectious disease surveillance and prediction.

3.4 Data sources for AI-driven surveillance

To answer the second research question concerning data utilization, this section outlines the different data sources mainly used by these AI-driven systems. As shown in Table 3, common data sources for AI-powered infectious disease surveillance include news reports, social media platforms, electronic health records, environmental monitoring data, and official health notifications.

Table 3
www.frontiersin.org

Table 3. Common data sources for AI-powered infectious disease surveillance.

3.4.1 Digital and publicly available data

Open-source internet data provides a rich source of early outbreak signals, including news reports, social media activity, blogs, and health forums (12, 60). The internet serves as an extensive, real-time repository where public concerns, discussions about symptoms, and early reports of illness can emerge before official health notifications (10). Analyzing trends in social media posts [e.g., from platforms like Twitter (32, 50)], internet searches for health-related terms [e.g., using Baidu Index or Google Trends (61)], and online news articles can offer leading indicators of disease outbreaks.

3.4.2 Health system and personal health data

Traditional epidemiological data from official health notifications and reports issued by organizations such as the World Health Organization (WHO) and national health agencies (e.g., the Centers for Disease Control and Prevention, CDC) are important to train and validate AI models (48). This includes specific datasets such as influenza-like illness (ILI) reports (13, 61), mandatory case reporting for diseases like dengue (27, 52), and hospital records, including electronic health records (EHR), laboratory information systems (LIS), and picture archiving and communication systems (PACS) (46, 62). Additionally, broader public health surveillance system data is frequently used (40). The emerging role of mobile health (mHealth) technologies and wearable device data offers a continuous stream of physiological indicators suitable for surveillance (29, 34, 63), although practical applications are still developing (17, 64).

3.4.3 Environmental and contextual data

Environmental and climatic data, including temperature, humidity, and precipitation patterns, are important factors influencing the transmission of many infectious diseases, particularly vector-borne illnesses (6, 26, 27, 53). Travel data and human mobility patterns provide valuable insights into tracking and predicting the geographical spread of infectious diseases across regions (14, 15). Genomic data also plays an important role in identifying specific pathogens, tracking their evolution, and understanding potential changes in their characteristics (5, 8, 24). Finally, wastewater surveillance has emerged as a novel and unbiased data source for monitoring infection levels at the community level, capturing even asymptomatic cases (2, 22, 58).

3.4.4 Integration of data sources

The integration of different data sources empowers AI-powered systems to achieve a more comprehensive and timely understanding of infectious disease threats (9). The ability to combine and analyze these varied datasets, often referred to as multi-source or multi-channel surveillance (7), is an important strength of AI in this domain (18, 38). Combining different data streams often provides a more robust and earlier signal than any single source alone (57), creating a synergistic effect for outbreak detection and prediction.

However, integrating such diverse datasets presents significant challenges, notably data heterogeneity, where information from various origins (e.g., structured climate data, unstructured social media text, epidemiological case counts) differs in format, scale, temporality, and reliability. Addressing these challenges is important for the effective application of AI in EWS.

For instance, in the surveillance of respiratory illnesses like influenza or COVID-19, many studies attempt to combine meteorological data (e.g., temperature, humidity) with indicators derived from web sources such as social media posts (for symptom mentions or public sentiment) or search query trends, alongside official epidemiological reports (18, 30, 50).

Successfully harmonizing these disparate data types typically involves several key steps.

3.4.4.1 Temporal alignment

Datasets are often collected at different frequencies. A common approach is to aggregate them into consistent time units, such as daily or weekly summaries. For example, daily climate readings might be aligned with weekly aggregated social media sentiment scores and official case counts.

3.4.4.2 Spatial aggregation

Information needs to be linked to common geographical units (e.g., city, county, or specific health districts). This might involve averaging climate data over a region or linking geolocated social media posts to defined administrative boundaries.

3.4.4.3 Feature engineering

Raw data often requires transformation into formats suitable for AI model input. For social media, this could involve using natural language processing (NLP) to extract sentiment scores, topic frequencies, or mentions of specific symptoms. Climate variables might be used directly or transformed into anomaly indices (e.g., deviations from seasonal norms).

3.4.4.4 Normalization and scaling

To prevent features with larger numerical values from disproportionately influencing model training, numerical data from different sources (e.g., temperature values ranging from −10 to 40°C and sentiment scores from −1 to 1) are typically normalized or scaled to a common range (e.g., 0 to 1 or z-scores).

Beyond harmonization, addressing the heterogeneity of combined data often relies on robust preprocessing pipelines to handle missing values and outliers, and the strategic selection of AI models. For example, ensemble methods like Random Forests have shown efficacy in managing complex datasets with a mix of structured (e.g., climate data) and unstructured (e.g., text-derived features) data (30). Furthermore, multimodal deep learning architectures are increasingly being explored for their capacity to learn joint representations from different data modalities simultaneously, offering a sophisticated approach to leveraging heterogeneous information for improved prediction accuracy in EWS.

3.5 Benefits of AI in infectious disease surveillance

This section addresses the third research question by summarizing the main reported benefits of applying artificial intelligence (AI) in infectious disease surveillance, based on the reviewed literature.

One of the key advantages of using AI in this field is its ability to enable earlier and faster detection of outbreaks compared to traditional surveillance systems (10, 12, 38). As noted in Section 3.1, speed is critical for effective response, and AI can reduce the delays associated with conventional methods by identifying epidemic signals much earlier (1, 23). A famous example is the BlueDot platform, which detected early signs of the COVID-19 outbreak before official reports were released (12).

This speed advantage is partly due to AI's ability to efficiently process and analyze large volumes of diverse data relevant to public health surveillance, as discussed in Section 3.4 (1, 16, 31). AI systems can handle data from sources such as medical records, laboratory results, social media, and environmental sensors (14, 18), extracting meaningful insights from information that would be too large or complex for human analysts to manage.

By processing this wide range of data using the techniques described in Section 3.3, AI can also improve the accuracy and precision of outbreak prediction and forecasting (13, 53). This leads to better-informed public health decision-making. Predictive analytics powered by AI have been important for monitoring epidemiological trends, allowing more accurate anticipation and faster responses to potential outbreaks (27, 30, 39). Studies show that AI and machine learning (ML) models often achieve higher performance metrics, such as accuracy, sensitivity, and lower error rates, compared to baseline or single-model approaches (40, 48, 52, 54).

Improved predictions also help optimize resource allocation and strengthen pandemic preparedness (6, 14). AI tools can analyze population health data to predict disease risk and spread, guiding the efficient distribution of resources such as hospital beds, medical supplies, and healthcare workers to areas of greatest need (15, 36). Timely and accurate predictions allow public health authorities to implement proactive measures, identify high-risk regions, and reduce the impact of outbreaks (27, 37, 48).

Furthermore, AI shows potential to address challenges in resource-constrained settings. In low-income countries, where human resources for traditional surveillance are limited, AI can automate processes and offer cost-effective solutions (12, 27). Technologies such as Edge AI can enable local analysis where centralized infrastructure is unavailable (63), and AI-powered point-of-care diagnostics can improve access to timely information (59). AI may also help overcome issues like data censorship by identifying signals from alternative sources, offering a more objective view of disease activity. However, the effectiveness of early warning systems can vary significantly between high- and low-resource settings (23).

3.6 Limitations and challenges of AI in infectious disease surveillance

Addressing the fourth review question, this section discusses the key limitations, challenges, and ethical considerations identified in the literature regarding the use of artificial intelligence (AI) for infectious disease surveillance.

3.6.1 Data quality and biases

Despite its advantages, the use of AI in this field has important limitations. One major concern is the quality, completeness, and consistency of the data used to train and operate AI models (9, 63). Inaccurate, fragmented, or missing data can lead to unreliable outputs and poor model performance (19). Moreover, biases present in the data–such as underrepresentation of certain demographic or linguistic groups or lack of properly encoded information–can result in AI models that perform poorly for those groups, potentially worsening health inequities (11, 14, 32). Ensuring that datasets are comprehensive and representative is therefore critical to avoid biased outcomes (5, 17). Some models may also fail validation when applied to new datasets, showing issues with generalizability and potential overfitting (31, 51).

3.6.2 Lack of transparency and understandability (“black box” problem)

Another critical issue is the lack of transparency in many advanced AI models, particularly deep learning algorithms, often referred to as the “black box” problem (9, 41). These systems often generate results without a clear explanation of how conclusions were reached (25, 65). This lack of understandability makes it difficult for public health professionals and clinicians to confirm, trust, or troubleshoot model outputs (31). Ongoing research in explainable AI (XAI) aims to address this challenge (41, 44, 56). Without understanding the reasoning behind AI predictions, it becomes difficult to correct errors or explain decisions based on them.

3.6.3 The necessity of human expertise and oversight

While AI tools can process and analyze data at scale, human oversight remains essential for interpreting results and making appropriate public health decisions (14, 43). AI systems are best used as complementary tools alongside traditional epidemiological methods, rather than as replacements (8, 12). Human expertise is necessary to validate findings, evaluate anomalies, contextualize AI outputs, and ensure that insights are appropriately applied in complex real-world situations (1, 31).

3.6.4 Challenges in integrating AI into existing infrastructure

Integrating AI into existing public health infrastructures presents significant technical and organizational challenges (1, 44). These include issues with the interoperability of different systems, where varying data formats and protocols restrict the seamless exchange of information needed for comprehensive AI analysis (19, 63). Other challenges involve inconsistent data-sharing protocols (7), the need for robust local data infrastructure (36), and limited workforce training or expertise to utilize AI tools effectively (1). Maintaining data security and confidentiality while ensuring data availability for real-time processing is also a significant operational concern (63). Difficulties in enabling distributed, collaborative decision-making across different platforms or institutions have also been noted (28).

3.6.5 Ethical considerations

The use of AI in infectious disease surveillance raises numerous ethical challenges (15, 16, 65). Data privacy and security are top concerns, given the sensitivity of personal health information (8, 11). This requires robust governance frameworks and privacy-preserving techniques (14). Technologies such as federated learning, which allow model training on decentralized data without sharing raw information, are being explored to mitigate these risks (29, 63). Questions remain regarding who controls health data, how consent is obtained [especially when using public data sources like social media (50)], and how to ensure responsible data use (11).

Additionally, AI systems may continue or even worsen existing social inequities when trained on biased data, resulting in unfair treatment or exclusion of disadvantaged groups (25, 32, 44). The issue of accountability is also important: when AI systems support public health decisions, it is important to define who is responsible, particularly when errors occur (11). Finally, ensuring equitable access to AI technologies and their benefits for all populations is essential to avoid large global health disparities (36, 44).

3.7 Existing and proposed AI-based early warning systems for infectious diseases

Despite the challenges mentioned in the previous section, several AI-based early warning systems have been developed and deployed, demonstrating the practical application of these technologies (38, 55). These systems vary in their approaches, data sources, and specific techniques employed. To better understand the operational characteristics of some of the AI-enhanced EWS, Table 4 provides a comparative summary of the three main systems (i.e., HealthMap, BlueDot, and EPIWATCH) showing their input data, AI approaches, output features, and aspects related to latency.

Table 4
www.frontiersin.org

Table 4. Comparison of selected AI-based early warning systems.

3.7.1 Systems primarily using web and public data sources

Several systems focus on using open-source internet data for early detection of signals. HealthMap is a fully automated system that monitors health events, including infectious diseases, by using natural language processing (NLP) to analyze real-time data from web sources such as news reports and health forums, providing a global view (12). EPIWATCH is another AI-based system that generates automated early warnings by analyzing open-source data with techniques like NLP and named entity recognition (NER) (12). It has been used, for example, to study the effects of conflicts on disease epidemiology and to track respiratory illness trends (66).

Epitweetr, developed by the European Center for Disease Prevention and Control (ECDC), specifically monitors tweets related to infectious diseases, allowing filtering by location and time (12). An earlier example, Google Flu Trends, attempted to predict influenza prevalence using search query data (12). While pioneering, it faced challenges related to accuracy, including matching noise instead of true signals (overfitting) and seasonal biases. Other research continues to explore the utility of social media platforms like Twitter (50) and search engine query data (10, 61) for surveillance.

3.7.2 Systems integrating diverse data sources

Other systems aim to integrate a wider variety of data sources to improve predictions and address limitations such as reporting delays. BlueDot, a well-known Canadian platform, gained attention for its early detection of the COVID-19 outbreak (12). It uses AI to analyze diverse global data, including airline ticketing data and official reports, demonstrating how combining non-traditional sources can potentially overcome delays or censorship in official reporting.

The Global Biosurveillance Portal (GBSP) is a web-based system that integrates data from multiple web applications and government sources for timely responses, using AI-based predictive analysis (12). The Metabiota Epidemic Tracker uses big data analytics and cloud computing to simulate epidemic events and conduct risk analysis across numerous pathogens (12).

Beyond these named platforms, many studies describe frameworks or models that integrate multiple data streams. These include systems that combine: (1) community-level or hospital surveillance data (such as influenza-like illness reports, case notifications, and electronic health records) with external factors like weather or web data (4, 40, 53, 61, 62); (2) vector surveillance data with climate parameters and case data, especially for diseases like dengue (26, 27, 52); (3) food safety surveillance data with human case data for foodborne illnesses (57); (4) data from networked point-of-care testing devices using Internet of Things (IoT) technology (59); (5) wastewater-based epidemiology data with hydraulic modeling (58); and (6) inputs from social networks, public reports, and direct citizen participation (60).

Many proposed systems emphasize multi-source, multi-channel, or multi-point trigger approaches to improve sensitivity and robustness (7).

These examples show the diverse approaches and data sources used by existing and proposed AI-powered early warning systems. While some systems show significant promise, ongoing development and refinement are important for improving their accuracy, reliability, and acceptance by public health authorities (9, 22, 67, 68).

3.8 AI applications in specific infectious diseases

The practical application and impact of artificial intelligence (AI) in early warning and disease management have been demonstrated across several major infectious disease outbreaks and surveillance efforts, as illustrated by the following examples.

3.8.1 COVID-19

During the recent pandemic, AI played several important roles (8, 41). Platforms like BlueDot provided early detection of the outbreak, demonstrating the benefits discussed in previous sections (12). AI-supported radiology tools aided diagnosis through the automated analysis of medical images such as CT scans (51). Social media and search query data were analyzed using AI to track the virus's spread and monitor public sentiment (33). AI models also predicted COVID-19 spread based on mobility data and other factors (1). Furthermore, tools like EPIWATCH analyzed the impact of global events on disease epidemiology (66), and other AI models were developed using COVID-19 case data to improve forecasting and understand transmission patterns (11, 39, 61).

3.8.2 Influenza

Influenza surveillance has also benefited significantly from AI applications. Google Flu Trends represented an early attempt to predict influenza activity using search queries, illustrating both the potential and limitations related to data quality and accuracy, as discussed in previous sections (12). More recently, tools like EPIWATCH have included components for predicting flu season severity (66). Deep learning models, including custom attention-based networks, have been developed specifically for influenza surveillance using influenza-like illness (ILI) data, demonstrating strong early warning performance in some settings (13, 61). AI-driven analysis of search queries and other data sources continues to be explored for forecasting influenza trends (30, 42).

3.8.3 Ebola

During the 2014 West Africa outbreak, AI algorithms were applied to analyze large datasets to track virus spread and predict potential hotspots, reportedly aiding more efficient resource allocation and containment efforts. Platforms such as HealthMap, BlueDot, and Metabiota included Ebola in their monitoring activities (12). While detailed examples specific to Ebola were less prominent in the reviewed literature compared to COVID-19 or influenza, the event shows the potential for predictive models in outbreak response.

3.8.4 Dengue fever

Given its significant global burden, dengue fever is another area where AI and machine learning (ML) models are actively being developed and applied. Research focuses on forecasting dengue cases or outbreaks using epidemiological surveillance data combined with climate or meteorological variables (26, 27, 69). AI approaches, including spatiotemporal models, are being designed specifically for dengue early warning systems (52). Systems may incorporate vector surveillance data alongside case and climate information (26), or integrate data from platforms involving citizen participation (60). The goal is to provide timely predictions to support public health interventions and vector control efforts (6, 48).

3.8.5 Other infectious diseases

AI and ML techniques are also being applied to a growing range of other infectious diseases beyond the major examples above. In response to the monkeypox outbreaks, researchers have developed machine learning models for diagnosis, using either clinical symptom data–sometimes incorporating explainable AI (XAI) techniques to improve trust and transparency (56)–or analyzing image data of lesions to aid detection (49).

Applications in tuberculosis (TB) surveillance include using machine learning to predict the risk of relapse in patients and exploring the transferability of predictive models trained on other respiratory illnesses, such as COVID-19, to forecast TB case numbers (25, 39, 54). For foodborne illnesses, tree-based machine learning algorithms have been applied to integrated food safety surveillance data and human case reports to predict the spatiotemporal patterns of salmonellosis outbreaks (57).

Nosocomial (hospital-acquired) infections represent another area where machine learning methods are used on hospital surveillance data, incorporating factors such as antibiotic use and operational metrics to assess risks and predict infection incidence (40). Furthermore, AI models are being developed for broader categories, such as classifying various vector-borne diseases based on symptomatology (47) or using neural networks for early warning across multiple notifiable respiratory and digestive tract diseases (42, 48).

Specific applications also include surveillance systems for outbreaks like Lassa fever (55) and predicting trends for diseases such as hand, foot, and mouth disease (53).

3.9 Future trends and advancements in AI for early detection of infectious disease outbreaks

Finally, addressing the fifth research question, this section explores the emerging trends and future directions for the development and application of artificial intelligence (AI) in this field, as suggested by the reviewed literature.

3.9.1 Data integration and algorithm improvement

Future developments will involve enhanced integration of diverse data sources, including real-time streams from social media, wearable devices, environmental sensors, wastewater monitoring, and genomic sequencing (16, 50, 59). There is an increasing emphasis on combining multi-sectoral data under frameworks such as One Health (4), and integrating clinical information with external factors like climate patterns or web searches (22, 53).

At the same time, AI models will continue to evolve, with the development of more advanced and accurate algorithms. This includes further refinement of deep learning models and ensemble techniques, aimed at improving predictive capabilities (5, 38, 46). These advancements are expected to enhance the reliability of AI systems and address challenges related to data quality and completeness (9). Continuous model updating and optimization through iterative feedback and expert judgment will be essential for maintaining model relevance and performance (13).

3.9.2 Enhanced transparency and privacy

Addressing ethical concerns will remain important. A growing focus on explainable AI (XAI) is anticipated to improve transparency and trust in AI-driven systems (25, 41). XAI directly tackles the “black box” challenge discussed in Section 3.6.2, allowing stakeholders to better understand the reasoning behind AI predictions (5, 50, 56).

Several specific XAI techniques are gaining importance for their utility in interpreting complex models, thereby enhancing practical relevance. For example, Local Interpretable Model-agnostic Explanations (LIME) offers a method to explain the predictions of any machine learning model by approximating its behavior with a simpler, interpretable model (e.g., a linear model) locally around a specific instance being predicted (70). In the EWS context, LIME could thus help public health officials understand why an otherwise opaque AI system flagged a particular region or time point as high-risk for an outbreak.

Another widely adopted technique is SHapley Additive exPlanations (SHAP), which utilizes a game theory approach, specifically Shapley values, to assign an importance value to each feature for a particular prediction (71). This value indicates the feature's contribution to the model's output, allowing for a quantitative assessment of how different data inputs (e.g., specific symptoms reported on social media, recent mobility patterns, or prevailing climate conditions) contributed to an AI model's forecast of increased disease incidence.

Furthermore, particularly for deep learning models such as Transformers and some Convolutional Neural Networks (CNNs), Attention Visualization provides valuable insights by allowing for the inspection of internal attention mechanisms. These visualizations can reveal which parts of the input data the model focused on most when making a decision (72). For instance, in an EWS processing news articles or social media posts, attention weights could identify specific keywords or phrases that triggered an alert; similarly, in time-series forecasting, such visualizations could identify which historical data points were most influential for a given prediction.

The adoption and further development of these and other XAI methods are crucial for building trust and facilitating the practical deployment of AI in critical public health decision-making processes. Ongoing research continues to refine these techniques and develop new ones tailored to the complexities of AI in healthcare and infectious disease surveillance.

In parallel, emerging privacy-enhancing technologies such as federated learning (29, 63), which allows AI models to be trained on decentralized data without sharing raw sensitive information, will be equally important.

The promise of federated learning (FL) in the EWS context primarily derives from its ability to train robust AI models collaboratively across multiple entities (e.g., hospitals, clinics, individual wearable devices) without the need to centralize sensitive raw patient data, thus directly addressing critical data privacy and security concerns (29, 63). The reviewed literature describes conceptual architectures for FL in health surveillance. For instance, Javed et al. introduce a framework leveraging FL with data from wearable health monitoring “gages” for the early diagnosis of infectious diseases like COVID-19, dengue, and tuberculosis, emphasizing its potential for lower power consumption on distributed devices (29). Tian et al. (28) propose an FL-based “alliance monitoring” module specifically for medical institutions as part of a broader blockchain-enabled EWS, facilitating secure inter-institutional data sharing and collaborative model building. These architectures typically involve local model training on decentralized datasets, with only aggregated parameters or model updates being shared, often via a central server (though serverless peer-to-peer models are also conceived), to build a more generalized global model.

Regarding technical feasibility, while FL offers significant advantages for privacy and access to diverse data, its practical implementation in EWS is subject to several considerations identified in the literature. Benefits include the potential for more accurate and generalizable models from varied data sources (29) and reduced latency when combined with edge computing (63). However, significant challenges persist, including managing statistical heterogeneity (non-IID data) across different participating nodes, the communication overhead required for transmitting model updates, ensuring the security and privacy of the model updates themselves against inference attacks, the computational demands on local devices or institutions, and the complexities of system interoperability and integration into existing public health infrastructures (63).

While comprehensive case studies detailing the large-scale deployment and empirical performance of FL-based EWS for specific infectious diseases were still emerging within our review period (2018-onward), the reviewed literature strongly advocates for its potential and outlines numerous proposed applications. Beyond specific disease mentions by Javed et al. (29), the general applicability of FL is highlighted for scenarios requiring collaborative analysis of distributed health data while preserving privacy, which is fundamental for effective and equitable EWS. Overcoming the identified technical and logistical hurdles is crucial for transitioning FL from a promising concept to a widely adopted, impactful technology in routine public health surveillance for infectious diseases.

Blockchain technology is also being explored for secure data sharing and smart contract applications (22, 28). These technologies are expected to support collaborative AI development while safeguarding data privacy and security.

3.9.3 Broadening scope: novel threats and personalization

Advancements are anticipated in the real-time monitoring and early detection of novel and emerging infectious diseases (EIDs) (7, 8, 24). AI systems will increasingly focus on detecting unusual symptom clusters through syndromic surveillance, potentially identifying new or unexpected infectious threats before their causes are fully understood (23).

At the same time, the application of AI in personalized and precision public health is expected to expand (3, 31). Future strategies could involve customizing warnings or preventive advice based on individual risk profiles derived from data from wearable devices, genomic information, or particular clinical factors (17, 43, 44).

3.9.4 Global collaboration and standardization

Increased global collaboration and data sharing will be essential to enhance pandemic preparedness (16, 25). Developing standardized AI tools and data protocols will facilitate more effective global disease surveillance and response (8), helping to overcome integration challenges and reducing data biases described in previous sections.

Establishing cross-sectoral partnerships among public health agencies, healthcare providers, academic institutions, and technology developers will be critical for sharing expertise, co-developing solutions, and fostering innovation (5, 19, 32).

4 Limitations of this review

This systematic review, while aiming to provide a comprehensive overview of the recent landscape of AI applications in EWS for infectious diseases, is subject to several methodological limitations that should be considered when interpreting its findings.

Firstly, the scope of our literature retrieval was primarily based on the Semantic Scholar database. Although Semantic Scholar is an extensive, updated, AI-driven platform indexing an extensive number of academic papers, the dependence on a single primary database, despite our structured search strategy, may mean that some relevant studies indexed exclusively in other databases (e.g., Web of Science, Scopus, PubMed Central for specific biomedical aspects) might have been missed. This could potentially introduce a degree of selection bias.

Secondly, our review included a language restriction, focusing only on studies published in English. This was a practical decision to ensure consistent interpretation and data, but it inevitably excludes research published in other languages. Therefore, valuable insights and AI applications developed or reported in non-English literature, particularly from regions where English is not the primary language of scientific publication, may not be represented in our synthesis, potentially skewing the geographical representation of research activities.

Thirdly, as with most systematic reviews, there is a potential for publication bias. Studies reporting positive, novel, or statistically significant findings are often more likely to be published than those with null, negative, or inconclusive results. This could lead to an overrepresentation of successful AI applications or an underestimation of the challenges and failures in the field of AI for EWS.

Fourthly, the timeframe for our search (2018 onwards) was chosen to focus on recent advancements in this rapidly growing field. While this provides a contemporary overview, it means that foundational or earlier relevant studies published before 2018 were not included in this specific review.

Finally, the significant heterogeneity observed across the 67 included studies in terms of AI methodologies, specific diseases, datasets, and evaluation metrics made it challenging to conduct a direct quantitative comparison or meta-analysis of the performance of different AI approaches. Our review, therefore, primarily provides a qualitative synthesis and mapping of the reported landscape.

5 Discussion

Historical pandemics and contemporary factors such as globalization, climate change, and zoonotic spillover (diseases transmitted from animals to humans) show the urgent need to enhance global preparedness against infectious diseases (1, 2, 4, 6). In response, this systematic review evaluated the current state of the use of AI in early warning systems (EWS) for infectious disease surveillance, summarizing findings from 67 relevant studies. Specifically, it addressed five research questions related to: primary AI methods, data sources, perceived benefits, significant challenges, and future trends. One consideration is that our review was based on literature retrieved from Semantic Scholar. Although this database covers a broad spectrum of scientific publications, it may not include all relevant studies indexed in other sources such as Web of Science or Scopus. However, given its integration of diverse publication sources and strong coverage of peer-reviewed literature, we believe this approach was appropriate for the scope and objectives of our review. Overall, the findings suggest that AI has the potential to transform infectious disease surveillance from reactive approaches into proactive, data-driven predictions, although several technical, practical, and ethical barriers still limit its general implementation.

Regarding the first research question on primary AI methods, the reviewed studies show a clear shift from traditional statistical methods toward more advanced machine learning (ML) and deep learning (DL) techniques. ML classifiers, such as support vector machines (SVM), logistic regression, and k-nearest neighbors (KNN), remain popular for disease prediction and classification tasks (46, 48). Ensemble methods, particularly Random Forests, consistently achieve strong performance in predicting hospital-acquired infections (40), forecasting communicable diseases (48), and identifying foodborne illness outbreaks (57). Additionally, DL models that capture temporal patterns, such as Long Short-Term Memory (LSTM) networks, have proven particularly effective for forecasting diseases like influenza and Dengue fever (52, 53). Innovations in customized DL architectures, demonstrated by attention-based SEAR networks for influenza surveillance (13), further illustrate the evolution of the field. Meanwhile, natural language processing (NLP) techniques have become important for extracting insights from unstructured text in news articles, social media, and clinical reports, enabling real-time tracking of public sentiment and symptom reporting (10, 12, 32). Hybrid approaches, combining multiple algorithms or integrating AI with traditional epidemiological models, are increasingly adopted to improve overall predictive accuracy and system robustness (39, 58).

In addressing the second research question about data sources, modern AI-driven EWS emphasize the integration of diverse and large-scale datasets. While traditional epidemiological sources–such as case reports, influenza-like illness (ILI) counts, and hospital records–remain important (13, 62), the full potential of AI appears through combining these traditional sources with non-traditional datasets. Web-based data, including news articles, social media platforms [e.g., Twitter (50)], and search engine queries [e.g., Baidu Index (61)], offer real-time indicators of emerging health concerns. Environmental and climatic datasets are important for forecasting vector-borne illnesses like Dengue fever (26, 27), as well as other weather-sensitive diseases (6). Emerging data sources, such as wastewater surveillance, provide unbiased, community-level indicators of disease activity (22, 58), and genomic sequencing enables precise identification and tracking of pathogens (8, 24). Mobile health technologies and wearable devices offer future potential for personalized health monitoring, although their integration into broader public health surveillance remains limited at this time (29, 34, 63). The integration of such diverse data enhances predictive accuracy but also introduces substantial challenges related to data quality, consistency, and interoperability (9, 63).

Furthermore, a significant overarching challenge implicitly linked to data quality, model transparency, and ethical considerations is the generalizability of AI models, particularly in the context of cross-region or cross-population applications. Our review notes that issues with model generalizability and the risk of poor performance when applied to new, distinct datasets are recognized limitations in the field (as discussed in Section 3.6.1). While the concept of transfer learning between different diseases was noted in some reviewed literature (Section 3.8.5), a deep, specific exploration into the methodologies, comparative effectiveness, and challenges of cross-region transfer learning (e.g., adapting models developed in one continent for robust application in another with different demographic, environmental, or healthcare system characteristics) was beyond the defined scope of our primary research questions. Our review aimed to provide a broad assessment of the current landscape of AI techniques, data sources, reported benefits, and broadly identified challenges within EWS for infectious diseases. The complexities of developing, validating, and implementing effective and equitable cross-region transfer learning strategies represent a substantial and critical research domain in their own right.

Regarding the third research question (benefits), AI-based early warning systems primarily enhance the time and accuracy of outbreak detection. Multiple studies and real-world systems, such as BlueDot's early identification of COVID-19 (12), illustrate how AI can detect outbreaks sooner than traditional surveillance methods (10, 38), thus enabling faster and more effective public health interventions (23). Improved predictive accuracy further supports health authorities in allocating resources and responding effectively to outbreaks (14, 27, 36, 48, 52, 54). Additionally, AI-driven automation of data processing may offer cost savings, particularly in resource-limited settings (12), although equitable access to these advanced technologies remains an important concern (23).

Despite these clear benefits, the fourth research question identifies substantial limitations. The quality, completeness, and representativeness of input data determine AI performance; thus, poor data quality inevitably leads to unreliable predictions (“garbage in, garbage out”) (9). Biases inherent in data collection processes–such as underreporting or limited digital access–can result in biased AI outputs that intensify existing health inequities (11, 14, 32). The “black box” nature of complex DL models, characterized by their lack of transparency, also represents a significant barrier to clinician and public health official adoption (31, 41). While Explainable AI (XAI) methods are emerging to address this challenge, they remain underdeveloped (5, 56). Additional challenges include technical difficulties in integrating AI systems into existing public health infrastructure, along with complex ethical considerations around privacy, consent, fairness, accountability, and potential misuse of data (11, 14, 19, 32, 50, 63). Finally, human expertise continues to be essential for interpreting AI-generated insights and making important public health decisions (8, 43).

Considering future trends (fifth research question), the field is moving toward integrating diverse datasets, developing more sophisticated, transparent algorithms, and adopting privacy-preserving technologies such as federated learning and blockchain (3, 5, 16, 28, 29, 41). However, achieving these goals will require global collaboration, standardized data practices, sustained investment in infrastructure and workforce training, and clear ethical frameworks to guide responsible AI development and deployment (1, 8, 14, 19).

6 Conclusion

This systematic review shows the growing importance and rapid development of artificial intelligence (AI) in early warning systems for infectious diseases. AI methods have the potential to greatly improve the speed, accuracy, and effectiveness of outbreak detection and prediction. By analyzing large and varied data sources, ranging from traditional health records to digital media, environmental measurements, and wastewater surveillance, AI can provide earlier and more precise warnings. This advantage has been clearly demonstrated for diseases such as COVID-19, influenza, and Dengue fever.

However, significant challenges remain, preventing AI from being widely implemented. Issues related to data quality, missing or biased data, and transparency in complex AI (“black box”) models must be carefully addressed. The need to explain how AI reaches its conclusions (“explainable AI”) is necessary to build trust among healthcare professionals and public health authorities. Additionally, there are technical difficulties in combining and managing large datasets, and ethical concerns about privacy, fairness, and accountability. It is also important to ensure that AI systems support human decision-making rather than replace it.

While AI offers great promise for improving infectious disease surveillance and global health preparedness, achieving these benefits requires a coordinated effort. Continued investment in developing transparent, fair, and ethical AI technologies is needed, along with improvements in data management, training of health workers, and international cooperation.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

IV-M: Conceptualization, Data curation, Formal analysis, Visualization, Writing – original draft, Writing – review & editing. GX: Conceptualization, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing. YX: Conceptualization, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was partially supported by the National Institutes of Health [P50CA70907, R35GM136375, R01GM140012, R01GM141519, U01CA249245, and U01AI169298], and the Cancer Prevention and Research Institute of Texas [RP230330 and RP240521].

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2025.1609615/full#supplementary-material

References

1. Panah HR. Early detecting of infectious disease outbreaks: AI potentials for public health systems. Rangahau Aranga: AUT Grad Rev. (2023) 2. doi: 10.24135/rangahau-aranga.v2i3.180

Crossref Full Text | Google Scholar

2. Oeschger T, McCloskey D, Buchmann RM, Choubal AM, Boza J, Mehta S, et al. Early warning diagnostics for emerging infectious diseases in developing into late-stage pandemics. Acc Chem Res. (2021) 54:3656–66. doi: 10.1021/acs.accounts.1c00383

PubMed Abstract | Crossref Full Text | Google Scholar

3. Srivastava V, Kumar R, Wani MY, Robinson K, Ahmad A. Role of artificial intelligence in early diagnosis and treatment of infectious diseases. Infect Dis. (2024) 57:1–26. doi: 10.1080/23744235.2024.2425712

PubMed Abstract | Crossref Full Text | Google Scholar

4. Zhou C, Wang S, Wang CX, Qiang N, Xiu L, Hu Q, et al. Integrated surveillance and early warning system of emerging infectious diseases in China at community level: current status, gaps and perspectives. Sci One Health. (2024) 4:100102. doi: 10.1016/j.soh.2024.100102

PubMed Abstract | Crossref Full Text | Google Scholar

5. Wong F, de la Fuente-Nunez C, Collins JJ. Leveraging artificial intelligence in the fight against infectious diseases. Science. (2023) 381:164–70. doi: 10.1126/science.adh1114

PubMed Abstract | Crossref Full Text | Google Scholar

6. Morin C, Semenza J, Trtanj J, Glass G, Boyer C, Ebi K. Unexplored opportunities: use of climate- and weather-driven early warning systems to reduce the burden of infectious diseases. Curr Environ Health Rep. (2018) 5:430–8. doi: 10.1007/s40572-018-0221-0

PubMed Abstract | Crossref Full Text | Google Scholar

7. Sun H, Hu W, Wei Y, Hao Y. Drawing on the development experiences of infectious disease surveillance systems around the world. China CDC Weekly. (2024) 6:1065–74. doi: 10.46234/ccdcw2024.220

PubMed Abstract | Crossref Full Text | Google Scholar

8. Parums D. Editorial: Infectious disease surveillance using artificial intelligence (AI) and its role in epidemic and pandemic preparedness. Med Sci Monit. (2023) 29:e941209-1–4. doi: 10.12659/MSM.941209

PubMed Abstract | Crossref Full Text | Google Scholar

9. Morr CE, Ozdemir D, Asdaah Y, Saab A, El-Lahib Y, Sokhn E. AI-based epidemic and pandemic early warning systems: a systematic scoping review. Health Informatics J. (2024) 3:14604582241275844. doi: 10.1177/14604582241275844

PubMed Abstract | Crossref Full Text | Google Scholar

10. Arslan J, Benke K. Artificial intelligence and telehealth may provide early warning of epidemics. Front Artif Intell. (2021) 4:556848. doi: 10.3389/frai.2021.556848

PubMed Abstract | Crossref Full Text | Google Scholar

11. Li C, Ye G, Jiang Y, Wang Z, Yu H, Yang M. Artificial intelligence in battling infectious diseases: a transformative role. J Med Virol. (2024) 96:29355. doi: 10.1002/jmv.29355

PubMed Abstract | Crossref Full Text | Google Scholar

12. Macintyre C, Chen X, Kunasekaran M, Quigley A, Lim S, Stone H, et al. Artificial intelligence in public health: the potential of epidemic early warning systems. J Int Med Res. (2023) 51:3000605231159335. doi: 10.1177/03000605231159335

PubMed Abstract | Crossref Full Text | Google Scholar

13. Yang L, Yang J, He Y, Zhang M, Han X, Hu X, et al. Enhancing infectious diseases early warning: a deep learning approach for influenza surveillance in China. Prev Med Rep. (2024) 43:102761. doi: 10.1016/j.pmedr.2024.102761

PubMed Abstract | Crossref Full Text | Google Scholar

14. Mckee M, Rosenbacke R, Stuckler D. The power of artificial intelligence for managing pandemics: a primer for public health professionals. Int J Health Plann Manage. (2024) 40:257–70. doi: 10.1002/hpm.3864

PubMed Abstract | Crossref Full Text | Google Scholar

15. Isiaka AB, Anakwenze VN, Ilodinso CR, Anaukwu CG, Ezeokoli CMV, Noi SM, et al. Harnessing artificial intelligence for early detection and management of infectious disease outbreaks. Int J Innov Res Dev. (2024) 13:52–65. doi: 10.24940/ijird/2024/v13/i2/FEB24016

Crossref Full Text | Google Scholar

16. James C, Ezeh CJ, Anioke SC, Oyewole S, David MG. The role of predictive analytics in enhancing public health surveillance: proactive and data-driven interventions. World J Adv Res Rev. (2024) 24:3059–077. doi: 10.30574/wjarr.2024.24.3.3909

Crossref Full Text | Google Scholar

17. Singh J, Dhiman G. A review on predictive analytics for early disease detection in neonatal healthcare using artificial intelligence. J Neonatal Surg. (2025) 14:831–42. doi: 10.52783/jns.v14.2158

Crossref Full Text | Google Scholar

18. Ekundayo F. Using machine learning to predict disease outbreaks and enhance public health surveillance. World J Adv Res Rev. (2024) 24:794–811. doi: 10.30574/wjarr.2024.24.3.3732

Crossref Full Text | Google Scholar

19. Eze CE, Igwama GT, Nwankwo EI, Emeihe EV. AI-driven health data analytics for early detection of infectious diseases: a conceptual exploration of U.S. public health strategies. Compr Res Rev Sci Technol. (2024) 2:74–82.

Google Scholar

20. Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. (2021) 372:n71. doi: 10.1136/bmj.n71

PubMed Abstract | Crossref Full Text | Google Scholar

21. Ammar W, Groeneveld D, Bhagavatula C, Beltagy I, Crawford M, Downey D, et al. Construction of the literature graph in semantic scholar. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. New Orleans, LA: ACL (2018). p. 84–91. doi: 10.18653/v1/N18-3011

PubMed Abstract | Crossref Full Text | Google Scholar

22. Li Z, Meng F, Wu B, Kong D, Geng M, Qiu X, et al. Reviewing the progress of infectious disease early warning systems and planning for the future. BMC Public Health. (2024) 24:3080. doi: 10.1186/s12889-024-20537-2

PubMed Abstract | Crossref Full Text | Google Scholar

23. Meckawy R, Stuckler D, Mehta A, AL-Ahdal T, Doebbeling B. Effectiveness of early warning systems in the detection of infectious diseases outbreaks: a systematic review. BMC Public Health. (2022) 22:2216. doi: 10.1186/s12889-022-14625-4

PubMed Abstract | Crossref Full Text | Google Scholar

24. Bernasconi A, Chiara M, Alfonsi T, Ceri S. CEUR-WS. SENSIBLE: Implementing Data-driven Early Warning Systems for Future Viral Epidemics. (2024). p. 18–25. Available online at: https://re.public.polimi.it/handle/11311/1266723

Google Scholar

25. Towfek SK, Elkanzi M. A review on the role of machine learning in predicting the spread of infectious diseases. metaheuristic optimization review. Metaheuristic Optim Rev. (2024) 02:14–27. doi: 10.54216/MOR.020102

Crossref Full Text | Google Scholar

26. Nasir M, Aldillah Wulandhari S, Tenrisau D, Haris Ibrahim M, Rahastri A, Sa'adatar Rohmah N, et al. Machine learning approach to predict the dengue cases based on climate factors. Window Health: J Kesehatan. (2024) 7:203–14. doi: 10.33096/woh.vi.1428

Crossref Full Text | Google Scholar

27. Roster K, Connaughton C, Rodrigues F. Machine learning based forecast of dengue fever in Brazilian cities using epidemiological and meteorological variables. Am J Epidemiol. (2022) 191:1803–12. doi: 10.1093/aje/kwac090

PubMed Abstract | Crossref Full Text | Google Scholar

28. Tian Y, Wan-jun Y, Zhang M, Zhang M, Tang JY. The application of blockchain technology in the early warning and monitoring of infectious diseases. In: 2020 5th International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBM). Okinawa: IEEE (2020). p. 229–33. doi: 10.1109/ICIIBMS50712.2020.9336418

Crossref Full Text | Google Scholar

29. Javed I, Iqbal U, Bilal M, Shahzad B, Chung TS, Attique M. Next generation infectious diseases monitoring gages via incremental federated learning: current trends and future possibilities. Comput Intell Neurosci. (2023) 2023:1102715. doi: 10.1155/2023/1102715

PubMed Abstract | Crossref Full Text | Google Scholar

30. Manshi. Predicting disease outbreaks using machine learning models on public health data. Int J Multidiscip Res. (2024) 6. doi: 10.36948/ijfmr.2024.v06i05.28999

Crossref Full Text | Google Scholar

31. Peterson E. Machine learning, predictive analytics, and clinical practice: can the past inform the present? JAMA. (2019) 322:2283–4. doi: 10.1001/jama.2019.17831

PubMed Abstract | Crossref Full Text | Google Scholar

32. Flores L, Kim S, Young SD. Addressing bias in artificial intelligence for public health surveillance. J Med Ethics. (2023) 50:190–4. doi: 10.1136/jme-2022-108875

PubMed Abstract | Crossref Full Text | Google Scholar

33. Raja S, Sukanya N. Artificial intelligence in public health surveillance for monitoring the infectious diseases. In: 2024 International Conference on Artificial Intelligence and Quantum Computation-Based Sensor Application (ICAIQSA). Nagpur: IEEE (2024). p. 1–4. doi: 10.1109/ICAIQSA64000.2024.10882310

Crossref Full Text | Google Scholar

34. Olaboye JA, Maha CC, Kolawole TO, Abdul S. Innovations in real-time infectious disease surveillance using AI and mobile data. Int Med Sci Res J. (2024) 4:647–67. doi: 10.51594/imsrj.v4i6.1190

PubMed Abstract | Crossref Full Text | Google Scholar

35. Jaswal S, Sharma S, Parihar N. Artificial intelligence-driven predictive analytics for early detection of chronic diseases in primary healthcare. In: International Conference on Signals and Electronic Systems. Chennai: IEEE (2024). p. 1–6. doi: 10.1109/ICSES63760.2024.10910314

Crossref Full Text | Google Scholar

36. Nwankwo EI, Emeihe EV, Ajegbile MD, Olaboye JA, Maha CC. Artificial intelligence in predictive analytics for epidemic outbreaks in rural populations. Int Med Sci Res J. (2024) 7:78–94. doi: 10.53771/ijlsra.2024.7.1.0062

Crossref Full Text | Google Scholar

37. Natrayan L, Kamal M, Manivannan KK, Sunil G. Machine learning and data mining approaches for infectious disease surveillance and outbreak management in healthcare. In: 2024 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC). Bhubaneswar: IEEE (2024). p. 1–7. doi: 10.1109/ASSIC60049.2024.10507990

Crossref Full Text | Google Scholar

38. Zhang Y, Shi W, Han C, Luo Z. The research of artificial intelligence assisted real-time monitoring and response system for infectious disease outbreaks. In: 2024 International Conference on Integrated Intelligence and Communication Systems (ICIICS). Kalaburagi: IEEE (2024). p. 1–5. doi: 10.1109/ICIICS63763.2024.10860203

PubMed Abstract | Crossref Full Text | Google Scholar

39. Cheng Y, Bai Y, Yang J, Tan X, Xu T, Cheng R. Analysis and prediction of infectious diseases based on spatial visualization and machine learning. Sci Rep. (2024) 14:28659. doi: 10.1038/s41598-024-80058-1

PubMed Abstract | Crossref Full Text | Google Scholar

40. Chen Y, Zhang Y, Nie S, Ning J, Wang Q, Yuan H, et al. Risk assessment and prediction of nosocomial infections based on surveillance data using machine learning methods. BMC Public Health. (2024) 24:1780. doi: 10.1186/s12889-024-19096-3

PubMed Abstract | Crossref Full Text | Google Scholar

41. Addaali B, Latif R, Saddik A. Addressing the spread of infectious diseases in the era of explainable AI. In: World Conference on Complex Systems. Mohammedia: IEEE (2024). p. 1–6. doi: 10.1109/WCCS62745.2024.10765539

Crossref Full Text | Google Scholar

42. Guo Z, He K, Xiao D. Early warning of some notifiable infectious diseases in China by the artificial neural network. R Soc Open Sci. (2020) 7:191420. doi: 10.1098/rsos.191420

PubMed Abstract | Crossref Full Text | Google Scholar

43. Langford BJ, Branch-Elliman W, Nori P, Marra AR, Bearman G. Confronting the disruption of the infectious diseases workforce by artificial intelligence: what this means for us and what we can do about it. Open Forum Infect Dis. (2024) 11:ofae053. doi: 10.1093/ofid/ofae053

PubMed Abstract | Crossref Full Text | Google Scholar

44. Ibiam VA, Omale LE, Taiwo O. The role of Artificial Intelligence models in clinical decision support for infectious disease diagnosis and personalized treatment planning. Int J Sci Res Arch. (2025) 14:1337–47. doi: 10.30574/ijsra.2025.14.3.0769

Crossref Full Text | Google Scholar

45. Mandepudi P, Sindhu G, Rohan SS, Reddy DLC, M P. AI-driven medical chatbot for early disease detection. Int J Res Appl Sci Eng Technol. (2025) 13:2324–9. doi: 10.22214/ijraset.2025.67802

Crossref Full Text | Google Scholar

46. Li B, Ding S, Song G, Li J, Zhang Q. Computer-aided diagnosis and clinical trials of cardiovascular diseases based on artificial intelligence technologies for risk-early warning model. J Med Syst. (2019) 43:1–10. doi: 10.1007/s10916-019-1346-x

PubMed Abstract | Crossref Full Text | Google Scholar

47. Abinaya N, Harikrishnan VS, Santhiya S, Jayadharshini P, Rathika P, Nishanandhini A, et al. Optimizing tabular vector-borne disease surveillance with machine learning classification techniques. In: 2024 2nd International Conference on Advances in Computation, Communication and Information Technology (ICAICCIT), Vol. 1. Delhi: IEEE (2024). p. 918–22. doi: 10.1109/ICAICCIT64383.2024.10912396

Crossref Full Text | Google Scholar

48. Go VDM. Communicable disease surveillance through predictive analysis: a comparative analysis of prediction models. HCMCOU J Sci Eng Technol. (2023) 13:45–54. doi: 10.46223/HCMCOUJS.tech.en.13.2.2944.2023

Crossref Full Text | Google Scholar

49. Gairola AK, Kumar V. Monkeypox disease diagnosis using machine learning approach. In: International Computer Science Conference. Noida: IEEE (2022). p. 423–427. doi: 10.1109/ICSC56524.2022.10009135

Crossref Full Text | Google Scholar

50. Kashmar N, Ozdemir D, Yousuf I, Dabboussi AH, Saab A, Salem-Sokhn E, et al. Enhancing epidemic early warning systems with social media data. In: 2025 International Conference on Control, Automation, and Instrumentation (IC2AI). Beirut: IEEE (2025). p. 1–5. doi: 10.1109/IC2AI62984.2025.10932178

Crossref Full Text | Google Scholar

51. Chu WT, Reza SMS, Anibal JT, Landa AJ, Crozier I, Bagci U, et al. Artificial intelligence and infectious disease imaging. J Infect Dis. (2023) 228(Supplement 4):S322–36. doi: 10.1093/infdis/jiad158

PubMed Abstract | Crossref Full Text | Google Scholar

52. Ningrum DNA Li YC, Hsu CY, Muhtar MS, Suhito H. Artificial intelligence approach for severe dengue early warning system. Stud Health Technol Inform. (2024) 310:881–5. doi: 10.3233/SHTI231091

PubMed Abstract | Crossref Full Text | Google Scholar

53. Wang M, Lee C, Wang W, Yang Y, Yang C. Early warning of infectious diseases in hospitals based on multi-self-regression deep neural network. J Healthc Eng. (2022) 2022:8990907. doi: 10.1155/2022/8990907

PubMed Abstract | Crossref Full Text | Google Scholar

54. Haval DAM, Ikhar S. Using a convolutional neural network for early infectious disease detection during public health emergencies. South East Eur J Public Health. (2024) 1:387–91. doi: 10.70135/seejph.vi.949

PubMed Abstract | Crossref Full Text | Google Scholar

55. Wattamwar A, Akwafuo SE, Mistry V. Data-driven real-time surveillance system for tracking disease outbreaks: a case study of lassa fever outbreak. In: 2024 IEEE 12th International Conference on Healthcare Informatics (ICHI). Orlando, FL: IEEE (2024). p. 344–9. doi: 10.1109/ICHI61247.2024.00051

Crossref Full Text | Google Scholar

56. Setegn GM, Dejene BE. Explainable AI for symptom-based detection of monkeypox: a machine learning approach. BMC Infect Dis. (2025) 25:419. doi: 10.1186/s12879-025-10738-4

PubMed Abstract | Crossref Full Text | Google Scholar

57. Garcia-Vozmediano A, Maurella C, Ceballos LA, Crescio E, Meo R, Martelli W, et al. Machine learning approach as an early warning system to prevent foodborne Salmonella outbreaks in northwestern Italy. Vet Res. (2024) 55:72. doi: 10.1186/s13567-024-01323-9

PubMed Abstract | Crossref Full Text | Google Scholar

58. Zehnder C, Béen F, Vojinovic Z, Savić DA, Torres A, Mark O, et al. Machine learning for detecting virus infection hotspots via wastewater-based epidemiology: the case of SARS-CoV-2 RNA. GeoHealth. (2023) 7:e2023GH000866. doi: 10.1029/2023GH000866

PubMed Abstract | Crossref Full Text | Google Scholar

59. Fu Y, Liu Y, Song W, Yang D, Wu W, Lin J, et al. Early monitoring-to-warning Internet of Things system for emerging infectious diseases via networking of light-triggered point-of-care testing devices. Exploration. (2023) 3:20230028. doi: 10.1002/EXP.20230028

PubMed Abstract | Crossref Full Text | Google Scholar

60. Apolinario-Arzube Ó, García-Díaz JA, Pinto S, Luna-Aveiga H, Medina-Moreira JJ, Gómez-Berbis JM, et al. CollaborativeHealth: smart technologies to surveil outbreaks of infectious diseases through direct and indirect citizen participation. In: Applied Informatics and Cybernetics in Intelligent Systems: Proceedings of the 9th Computer Science On-line Conference 2020, Volume 39. Cham: Springer (2020). p. 177–90. doi: 10.1007/978-3-030-51974-2_15

Crossref Full Text | Google Scholar

61. Zhang T, Yang L, Fan Z, Hu X, Yang J, Luo Y, et al. Comparison between threshold method and artificial intelligence approaches for early warning of respiratory infectious diseases — Weifang City, Shandong Province, China, 2020–2023. China CDC Weekly. (2024) 6:635–41. doi: 10.46234/ccdcw2024.119

PubMed Abstract | Crossref Full Text | Google Scholar

62. Zhang L, Li MY, Zhi C, Zhu M, Ma H. Identification of early warning signals of infectious diseases in hospitals by integrating clinical treatment and disease prevention. Curr Med Sci. (2024) 44:273–80. doi: 10.1007/s11596-024-2850-x

PubMed Abstract | Crossref Full Text | Google Scholar

63. Badidi E. Edge AI for early detection of chronic diseases and the spread of infectious diseases: opportunities, challenges, and future directions. Future Internet. (2023) 15:370. doi: 10.3390/fi15110370

Crossref Full Text | Google Scholar

64. Tran N, Kretsch CM, LaValley C, Rashidi H. Machine learning and artificial intelligence for the diagnosis of infectious diseases in immunocompromised patients. Curr Opin Infect Dis. (2023) 36:235–42. doi: 10.1097/QCO.0000000000000935

PubMed Abstract | Crossref Full Text | Google Scholar

65. Bandal U. Predicting multiple diseases with machine learning and streamlit: enhancing healthcare digitally. Int J Sci Res Eng Manag. (2024) 8:1–4. doi: 10.55041/IJSREM30266

Crossref Full Text | Google Scholar

66. Quigley A, Honeyman MD, Stone MH, Dawson R, MacIntyre CR. EPIWATCH: an artificial intelligence early-warning system as a valuable tool in outbreak surveillance. Int J Infect Dis. (2025) 152:107579. doi: 10.1016/j.ijid.2024.107579

Crossref Full Text | Google Scholar

67. Hu S, Chen D, Cheng X. Research on early warning index system of infectious diseases. In: 2021 International Conference on Public Management and Intelligent Society (PMIS). (2021). p. 396–9. doi: 10.1109/PMIS52742.2021.00096

Crossref Full Text | Google Scholar

68. Hu S-n, Cheng X, Chen D. Comparative study on early warning methods of infectious diseases. In: E3S Web of Conferences. (2021).

Google Scholar

69. Mazhar B, Ali N, Manzoor F, Khan MK, Nasir M, Ramzan M. Development of data-driven machine learning models and their potential role in predicting dengue outbreak. J Vector Borne Dis. (2024) 61:1–4. doi: 10.4103/0972-9062.392264

PubMed Abstract | Crossref Full Text | Google Scholar

70. Nguyen HTT, Cao HQ, Nguyen KVT, Pham NDK. Evaluation of explainable artificial intelligence: shap, lime, and cam. In: Proceedings of the FPT AI Conference. Quy Nhon: FPT (2021). p. 1–6.

Google Scholar

71. Mosca E, Szigeti F, Tragianni S, Gallagher D, Groh G. SHAP-based explanation methods: a review for NLP interpretability. In: Proceedings of the 29th International Conference on Computational Linguistics. (2022). p. 4593–603.

Google Scholar

72. Dehimi NEH, Tolba Z. Attention mechanisms in deep learning: towards explainable artificial intelligence. In: 2024 6th International Conference on Pattern Analysis and Intelligent Systems (PAIS). EL OUED: IEEE (2024). p. 1–7. doi: 10.1109/PAIS62114.2024.10541203

Crossref Full Text | Google Scholar

73. Freifeld CC, Mandl KD, Reis BY, Brownstein JS. HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports. J Am Med Inform Assoc. (2008) 15:150–7. doi: 10.1197/jamia.M2544

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: artificial intelligence, public health, disease surveillance, early warning system (EWS), infectious disease, systematic review

Citation: Villanueva-Miranda I, Xiao G and Xie Y (2025) Artificial intelligence in early warning systems for infectious disease surveillance: a systematic review. Front. Public Health 13:1609615. doi: 10.3389/fpubh.2025.1609615

Received: 10 April 2025; Accepted: 02 June 2025;
Published: 23 June 2025.

Edited by:

Sandra Rojas-Berrio, National University of Colombia, Colombia

Reviewed by:

Manjit Kaur, SR University, India
Jorge Magalhães, Oswaldo Cruz Foundation (Fiocruz), Brazil
Tian Gan, East China University of Science and Technology, China

Copyright © 2025 Villanueva-Miranda, Xiao and Xie. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yang Xie, eWFuZy54aWVAdXRzb3V0aHdlc3Rlcm4uZWR1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.