REVIEW article

Front. Digit. Health, 16 January 2026

Sec. Health Technology Implementation

Volume 7 - 2025 | https://doi.org/10.3389/fdgth.2025.1678047

Federated multimodal AI for precision-equitable diabetes care

  • 1. Medical Service Dimension, China-Japan Union Hospital of Jilin University, Changchun, China

  • 2. Department of Hand and Foot Surgery, China-Japan Union Hospital of Jilin University, Changchun, China

  • 3. Department of Nursing, China-Japan Union Hospital of Jilin University, Changchun, China

Article metrics

View details

2,1k

Views

103

Downloads

Abstract

Type 2 diabetes mellitus (T2DM) constitutes a rapidly expanding global epidemic whose societal burden is amplified by deep-rooted health inequities. Socio-economic disadvantage, minority ethnicity, low health literacy, and limited access to nutritious food or timely care disproportionately expose under-insured populations to earlier onset, poorer glycaemic control, and higher rates of cardiovascular, renal, and neurocognitive complications. Artificial intelligence (AI) is emerging as a transformative counterforce, capable of mitigating these disparities across the entire care continuum. Early detection and risk prediction have progressed from static clinical scores to dynamic machine-learning (ML) models that integrate multimodal data—electronic health records, genomics, socio-environmental variables, and wearable-derived behavioural signatures—to yield earlier and more accurate identification of high-risk individuals. Complication surveillance is being revolutionised by AI systems that screen for diabetic retinopathy with near-specialist accuracy, forecast renal function decline, and detect pre-ulcerative foot lesions through image-based deep learning, enabling timely, targeted interventions. Convergence with continuous glucose monitoring (CGM) and wearable technologies supports real-time, AI-driven glycaemic forecasting and decision support, while telemedicine platforms extend these benefits to remote or resource-constrained settings. Nevertheless, widespread implementation faces challenges of data heterogeneity, algorithmic bias against minority groups, privacy risks, and the digital divide that could paradoxically widen inequities if left unaddressed. Future directions centre on multimodal large language models, digital-twin simulations for personalised policy testing, and human-in-the-loop governance frameworks that embed ethical oversight, trauma-informed care, and community co-design. Realising AI's societal promise demands coordinated action across patients, clinicians, technologists, and policymakers to ensure solutions are not only clinically effective but also equitable, culturally attuned, and economically sustainable.

1 Introduction

Diabetes's social burden is a complex issue with societal, psychological, and economic facets in addition to personal health consequences. T2DM, in particular, has become a global epidemic that disproportionately affects underinsured people, members of racial and ethnic minorities, and those from socioeconomically disadvantaged backgrounds (1) (Figure 1). Cardiovascular disorders, renal failure, and cognitive decline are among the complications of the disease that worsen its effects on society and put a tremendous burden on healthcare systems and economies around the world (2). Diabetes management and outcomes are greatly influenced by social determinants of health, which frequently reinforce health disparities. These factors include childhood environment, education, socioeconomic status, and access to healthcare (3). The lack of access to wholesome food, rigid healthcare schedules, and low health literacy are some of the obstacles that people from underprivileged backgrounds usually encounter when trying to manage their health effectively. These factors all work together to worsen glycemic control and increase the likelihood of complications (1, 3). The confluence of these elements emphasizes how urgently creative, scalable solutions are needed to alleviate the social burden of diabetes.

Figure 1

Panel A shows a world map indicating regional diabetes prevalence in 2024, totaling 588.7 million globally. Regions include NAC (56.2 million), EUR (65.6 million), AFR (24.6 million), SACA (35.4 million), MENA (84.7 million), SEA (106.9 million), and WP (215.4 million). Panel B is a graph showing a rise in the number of articles and cumulative citations in the field of Artificial Intelligence and Diabetes from 2013 to 2023, with articles on the y-axis and cumulative citations on the x-axis.

Prevalence rate of diabetes in the world and AI issuing trend. (A) Distribution of diabetes patients worldwide in 2024. (B) Research papers and citation situations in the field of “artificial intelligence and diabetes” from 2013 to 2023.

Although this is a narrative review rather than a systematic review, we adhered to a structured approach for literature selection to ensure comprehensiveness and relevance. We conducted a comprehensive search of PubMed, Web of Science, and Embase databases from inception until October 2025, using a combination of keywords and MeSH terms related to “artificial intelligence,” “machine learning,” “diabetes mellitus,” “diabetic complications,” “federated learning,” “digital twin,” and “health equity.”

Studies were included if they: (1) focused on AI/ML applications in diabetes care, including risk prediction, complication monitoring, or personalized interventions; (2) were published in peer-reviewed journals in English; (3) involved human subjects or real-world data; and (4) reported measurable outcomes (e.g., AUC, sensitivity, specificity, clinical impact).

Exclusion criteria included: (1) conference abstracts, editorials, or letters without original data; (2) studies limited to technical model development without clinical relevance; and (3) non-English publications.

After removing duplicates, titles and abstracts were screened by two independent reviewers (BB and HL), with disagreements resolved by consensus. Full texts of potentially relevant articles were assessed for eligibility. In addition, reference lists of key articles and recent high-quality reviews were manually searched to identify additional relevant studies. The final selection was based on thematic relevance, methodological quality, and representation of diverse populations and settings.

With previously unheard-of potential to lessen the disease's societal impact, AI marks a revolutionary shift in the treatment of diabetes. AI-driven technologies have shown great promise in lowering complications, increasing patient engagement, and improving glycemic control. Examples of these technologies include CGM, automated insulin delivery (AID) systems, and predictive analytics (4, 5). It has been demonstrated that AID systems help patients with type 1 and type 2 diabetes spend more time within target glucose ranges and lower their HbA1c levels, which lessens the strain of daily self-management (5, 6). Furthermore, AI-powered tools can offer real-time, personalized advice, improving treatment adherence and enabling patients to actively participate in their care (7). Nevertheless, there are certain difficulties in incorporating AI into diabetes care. Thorough monitoring and ethical considerations are required for issues like data privacy, algorithmic bias, and the danger of becoming overly dependent on technology (7, 8). In order to ensure that underprivileged groups profit from these developments, AI adoption must be equitable. The need for standardized, culturally relevant solutions is highlighted by the fact that, although AI-based diabetic retinopathy (DR) screening systems have demonstrated high negative predictive values, their sensitivity varies greatly (8).

AI's potential to alleviate the social burden of diabetes is further enhanced by its convergence with wearable technology and telemedicine. Continuous glycemic level and behavior monitoring is made possible by wearable technology, which also offers real-time feedback that can guide tailored interventions (2). AI in conjunction with social media platforms promotes a cooperative approach to diabetes management by facilitating peer-to-peer support and patient-provider interactions (2). To guarantee their accessibility and scalability, however, extensive clinical trials and cost-effectiveness evaluations are necessary for the broad adoption of these technologies (2). Social determinants of health, systemic barriers to care, and health equity must be given top priority when integrating AI into public health strategies. High-risk populations' ability to manage their diabetes can be improved by trauma-informed care models that take into consideration the psychological and emotional effects of childhood adversity (9). Healthcare professionals can customize interventions to match the specific requirements of various populations by using AI to comprehend and address the larger sociopolitical contexts that influence health behaviors (3).

Diabetes has a significant social impact that necessitates creative, comprehensive solutions. AI holds great promise for improving outcomes, reducing disparities, and lessening the disease's impact on society, making it a significant turning point in the treatment of diabetes. But for AI to be successfully incorporated into diabetes care, a well-rounded strategy that tackles moral, technical, and societal issues is needed. For AI-driven innovations to be fair, available, and efficient, cooperation between patients, healthcare professionals, technologists, and legislators is crucial. The healthcare industry can transform diabetes care and lessen its significant societal burden by embracing AI's potential while maintaining moral principles and putting patient-centered care first (8).

1.1 Fundamental concepts

A revolutionary approach to precision-equitable diabetes care is represented by the convergence of federated learning (FL) and multimodal artificial intelligence (AI), which tackles important issues with data privacy, interoperability, and tailored intervention (10). Diabetes mellitus (DM) is a rising global health burden that necessitates creative ways to reduce inequalities in care outcomes and access, especially for underprivileged groups (10). Traditional centralized AI models face limitations due to fragmented healthcare data and stringent privacy regulations, which hinder cross-institutional collaboration (11). Federated multimodal AI emerges as a robust framework to overcome these barriers by enabling decentralized model training across heterogeneous datasets while preserving patient confidentiality (12). FL facilitates collaborative model development without raw data exchange, aligning with privacy-preserving regulations such as HIPAA and PIPEDA (12). In diabetes care, FL has demonstrated efficacy in predicting complications and optimizing glycemic control using electronic health records from multi-province cohorts (11). Studies leveraging SecureBoost and vertical-FL have shown promise in integrating diverse clinical features across institutions sharing the same patient population but differing in feature spaces (12). This approach mitigates biases inherent in single-center datasets and enhances model generalizability (11). However, challenges persist in scenarios with label scarcity, necessitating semi-supervised FL techniques that combine labeled and unlabeled data (13). Innovations like Federated PU Learning address settings where some institutions possess only positive and unlabeled samples, while others contribute purely unlabeled data, enabling robust risk prediction even with sparse annotations (14). Multimodal AI integrates diverse data modalities—continuous glucose monitoring (CGM), insulin pump logs, activity trackers, and genomic markers—to capture the complex pathophysiology of diabetes (10, 14). Unlike unimodal models, multimodal frameworks leverage cross-modal alignment and co-learning to enhance predictive accuracy and clinical relevance (10, 15). While image-to-text translation models enhance diabetic retinopathy screening, time-series analysis of CGM data in conjunction with NLP-derived dietary logs can forecast glycemic variability (10, 14). By pretraining models on unlabeled multimodal pairs, recent developments in self-supervised learning (SSL) further reduce reliance on labeled data, enabling label-efficient fine-tuning for personalized care. Two major challenges are representation learning and fusion strategies (10, 15).

2 Main methodologies

2.1 Early screening and risk prediction

A revolutionary change in healthcare is represented by the incorporation of AI into risk prediction and early screening frameworks, especially when considering chronic illnesses and intricate clinical outcomes. Clinical decision-making has traditionally relied on traditional scoring systems, such as the Finnish Diabetes Risk Score (FINDRISC) for diabetes risk assessment. Nevertheless, these systems frequently depend on manually selected, static variables and are not flexible enough to adjust to the changing needs of patient data. By utilizing multimodal data sources, such as genetic information, wearable device data, and electronic health records (EHRs), ML-enhanced models have overcome these constraints and produced more precise and customized risk prediction tools (16, 17) (Figure 2).

Figure 2

Flowchart illustrating an AI application predicting the probability of type two diabetes mellitus (T2DM). Multimodal raw data inputs include EHR (diagnosis, treatment, examination), genomics (PRS+GWAS, SNP), and imaging (eyeground, CT, US). The data undergoes data quality control (QC) such as missing value interpolation, normalization, and privacy de-identification. The AI model processes this data, outputting a risk score with confidence intervals and personalized intervention advice shown through graphs and insulin pens.

Workflow of multimodal artificial intelligence for diabetes risk prediction.

2.1.1 Transitioning from traditional scoring systems to ML-enhanced predictive models

The Finnish Diabetes Risk Score (FINDRISC) and other traditional risk-scoring systems have long been used as fundamental instruments for diabetes stratification and prediction. These systems frequently include clinical factors chosen by univariate analysis or expert consensus, and they usually rely on traditional statistical methods that assume linear relationships between variables and outcomes (18). Although these models offer a structured framework for risk assessment, their intrinsic drawbacks, such as their incapacity to capture intricate, nonlinear interactions among predictors, frequently lead to subpar accuracy, especially in patient populations that are heterogeneous (19). Traditional scores are frequently developed on selective cohorts, limiting their generalizability to real-world clinical settings where patient demographics and comorbidities may vary significantly (20).

By addressing these issues through sophisticated computational methods, machine learning (ML) models, on the other hand, offer transformative potential. Neural networks, Random Forest (RF), gradient boosting, and other machine learning algorithms are excellent at spotting complex patterns in high-dimensional datasets, including nonlinear and interactive effects between variables (21, 22). When it comes to predicting diabetes-related outcomes, ensemble techniques like CatBoost have shown better discrimination and calibration than established scores like GRACE or conventional logistic regression (23, 24). These models maximize the informational yield from available datasets by utilizing a wider range of predictors, including clinical, behavioral, and demographic data (25, 26). Notably, ML approaches have been successfully applied to predict diabetic complications and health literacy levels, areas where traditional scores often falter due to oversimplification (27, 28). Although they are useful, traditional risk assessment tools rely on a small number of clinical and demographic variables that frequently lack the dynamic adaptability and granularity needed for personalized medicine (29). In contrast, ML-enhanced models use extensive datasets, such as imaging, clinical, demographic, and even socioeconomic factors, to find intricate patterns and relationships that conventional models might miss (30). ML algorithms like Artificial Neural Networks (ANN), Support Vector Machines (SVM), and K-nearest neighbors (KNN) have shown excellent performance in the context of hip fracture risk prediction, with accuracy levels ranging from 70.26% to 90% and Area Under the Receiver Operating Curve (AUC) values between 0.39 and 0.96 (31). The predictive power of these models is greatly increased by incorporating a variety of variables, such as geometric factors from imaging data and finite element analysis (32).

Beyond traditional risk factors, AI is being used in early screening to identify subtle biomarkers and imaging features that are difficult to detect using traditional techniques. ML models have been created to evaluate multi-target panels from blood tests in colorectal cancer (CRC) screening, with AUC values of up to 0.941 for CRC detection and 0.925 for the classification of colorectal adenoma (CRA) (33).

These models provide a non-invasive, affordable, and scalable approach to early detection, outperforming conventional biomarkers such as CEA and CA199 (34). With an AUC of 0.87 for identifying high-risk prostate cancer (PCa), ML models based on MRI radiomic features have also demonstrated encouraging results in PCa risk stratification, supporting individualized treatment planning. These developments demonstrate how AI can be used to enhance conventional scoring models by increasing their predictive accuracy and integrating a wider variety of data sources.

The capacity of ML-enhanced models to incorporate multimodal data—such as imaging, clinical, and behavioral factors—to produce a thorough risk assessment is one of their main innovations. ML algorithms like Random Forest (RF) and Adaptive Boost Machine (ABM) have been used to predict 90-day functional impairment in stroke risk prediction, with AUC values that are comparable to those of conventional logistic regression models (35). These models provide a more comprehensive evaluation of patient risk by utilizing a wide range of predictors, such as imaging data, clinical history, and lifestyle factors (36). By combining several algorithms to reduce overfitting and increase generalizability, stacking-integrated machine learning (SIML) techniques have further improved these models’ predictive performance (37). With an AUC of 0.877, sensitivity of 81.8%, and specificity of 81.9% in cervical cancer screening, SIML models have greatly outperformed conventional risk assessment instruments (38).

The use of AI in risk assessment and early screening also includes resource allocation and screening interval optimization. AI models like Mirai have been used in breast cancer screening to classify patients into risk groups according to their mammogram data. This allows for customized screening intervals, which lowers the incidence of advanced cancers by about 18 cases per 1,000 screened people (39). By concentrating on high-risk individuals, this strategy not only increases the effectiveness of screening programs but also lessens the strain on healthcare resources (40). Analyzing data from the Short Physical Performance Battery (SPPB), ML models have been developed to predict fall risk. These models have achieved AUC values of 0.78 and offer a scalable solution for identifying older adults who are at high risk of falling (41). These models offer a more precise evaluation of fall risk by combining a variety of predictors, such as handgrip strength, body mass index (BMI), and fall history (42).

Several drawbacks of conventional scoring models, such as their dependence on a small number of predictors and their incapacity to capture intricate relationships between variables, are also addressed by the incorporation of AI and ML in early screening and risk prediction. Conventional stroke risk scores frequently depend on a limited number of clinical predictors, which might not adequately account for the variety of stroke risk factors (43). However, to provide a more thorough evaluation of stroke risk, machine learning models can integrate a variety of predictors, such as imaging data, clinical history, and behavioral factors (44). By continuously adding new data, machine learning models can adjust to shifting risk profiles and gradually increase their predictive accuracy (45).

2.1.2 Multimodal data integration

An innovative strategy for addressing this worldwide health issue is the incorporation of AI into diabetes risk assessment and early screening. AI models can greatly improve the precision and effectiveness of diabetes detection and risk stratification by utilizing multimodal data, such as genetic information, wearable device outputs, and EHRs. The shortcomings of conventional diagnostic techniques, which frequently rely on a single data source and might overlook early signs of disease progression, are addressed by this multimodal approach. AI systems that have been trained on extensive datasets have shown impressive predictive powers. They were able to identify high-risk subgroups for Type 2 Diabetes with an AUC of 0.94 (46). This high accuracy highlights AI's potential to support early detection and preventive measures, which are essential for reducing the serious consequences of diabetes, such as retinopathy, nephropathy, and cardiovascular disease (46).

With its increased risk of T2DM, cardiovascular disease, and poor perinatal outcomes, gestational diabetes mellitus (GDM), a condition marked by glucose intolerance initially identified during pregnancy, poses serious risks to both mother and child (38, 47). Traditional screening techniques, like the oral glucose tolerance test (OGTT), are frequently laborious, time-consuming, and have a limited capacity to predict risk prior to the second trimester, despite the importance of early detection and intervention (43, 47). By combining various data sources, such as clinical, demographic, and multi-omics data, AI-driven models that use ML algorithms present a promising way to accurately and efficiently predict the risk of GDM (41, 48).

While multi-omics techniques offer a thorough understanding of the molecular alterations linked to GDM, ultrasound imaging provides real-time information on fetal and placental development, both of which are essential for early risk assessment (38, 49). The potential of imaging biomarkers in early diagnosis has been highlighted by studies showing that AI models that incorporate placental texture features from ultrasound images can successfully differentiate between pregnant women in good health and those with GDM (38, 49). It has been demonstrated that combining multi-omics data with clinical variables, such as plasma adiponectin levels and other metabolic markers, improves predictive accuracy (47, 50). In addition to improving the accuracy of GDM prediction, these integrated approaches give physicians practical advice for early intervention, such as pharmacological treatments or lifestyle changes, to reduce negative consequences (41, 47).

AI models are especially useful in settings with limited resources because they have shown a remarkable degree of versatility in using readily available clinical variables to predict GDM risk. Traditional risk stratification techniques have been outperformed by non-invasive models that use maternal age, mean arterial blood pressure, prior history of GDM, and ethnicity. These models have demonstrated high predictive performance (AUC: 0.82) (41, 50). These models remove the need for intricate laboratory testing, making them widely applicable in low- and middle-income nations with limited access to cutting-edge diagnostic equipment (41, 43). AI-driven models are useful in a variety of healthcare settings because they have been tuned to achieve high sensitivity and specificity with few input variables (41, 48). A study created 12 distinct machine learning models, four of which, with just 7–12 variables, had a sensitivity of 0.82 and a specificity of 0.72–0.74, indicating the possibility of scalable and effective GDM screening (48).

Beyond risk assessment, the creative application of AI in GDM prediction incorporates long-term health monitoring and tailored interventions. With the use of AI models, patients can be categorized according to their risk profiles, allowing for more focused interventions like medication, exercise regimens, and dietary changes (41). AI has also been used to forecast when GDM will progress to postpartum type 2 diabetes, providing a customized risk score that can direct long-term medical interventions (51). Prenatal fasting glucose levels, gestational age at diagnosis, and insulin therapy during pregnancy have all been found by machine learning algorithms to be significant predictors of postpartum dysglycemia, laying the groundwork for individualized postpartum care (51). These developments demonstrate AI's potential to lessen the long-term burden of diabetes and its associated complications in addition to improving immediate pregnancy outcomes.

Utilizing cutting-edge machine learning methods, like eXtreme Gradient Boosting (XGBoost), to combine various data types is another significant development in this field. When paired with demographic factors, Polygenic Risk Scores (PRS) from genome-wide association studies (GWAS) and Multi-image Risk Scores (MRS) from medical imaging data offer a comprehensive picture of a person's risk for diabetes (46). This integration allows for the identification of subtle patterns and correlations that traditional methods might overlook. The predictive ability of AI models 2 has been further improved by the use of medical imaging methods like electrocardiography (ECG) and abdominal ultrasonography to identify T2D-associated diseases like cardiovascular disease and nonalcoholic fatty liver disease (46). CGM outputs and other wearable device data provide real-time insights into metabolic health, facilitating dynamic risk assessment and individualized intervention strategies (52).

The diversity of diabetes risk factors among various populations is also addressed by the use of AI in diabetes screening. Due to variations in lifestyle, genetics, and environmental factors, traditional risk assessment instruments created for Western populations frequently perform poorly when used with Asian or other ethnic groups (53). However, by training on local datasets, AI models can be customized to target particular populations, increasing their clinical utility and predictive accuracy. When applied to Chinese populations, for example, ensemble machine learning techniques have been demonstrated to perform better than current non-invasive risk score systems, underscoring the significance of developing population-specific models (53). AI is a useful tool for global diabetes prevention initiatives because of its adaptability, especially in areas with inadequate healthcare resources.

AI-driven diabetes screening has the potential to improve patient engagement and adherence to preventive measures, which is another important benefit. Online platforms and other AI-based risk assessment tools offer easily navigable interfaces that enable people to evaluate their risk of diabetes and obtain tailored recommendations (46). With the help of these resources, patients can take the initiative to change their lifestyles, which are essential for preventing diabetes and include better eating and more exercise (52). AI models can help medical professionals spot high-risk patients early, allowing for prompt interventions and lessening the strain on healthcare systems. Clinicians can better monitor patient progress and customize treatment plans by using predictive models that forecast changes in HbA1c levels based on EHR data (52).

These models' clinical applicability is further improved by the application of explainable AI (XAI) techniques, which offer insights into the variables influencing risk predictions. Chest radiographs (CXRs) may be used for improved T2D screening, for example, as XAI has shown correlations between certain adiposity measures and high predictivity (54). In addition to fostering confidence in AI-driven diagnostics, this openness makes it easier to incorporate AI suggestions into clinical procedures. Further demonstrating AI models' scalability and potential for broad use is their capacity to handle massive amounts of data, including more than 270,000 CXRs and 160,000 patient records (54).

Recent advancements show how multi-modal data fusion, which integrates lifestyle, genomic, clinical, and imaging data, can increase prediction accuracy. Nevertheless, there are still challenges in aligning different features while lowering noise and positional shifts (55, 56). By using adaptive region-feature alignment and RoI jittering to address the positional misalignment between modalities, Aligned Area CNN (AR-CNN) increases robustness in object detection tasks (56). Similarly, Frequency-Aware Feature Fusion (FreqFusion) addresses intra-category inconsistency in dense predictions by harmonizing spatial and boundary features using adaptive low- and high-pass filters. This method can be used for imaging analysis associated with diabetes (57). These techniques emphasize the importance of dynamic feature calibration, where spatial and temporal dependencies are modeled to refine feature relevance, as demonstrated by video-based 3D detection systems that propagate scene features across frames (58).

The gap between algorithmic accuracy and clinical interpretability can be reduced by combining statistical models with explainable AI (XAI) frameworks. By tailoring feature selection to specific patient profiles, SHAP value analysis in the iCARE system maximizes diagnostic accuracy for early-stage diabetes prediction by measuring feature contributions (59). Trust-weighted fusion of multi-block features can withstand noise thanks to evidential deep learning (EMFF) and other complementary techniques that quantify uncertainty in adversarial environments. This is an essential capability for diabetes risk stratification using high-dimensional gene expression data (60). Hybrid optimization algorithms demonstrate how statistical optimization and artificial intelligence collaborate to reduce dimensionality by continuously enhancing gene signatures to attain higher classification accuracy (61).

However, for equitable implementation, disparities in data quality between populations must be addressed. The absence of standardized electronic health records (EHRs) in rural and underserved areas exacerbates biases in AI models trained on urban datasets. Large-scale, low-cost retinal imaging is used by solutions like cross-modal self-supervised learning to democratize access, while federated learning frameworks safeguard privacy by decentralizing model training (24, 62). A critical strategy for scalable diabetes screening in resource-constrained settings, ensemble approaches reduce overfitting in unbalanced datasets and achieve 93% accuracy with smaller feature sets (63).

Future directions should prioritize interdisciplinary collaboration to validate fusion techniques across multiple cohorts and standardize multimodal datasets. While integrating AlphaFold-predicted protein structures with EHRs (as in PredGO) may uncover new biomarkers for diabetic nephropathy, real-time edge computing may enable low-latency, contrast-adaptive monitoring of glycemic fluctuations (64, 65). Precision-equitable diabetes care can overcome present constraints by balancing AI's scalability with statistical rigor, providing tailored interventions that are both clinically actionable and widely available.

Explainable AI (XAI) is increasingly recognized as a paradigm-shifting approach in healthcare, particularly for precision-equitable diabetes care. XAI bridges the gap between complex AI-driven predictions and useful clinical insights. By incorporating multimodal data, such as wearable sensor outputs, imaging, and electronic health records (EHRs), XAI enhances the interpretability of federated learning models, which are crucial for collaborative yet privacy-preserving diabetes management across diverse populations (66). Federated learning enables multiple institutions to train a single AI model on various datasets without sharing raw data, addressing disparities in data availability and labeling methods (67). However, clinician trust is often diminished by these models' “black-box” nature. By employing techniques like SHAP (Shapley Additive Explanations) and Local Interpretable Model-Agnostic Explanations (LIME), which elucidate feature importance and decision pathways in glycemic control predictions or diabetic retinopathy screening, XAI tackles this problem (68, 69).

XAI is novel due to its dual capacity to strike a balance between cutting-edge AI and conventional statistical methods like logistic regression or decision trees, which are still essential for risk stratification in diabetes care due to their intrinsic transparency (70). While CNNs and other deep learning models outperform these methods in terms of accuracy, particularly for tasks like retinal image analysis, their opacity makes clinical adoption more challenging (71). XAI addresses this by reducing complicated neural network outputs into human-friendly rules or visualizations. Examples of these include counterfactual explanations that demonstrate how shifting HbA1c levels impact risk predictions or saliency maps that draw attention to microaneurysms in fundus images (72, 73). This synergy is crucial for precision equity because it reduces biases in marginalized populations and ensures that AI tools are both high-performing and accessible to clinicians with varying degrees of technical expertise (71, 74).

One major technological advantage in the treatment of diabetes is XAI's ability to integrate multimodal data fusion techniques, such as REMEDIS, which enhances model robustness by combining representations from multiple data sources. By integrating retinal scans with EHR-derived comorbidities, XAI can demonstrate how diabetic neuropathy detection is enhanced, providing clinicians with a thorough grasp of patient risk (68). XAI's role in federated learning ensures equitable model performance across organizations with diverse data demographics—a critical component of diabetes management globally (67, 74). XAI also makes it easier to comply with legal frameworks like GDPR, which require transparency in automated decision-making, by offering detailed explanations (69, 71).

When clinicians comprehend the model's reasoning, they move from skepticism to trust, which is one of the long-standing problems in diabetes care that XAI can help with (75). Generative counterfactual XAI frameworks can simulate how minor alterations in ECG morphology, a stand-in for diabetic autonomic neuropathy, impact AI predictions in order to conform to clinicians' causal reasoning (73). Similarly, Clinical decision support systems (CDSS) driven by XAI can compare evidence in favor of and against insulin titration, encouraging team decision-making and reducing therapeutic inertia (76, 77). New tools like HistoMapr-Breast demonstrate how XAI can enhance diabetes workflows by pre-identifying diagnostic regions in pathology slides or continuous glucose monitoring trends, despite the fact that they were developed for oncology (68).

XAI is a significant development for precision-equitable diabetes care by demystifying AI outputs, validating federated models across multiple cohorts, and emphasizing the complementary roles of AI and conventional statistics. Future studies should concentrate on improving multimodal data fusion for low-resource settings, standardizing XAI evaluation metrics in clinical settings, and expanding real-world validation of frameworks like LIME-SHAP hybrid models (69, 71). XAI can accelerate the shift from reactive diabetes management to proactive, customized interventions while maintaining equity and trust by placing equal emphasis on interpretability and performance.

Missing values, temporal misalignments, and inconsistent data formats are common problems with multimodal datasets. While digital health records might not include clinical biomarkers, EHRs might not have detailed lifestyle data (78, 79). Robust machine learning models cannot be trained with sparse datasets, so sophisticated imputation methods and FL frameworks are required to reduce bias (80). Scalability is limited by the substantial computational resources required to process high-dimensional data (81, 82). Additionally, privacy concerns arise when integrating sensitive health information across platforms, requiring stringent anonymization protocols (83). Although AI models are very good at identifying patterns, clinical translation is made more difficult by their “black-box” nature. As demonstrated by studies where ML-derived phenotypes were not easily actionable without additional validation, doctors may mistrust predictions that lack mechanistic explanations (79, 84). Efforts to enhance model interpretability—such as attention mechanisms in transformer architectures—are critical for fostering trust (85). External validity is called into question because the majority of multimodal studies concentrate on homogeneous cohorts. In rural areas with different comorbidities and access to healthcare, algorithms trained on urban populations may perform worse (86, 87). Cross-institutional collaborations and diverse dataset curation are essential to address this limitation. Further demonstrating AI models' scalability and potential for broad use is their capacity to handle massive amounts of data, including more than 270,000 CXRs and 160,000 patient records (54).

2.2 Complication surveillance & intervention

With previously unheard-of levels of precision, efficiency, and scalability, the application of AI to the monitoring and treatment of diabetes complications represents a revolutionary approach to managing these chronic conditions. Three crucial areas—diabetic retinopathy, diabetic nephropathy, and diabetic foot complications—are especially affected by AI-driven technologies, which use sophisticated computational techniques to improve early detection, prediction, and intervention techniques.

2.2.1 Diabetic retinopathy

A revolutionary development in medical technology is the application of AI to the monitoring and treatment of diabetic complications, especially DR. Early detection and intervention are essential to preventing irreversible vision loss because DR, a major cause of blindness in the working-age population, is frequently asymptomatic (88, 89). In addition to being time-consuming and resource-intensive, traditional diagnostic techniques that depend on ophthalmologists manually analyzing retinal images also have poor patient adherence rates—less than 50% of diabetics follow recommended yearly eye exams (90, 91). However, AI-driven systems provide a scalable, effective, and precise substitute that can address the rising global burden of DR by introducing specialty-level diagnostics into primary care settings (92, 93).

The creation of autonomous AI diagnostic systems, which have shown impressive diagnostic accuracy, is among the most important developments in this field. An AI system for DR detection, for example, exceeded pre-specified superiority endpoints in a pivotal trial with a sensitivity of 87.2% and specificity of 90.7% (94). This FDA-approved system represents a significant advancement in the use of AI in medicine as it is the first of its kind to be implemented in clinical practice (95). The AI system guarantees accurate disease-level output across a range of patient demographics, including differences in age, race, and ethnicity, by delivering real-time clinical decisions at the point of care along with immediate image quality feedback (96, 97). In environments with limited resources, where access to specialized healthcare is frequently restricted, such capabilities are especially important (98).

Novel machine learning architectures, such the Hierarchical Block Attention (HBA) and HBA-U-Net models, have improved AI-driven systems by honing picture segmentation by concentrating on pixel-level details and spatial correlations (99). These models have demonstrated diagnostic accuracies surpassing 99% in identifying vision-threatening DR (VTDR) when paired with hybrid Convolutional Neural Networks (CNNs) and Support Vector Machines (SVMs) (100). The diagnosis of diabetic macular edema (DME), a serious consequence of DR, is made possible by the combination of fundus imaging with optical coherence tomography (OCT), which further improves diagnostic accuracy (101, 102). These developments highlight AI's capacity to identify DR and track its development, enabling prompt interventions that can greatly enhance patient outcomes (103).

Real-world clinical settings have also validated the use of AI in DR screening. A study with a 92% specificity in Western Australian primary care practices showed that using AI systems for DR detection was feasible (104). To improve diagnostic reliability and decrease false positives, AI algorithms must be further refined. However, issues like low disease incidence rates and poor image quality were noted (105). Notwithstanding these obstacles, AI-driven systems' affordability and scalability make them a viable way to close the global DR screening gap, especially in areas with little access to ophthalmologists (106).

Along with its diagnostic potential, AI has demonstrated promise in tracking the progression of DR by analyzing microaneurysm (MA) turnover, a possible biomarker for DR risk. With high sensitivity and specificity, automated tools have been developed to locate MAs, align retinal images from multiple patient encounters, and estimate MA turnover rates (107). Clinicians can optimize patient care by using this capability to evaluate disease activity and customize treatment plans based on real-time data. By enabling patients to have retinal imaging in primary care settings and receive evaluations remotely, AI-driven systems can support telemedicine-based DR screening, removing logistical obstacles and enhancing screening protocol adherence.

It is impossible to overestimate the economic and societal effects of AI-driven DR monitoring and intervention. In addition to being a major cause of blindness, DR places a heavy financial strain on healthcare systems around the globe. For millions of people with diabetes, early detection and treatment of DR can improve their quality of life, lower healthcare costs, and prevent vision loss. AI-driven systems have the potential to completely transform DR management because they can provide high-efficiency, high-accuracy diagnostics at a reduced cost, especially in low- and middle-income nations where the prevalence of diabetes is disproportionately high.

2.2.2 Diabetic nephropathy

A revolutionary development in the treatment of chronic diseases is the incorporation of AI into the monitoring and treatment of diabetic complications, especially diabetic nephropathy (DN). One of the main causes of end-stage renal disease (ESRD) and a serious complication of T2DM, DN is linked to a high rate of morbidity and mortality (108). The intricate relationship between hyperglycemia, hypertension, and dyslipidemia in DN management calls for a multimodal strategy that AI is uniquely suited to provide (109). A I-driven models that use ML and deep learning (DL) techniques have shown great promise in anticipating the course of a disease, improving treatment plans, and facilitating early intervention to reduce complications (110). The ability of AI to forecast disease progression using time-series data and electronic medical records (EMRs) is one of the most promising uses of AI in DN. Recent research has demonstrated that by examining historical data and spotting minute patterns suggestive of early renal decline, AI models can forecast the onset of DN even before clinical symptoms like microalbuminuria manifest (111). When identifying high-risk patients who might benefit from intensive interventions, like statins and antihypertensive drugs, to postpone or stop the progression to ESRD, this predictive capability is especially helpful (112). By examining biomarkers like serum creatinine levels and the urinary albumin-to-creatinine ratio, AI models have been created to forecast the risk of cardiovascular events, which are strongly associated with DN (113). By averting expensive complications and hospital stays, these models not only improve clinical judgment but also lessen the financial burden related to DN (114). The integration of multimodal AI approaches to concurrently model renal, cardiovascular, and metabolic risks represents a transformative paradigm in precision medicine, addressing the interconnected pathophysiology of these conditions (115, 116).

In order to optimize individualized treatment plans for patients with DN, AI is also essential. Through the integration of data from various sources, such as electronic medical records, laboratory results, and patient-reported outcomes, artificial intelligence algorithms are able to determine the most effective treatment plans customized for each patient's unique profile (117). Based on real-time glucose monitoring and predictive analytics, AI-driven decision support systems can suggest changes to dietary regimens, medication dosages, and lifestyle choices (118). AI-powered mobile health (mHealth) apps further improve this individualized approach by giving patients ongoing feedback on their blood sugar levels, medication compliance, and physical activity, enabling them to actively manage their condition (119).

The early detection and monitoring of DN has also been transformed by the use of AI in imaging and diagnostic technologies. AI algorithms have been added to advanced imaging methods, such as femoral intima-media thickness (IMT) measurements and lower extremity arterial ultrasound, to evaluate the burden of peripheral atherosclerosis and forecast the risk of cardiovascular events (38). By giving doctors a more thorough picture of the patient's overall vascular health, these AI-enhanced imaging tools allow for earlier and more focused interventions (39). In order to increase the precision of DN diagnosis and staging and enable more accurate treatment planning, artificial intelligence has also been used in the analysis of renal biopsies and other histopathological data (41). Fundus photography and optical coherence tomography (OCT), which call for specific tools and skilled workers, are the main methods used by AI algorithms for DR detection (97, 120). Studies reveal that portable smartphone-based cameras achieve lower sensitivity compared to tabletop systems, particularly in low-resource settings where image quality is compromised by inadequate lighting or operator skill (121). Additionally, 28.2% of images in real-world screenings had artifacts that prevented AI from grading them, highlighting the necessity of strong preprocessing and quality control procedures (122).

While AI models like the Hierarchical Block Attention U-Net (HBA-U-Net) achieve 99.18% accuracy in controlled datasets, their performance diminishes in multiethnic cohorts or underrepresented populations (123, 124). AI systems trained on homogeneous datasets exhibit higher false-negative rates for darker retinal pigmentation or atypical lesion patterns, exacerbating health disparities (125, 126). The lack of standardized evaluation frameworks further complicates cross-study comparisons and clinical adoption (127).

The use of multimodal AI techniques to simultaneously model renal, cardiovascular, and metabolic risks is a groundbreaking paradigm in precision medicine that tackles the interconnected pathophysiology of these disorders (115, 116). Metabolic disorders such as type 2 diabetes (T2D), cardiovascular diseases (CVD), and chronic kidney disease (CKD) often coexist because they share risk factors such as hypertension, dyslipidemia, and systemic inflammation (128, 129). Traditional unimodal predictive models, which analyze clinical, imaging, or biomarker data independently, have limited prognostic accuracy and clinical utility due to their inability to capture the synergistic interactions between these systems (130, 131). The Kidney Failure Risk Equation (KFRE) ignores cardiovascular comorbidities that exacerbate the course of chronic kidney disease (CKD), despite using biochemical factors to predict renal outcomes (55). CVD risk stratification tools often ignore renal dysfunction, a critical component of cardiovascular mortality (128, 129). Multimodal AI frameworks overcome these limitations by integrating multiple data modalities, including wearable-derived metrics, omics (proteomics, metabolomics), imaging (retinal fundus photography, coronary CT angiography), and electronic health records (EMR), to generate comprehensive risk profiles (130, 132).

Because it can carry out cross-modal representation learning and dynamic feature fusion, multimodal AI is technologically innovative and can uncover latent biological patterns that single-modality models miss. Deep learning architectures like ResNeSt and Transformer-based models have demonstrated remarkable performance when integrating renal ultrasound images, spectral waveforms, and clinical data to diagnose renal artery stenosis (RAS), with accuracies exceeding 80% (133). By detecting microvascular changes indicative of systemic endothelial dysfunction, the combination of non-invasive clinical risk factors and retinal fundus photography improved the prediction of CVD (132). These developments improve feature representativeness even with small multimodal datasets by utilizing semi-supervised autoencoders and transfer learning strategies (134). Notably, multimodal models also address data imbalance problems, which are common in metabolic and renal cohorts, by employing ensemble learning and synthetic minority oversampling (SMOTE) to improve minority-class prediction (134, 135).

The clinical imperative for multimodal AI is highlighted by the need for early risk stratification and customized interventions. T2D patients with concurrent CKD and CVD exhibit distinct proteomic and metabolomic signatures that more accurately predict adverse outcomes than HbA1c alone (134, 136). Multimodal frameworks like MAIGGT, which combine clinical phenotypes and histopathological features, have achieved AUROCs of 0.92 for BRCA1/2 mutation screening, revealing microenvironmental markers linked to metabolic dysregulation (137). These models enable targeted therapies, such as SGLT2 inhibitors, which concurrently lower metabolic, cardiovascular, and renal risks (129, 136). Interpretability tools like SHAP values make modality-specific contributions clear, encouraging collaborative decision-making and fostering clinician trust (132, 137).

The results for patients with DN have been further enhanced by the incorporation of AI into multidisciplinary care models. AI-driven technologies can be used by multidisciplinary teams that include endocrinologists, nephrologists, dietitians, and diabetes educators to expedite care coordination and guarantee that patients receive prompt and efficient interventions (44). In order to lower the risk of complications and increase adherence to care protocols, I-powered platforms can automate follow-up appointment scheduling, monitor patient progress, and notify clinicians of deviations from treatment goals (43). Moreover, it has been demonstrated that AI-enabled patient empowerment devices, like blood pressure monitors, glucometers, and smart tablets, improve glycemic control and self-management practices in high-risk patients (126).

2.2.3 Diabetic foot

The creation of sensor-based remote patient monitoring (RPM) systems is one of the most exciting uses of AI in diabetic foot care. These systems use multimodal sensing technologies, like smart insoles, to continuously monitor temperature, activity levels, and plantar pressure (138, 139). These gadgets can offer patient-facing biofeedback by incorporating AI algorithms, warning users of persistently elevated plantar pressures or temperature imbalances that might be signs of pre-ulcerative conditions. In addition to empowering patients to actively offload, this proactive approach allows medical professionals to take early action, lowering the risk of ulceration and its aftereffects (140, 141). RPM systems support glycemic control, cardiovascular health, and general disease management by being in line with integrative foot care guidelines (142).

AI-driven image recognition technologies are transforming the evaluation and categorization of diabetic foot wounds in addition to sensor-based monitoring. Conventional wound assessment techniques, like the PEDIS index, mainly depend on clinicians' qualitative evaluations, which can be erratic and subjective (143). AI-powered technologies can identify, locate, and measure ulcers with high accuracy (up to 90%) by analyzing wound images using CNNs and deep learning algorithms (144). In addition to improving diagnostic accuracy, these systems help multidisciplinary care teams communicate with one another, which makes treatment planning and resource allocation more efficient. Patients can self-monitor their DFUs at home with AI-based wound monitoring tools, like the smartphone app “MyFootCare,” which offers insightful data on the course of healing and the effects of self-care routines (138). Despite the potential of these tools, issues like usability, data privacy, and the requirement for more extensive validation studies still exist (38).

The creation of customized therapeutic exercise regimens is another cutting-edge use of AI in diabetic foot care. In people with DPN, foot-ankle exercises have been demonstrated to increase muscle strength, range of motion, and gait biomechanics, lowering the risk of ulcers and improving functional outcomes (39, 41). AI-powered platforms, like the Sistema de Orientação ao Pé Diabético (SOPeD), provide personalized workout plans based on each person's physical capabilities and the severity of their condition. These programs, which can be accessed through mobile or web applications, use gamification techniques and real-time feedback to encourage self-management and adherence (43). The potential of these interventions as supplemental treatments in primary and secondary care settings has been highlighted by studies showing that they can result in notable improvements in foot pain, function, and kinematic outcomes (44).

AI is also being incorporated into decision support and predictive analytics for diabetic foot care. AI models can predict healing trajectories, optimize treatment protocols, and identify risk factors for DFU development by analyzing large datasets (145). In order to categorize people according to their risk of complications and suggest focused interventions, machine learning algorithms can evaluate patient data, including demographic, clinical, and imaging information (146). In addition to improving clinical decision-making, these predictive tools facilitate the application of evidence-based recommendations, like those offered by the International Working Group on the Diabetic Foot (IWGDF) (147). However, issues with data quality, algorithm transparency, and ethical considerations must be resolved for these systems to be deployed successfully (148) (Figure 3).

Figure 3

Diagram of a smart health monitoring system using a smart insole sensor for pressure, temperature, and gait data. It connects wirelessly to Edge AI, sending data via 5G or WiFi to a cloud platform for learning aggregation and model optimization. Real-time risk monitoring and patient alerts are accessed through an API, enabling remote diagnosis and treatment via a doctor's interface.

AI-driven closed-loop diabetic foot monitoring.

2.3 Real-Time monitoring & decision support

2.3.1 CGM time-series forecasting

A revolutionary method of managing diabetes, especially in the area of time-series glucose prediction, is the combination of AI with wearable technology and CGM devices. In order to optimize insulin dosage and avoid adverse glycemic events, real-time, personalized glucose forecasting is made possible by the combination of AI and CGM data. Because of their exceptional ability to handle sequential data, Transformer architectures and Long Short-Term Memory (LSTM) networks have attracted a lot of attention among the different AI models used for this purpose. LSTM networks—a specialized type of Recurrent Neural Networks (RNNs)—have shown impressive effectiveness in glucose prediction tasks. With a root mean square error (RMSE) reduction from 14.55 to 10.23 mg/dL when compared to standard stacked LSTM approaches in T1D cohorts, research has shown that LSTM models are more successful than traditional machine learning algorithms and even human specialists at predicting blood glucose levels over short time horizons (Shen and Kleinberg, 2025). However, the quality of the data greatly affects this benefit: In real-world use cases where signal integrity is compromised, false hypoglycemia/hyperglycemia alert rates can double due to CGM sensor noise increasing LSTM prediction error by up to 50%. Furthermore, cross-population validation reveals that when applied to older T2D patients with comorbid renal impairment, the majority of LSTM studies rely on data from young adults with T1D. This is explained by their ability to simulate the dynamic and non-linear character of glucose-insulin interactions, which are impacted by elements like insulin administration, food consumption, and physical activity (149). LSTM networks are appropriate for real-world applications where sensor errors and delays are common because they have been demonstrated to be resilient to noise in CGM data (150). A study introduces RL-DITR, a model-based reinforcement-learning framework that optimizes customized insulin dosage for in-patients with type 2 diabetes while also learning patient glucose dynamics (151) (Figure 4). Extensive validation across retrospective, prospective and proof-of-concept trials showed superior accuracy and safety compared with junior/intermediate physicians and near-parity with senior specialists. Seamless integration into the clinical workflow yielded rapid, safe glycemic improvement without increasing hypoglycemia, establishing RL-DITR as a practical, scalable AI decision-support tool for inpatient diabetes care (151).

Figure 4

Flowchart illustrating the construction and application of an AI-based insulin system. Left section depicts the construction process: electronic medical records inform a diabetes digital model, leading to an insulin decision system. Right section shows application: datasets are used in retrospective and prospective studies for doctor-AI interaction, real-time blood sugar monitoring, and clinical intervention, resulting in improved satisfaction.

AI-Driven Insulin Titration for T2DM.

Conversely, time-series prediction tasks, such as glucose forecasting, are increasingly being investigated using Transformer models, which have transformed natural language processing. Transformers can process long-range dependencies more efficiently than LSTMs because they use self-attention mechanisms to capture relationships between various time steps in the sequence. In glucose prediction, this feature enables better capture of long-cycle patterns, as demonstrated by the CGMformer models (152). In a mixed T1D/T2D cohort (n = 427), the LSTM model by Shao et al. was specifically validated for 30-min hypoglycemia prediction with >94% AUC across Chinese and European-American populations, highlighting complementary strengths in short-term accuracy vs. long-range dependency modeling (152). However, Transformers have some significant practical drawbacks in wearable applications: their self-attention mechanism raises dynamic energy by at least the same factor while increasing computational complexity quadratically with sequence length (153). Transformers have demonstrated potential in other fields, but their use in glucose prediction is still in its infancy, and there aren't many studies comparing them to LSTM models. However, early research indicates that Transformers might perform better in situations where complex, multi-scale temporal patterns need to be modeled (154). Transformers may be able to more easily incorporate extra contextual data than LSTMs, such as meal schedules and physical activity, improving prediction accuracy (154).

A number of issues with conventional glucose monitoring techniques are also resolved by integrating AI with CGM systems. The delay between actual plasma glucose levels and interstitial glucose measurements is one of the main problems, which can cause inaccurate real-time predictions. Rate-of-change metrics, variability indices, and other time-based features obtained from CGM data can be incorporated into AI models, especially those using sophisticated feature engineering techniques, to help alleviate this problem (155). Even without additional physiological metrics, these features allow for more precise and timely predictions by capturing the dynamism of glucose fluctuations (155). An important development in this area has been the creation of customized prediction models that are adapted to the glucose dynamics of specific patients. AI models can take into consideration distinct physiological reactions by utilizing patient-specific data, which increases prediction accuracy and dependability (156).

The models' computational efficiency is a crucial component of integrating AI with CGM systems, especially when used on wearable technology. Despite their effectiveness, LSTM networks can be computationally demanding, which presents problems for real-time applications on devices with limited resources. In order to address this problem, efforts to optimize these models—such as by using model compression techniques or lowering the number of parameters—have shown promise (156). Transformers are feasible for wearable applications because they have been modified for edge devices using methods like quantization and pruning, despite their computational complexity (154). One important step toward the development of closed-loop insulin delivery systems, sometimes known as the “artificial pancreas,” is the capability of wearable technology to make precise glucose predictions (119).

Beyond predicting blood sugar levels, AI is also used in CGM systems to detect hypoglycemic and hyperglycemic episodes, which are vital for avoiding serious complications in the treatment of diabetes. Research has shown that AI models, especially deep learning-based models, can accurately classify blood glucose levels into hypo-, normo-, and hyperglycemic ranges (155). While LSTM models perform well over longer prediction horizons, like 1 h, logistic regression models have demonstrated exceptional performance in predicting hypoglycemia within a 15-min horizon (155). These features are crucial for giving patients and medical professionals timely alerts so that proactive measures to preserve glycemic control can be taken (155).

AI and CGM-derived glycemic variability (GV) metrics provide a novel approach to enhancing risk assessment and enabling early, customized interventions in prediabetes—a critical window for preventing the onset of type 2 diabetes (157). CGM technology records dynamic glucose fluctuations and offers granular insights, in contrast to traditional metrics like HbA1c, which often overlook transient dysglycemia in prediabetic individuals (158, 159). AI-driven analytics can be used to systematically evaluate multidimensional GV parameters, such as coefficient of variation (%CV), time in range (TIR), time above/below range (TAR/TBR), and postprandial glucose responses (PPGR), to identify high-risk subgroups (160). Even in well-managed populations, studies show that elevated %CV (>33%) correlates with hypoglycemia risk and suboptimal glycemic control, and post-breakfast glucose excursions are early indicators of dysglycemia progression in prediabetes (161). These metrics are combined with clinical variables like insulin resistance and β-cell function by AI models like supervised learning and unsupervised clustering to stratify patients into various “glucotypes” and increase predictive accuracy (158, 162). The integration of CGM-derived glycemic variability (GV) metrics with AI represents a transformative approach to refining risk stratification and enabling early, personalized interventions in prediabetes—a critical window for preventing progression to T2DM (157).

The automated processing of CGM data via AI pipelines addresses the primary shortcomings of manual analysis, enabling scalable, real-time risk assessment. Machine learning algorithms have successfully predicted nocturnal hypoglycemia using state-of-the-art metrics such as the gradient variability coefficient (GVC), which simultaneously measures GV amplitude and hypoglycemia frequency (161). AI-powered meal detection algorithms improve postprandial glucose measurement accuracy by reducing reliance on self-reported logs, which are prone to error (163). These advancements particularly affect marginalized populations, where disparities in diabetes technology access exacerbate health inequalities (162). Notably, AI-enhanced CGM analytics have identified metabolomic signatures associated with GV, suggesting potential biomarkers for early intervention (164). Because longitudinal studies link GV to cardiometabolic outcomes like atherosclerosis and cardiovascular events in prediabetes, its prognostic value goes beyond glycemic control (158, 165).

The combination of CGM and AI may improve personalized prediabetes prevention strategies. Dynamic risk models that include TIR and PPGR could direct tailored lifestyle or medication interventions, such as treating post-breakfast hyperglycemia with dietary modifications (162, 166). Furthermore, there is mounting evidence that short-term GV metrics can be used to forecast long-term complications, offering a useful alternative to HbA1c in resource-constrained settings (159, 167). However, there are still problems, like validating AI models in large-scale trials and standardizing GV thresholds across different populations (160, 168). In order to understand the mechanistic connections between GV and T2DM pathogenesis and eventually enable precision medicine in the management of prediabetes, future research should give priority to integrated multi-omics approaches (158, 164). Clinicians can shift from reactive to proactive care by utilizing AI-driven CGM analytics, reducing the worldwide burden of type 2 diabetes in its early stages.

AI-processed CGV metrics represent a paradigm shift in the management of prediabetes by providing unparalleled granularity in risk assessment and intervention customization. From automated glucotype classification to metabolomic biomarker discovery, these developments bridge the gap between data-rich CGM outputs and useful clinical insights. As technology advances, integrating AI with real-time CGM data will be crucial to achieving early, equitable, and customized diabetes prevention (162, 166).

The integration of AI with CGM systems still faces a number of obstacles, despite the notable progress. The generalizability of AI models across diverse patient populations remains a critical challenge in diabetes management, particularly when addressing the distinct pathophysiological profiles of type 1 diabetes (T1D) and type 2 diabetes (T2D). Long short-term memory (LSTM) networks have shown strong performance in cross-population validation studies, making them a promising tool for hypoglycemia prediction. Their capacity to identify temporal dependencies in continuous glucose monitoring (CGM) data is a significant innovation. For mild hypoglycemia prediction in Chinese cohorts, they achieved area under the curve (AUC) values exceeding 97%, with only a slight 3% decrease when applied to European-American populations (152). This performance outperforms conventional machine learning techniques like random forests (RF) and support vector machines (SVM), which frequently have trouble with false alarm rates—a major obstacle to clinical adoption (152). The LSTM's superiority stems from its capacity to model complex glucose dynamics while maintaining high specificity (reducing false alarms by up to 42% compared to SVM/RF) across both T1D and T2D subgroups (152).

However, resolving the intrinsic constraints in training data diversity is essential to these models' translational potential. When used in global populations or for the management of type 2 diabetes, the majority of early hypoglycemia prediction algorithms may be biased because they were created using small, homogeneous cohorts of Western T1D patients (152). Due to variables that may not be sufficiently represented in training datasets, such as younger age, lower BMI, and higher use of premixed insulin formulations, Southeast Asian T2D patients have distinct risk profiles (152). Although continued validation in underrepresented populations is still crucial, the LSTM's demonstrated generalizability in a validation cohort of 427 patients suggests its adaptability to these heterogeneities (152).

The clinical utility of LSTMs is further improved by technological developments in model interpretability. To verify whether LSTM-based predictions are consistent with physiological principles—a crucial prerequisite for patient safety—tools such as SHapley Additive exPlanations (SHAP) have been used (169). Two LSTM variants with similar accuracy showed significant differences in clinical reliability in one study: only the physiologically consistent model (p-LSTM) correctly learned the inverse relationship between insulin administration and glucose levels, as confirmed by SHAP analysis (169). This emphasizes how crucial it is to incorporate explainability frameworks to guarantee that models generalize across populations both statistically and pathophysiologically (169).

Additional avenues for enhancing generalizability are provided by the combination of ensemble techniques and hybrid architectures. To overcome the drawbacks of single-model approaches, bidirectional LSTM layers combined with transformer encoders, for instance, can simultaneously capture short-term glucose fluctuations and long-term trend (170). Personalized glucose forecasting using such hybrid systems has shown promise, especially when metabolomic or immune response data are included to account for inter-individual variability in T1D progression (170). Similarly, using feature selection algorithms like GALGO, which cut input variables by 50% without compromising accuracy, sex-specific ensemble models for T2D prediction achieved AUCs of 0.96–0.98. This approach could be modified for population-specific tuning (63).

Despite these developments, there are still significant gaps. There is little research on how AI-driven insights translate into customized treatment modifications; instead, the majority of studies concentrate on glycemic prediction rather than therapeutic outcomes (171). Biases in model performance can be sustained by differences in dataset representation, such as the UK Biobank's underrepresentation of female and non-White participants (172). The most predictive factor for the risk of hypertension in T2D patients was sex assigned at birth, although its relative significance differed greatly between datasets with different demographic balances (172). Addressing these disparities requires not only larger, more diverse cohorts but also algorithmic innovations like federated learning to pool data across institutions while preserving privacy (172).

Three areas should be given priority in future directions: multi-omic integration (e.g., metabolomic markers for insulin resistance in T2Dor TCR sequencing for autoimmune risk stratification in T1D) (173). Additionally, real-world hybrid model validation in low-resource environments where CGM adoption is increasing, and standardization of outcome measures to facilitate comparisons between different studies (173). The success of zimislecel, an allogeneic stem cell-derived treatment that restored endogenous insulin production in T1D patients, illustrates the potential for combining AI with biomarker-driven interventions to customize diabetes care (174). Achieving equitable clinical impact will depend critically on AI models' ability to generalize across the whole range of diabetes subtypes, ethnicities, and comorbidities as the field advances toward precision medicine.

Although cross-population generalizability for diabetes management has advanced significantly with LSTM-based models, their ultimate usefulness will depend on thorough validation in practical contexts, improved interpretability frameworks, and proactive mitigation of dataset biases. The combination of wearable technology, immunotherapeutic advancements, and deep learning provides a roadmap for creating truly patient-centric solutions that cut across phenotypic and geographic barriers (175, 176). The models' ability to adjust to abrupt changes in glucose dynamics, such as those brought on by illness or medication changes, is also called into question by their reliance on historical CGM data for training (156).

A paradigm shift in diabetes care is being brought about by the combination of AI with wearable and CGM devices, which presents previously unheard-of possibilities for individualized, real-time glucose prediction. At the forefront of this change are Transformer models and LSTM networks, which have special advantages in managing sequential data. Transformers have the potential to capture intricate, long-range dependencies and may even outperform LSTMs in specific situations, even though LSTM models have proven their effectiveness in short-term glucose prediction. Realizing the full potential of AI in diabetes care will require further development of AI models as well as increases in generalizability and computational efficiency. These technologies have the potential to significantly contribute to the creation of closed-loop insulin delivery systems, which will ultimately enhance the lives of diabetics.

2.3.2 Real-Time decision support

A revolutionary development in the treatment of diabetes is the combination of wearable technology and CGM systems, especially when it comes to real-time decision support for hypoglycemia alerts and meal recommendations. People with diabetes can now make well-informed decisions regarding their insulin dosage, food choices, and physical activity levels thanks to this innovation, which uses the continuous, high-frequency data produced by CGM devices to provide actionable insights. These systems’ real-time functionality fills important gaps in conventional self-monitoring of blood glucose (SMBG) techniques, which frequently offer patchy and delayed data, raising the possibility of hypoglycemic episodes and less-than-ideal glycemic control (4, 177).

The ability to anticipate and prevent hypoglycemia through predictive low glucose alerts is one of the biggest technological benefits of combining CGM with wearables. Predictive low glucose alerts have been shown in studies to dramatically improve patient safety by reducing the frequency and duration of hypoglycemic episodes by up to 59% (178). Because they offer early warnings that enable prompt interventions, like carbohydrate consumption or insulin adjustments, these alerts are especially helpful for people with impaired hypoglycemia awareness or those who are at high risk of experiencing severe hypoglycemia (178, 179). By integrating these alerts with wearable technology, users can minimize disruptions to their routines by receiving notifications in real-time, even while they are involved in daily activities (177).

Apart from preventing hypoglycemia, the incorporation of CGM with wearable technology enables customized meal suggestions, which are essential for maximizing postprandial glucose regulation. When planning food intake, sophisticated algorithms, like those used by the DiaCompanion system, forecast postprandial blood glucose levels using real-time CGM data and offer customized dietary recommendations (180). This method lessens the burden of manual glucose monitoring and dietary planning, which can be especially difficult for people with T1D or GDM. It also helps people avoid hyperglycemic spikes (181). Integrating these suggestions with automated insulin dosing systems has the potential to enable more aggressive insulin adjustments upon meal detection, which can then be confirmed by positive CGM rate-of-change trends. This could lead to fully closed-loop insulin delivery (182).

Numerous studies have validated the practical use of these technologies and shown notable improvements in glycemic outcomes, such as HbA1c level reductions, longer time in range (TIR), and shorter hypoglycemic duration (181). In comparison to intermittently scanned CGM (isCGM), the use of real-time CGM with alert functionality has been demonstrated to increase TIR by 6.85 percentage points, underscoring the additional advantages of real-time data integration and decision support (179). Additionally, combining CGM data with other health indicators like electrodermal activity and heart rate variability improves the precision of hypoglycemia forecasts and offers a more comprehensive approach to diabetes care (183).

The effectiveness of RT-CGM in enhancing glycemic control in pregnant women with T1D and GDM has been shown by randomized controlled trials (RCTs). The CONCEPTT trial demonstrated how CGM can lower maternal hyperglycemia and improve neonatal outcomes, such as a decreased risk of neonatal intensive care unit (NICU) admissions and large-for-gestational-age (LGA) infants (184). Recent studies have explored the integration of AI with RT-CGM, expanding on these findings to provide real-time dietary interventions. One hour after meals, the DiaCompanion I smartphone app forecasts blood glucose levels using AI based on user-provided meal data. Through real-time meal modification, women with GDM can maintain glucose levels within the recommended range, improving glycemic control and reducing the need for frequent clinic visits (180). Because it tackles the dynamic and unique nature of glucose regulation during pregnancy, the use of AI in this context is especially novel. Complex CGM data can be analyzed by AI algorithms to spot trends and forecast glucose excursions, providing personalized dietary advice that takes into consideration variables like insulin sensitivity, physical activity, and meal composition (180). In addition to improving adherence to dietary interventions, this individualized approach gives patients the confidence to actively manage their condition. Additionally, by reducing the need for frequent in-person consultations, the integration of AI with RT-CGM can ease the strain on healthcare systems, which is particularly advantageous in settings with limited resources (185). The capacity of AI-integrated CGM systems to offer real-time feedback and decision support is one of their main technological advantages. AI systems can notify users when hyperglycemia or hypoglycemia is about to occur, enabling prompt actions like insulin administration or dietary changes. During pregnancy, when even brief episodes of hyperglycemia can have serious effects on fetal development, this preventative strategy is especially beneficial (180, 186). Additionally, even when patients are not physically present in the clinic, AI-driven CGM systems can enable remote monitoring, allowing medical professionals to track patients' blood sugar levels in real-time and take appropriate action (187). Additionally, there is potential for increasing the precision and dependability of glucose monitoring through the integration of AI with CGM. It has been demonstrated that the Dexcom G6 CGM system gives pregnant women accurate glucose readings across a number of sensor wear sites, with a mean absolute relative difference (MARD) of 10.3% when compared to reference values (184). These systems can further improve accuracy by correcting for sensor errors and supplying more trustworthy data for clinical decision-making when paired with AI algorithms. This is especially crucial during pregnancy, when careful glucose management is necessary to reduce the chance of negative consequences (184). The broad use of AI-integrated CGM systems still faces obstacles in spite of these developments. These include creating user-friendly interfaces, integrating these technologies into current healthcare workflows, and requiring strong clinical validation. More research is required to assess the long-term advantages and cost-effectiveness of these systems, especially in healthcare settings with a variety of patient populations (180). An important advancement in the control of blood sugar levels during pregnancy is the combination of AI with wearable and continuous glucose monitoring devices. These systems have the potential to enhance glycemic control, lower the risk of negative outcomes, and enable patients to actively manage their condition by offering real-time, customized dietary interventions.

2.3.3 Telemedicine

In environments with limited resources, like India and Africa, the combination of wearable and CGM technologies with telemedicine has become a game-changer for managing diabetes. Because of the severe infrastructure, cost, and accessibility issues with healthcare in these areas, cutting-edge approaches like telemedicine and CGM are essential to enhancing patient outcomes. In order to close gaps in care delivery, mobile health (mHealth) technologies have become essential in India due to the sharp increase in non-communicable diseases (NCDs), especially diabetes. Utilizing India's vast mobile communications network, mobile health platforms have demonstrated promise in risk assessment, early diabetes detection, and establishing connections between primary care doctors and community providers (188, 189). The viability and acceptance of such approaches in rural and remote areas are demonstrated by initiatives such as the Jodhpur School of Public Health's community-based strategy, which uses mobile health technology to survey households and triage people at high risk for diabetes (4, 189). HealthCubed provides a comprehensive solution for point-of-care testing with its integrated devices, which measure various physiological and biochemical parameters related to cardiovascular diseases. With cloud-based data storage and Bluetooth control, these gadgets are especially useful in rural and semi-urban areas with limited access to diagnostic labs (190, 191). By incorporating these technologies into telemedicine frameworks, diabetes management in marginalized populations is improved through real-time monitoring, tailored feedback, and prompt interventions.

The use of telemedicine and CGM has demonstrated promise in filling important gaps in diabetes care in Africa, where healthcare systems are frequently overburdened by conflicting health priorities. In low-resource settings, continuous glucose monitoring—which has historically been limited to high-income nations due to cost—could be extremely important, especially for monitoring serious conditions like pediatric hypoglycemia in hospitalized children (192). A study conducted in Malawi examined the appropriateness and acceptability of CGM among individuals with T1D at first-level hospitals, emphasizing its potential as a part of individualized diabetes care in settings with limited resources (193). Although the necessity of health education sessions to support its adoption was underlined, participants and healthcare providers reported that the use of CGM was appropriate and acceptable (193). By facilitating remote monitoring and real-time data sharing with healthcare providers, the integration of CGM with telemedicine platforms may further increase its usefulness and remove geographical barriers to care. To guarantee widespread adoption, however, issues like high temperatures, humidity, and restricted access to dependable electricity that can impair CGM sensor performance must be resolved (193).

Real-time data collection, remote monitoring, and customized interventions are just a few of the technological benefits that come with the combination of CGM and telemedicine. For people with type 2 diabetes, the Onduo Virtual Diabetes Clinic (VDC) offers comprehensive care through a mobile app, connected devices, remote lifestyle coaching, and live video consultations with endocrinologists. The viability of remotely prescribing and delivering CGM devices without in-office training was highlighted by the VDC model's notable improvements in HbA1c levels and patient satisfaction (181, 194). By incorporating smartphone apps and wearable fitness trackers into diabetes self-management education and support (DSMES) programs, behavioral goal monitoring has been made easier, allowing teachers to more efficiently monitor patients' adherence to dietary and exercise regimens (188, 195). These innovations highlight the potential of connected health interfaces to enhance patient engagement and improve clinical outcomes.

3 Representative models and performance comparison

3.1 AI comparison and assessment of model performance, interpretability, and external validity

With notable improvements in accuracy, efficiency, and scalability, the use of AI in risk prediction and early diabetes screening has become a game-changing strategy in healthcare. When it comes to diabetes risk prediction, undiagnosed case identification, and high-risk population stratification, AI-based models—especially those that use ML algorithms—have outperformed conventional statistical techniques (53, 196). These models produce predictive insights without the need for expensive laboratory testing by utilizing a variety of readily available and non-invasive clinical features, including age, gender, BMI, family history of diabetes, and lifestyle factors (51, 197). When it comes to detecting undiagnosed T2D, ensemble approaches that combine several risk score systems have demonstrated improved predictive accuracy, surpassing individual risk score systems (53). The robustness of gradient boosting machines (GBM) and random forest (RF) algorithms in diabetes risk prediction has been demonstrated by their high AUC values (198, 199). AI is now a feasible tool for widespread diabetes screening since its incorporation into primary care and community settings has further decreased detection costs and increased cost-effectiveness (200).

The capacity of AI models to capture intricate, non-linear relationships between risk factors—which conventional statistical approaches frequently oversimplify—is one of the major advancements in this field (51, 198). Novel risk factors that were not previously taken into account by traditional risk prediction models, such as a preference for sweet flavors and urine glucose levels, have been discovered by machine learning algorithms (198). More precise and nuanced risk stratification is made possible by this capability, especially in diverse populations with a range of genetic, environmental, and lifestyle traits (53, 196). The distinct epidemiological characteristics of particular populations, like rural Chinese cohorts, where conventional Western-based risk scores perform poorly, have been addressed by AI models (53, 198). These models' increased predictive accuracy through the use of population-specific data emphasizes the significance of regional risk assessment instruments (53, 196).

Numerous datasets and environments have been used to thoroughly validate the effectiveness of AI-based diabetes prediction models. Using data from the Korean National Health and Nutrition Examination Survey (KNHANES), studies comparing ML-based models with conventional statistical methods discovered that ML models consistently produced higher AUC values, ranging from 0.788 to 0.819, than their statistical counterparts (196). Strong predictive performance was shown by models created with Henan Rural Cohort Study data; depending on whether laboratory data was included, AUC values ranged from 0.767 to 0.872 (198). These results highlight AI's potential to improve early diabetes detection and enable prompt interventions, especially in settings with limited resources and restricted access to cutting-edge diagnostic tools (197, 200).

Through methods like SHapley Additive exPlanations (SHAP) and decision tree analyses, AI models provide interpretability and transparency in addition to their predictive power. These techniques help clinicians make well-informed decisions and customize interventions for particular patient profiles by shedding light on the relative significance of individual risk factors (201). The most important predictors of diabetes, according to decision tree analyses, are age, family history of the disease, and BMI. These findings inform focused community interventions and public health initiatives (197). Proactive diabetes management and prevention have been made possible by the combination of AI with wearable technology and EHR, which has allowed for real-time monitoring and customized risk assessment (38, 52).

3.2 Comparison of methodological limitations

A revolutionary but challenging area of healthcare innovation is the combination of AI-based models and traditional statistical methods in precision-equitable diabetes care. While AI-driven methods like deep learning (DL) and machine learning (ML) have proven remarkably effective in tasks like diabetic retinopathy (DR) screening—achieving sensitivities and specificities exceeding 85%—their reliance on large-scale, high-quality datasets creates issues with data standardization, privacy, and equitable access (128). In settings with limited resources, disparities in data annotation quality across regions and the computational requirements of convolutional neural networks (CNNs) may make inequality worse (8). On the other hand, traditional statistical techniques like logistic regression or LASSO provide interpretability and transparency, which are essential for clinical trust. However, they frequently fail to handle the nonlinear, high-dimensional interactions present in diabetes complications like diabetic nephropathy (DN) or cardiovascular disease (CVD) (202). The hybrid use of XAI frameworks, such as XAI4Diabetes, attempts to bridge this gap by combining the predictive power of algorithms like XGBoost (which achieved perfect accuracy in DM classification) with interpretable decision pathways, enabling clinicians to validate model outputs and address biases (203). While traditional methods run the risk of oversimplifying intricate metabolic interactions, AI models trained on heterogeneous datasets may unintentionally perpetuate disparities if they are not calibrated for demographic variability (24, 204). Innovations like federated learning and synthetic data generation could mitigate these limitations by enabling collaborative model training without centralized data sharing, thus preserving privacy and enhancing generalizability (205). Furthermore, the ethical imperative of equitable care demands rigorous validation of AI tools across diverse populations, as seen in studies highlighting the variable performance of DR screening algorithms (206). Ultimately, the synergy of AI and statistical methods must prioritize not only precision but also inclusivity, ensuring that advancements in diabetes care—from early detection of diabetic foot ulcers (DFU) to personalized risk stratification for diabetic ketoacidosis (DKA)—are accessible across socioeconomic and geographic divides (207). Future research should focus on harmonizing algorithmic transparency with clinical utility, leveraging multimodal data integration, and addressing the “last-mile” challenges of implementation in low-resource settings (208).

3.3 Comparison of subgroup applicability for different populations

The applicability of AI-based models vs. conventional statistical techniques in diabetes care across diverse population groups represents a critical area of investigation, given the growing global burden of diabetes and the need for personalized, scalable solutions. AI-driven approaches, including machine learning (ML) and deep learning (DL), have demonstrated superior performance in diabetes classification, risk stratification, and complication prediction compared to traditional methods like logistic regression or Chi-square tests (204). XGBoost achieved perfect accuracy (1.0), sensitivity, and specificity in classifying diabetes using sociodemographic and clinical data, outperforming conventional statistical models such as Student's t-test (204). However, the generalizability of these AI models across heterogeneous populations—particularly underrepresented groups—remains a challenge due to biases in training datasets, which often overrepresent Western cohorts (62). This disparity is exacerbated in low-resource settings, where limited data availability and infrastructural constraints hinder the deployment of AI tools, despite their potential to mitigate healthcare inequities (202).

Conventional statistical techniques, while less computationally intensive, offer advantages in explainability and interpretability, which are crucial for clinical adoption. Fuzzy cognitive maps (FCMs), for example, enable scenario-based simulations to elucidate variable relationships in diabetes risk prediction, a feature often lacking in “black-box” AI models (204). Such transparency is vital for addressing ethical concerns, including algorithmic bias and data privacy, which disproportionately affect marginalized populations (209). In contrast, AI's strength lies in its ability to process high-dimensional data and identify non-linear patterns imperceptible to traditional methods (209). AI algorithms analyzing breath samples for hypoglycemia detection achieved non-invasive monitoring, a breakthrough unattainable with conventional statistics (209). Yet, these innovations must be validated across diverse demographics; studies note that AI performance degrades in subgroups with differing disease prevalence (210).

The integration of AI with telehealth and wearables further underscores its transformative potential for population-specific care. Mobile apps leveraging AI to tailor dietary recommendations for diabetic patients have shown promise in improving glycemic control, particularly in urban, tech-literate cohorts. However, rural or elderly populations may face barriers due to digital literacy gaps or limited access to wearable devices (202). Conversely, conventional methods like risk scores remain widely applicable in low-tech settings but lack the dynamic adaptability of AI to individual behavioral or physiological changes (204).

Ethical and practical challenges persist in both paradigms. AI requires large, annotated datasets for training, raising concerns about data privacy and consent, especially in vulnerable groups (62). Retrospective studies, though scalable, often fail to capture real-world diversity, while prospective trials are constrained by sample size (211). Conventional techniques, though simpler, struggle with complex interactions (212). Future research must prioritize inclusive dataset curation, hybrid models combining AI's predictive power with statistical explainability, and rigorous subgroup analyses to ensure equitable applicability. Policymakers should incentivize collaborations between AI developers and local healthcare providers to address context-specific barriers, from algorithmic bias to resource limitations (8, 202).

While AI-based models excel in precision and scalability for diabetes care, their applicability varies significantly across populations due to data biases and infrastructural disparities. Conventional statistical methods remain indispensable for interpretability and broad accessibility but are limited in handling complex, real-time data. A synergistic approach—leveraging AI's analytical prowess where feasible and statistical robustness where necessary—could optimize diabetes management globally, provided it is grounded in ethical frameworks and tailored to local needs (202).

3.4 Comparison of model updating and iteration capability

The integration of artificial intelligence (AI) and machine learning (ML) models into diabetes care has introduced transformative capabilities in predictive accuracy, personalized intervention, and dynamic model updating, surpassing the limitations of conventional statistical techniques. Unlike traditional methods such as logistic regression, which rely on static assumptions and linear relationships, AI/ML models excel in capturing nonlinear interactions and adapting to evolving clinical data, thereby enhancing their prognostic and diagnostic utility. Random forest (RF) and k-nearest neighbors (KNN) algorithms have demonstrated superior performance in predicting cardiometabolic risk and HbA1c control, outperforming conventional approaches by accounting for complex feature interactions and temporal data shifts. This adaptability is particularly critical in diabetes management, where patient-specific factors and evolving clinical guidelines necessitate continuous model refinement to maintain accuracy.

A key innovation of AI-based models lies in their iterative updating mechanisms, which address the “calibration drift” inherent in dynamic healthcare environments. Conventional statistical models often degrade over time due to shifts in population demographics, clinical practices, or biomarker distributions 26. In contrast, AI/ML frameworks employ proactive strategies such as real-time recalibration, incremental learning, and batch updates to mitigate performance decay. Reinforcement learning (RL) has been leveraged to optimize anesthetic dosing in diabetes-related surgeries, dynamically adjusting to patient responses and operational stimuli (213). Similarly, ensemble methods like gradient-boosted trees and long short-term memory (LSTM) networks enable longitudinal prediction of diabetes progression by integrating time-series data from electronic health records (EHRs) and wearable devices (214). These capabilities are further enhanced by IoT-enabled real-time monitoring, which provides continuous data streams for model retraining—a feature absent in conventional techniques (203).

The transparency and interpretability of AI models also contribute to their clinical adoption. Feature importance analysis in RF and neural networks elucidates the impact of variables such as exercise habits, dietary patterns, and medication timing on glycemic control, facilitating shared decision-making between clinicians and patients (52). This contrasts with the “black-box” perception of early AI systems and aligns with the need for accountability in healthcare. However, challenges persist, including the requirement for large, diverse training datasets to ensure generalizability and the ethical considerations of data privacy (215). Policy initiatives must therefore prioritize standardized protocols for model validation, interoperability, and practitioner training to bridge the gap between algorithmic potential and real-world implementation (216).

AI/ML models represent a paradigm shift in diabetes care by combining predictive precision with adaptive iteration, outperforming conventional statistical methods in scalability and responsiveness. Prospective studies to confirm long-term efficacy across diverse populations and hybrid approaches that combine domain-specific knowledge with AI-driven insights should be the main focus of future research (214). By addressing these frontiers, AI can solidify its role as a cornerstone of precision medicine in diabetes management.

3.5 Technical differences and performance trade-offs of diabetes-related models

The creation and use of various predictive and decision-support models, each with unique methodological advantages and disadvantages, has greatly influenced the field of diabetes care. Machine learning (ML)-based predictive systems and hybrid care pathways are two examples of how knowledge-driven and data-driven approaches diverge critically. Knowledge-driven models, like the Clinical Decision Support (CDS) system, incorporate static clinical guidelines and use established risk equations from frameworks like the UKPDS-OM2 to simulate long-term outcomes and cost-effectiveness (217). In contrast, data-driven ML models, including XGBoost and Random Forest, prioritize feature engineering and ensemble learning to optimize predictive accuracy for diabetes-related complications, achieving AUCs >0.75 through Monte Carlo Cross-Validation (218). The hybrid care pathway model, which integrates both paradigms, demonstrates a 13.12% probability of cost-effectiveness at a €45,000/QALY threshold, though its performance hinges on variables like HbA1c reduction and complication costs (219).

Technological disparities further underscore model divergences. While the CDS system relies on deterministic simulations in Excel® to project complications and mortality, ML models dynamically adapt to EHR-derived features, such as age-diagnostic code pairs, to identify high-impact predictors for complications like amputations (218). Notably, ML models exhibit superior robustness in handling imbalanced datasets through stacking architectures and SHAP-based interpretability, enabling granular feature impact analysis (220). However, knowledge-driven models maintain an edge in clinical translatability by aligning with ADA guidelines, whereas ML models face challenges in generalizability due to sample size limitations and “human bias” in real-world prescription patterns (221).

Trade-offs between scalability and precision are revealed by performance comparisons. Better short-term glycemic control (OR 1.73) and lower long-term complication risks were associated with the hybrid model of the SingHealth Diabetes Registry's 43%–59% concordance with doctor prescriptions for antiglycemic and lipid-lowering medications (221). Conversely, although their retrospective design restricts real-time applicability, unsupervised anomaly detection models stratify high-utilization T2DM patients by identifying outliers with 7.94 annual hospitalizations (compared to 3.12 in typical cohorts). An additional level of complexity is introduced by managed care evaluations: When compared to Fee-for-Service enrollees, Medicare Advantage (MA) beneficiaries show higher rates of preventive care but worse intermediate outcomes (such as elevated HbA1c) and lower adoption of GLP-1RAs/SGLT2is, underscoring systemic disparities in evidence-based therapy access (222).

This is demonstrated by the SynthA1c encoder model, which combines clinical data and image-derived phenotypes (IDPs) to predict diabetes risk with an 87.6% sensitivity—a paradigm shift from conventional lab-based classifiers (223). Similarly, feature selection (FS) techniques in ML models reduce dimensionality without sacrificing accuracy, peaking at 150 features for amputation prediction (218). Future directions must address scalability, equity, and hybrid model optimization through prospective validation (222). Ultimately, the convergence of interpretable ML and guideline-driven logic promises to redefine personalized diabetes care, balancing predictive power with clinical feasibility.

The divergence between representative models reflects a broader tension between clinical tradition and computational innovation. While knowledge-driven systems anchor decisions in established guidelines, data-driven approaches uncover latent patterns in heterogeneous datasets. Their comparative performance underscores the need for context-aware deployment—whether prioritizing cost-effectiveness, precision, or equity (218). In order to ensure that models like SynthA1c or CDS can easily integrate into changing care frameworks while addressing biases in data and delivery, future research must prioritize interoperability (223).

3.6 Conditional effectiveness of diabetes care models

The effectiveness of diabetes care models hinges on their ability to address patient heterogeneity, systemic barriers, and contextual factors, which often determine their success or failure. Shared Medical Appointments (SMAs), for instance, demonstrate success in improving glycemic control and self-management among diverse populations, particularly when they incorporate culturally tailored interventions and address social determinants of health (SDOH) (224). For African American populations, SMAs that integrate family support, community networks, and culturally sensitive education show higher engagement and satisfaction, bridging gaps left by traditional care models (224). However, their implementation often falters in practices lacking key implementation champions, stable staffing, or additional resources—conditions identified as necessary for SMA success through Qualitative Comparative Analysis (QCA) (225). Without these enablers, logistical challenges such as workflow disruptions and patient recruitment barriers undermine scalability (225).

Precision Medicine (PM) and Simulation Modeling offer another lens into conditional success. Dynamic simulation models excel in evaluating PM interventions by capturing patient-specific pathways and heterogeneity, which static models like Markov chains cannot (226). These models succeed when applied to complex decision cascades, such as personalized treatment selection in type 2 diabetes (T2DM), where they outperform traditional cost-effectiveness analyses by integrating real-world variability (227). However, their failure modes emerge in settings with insufficient data granularity or when assumptions about rational decision thresholds (e.g., in Decision Curve Analysis) misalign with clinician behavior (227). While PM models can optimize drug choices for relapsing-remitting multiple sclerosis, their clinical utility diminishes if patient preferences or safety profiles are inaccurately weighted (227).

Agent-Based Models (ABMs) further highlight the role of context in diabetes care innovation. ABMs simulate social determinants like dietary habits and peer influence, proving valuable for obesity-related T2DM prevention (228). Their strength lies in modeling emergent behaviors, but they often fail to translate into policy due to lack of stakeholder collaboration or oversimplified representations of socioeconomic barriers (228). The absence of ABMs in real-world policy implementation underscores the gap between theoretical potential and practical adoption (228).

Machine Learning (ML) and Predictive Modeling reveal similar conditional dynamics. ML models identify multifactorial predictors of Diabetes Self-Management Education (DSME) engagement, such as socioeconomic status and comorbidities, enabling targeted interventions (229). Yet, their success is limited by data biases (e.g., underrepresentation of minority groups) and algorithmic opacity, which may perpetuate disparities if not addressed (229). While ML can stratify Medicare beneficiaries by DSME participation risk, its recommendations may overlook cultural preferences for triadic decision-making (patient-family-clinician) observed in Black communities (230).

Integrated Care Models, such as the Chronic Care Model (CCM), succeed when they emphasize patient-centeredness and community support, as seen in high patient satisfaction scores for family-involved care (231). However, they fail in fragmented health systems where provider coordination is weak or where SDOH (e.g., transportation, income) are unaddressed (232). The legacy effect of early glycemic control in pediatric diabetes further illustrates this: models ignoring metabolic memory or socioeconomic drivers underestimate long-term complications, leading to flawed economic evaluations (233).

The performance of diabetes care models is deeply contextual. SMAs thrive with champions and resources; PM and simulation models require robust data and alignment with clinical realities; ABMs need stakeholder co-production; and ML must mitigate biases. Future efforts should prioritize adaptive implementation frameworks—like PRISM or QCA—to tailor models to local conditions (225), while advancing equity-focused validation in underrepresented groups (224). Only by addressing these conditional determinants can innovations transition from theoretical promise to real-world impact.

4 Unresolved challenges

The potential of AI applications in diabetes care is often limited by disparities in data formats, coding standards, and privacy concerns, which frequently impede smooth data exchange and interoperability (234).

The heterogeneity of health data, which includes multimodal data from wearables, unstructured PGHD, and structured EHRs, is one of the main obstacles. Although AI tools like ML algorithms have demonstrated promise in examining these datasets to identify connections between genetic predispositions, lifestyle factors, and disease outcomes, a major obstacle is the absence of standardized terminology and interoperability frameworks (235). The accuracy of AI models may be jeopardized by problems with data redundancy, incompleteness, and time validation that must be resolved in order to integrate PGHD into EHRs (234). In addition, the lack of widely recognized guidelines for data interchange and representation, like Fast Healthcare Interoperability Resources (FHIR), makes incorporating AI solutions into current healthcare processes even more challenging (236).

Concerns about security and privacy pose a significant obstacle to the use of AI in diabetes treatment. Because health data is sensitive, strict measures must be taken to protect patient confidentiality. However, integrating AI frequently requires using third-party platforms and cloud-based architectures, which may not always adhere to legal frameworks (235). This undermines confidence in AI-driven healthcare solutions by creating vulnerabilities that could expose patient data to cyber threats or unauthorized access. To guarantee fair and efficient treatment for every patient, the ethical ramifications of AI, such as algorithmic bias and the possibility of an excessive dependence on automated systems, must also be addressed (8).

Addressing the issues of data standardization and interoperability requires coordinated efforts in order to fully realize AI's potential in diabetes care. This entails creating standardized data protocols, using open-source data processing platforms like Hadoop and Spark, and putting strong privacy-preserving measures in place (235). Establishing frameworks that enable the safe and effective integration of AI into clinical practice requires cooperation between stakeholders, including legislators, technology developers, and healthcare providers. More research is required to verify how well AI models function in practical situations and investigate new uses, like AI-powered decision support systems and the results of robotic surgery in diabetic patients (209).

Although the use of AI in diabetes treatment has the potential to revolutionize the field, there are many obstacles to overcome, especially with regard to algorithmic bias and ethical governance. AI tools, such as machine learning and deep learning, have shown impressive potential in improving patient outcomes, treatment strategy optimization, and diagnostic accuracy in the management of diabetes (7, 8). AI-driven solutions have demonstrated promise in creating individualized treatment plans and automating ongoing glucose monitoring, enabling patients to make knowledgeable decisions about their care (237). These systems' effectiveness, however, depends on the caliber and representativeness of the training data, which frequently introduces biases that can disproportionately impact marginalized communities (238, 239).

Incomplete or skewed datasets, a lack of diversity in the development teams, and the intrinsic limitations of the algorithms themselves are some of the causes of algorithmic bias in AI systems (238, 239). With sensitivities ranging from 50.98% to 85.90%, a multicenter validation study of AI-based DR screening systems showed notable performance disparities, underscoring the variation in algorithmic efficacy across various population (8). Biases like these can result in unfair healthcare outcomes, especially for underrepresented groups that may already have trouble getting high-quality care (239). Furthermore, a strong ethical framework is required for the incorporation of AI into clinical workflows in order to address issues with patient autonomy, transparency, and data protection (237, 240).

Fairness, accountability, and openness must be given top priority in ethical governance for AI technologies used in diabetes care. Although they offer a fundamental framework, current legal frameworks frequently overlook the complex ethical conundrums raised by AI technologies (238). AI algorithms may perform inconsistently in various healthcare settings due to a lack of standardized evaluation procedures, which could erode public confidence in these systems (241). AI-generated recommendations’ ethical ramifications also need to be carefully examined, especially in light of how they may affect patient autonomy and the doctor-patient relationship (242). Healthcare professionals must make sure that AI-based interventions support patient-centered care rather than obstruct it, and patients must maintain the freedom to accept or reject them (242).

A multifaceted strategy that takes into account technical, ethical, and regulatory aspects is needed to lessen these difficulties. To guarantee consistent performance across various populations, technical efforts should be focused on enhancing data quality and representativeness, creating algorithms that are resistant to biases, and putting strict validation procedures in place (8, 238). Developing inclusive design principles that put the needs of underserved communities first and establishing precise rules for the ethical application of AI in healthcare are two examples of ethical considerations (239, 242). The special difficulties presented by AI technologies, such as the requirement for third-party assessments, openness in algorithmic decision-making, and systems for ongoing observation and enhancement, require regulatory frameworks to change (238, 241).

A paradigm shift in healthcare delivery is also required for the integration of AI into diabetes care, with an emphasis on patient empowerment and interdisciplinary collaboration. In order to create AI solutions that are not only technically sound but also morally and socially responsible, healthcare providers, technologists, and legislators must collaborate (238). It is imperative that patients actively participate in the creation and application of these technologies, guaranteeing that their preferences and values be upheld (242). In order to give patients and healthcare professionals the information and abilities they need to successfully navigate this quickly changing environment, educational programs should be put in place to increase awareness of the possible advantages and hazards of AI in diabetes care (238).

While AI-driven tools, such as automated insulin delivery (AID) systems and clinical decision support systems (AI-CDSS), have demonstrated efficacy in glycemic control and risk stratification (7, 243), their equitable deployment remains hindered by data standardization issues, algorithmic bias, and heterogeneous clinical validation. AI-based DR screening systems exhibit wide variability in sensitivity (50.98%–85.90%) and negative predictive values (82.72%–93.69%) across multicenter studies, raising concerns about generalizability to diverse populations (244). Similarly, the reliance on large, high-quality datasets for training AI models exacerbates disparities in low-resource settings, where data collection infrastructure is often inadequate (245).

A critical unresolved controversy lies in the trade-off between algorithmic performance and interpretability. Deep learning models, though highly accurate, often function as “black boxes,” limiting clinician trust and adherence to AI-generated recommendations. This is particularly problematic in diabetes care, where treatment decisions require transparency to align with patient-specific comorbidities and socioeconomic contexts. While AI-enabled risk stratification tools can identify high-risk patients using variables like waist circumference and BMI, their inability to account for nuanced social determinants of health may perpetuate inequities (246). Furthermore, the lack of standardized regulatory frameworks for AI in healthcare complicates validation and adoption, as evidenced by the reluctance of providers to embrace digital twins (DTs) for personalized treatment simulations due to concerns about data privacy and algorithmic accountability (62).

Methodological challenges also arise from biased training datasets that underrepresent marginalized populations, leading to skewed predictions. AI models trained predominantly on data from high-income countries may fail to generalize to low-resource settings, where diabetes prevalence and complications diverge significantly (247). Unsupervised learning techniques have been proposed to mitigate this by identifying neglected subpopulations (248), but their clinical utility remains unproven. Additionally, conflicting evidence on AI's comparative efficacy complicates its integration: while some studies suggest AI outperforms clinicians in tasks like DR diagnosis, others argue human expertise remains superior for complex cases requiring contextual judgment (244).

Ethical and operational barriers further impede progress. Patient privacy concerns, particularly with wearable technologies that continuously monitor sensitive health data, pose risks of misuse or breaches (247). The high cost of AI-enabled devices exacerbates inequities, as seen in China, where diabetes management is already hampered by disparities in healthcare access (247). Moreover, clinician resistance rooted in inadequate AI literacy and workflow disruptions underscores the need for tailored training programs and sociotechnical integration strategies (246).

Future research must prioritize four key areas: (1) developing FL frameworks to enhance data diversity while preserving privacy (245); (2) advancing XAI techniques to bridge the interpretability gap; (3) establishing equity-focused validation protocols to ensure algorithmic fairness across demographics; and (4) fostering interdisciplinary collaboration among clinicians, data scientists, and policymakers to address sociotechnical barriers. The promise of AI in diabetes care is undeniable—from non-invasive hypoglycemia detection to personalized nutrition apps—but its equitable realization demands rigorous attention to these open problems. Without systemic solutions, AI risks exacerbating rather than alleviating the global diabetes burden (247).

When incorporating federated learning (FL) architectures into AI-based multimodal data streams for precision-equitable diabetes care, the suitability of cross-silo vs. cross-device FL paradigms must be carefully taken into account. Cross-silo FL, which is typified by fewer resource-rich clients, is particularly appropriate for multimodal diabetes care, where various data types—from genomic data and electronic health records (EHRs) to continuous glucose monitoring (CGM) signals and retinal images—require strong computational resources for local model training (14, 249). In keeping with the requirement for privacy-preserving cooperation among healthcare institutions, this architecture guarantees that sensitive patient data remains within institutional silos and permits the aggregation of model updates to improve global predictive performance (14, 249). Cross-silo FL has demonstrated efficacy in federated multi-input models for Alzheimer's disease classification using MRI and tabular data from several hospitals. For diabetic complications like DR or nephropathy, this framework could be adjusted (249). Cross-device FL, designed for scenarios involving thousands of edge devices, is less suitable for multimodal diabetes data due to limitations on bandwidth, processing power, and data heterogeneity. However, by combining lightweight models trained on spectrogram images of CGM signals, it could be helpful for real-time glucose monitoring using wearable sensors, where hypoglycemic events could be identified without endangering personal privacy (250, 251).

Which of these architectures is optimal depends on the trade-offs between clinical utility, resource availability, and data granularity. Cross-silo FL excels in situations requiring high-dimensional data fusion, such as integrating retinal fundus images with EHRs for DR risk stratification, when organizations like tertiary care centers can contribute curated datasets (249). This approach reduces the non-IID (non-independent and identically distributed) data issue that is commonly seen in the healthcare sector by allowing organizations to standardize local data distributions before aggregation (249). The inherent variation in sensor quality and sampling rates among devices makes cross-device FL challenging to manage multimodal data streams despite its scalability (251). However, the volatility of deep learning models like MLPs in cross-device settings, as shown by performance degradation in non-IID scenarios, highlights the need for trustworthy aggregation algorithms appropriate for multimodal data (252).

Innovative FL architectures must also address the integration of conventional statistical techniques, such as multistate models or logistic regression, with AI-driven approaches in order to enhance interpretability and clinical trust. Explainable AI (XAI) frameworks such as SHAP (Shapley Additive Explanations) or LIME (Local Interpretable Model-agnostic Explanations) can bridge this gap by providing clinicians with clear decision-making pathways (62, 253). XAI4Diabetes and DeepNetX2 have demonstrated how interpretable models can elucidate the role of multimodal features in diabetes progression predictions to promote clinician adoption (62, 253). Additionally, federated analytics, a companion to FL, can help harmonize data across silos by combining summary statistics without exchanging raw data, enabling organizations to align disparate data streams for global model training (254, 255). Precision-equitable care depends on ensuring that underrepresented populations benefit from federated insights without increasing biases (253).

Socioeconomic and regulatory factors also have an impact on the choice of FL architecture. FL complies with GDPR and HIPAA through inter-silo Federated learning for DR screening, in which institutions such as the UK Biobank and ADNI (Alzheimer's Disease Neuroimaging Initiative) collaborate to train models without exchanging data (62). Blockchain-secured aggregation, as proposed in decentralized FL (DFL), guarantees model verifiability and lowers single-point-of-failure risks by incorporating an auditable layer into cross-silo workflows (254). On the other hand, in low-resource environments, where wearable-based monitoring prioritizes energy-efficient algorithms like BiLSTM or lightweight CNNs, cross-device FL must manage hardware limitations (251, 256). For DDoS attack detection in IoMT networks, the suggested semi-decentralized FL model—a framework adaptable to diabetes care—offers a compromise by balancing scalability and privacy preservation by clustering IoT devices to minimize communication overhead (256).

The issue of non-independent and identically distributed (non-IID) data in federated learning (FL) is critical in healthcare applications, where real-world datasets display inherent heterogeneity due to variations in patient demographics, clinical practices, and regional healthcare policies (257, 258). This heterogeneity undermines the fundamental tenet of FL, which is that local data distributions are representative of the global population. This leads to unfair client outcomes, poor generalization performance, and biased model convergence (259, 260). Non-IID data in multicenter medical studies may arise from variations in diagnostic criteria, imaging protocols, or disease prevalence, as shown in federated models for detecting delayed cerebral ischemia (DCI), where performance varied significantly across hospitals due to divergent feature distributions and clinical definitions (257). Multilingual FL for depression detection also highlighted the challenges of non-IID data when clients have monolingual datasets with different label distributions, exacerbating imbalances and distorting global model predictions (261).

To solve these problems, creative algorithmic and architectural solutions have been created. One method that lessens feature bias is to separate representation learning from classification using personalized FL frameworks, such as Fed-RepPer. Fed-RepPer uses supervised contrastive loss for local representation learning and combines these into a global model to enable client-specific classifiers and achieve strong performance on non-IID data (262). Another innovation that lessens intra-cluster heterogeneity is clustered FL, which groups clients with similar data distributions. Federated subpopulation models trained on severe DCI cases improved performance for sites with homogeneous severe cohorts, demonstrating the value of targeted aggregation (257). Theoretical developments like weighted local Rademacher complexity, which provide sharper excess risk bounds for non-IID FL, allow frameworks like FedALRC to outperform FedAvg and FedProx by optimizing convergence rates without additional communication overhead (259).

Other strategies to lessen non-IID effects include privacy-preserving enhancements and adaptive aggregation techniques. By employing iterative layer-wise aggregation to address the mismatch between local and global batch normalization parameters—a significant contributor to gradient deviation in non-IID settings—FedTAN achieves strong performance across a range of data distributions (260). Homomorphic encryption and differential privacy (DP) are further integrated to protect sensitive healthcare data while controlling heterogeneity; however, DP's noise addition may inadvertently exacerbate non-IID issues by masking local data patterns (258, 263). Metadata-driven techniques like MetaFedCBT, which employ domain statistics to generate representative brain connectivity templates from decentralized, non-IID neuroimaging data, enhance the global model's centrality and generalizability (264).

In FL, non-IID data has broader implications for fairness and scalability. Label distribution skew and data scarcity can disproportionately disadvantage clients with rare conditions or small datasets, as shown in FL for electronic health records (EHRs), where PEARL's self-supervised learning and fine-tuning scheme improved fairness by tailoring global models to local EHR distributions (263). Communication-efficient methods, such as low-precision decentralized training with Range-EvoNorm, further optimize resource use while preserving accuracy under non-IID conditions and reduce compute energy (265). Fundamental trade-offs still exist, though: while personalized models and clustered FL improve performance, they may also increase complexity or jeopardize global model cohesion (257, 262).

5 Future directions

5.1 Large medical models

The use of AI in diabetes treatment has the potential to completely transform the field thanks to developments in DT technology, multimodal large models, and human-machine co-governance. These developments address the rising global burden of diabetes and its related complications by offering revolutionary potential in real-time monitoring, personalized diabetes management, and predictive analytics (10, 266). This change is being led by multimodal large models, which combine various data sources like clinical, omics, and lifestyle data. Clinicians can now predict the course of diseases and customize interventions with previously unheard-of precision thanks to these models' ability to synthesize complex datasets using machine learning algorithms (214, 267). AI-powered predictive models have proven effective in predicting the course of diabetic kidney disease (DKD), identifying patients at risk for hemodialysis with 71% accuracy (114). This feature highlights how AI can improve early intervention techniques, which will ultimately lower healthcare costs and improve patient outcomes (8).

The use of AI in diabetes treatment has the potential to revolutionize patient management and healthcare delivery, especially when applied to multimodal large models. The accuracy, effectiveness, and personalization of diabetes care could be greatly improved by these models, which incorporate a variety of data types, including clinical, molecular, and lifestyle data (128, 214). Multimodal large models can handle the intricate, nonlinear relationships present in diabetes pathophysiology by utilizing cutting-edge AI techniques like deep learning and ensemble methods. This provides predictive insights that go beyond conventional linear and logistic regression approaches (8, 114). Studies have demonstrated neural networks can accurately predict the onset and complications of diabetes, demonstrating their generalizability across a wide range of patient populations (52, 268). Early intervention and individualized treatment plans are made possible by the creation of dynamic models that capture temporal patterns and trends through the integration of time-series data, such as CGM and EMR (119, 209).

The use of multimodal large models to integrate heterogeneous data sources, such as wearable technology, imaging, and genomic data, is among the most important developments in this field. This all-encompassing strategy makes it possible to fully comprehend each person's risk factors and the course of the disease, allowing for customized interventions that address cardiovascular and metabolic comorbidities (10, 269). The risk of DKD progression has been evaluated using AI-driven predictive models, which have a 71% accuracy rate in forecasting the start of hemodialysis over a ten-year period (38). AI has demonstrated promise in identifying high-risk individuals and optimizing preventive measures in cardiovascular risk assessment for patients with T2D (39, 41). These developments highlight how multimodal large models can fill gaps in existing therapeutic and diagnostic strategies, providing a more patient-centered and sophisticated framework for diabetes care.

The capacity to process and analyze large datasets with high computational efficiency, revealing hidden patterns and correlations that may be missed by conventional methods, is the technological advantage of multimodal large models. For example, by combining logistic regression analysis and convolutional autoencoders, 3,073 features have been extracted from EMR data, making it possible to predict DKD aggravation with exceptional accuracy (43). Additionally, the use of ensemble models—like those optimized with the Adam algorithm—has shown superior predictive performance, with diabetes onset prediction achieving a ROC AUC value of 0.934 (44). In addition to improving diagnostic precision, these models also improve calibration, making predictions that can be understood and used in clinical settings (146). By incorporating sophisticated neural network architectures, like LSTM models, longitudinal data can be analyzed to gain insights into how diseases progress and how well treatments work over time (145).

Multimodal large models are useful for population-level health management in addition to individual patient care. These models can lessen the financial burden of diabetes and increase the effectiveness of healthcare delivery by identifying high-risk cohorts and allocating resources optimally (147, 148). High negative predictive values have been shown by AI-based DR screening systems, allowing for early detection and intervention in environments with limited resources (51). Patients are empowered to actively participate in their disease management through real-time monitoring and feedback made possible by the integration of AI with wearable technology and telehealth. The emergence of AI-driven personalized mobile healthcare platforms that use DT technology to model patient-specific outcomes and inform treatment choices is an example of this paradigm shift towards digital health and precision medicine.

5.2 Digital twin

A virtual depiction of a patient's condition called a DT is another essential component of AI-enabled diabetes treatment. DTs make it easier to continuously monitor, simulate, and optimize treatment plans by building a dynamic, real-time recreation of a patient's physiological state (267). AI-driven recommendations on insulin dosage and lifestyle changes are made possible by integrating insulin pump data, continuous glucose monitoring, and lifestyle variables into a DT framework (10). In addition to empowering people to actively manage their health, this patient-centric approach allows healthcare providers to provide individualized care at scale (266).

The creation of patient-centric DT frameworks based on personal health knowledge graphs (PHKGs) is one of the major advancements in this field. These frameworks combine information from various sources, such as lifestyle data, insulin pump data, and continuous glucose monitoring, to produce a dynamic and thorough depiction of a patient's health (266). Over time, PHKGs improve the precision and accuracy of health insights by facilitating the smooth integration of new patient data. This flexibility is essential for managing diabetes because lifestyle, environmental, and genetic factors can greatly affect how each person reacts to treatment (270). Healthcare professionals can provide focused interventions and preventive measures by using DTs that use AI algorithms to model the course of diseases, forecast treatment outcomes, and perform individualized risk assessments (271).

DTs have proven to be useful in the treatment of diabetes, according to closed-loop trials. To address one of the most difficult aspects of managing diabetes, for example, recent research has used DTs to predict blood glucose levels and optimize insulin dosage in real-time (270). DTs can offer practical insights that enhance glycemic control and lower the risk of complications, according to these trials. Furthermore, DTs have been used to present health information and provide tailored lifestyle advice, enabling patients to take an active role in their treatment (266). DTs' capabilities are further enhanced by the integration of wearable sensors and Internet of Things (IoT) devices, which allow for remote consultations and continuous monitoring—both of which are especially helpful in the management of chronic conditions like diabetes (272).

DTs are also being used to optimize clinical trials and drug development in the treatment of diabetes. DTs expedite the drug discovery process, lower costs, and hasten the release of new treatments by generating virtual patient populations and mimicking treatment outcomes (38). This strategy is especially pertinent to diabetes, where the creation of individualized therapies is crucial to addressing the disease's heterogeneity (39). Additionally, DTs make it easier to find biomarkers and indicators of how a disease will progress, allowing for early detection and treatment (41).

DTs create dynamic, patient-specific models by integrating multimodal data, such as CGM, insulin pump metrics, dietary intake, and activity logs, into virtual replicas of patients (266). By utilizing AI and ML to forecast glycemic fluctuations, optimize insulin dosage, and tailor lifestyle recommendations in real time, these models tackle the interindividual variability that undermines traditional “one-size-fits-all” approaches (273, 274). In addition to improving insulin resistance markers and β-cell function, DT platforms have demonstrated significant reductions in HbA1c (−1.8%) and anti-diabetic medication use (−1.5 medications per patient) over the course of a year (274). The bidirectional feedback loop of DTs enables them to continuously align their simulations with patient outcomes while also assimilating real-world data, thereby enabling adaptive precision medicine (10).

The ability of DTs to operationalize personal health knowledge graphs (PHKGs), which organize diverse data into useful insights, is a crucial benefit (266). PHKGs increase equity by ensuring that disadvantaged groups receive targeted, context-aware interventions, such as considering food insecurity or cultural dietary preferences in glycemic management (273). In terms of incremental glucose control, algorithms such as PNP outperform static dietary guidelines by 16.73% when using gut microbiome data to predict postprandial glucose responses (PPGRs) (273). Additionally, by incorporating telehealth, DTs close gaps in underserved or rural communities through remote monitoring and nudges (274). However, there are still ethical conundrums, such as risks to data privacy and algorithmic biases that could exacerbate inequality if training datasets lack diversity (270). Robust governance frameworks are essential to ensure DTs adhere to HL7 standards for interoperability while safeguarding patient autonomy (275).

Beyond clinical care, DTs have translational potential in the fields of public health and drug development. By accelerating clinical trials, simulating virtual patient cohorts, and predicting therapeutic efficacy and adverse events before actual testing, DTs lower costs and ethical burdens (276). Comparable DT frameworks have enabled respiratory medicine to forecast the disease's progression, indicating comparable uses for diabetes complications (277). As demonstrated by Human Digital Twin (HDT) prototypes for elderly T2D patients, which decreased hypoglycemic events by 45% through adaptive MPC-based insulin delivery, future directions include hybrid modeling (combining physics-based and data-driven approaches) to improve predictive accuracy for comorbid conditions (278). Scalability is still a challenge, requiring federated learning and cloud-based architectures to deploy DTs across healthcare systems without sacrificing data security (279, 280).

5.3 Human-machine co-governance

The concept of human-machine co-governance, which stresses cooperation between AI systems and medical practitioners, is essential to optimizing AI's advantages in diabetes treatment. Although AI is very good at processing data and identifying patterns, human knowledge is essential for contextualizing AI results and making complex medical decisions (146). AI recommendations have been demonstrated to perform better than physician decisions in virtual urgent care settings when it comes to spotting important warning signs and following clinical guidelines, but doctors are still better at tailoring recommendations to the needs of specific patients (146). When handling complex cases, this synergy between AI and human judgment is especially beneficial because AI can be used as a decision-support tool rather than as a substitute for clinical expertise (147). Establishing trust and guaranteeing fair access to AI-driven healthcare solutions also requires addressing the ethical implications of AI, such as data privacy and algorithmic transparency (51).

Algorithmic bias presents unique challenges in FL settings due to decentralized data silos, heterogeneous distributions, and privacy-preserving constraints. To quantify and reduce such biases, novel approaches that balance fairness with the inherent limitations of FL are required. Intersectionality theory-based differential fairness (DF) metrics have been adapted to FL to measure differences across overlapping protected attributes without requiring centralized data access (281). Even with sparse or dispersed data, these metrics assess fairness by evaluating parity in error rates across subgroups (281, 282). In healthcare FL applications, such as forecasting 30-day readmissions, fairness drift—where bias manifests after deployment—has been observed. This necessitates continuous monitoring to find variations in calibration or accuracy across racial or socioeconomic groups using metrics like the generalized entropy index and equalized odds difference (282, 283).

Common mitigation strategies in FL include local debiasing and fairness-aware model aggregation. The Capuchin algorithm uses adversarial debiasing during federated updates, where local models are trained to lessen bias amplification while retaining utility (284). Similarly, Fairlearn has been extended to FL settings so that users can collaboratively optimize for fairness constraints through iterative reweighting of local gradients (284). Post-processing techniques like equalized odds adjustment are also applied to federated outputs in order to mitigate biases in predictions for underrepresented groups (284). Empirical studies show that incorporating race-interaction terms into FL models improves fairness for non-White subgroups by reducing misclassification rates in EHR-continuity predictions by two to three times (285).

In order to align ethical norms with technological debiasing, the SHIFT paradigm highlights the social implications of FL bias and encourages interdisciplinary collaboration (286). Participatory FL frameworks ensure that models reflect a range of care scenarios by incorporating stakeholders in the co-design of fairness restrictions (287). This is operationalized by tools such as D-BIAS, which visualize causal relationships in federated data and enable users to simulate debiased datasets with minimal distortion and iteratively refine biased edges (288).

Data heterogeneity in FL can exacerbate bias, as shown in sleep-scoring algorithms where age-related biases emerged due to unequal training distributions (289). Two strategies to balance subgroup representation are federated reweighting and synthetic data augmentation (288). The NIST AI Bill of Rights and the GDPR's risk-based approach mandate that FL systems document bias mitigation strategies, such as pre-market evaluations for high-risk medical devices (290).

Federated settings need cooperative debiasing techniques, context-aware fairness metrics, and stringent governance to ensure equitable AI. By combining intersectional fairness measures, adversarial training, and stakeholder feedback, FL can advance toward bias-resistant models without compromising privacy or usefulness.

The integration of clinicians, patients, and community stakeholders in the development, validation, and application of human-machine co-governance models—which prioritize ethical accountability and participatory design—represents the TA paradigm shift in healthcare technology. This approach addresses the critical tension between societal benefits and individual risks, particularly in circumstances such as COVID-19 vaccination where model results impact critical decisions (291). Although population-level models may prioritize aggregate outcomes, individual stakeholders, such as remote workers in low-incidence areas, may rationally decide not to participate in interventions like the AZ vaccine due to individualized risk assessments. This divergence emphasizes the need for inclusive modeling frameworks that take into account a range of value judgments, as evidenced by initiatives like ISPOR's recommendations for involving clinicians, legislators, and patients in model development. However, there are still gaps in practice; many models were created without formal stakeholder collaboration, which puts marginalized voices at risk of epistemic injustices (291).

The multidisciplinary lifecycle management for AI/ML tools at Duke Health is an example of an innovative governance framework that shows how structured engagement can reduce biases and improve practical applicability (292). These frameworks guarantee that models are in line with clinical workflows while addressing equity concerns, such as avoiding algorithmic biases that disproportionately harm vulnerable populations, by combining regulatory best practices with ongoing monitoring (292). In a similar vein, The Lancet-FT Commission's participatory approaches to digital health governance prioritize community enfranchisement as a means of fostering trust and facilitating democratic oversight and equitable access (293). There are still gaps in practice, though, as many models were developed without formal stakeholder collaboration, putting marginalized voices at risk of epistemic injustices.

Human-machine co-governance also requires reconsidering traditional hierarchies and shifting toward inclusive modalities that emphasize transparency and cooperative decision-making (294). Levinasian ethics highlights the tension between distributive justice and relational ethics by urging models to find a balance between measurable outcomes and individualized care (295). Technologies like explainable AI (XAI) and dynamic consent mechanisms can bridge this gap by empowering patients to co-manage data use and algorithmic outputs. Self-determined data sharing is made possible by patient-controlled “knowledge commons” in cystic fibrosis care models (296). These systems address issues of enclosure and disempowerment while also adhering to GDPR regulations by granting patients control over their health data (296).

The operationalization of co-governance is hampered, though, particularly when it comes to striking a balance between technical and human factors. Human-in-the-loop (HITL) approaches are robust in psychiatric applications, but they are still limited by dataset quality and require safety measures like diversity sampling and reliability metrics to prevent inaccurate predictions (297). The lack of Global South perspectives in health AI governance and the underrepresentation of AI developers in legal-risk discussions are two examples of how disciplinary silos frequently obstruct interdisciplinary collaboration (298). Suggested solutions include institutionalized norms for public participation, akin to engineering safety standards, and adaptive model architectures that integrate real-time feedback from patients and clinicians (294). Task and theory-of-mind models in HITL systems must be updated frequently to reflect evolving clinical scenarios and ergonomic requirements in order to guarantee alignment with human cognitive and physical constraints (299).

The future of human-machine co-governance will ultimately depend on systemic approaches that integrate emancipatory goals—like equity and trust—into AI design and implementation. This necessitates both structural and technological changes, such as rules requiring providers to be AI literate and roundtable forums for stakeholder discussions (294, 296). These models can go beyond neoliberal efficiency paradigms to provide patient-centered, ethically sound care by elevating marginalized voices and encouraging interdisciplinary discourse (295, 298).

Future prospects for improving diabetes care are extremely bright thanks to the convergence of DTs, multimodal large models, and human-machine co-governance. Future studies should concentrate on creating strong AI algorithms that can reason causally, incorporating data from various populations in the real world, and confirming these technologies through extensive clinical trials (267). A comprehensive, patient-centered approach to healthcare may be made possible by extending DT frameworks to include additional chronic conditions (266). Overcoming technological, moral, and legal obstacles will require promoting interdisciplinary cooperation between computer scientists, physicians, and legislators (8). The application of AI to diabetes care has the potential to revolutionize healthcare and usher in a period of precision medicine and proactive disease management, in addition to improving clinical outcomes as the technology develops (267).

6 Conclusion

AI has proven to be a valuable tool in diabetes care, improving patient outcomes, personalizing treatment plans, and increasing diagnostic accuracy. In order to minimize complications and medical expenses, AI-driven algorithms have demonstrated efficacy in identifying glycemic episodes, forecasting disease progression, and refining treatment plans. Real-time monitoring and remote patient management have been transformed by the combination of AI with wearable technology and telehealth platforms, providing previously unheard-of convenience and accuracy in the treatment of diabetes. These developments highlight AI's revolutionary potential in tackling the worldwide burden of diabetes, a chronic illness with rising incidence and financial consequences.

Harnessing the full potential of AI in diabetes care requires interdisciplinary collaboration between healthcare providers, technologists, policymakers, and industry stakeholders. Collaborations across industries can spur AI algorithm development and guarantee that these algorithms can be tailored to a range of patient types and healthcare environments. Furthermore, in order to address ethical issues like algorithmic bias and data privacy and promote openness and confidence in AI-driven interventions, regulatory frameworks must change. Standardizing AI applications, encouraging fair access, and incorporating human oversight to reduce risks and errors are all critical tasks for policymakers. Stakeholders can jointly promote the moral and efficient application of AI in diabetes care by cultivating a cooperative ecosystem, which will ultimately improve patient outcomes and lessen the cost of healthcare worldwide.

The unrealized potential of AI in diabetes care must be investigated by researchers, physicians, business executives, and regulators working together to solve current issues and spur innovation for a healthier future. Through the prioritization of ethical considerations and interdisciplinary collaboration, the healthcare community can guarantee that AI technologies are not only patient-centered and socially responsible, but also scientifically sound.

Statements

Author contributions

BB: Data curation, Supervision, Writing – review & editing, Formal analysis. HL: Methodology, Writing – original draft, Formal analysis, Writing – review & editing, Investigation. XL: Resources, Funding acquisition, Data curation, Formal analysis, Supervision, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by Science and Technology Department of Jilin Province (20250203164SF).

Acknowledgments

We would like to acknowledge our cooperators for assistance in data collection.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  • 1.

    Meneghini LF Fortmann AL Clark TL Rodriguez K . Making inroads in addressing population health in underserved communities with type 2 diabetes. Diabetes Spectr. (2019) 32(4):30311. 10.2337/ds19-0010

  • 2.

    Novak V Gomez F Dias AC Pimentel DA Alfaro FJ . Diabetes-related cognitive decline, a global health issue, and new treatments approaches. Int J Priv Health Inf Manag. (2017) 5(2):5870. 10.4018/IJPHIM.2017070104

  • 3.

    Hill KE Gleadle JM Pulvirenti M McNaughton DA . The social determinants of health for people with type 1 diabetes that progress to end-stage renal disease. Health Expect. (2015) 18(6):251321. 10.1111/hex.12220

  • 4.

    Leelarathna L Evans ML Neupane S Rayman G Lumley S Cranston I et al Intermittently scanned continuous glucose monitoring for type 1 diabetes. N Engl J Med. (2022) 387(16):147787. 10.1056/NEJMoa2205650

  • 5.

    Kudva YC Raghinaru D Lum JW Graham TE Liljenquist D Spanakis EK et al A randomized trial of automated insulin delivery in type 2 diabetes. N Engl J Med. (2025) 392(18):180112. 10.1056/NEJMoa2415948

  • 6.

    Burnside MJ Lewis DM Crocket HR Meier RA Williman JA Sanders OJ et al Open-source automated insulin delivery in type 1 diabetes. N Engl J Med. (2022) 387(10):86981. 10.1056/NEJMoa2203913

  • 7.

    Jairoun AA Al-Hemyari SS Shahwan M Al-Qirim T Shahwan M . Benefit-risk assessment of ChatGPT applications in the field of diabetes and metabolic illnesses: a qualitative study. Clin Med Insights Endocrinol Diabetes. (2024) 17:11795514241235514. 10.1177/11795514241235514

  • 8.

    Jia W Fisher EB . Application and prospect of artificial intellingence in diabetes care. Med Rev. (2023) 3(1):1024. 10.1515/mr-2022-0039

  • 9.

    Roth EG Chard S . Affective practices of diabetes self-management among older adults: cumulative effects of childhood adversity. Gerontologist. (2022) 62(4):56876. 10.1093/geront/gnab124

  • 10.

    Lee J Yu J Yoon K-H . Opening the precision diabetes care through digital healthcare. Diabetes Metab J. (2023) 47(3):30714. 10.4093/dmj.2022.0386

  • 11.

    Huang S-C Jensen M Yeung-Levy S Lungren MP Poon H Chaudhari AS . A systematic review and implementation guidelines of multimodal foundation models in medical imaging. Res Sq [Preprint]. (2025). 10.21203/rs.3.rs-5537908/v1

  • 12.

    Qiu W Quan C Yu Y Kara E Qian K Hu B et al Federated abnormal heart sound detection with weak to no labels. Cyborg Bionic Syst. (2024) 5:0152. 10.34133/cbsystems.0152

  • 13.

    Ahn J Jang H Jeong Y Choi S Ko J Hwang SH et al Illuminating recent progress in nanotransfer printing: core principles, emerging applications, and future perspectives. Adv Sci. (2024) 11(1):2303704. 10.1002/advs.202303704

  • 14.

    Warner E Lee J Hsu W Syeda-Mahmood T Kahn CE Gevaert O et al Multimodal machine learning in image-based and clinical biomedicine: survey and prospects. Int J Comput Vis. (2024) 132(9):375369. 10.1007/s11263-024-02032-8

  • 15.

    Modena Heming CA Alvarez JA Miranda J Cardoso D Almeida Ghezzi CL Nogueira GF et al Mastering rectal cancer MRI: from foundational concepts to optimal staging. Eur J Radiol. (2025) 183:111937. 10.1016/j.ejrad.2025.111937

  • 16.

    Chin C-Y Hsieh S-Y Tseng VS . eDRAM: effective early disease risk assessment with matrix factorization on a large-scale medical database: a case study on rheumatoid arthritis. PLoS One. (2018) 13(11):e0207579. 10.1371/journal.pone.0207579

  • 17.

    Xie F Chakraborty B Ong MEH Goldstein BA Liu N . Autoscore: a machine learning–based automatic clinical score generator and its application to mortality prediction using electronic health records. JMIR Med Inform. (2020) 8(10):e21798. 10.2196/21798

  • 18.

    Oh M-Y Kim H-S Jung YM Lee H-C Lee S-B Lee SM . Machine learning–based explainable automated nonlinear computation scoring system for health score and an application for prediction of perioperative stroke: retrospective study. J Med Internet Res. (2025) 27:e58021. 10.2196/58021

  • 19.

    van Boekel AM van der Meijden SL Arbous SM Nelissen RGHH Veldkamp KE Nieswaag EB et al Systematic evaluation of machine learning models for postoperative surgical site infection prediction. PLoS One. (2024) 19(12):e0312968. 10.1371/journal.pone.0312968

  • 20.

    Birlik AB Tozan H Köse KB . Machine learning-based hybrid risk estimation system (ERES) in cardiac surgery: supplementary insights from the ASA score analysis. PLOS Digit Health. (2025) 4(6):e0000889. 10.1371/journal.pdig.0000889

  • 21.

    Tao H You L Huang Y Chen Y Yan L Liu D et al An interpreting machine learning models to predict amputation risk in patients with diabetic foot ulcers: a multi-center study. Front Endocrinol. (2025) 16:1526098. 10.3389/fendo.2025.1526098

  • 22.

    Zaka A Mutahar D Gorcilov J Gupta AK Kovoor JG Stretton B et al Machine learning approaches for risk prediction after percutaneous coronary intervention: a systematic review and meta-analysis. Eur Heart J Digit Health. (2025) 6(1):2344. 10.1093/ehjdh/ztae074

  • 23.

    Joddrell M El-Bouri W Harrison SL Huisman MV Lip GYH Zheng Y . Machine learning for outcome prediction in patients with non-valvular atrial fibrillation from the GLORIA-AF registry. Sci Rep. (2024) 14(1):27088. 10.1038/s41598-024-78120-z

  • 24.

    Ning L Su J Li K Dong J Lu H . The study of an integrated diabetes prediction model based on user-defined risk decision making strategy. Medicine (Baltimore). (2025) 104(24):e42680. 10.1097/MD.0000000000042680

  • 25.

    Capoglu R Gonullu E Bayhan Z Coskun M Harmantepe T Kucuk F . Comparison of scoring systems regarding the gender as a parameter with the traditional scoring systems for predicting appendicitis. Updates Surg. (2022) 74(3):103542. 10.1007/s13304-022-01272-y

  • 26.

    Cisnal A Ruiz Rebollo ML Flórez-Pardo C Matesanz-Isabel J Pérez Turiel J Fraile JC . Improved early prediction of acute pancreatitis severity using SHAP-based XGBoost model: beyond traditional scoring systems. Dig Liver Dis. (2025). 10.1016/j.dld.2025.10.017

  • 27.

    Wexner SD . Scoring systems for diverticular disease. Gut. (2022) 71(7):12456. 10.1136/gutjnl-2021-326009

  • 28.

    Keten T Ozercan AY Eroglu U Basboga S Tatlici K Senel C et al Can HALP score, a new prognostic tool, take the place of traditional scoring systems in fournier’s gangrene? Int Urol Nephrol. (2023) 55(10):238995. 10.1007/s11255-023-03695-x

  • 29.

    Cha Y Kim J-T Kim J-W Seo SH Lee S-Y Yoo J-I . Effect of artificial intelligence or machine learning on prediction of hip fracture risk: systematic review. J Bone Metab. (2023) 30(3):24552. 10.11005/jbm.2023.30.3.245

  • 30.

    Wang H Zhou Z Li H Xiang W Lan Y Dou X et al Blood biomarkers panels for screening of colorectal cancer and adenoma on a machine learning-assisted detection platform. Cancer Control. (2023) 30:10732748231222109. 10.1177/10732748231222109

  • 31.

    Shu X Liu Y Qiao X Ai G Liu L Liao J et al Radiomic-based machine learning model for the accurate prediction of prostate cancer risk stratification. Br J Radiol. (2023) 96(1143):20220238. 10.1259/bjr.20220238

  • 32.

    Patterson BW Engstrom CJ Sah V Smith MA Mendonça EA Pulia MS et al Training and interpreting machine learning algorithms to evaluate fall risk after emergency department visits. Med Care. (2019) 57(7):5606. 10.1097/MLR.0000000000001140

  • 33.

    Brentnall AR Atakpa EC Hill H Santeramo R Damiani C Cuzick J et al An optimization framework to guide the choice of thresholds for risk-based cancer screening. NPJ Digit Med. (2023) 6(1):223. 10.1038/s41746-023-00967-9

  • 34.

    Alaka SA Menon BK Brobbey A Williamson T Goyal M Demchuk AM et al Functional outcome prediction in ischemic stroke: a comparison of machine learning algorithms and regression models. Front Neurol. (2020) 11:889. 10.3389/fneur.2020.00889

  • 35.

    Hasegawa S Mizokami F Kameya Y Hayakawa Y Watanabe T Matsui Y . Machine learning versus binomial logistic regression analysis for fall risk based on SPPB scores in older adult outpatients. Digital Health. (2023) 9:20552076231219438. 10.1177/20552076231219438

  • 36.

    Eriksson M Román M Gräwingholt A Castells X Nitrosi A Pattacini P et al European validation of an image-derived AI-based short-term risk model for individualized breast cancer screening—a nested case-control study. Lancet Reg Health Eur. (2024) 37:100798. 10.1016/j.lanepe.2023.100798

  • 37.

    Sun L Yang L Liu X Tang L Zeng Q Gao Y et al Optimization of cervical cancer screening: a stacking-integrated machine learning algorithm based on demographic, behavioral, and clinical factors. Front Oncol. (2022) 12:821453. 10.3389/fonc.2022.821453

  • 38.

    Xiouras C Cameli F Quilló GL Kavousanakis ME Vlachos DG Stefanidis GD . Applications of artificial intelligence and machine learning algorithms to crystallization. Chem Rev. (2022) 122(15):1300642. 10.1021/acs.chemrev.2c00141

  • 39.

    Obermeyer Z Topol EJ . Artificial intelligence, bias, and patients’ perspectives. Lancet. (2021) 397(10289):2038. 10.1016/S0140-6736(21)01152-1

  • 40.

    Gombolay GY Gopalan N Bernasconi A Nabbout R Megerian JT Siegel B et al Review of machine learning and artificial intelligence (ML/AI) for the pediatric neurologist. Pediatr Neurol. (2023) 141:4251. 10.1016/j.pediatrneurol.2023.01.004

  • 41.

    Howell MD Corrado GS DeSalvo KB . Three epochs of artificial intelligence in health care. JAMA. (2024) 331(3):242. 10.1001/jama.2023.25057

  • 42.

    Yao MWM Nguyen ET Retzloff MG Gago LA Copland S Nichols JE et al Improving IVF utilization with patient-centric artificial intelligence-machine learning (AI/ML): a retrospective multicenter experience. J Clin Med. (2024) 13(12):3560. 10.3390/jcm13123560

  • 43.

    Bi WL Hosny A Schabath MB Giger ML Birkbak NJ Mehrtash A et al Artificial intelligence in cancer imaging: clinical challenges and applications. CA Cancer J Clin. (2019) 69(2):12757. 10.3322/caac.21552

  • 44.

    Eaneff S Obermeyer Z Butte AJ . The case for algorithmic stewardship for artificial intelligence and machine learning technologies. JAMA. (2020) 324(14):1397. 10.1001/jama.2020.9371

  • 45.

    Köchert K Friede T Kunz M Pang H Zhou Y Rantou E . On the application of artificial intelligence/machine learning (AI/ML) in late-stage clinical development. Ther Innov Regul Sci. (2024) 58(6):108093. 10.1007/s43441-024-00689-4

  • 46.

    Huang Y-J Chen C Yang H-C . AI-enhanced integration of genetic and medical imaging data for risk assessment of type 2 diabetes. Nat Commun. (2024) 15(1):4230. 10.1038/s41467-024-48618-1

  • 47.

    Parkhi D Periyathambi N Ghebremichael-Weldeselassie Y Patel V Sukumar N Siddharthan R et al Prediction of postpartum prediabetes by machine learning methods in women with gestational diabetes mellitus. iScience. (2023) 26(10):107846. 10.1016/j.isci.2023.107846

  • 48.

    Cubillos G Monckeberg M Plaza A Morgan M Estevez PA Choolani M et al Development of machine learning models to predict gestational diabetes risk in the first half of pregnancy. BMC Pregnancy Childbirth. (2023) 23(1):469. 10.1186/s12884-023-05766-4

  • 49.

    Zhou H Chen W Chen C Zeng Y Chen J Lin J et al Predictive value of ultrasonic artificial intelligence in placental characteristics of early pregnancy for gestational diabetes mellitus. Front Endocrinol. (2024) 15:1344666. 10.3389/fendo.2024.1344666

  • 50.

    Kumar M Chen L Tan K Ang LT Ho C Wong G et al Population-centric risk prediction modeling for gestational diabetes mellitus: a machine learning approach. Diabetes Res Clin Pract. (2022) 185:109237. 10.1016/j.diabres.2022.109237

  • 51.

    Theodosiou AA Read RC . Artificial intelligence, machine learning and deep learning: potential resources for the infection clinician. J Infect. (2023) 87(4):28794. 10.1016/j.jinf.2023.07.006

  • 52.

    Liu M-Y Liu C-F Lin T-C Ma Y-S . Implementing a novel machine learning system for nutrition education in diabetes mellitus nutritional clinic: predicting 1-year blood glucose control. Bioengineering. (2023) 10(10):1139. 10.3390/bioengineering10101139

  • 53.

    Liu Y Ye S Xiao X Sun C Wang G Wang G et al Machine learning for tuning, selection, and ensemble of multiple risk scores for predicting type 2 diabetes. Risk Manag Healthc Policy. (2019) 12:18998. 10.2147/RMHP.S225762

  • 54.

    Pyrros A Borstelmann SM Mantravadi R Zaiman Z Thomas K Price B et al Opportunistic detection of type 2 diabetes using deep learning from frontal chest radiographs. Nat Commun. (2023) 14(1):4039. 10.1038/s41467-023-39631-x

  • 55.

    Qiao Y Zhou H Liu Y Chen R Zhang X Nie S et al A multi-modal fusion model with enhanced feature representation for chronic kidney disease progression prediction. Brief Bioinform. (2024) 26(1):bbaf003. 10.1093/bib/bbaf003

  • 56.

    Zhang L Liu Z Zhu X Song Z Yang X Lei Z et al Weakly aligned feature fusion for multimodal object detection. IEEE Trans Neural Netw Learn Syst. (2025) 36(3):414559. 10.1109/TNNLS.2021.3105143

  • 57.

    Chen L Fu Y Gu L Yan C Harada T Huang G . Frequency-Aware feature fusion for dense image prediction. IEEE Trans Pattern Anal Mach Intell. (2024) 46(12):1076380. 10.1109/TPAMI.2024.3449959

  • 58.

    Cheng H Peng L Yang Z Lin B He X Wu B . Temporal feature fusion for 3D detection in monocular video. IEEE Trans Image Process. (2024) 33:266575. 10.1109/TIP.2024.3378475

  • 59.

    Setiawan D Wiranto Y Girard JM Watts A Ashourvan A . Individualized machine-learning-based clinical assessment recommendation system. medRxiv (2024). 10.1101/2024.07.24.24310941

  • 60.

    Wang Z Xu X Zhu L Bin Y Wang G Yang Y et al Evidence-based multi-feature fusion for adversarial robustness. IEEE Trans Pattern Anal Mach Intell. (2025) 47(10):892337. 10.1109/TPAMI.2025.3582518

  • 61.

    Chellappan D Rajaguru H . Generalizability of machine learning models for diabetes detection a study with nordic islet transplant and PIMA datasets. Sci Rep. (2025) 15(1):4479. 10.1038/s41598-025-87471-0

  • 62.

    Tao Y Hou J Zhou G Zhang D . Artificial intelligence applied to diabetes complications: a bibliometric analysis. Front Artif Intell. (2025) 8:1455341. 10.3389/frai.2025.1455341

  • 63.

    Mendoza-Mendoza M Acosta-Jiménez S Galván-Tejada C Maeda-Gutiérrez V Celaya-Padilla J Galván-Tejada J et al Sex-specific ensemble models for type 2 diabetes classification in the Mexican population. Diabetes Metab Syndr Obes. (2025) 18:150125. 10.2147/DMSO.S517905

  • 64.

    Zheng R Huang Z Deng L . Large-scale predicting protein functions through heterogeneous feature fusion. Brief Bioinform. (2023) 24(4):bbad243. 10.1093/bib/bbad243

  • 65.

    Ouyang D Wang M Zhang N He W Huo D Li Y et al 2D time-stretching anisotropic synapse realizing in-sensor intensity-spanning visual feature fusion. Adv Mater. (2025) 37(33):2507168. 10.1002/adma.202507168

  • 66.

    Gniadek T Kang J Theparee T Krive J . Framework for classifying explainable artificial intelligence (XAI) algorithms in clinical medicine. Online J Public Health Inform. (2023) 15:e50934. 10.2196/50934

  • 67.

    Sultan T Chowdhury M Jahan N Mridha MF Alfarhood S Safran M et al LeafDNet: transforming leaf disease diagnosis through deep transfer learning. Plant Direct. (2025) 9(2):e70047. 10.1002/pld3.70047

  • 68.

    Tosun AB Pullara F Becich MJ Taylor DL Fine JL Chennubhotla SC . Explainable AI (xAI) for anatomic pathology. Adv Anat Pathol. (2020) 27(4):24150. 10.1097/PAP.0000000000000264

  • 69.

    Matin SS Pradhan B . Earthquake-induced building-damage mapping using explainable AI (XAI). Sensors. (2021) 21(13):4489. 10.3390/s21134489

  • 70.

    Kothari S Sharma S Shejwal S Kazi A D’Silva M Karthikeyan M . An explainable AI-assisted web application in cancer drug value prediction. MethodsX. (2024) 12:102696. 10.1016/j.mex.2024.102696

  • 71.

    Manz R Bäcker J Cramer S Meyer P Müller D Muzalyova A et al Do explainable AI (XAI) methods improve the acceptance of AI in clinical practice? An evaluation of XAI methods on gleason grading. J Pathol Clin Res. (2025) 11(2):e70023. 10.1002/2056-4538.70023

  • 72.

    Rajpal A Sehra K Bagri R Sikka P . XAI-FR: explainable AI-based face recognition using deep neural networks. Wireless Pers Commun. (2023) 129(1):66380. 10.1007/s11277-022-10127-z

  • 73.

    Jang J-H Jo Y-Y Kang S Son JM Lee HS Kwon J-m et al A novel XAI framework for explainable AI-ECG using generative counterfactual XAI (GCX). Sci Rep. (2025) 15(1):23608. 10.1038/s41598-025-08080-5

  • 74.

    Bifarin OO Fernández FM . Automated machine learning and explainable AI (AutoML-XAI) for metabolomics: improving cancer diagnostics. J Am Soc Mass Spectrom. (2024) 35(6):1089100. 10.1021/jasms.3c00403

  • 75.

    Chraibi Kaadoud I Bennetot A Mawhin B Charisi V Díaz-Rodríguez N . Explaining aha! moments in artificial agents through IKE-XAI: implicit knowledge extraction for eXplainable AI. Neural Netw. (2022) 155:95118. 10.1016/j.neunet.2022.08.002

  • 76.

    Payrovnaziri SN Chen Z Rengifo-Moreno P Miller T Bian J Chen JH et al Explainable artificial intelligence models using real-world electronic health record data: a systematic scoping review. J Am Med Inform Assoc. (2020) 27(7):117385. 10.1093/jamia/ocaa053

  • 77.

    Pe S Famiglini L Gallazzi E Bortolotto C Carone L Cisarri A et al Alternative strategies to generate class activation maps supporting AI-based advice in vertebral fracture detection in x-ray images. Methods Inf Med. (2024) 63(3–4):12236. 10.1055/a-2562-2163

  • 78.

    Boehm KM Khosravi P Vanguri R Gao J Shah SP . Harnessing multimodal data integration to advance precision oncology. Nat Rev Cancer. (2022) 22(2):11426. 10.1038/s41568-021-00408-3

  • 79.

    Wamil M Hassaine A Rao S Li Y Mamouei M Canoy D et al Stratification of diabetes in the context of comorbidities, using representation learning and topological data analysis. Sci Rep. (2023) 13(1):11478. 10.1038/s41598-023-38251-1

  • 80.

    Meng-Papaxanthos L Zhang R Li G Cuturi M Noble WS Vert J-P . LSMMD-MA: scaling multimodal data integration for single-cell genomics data analysis. Bioinformatics. (2023) 39(7):btad420. 10.1093/bioinformatics/btad420

  • 81.

    Lipkova J Chen RJ Chen B Lu MY Barbieri M Shao D et al Artificial intelligence for multimodal data integration in oncology. Cancer Cell. (2022) 40(10):1095110. 10.1016/j.ccell.2022.09.012

  • 82.

    Tummler K Klipp E . Data integration strategies for whole-cell modeling. FEMS Yeast Res. (2024) 24:foae011. 10.1093/femsyr/foae011

  • 83.

    Yang M Zhao Y Li C Weng X Li Z Guo W et al Multimodal integration of liquid biopsy and radiology for the noninvasive diagnosis of gallbladder cancer and benign disorders. Cancer Cell. (2025) 43(3):398412.e4. 10.1016/j.ccell.2025.02.011

  • 84.

    Xie C Ning Z Guo T Yao L Chen X Huang W et al Multimodal data integration for biologically-relevant artificial intelligence to guide adjuvant chemotherapy in stage II colorectal cancer. eBioMedicine. (2025) 117:105789. 10.1016/j.ebiom.2025.105789

  • 85.

    Nunes FVM Behrens LMP Weimer RD Gonçalves GF da Silva Fernandes G Dorn M . Deep learning methods and applications in single-cell multimodal data integration. Mol Omics. (2025) 21(6):54565. 10.1039/D5MO00062A

  • 86.

    Aguado A Moratalla-Navarro F López-Simarro F Moreno V . Morbinet: multimorbidity networks in adult general population. Analysis of type 2 diabetes mellitus comorbidity. Sci Rep. (2020) 10(1):2416. 10.1038/s41598-020-59336-1

  • 87.

    Li L Zeng J Zhang X . Generalized liquid association analysis for multimodal data integration. J Am Stat Assoc. (2023) 118(543):198496. 10.1080/01621459.2021.2024437

  • 88.

    Abràmoff MD Lavin PT Birch M Shah N Folk JC . Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices. NPJ Digit Med. (2018) 1(1):39. 10.1038/s41746-018-0040-6

  • 89.

    Bilal A Imran A Baig TI Liu X Long H Alzahrani A et al Improved support vector machine based on CNN-SVD for vision-threatening diabetic retinopathy detection and classification. PLoS One. (2024) 19(1):e0295951. 10.1371/journal.pone.0295951

  • 90.

    Mwangi N Ng’ang’a M Gakuo E Gichuhi S Macleod D Moorman C et al Effectiveness of peer support to increase uptake of retinal examination for diabetic retinopathy: study protocol for the DURE pragmatic cluster randomized clinical trial in Kirinyaga, Kenya. BMC Public Health. (2018) 18(1):871. 10.1186/s12889-018-5761-6

  • 91.

    Sundaram S Selvamani M Raju SK Ramaswamy S Islam S Cha J-H et al Diabetic retinopathy and diabetic macular edema detection using ensemble based convolutional neural networks. Diagnostics. (2023) 13(5):1001. 10.3390/diagnostics13051001

  • 92.

    Bhaskaranand M Ramachandra C Bhat S Cuadros J Nittala MG Sadda S et al Automated diabetic retinopathy screening and monitoring using retinal fundus image analysis. J Diabetes Sci Technol. (2016) 10(2):25461. 10.1177/1932296816628546

  • 93.

    Hao Z Cui S Zhu Y Shao H Huang X Jiang X et al Application of non-mydriatic fundus examination and artificial intelligence to promote the screening of diabetic retinopathy in the endocrine clinic: an observational study of T2DM patients in Tianjin, China. Ther Adv Chronic Dis. (2020) 11:2040622320942415. 10.1177/2040622320942415

  • 94.

    Saleh I El-Den NN Elsharkawy M Mahmoud A Sewelam A Wang W et al AI-based methods for diagnosing and grading diabetic retinopathy: a comprehensive review. Artif Intell Med. (2025) 168:103221. 10.1016/j.artmed.2025.103221

  • 95.

    Shah P Mishra D Shanmugam M Doshi B Jayaraj H Ramanjulu R . Validation of deep convolutional neural network-based algorithm for detection of diabetic retinopathy—artificial intelligence versus clinician for screening. Indian J Ophthalmol. (2020) 68(2):398. 10.4103/ijo.IJO_966_19

  • 96.

    Kanagasingam Y Xiao D Vignarajan J Preetham A Tay-Kearney M-L Mehrotra A . Evaluation of artificial intelligence–based grading of diabetic retinopathy in primary care. JAMA Netw Open. (2018) 1(5):e182665. 10.1001/jamanetworkopen.2018.2665

  • 97.

    Li S Zhao R Zou H . Artificial intelligence for diabetic retinopathy. Chin Med J. (2022) 135(3):25360. 10.1097/CM9.0000000000001816

  • 98.

    Elsayed A Ghaith M Yosri A Li Z El-Dakhakhni W . Genetic programming expressions for effluent quality prediction: towards AI-driven monitoring and management of wastewater treatment plants. J Environ Manage. (2024) 356:120510. 10.1016/j.jenvman.2024.120510

  • 99.

    aha T Del Caño R Mahato K De la Paz E Chen C Ding S et al Wearable electrochemical glucose sensors in diabetes management: a comprehensive review. Chem Rev. (2023) 123(12):785489. 10.1021/acs.chemrev.3c00078

  • 100.

    Reynolds L Luo Z Singh K . Diabetic complications and prospective immunotherapy. Front Immunol. (2023) 14:1219598. 10.3389/fimmu.2023.1219598

  • 101.

    Jalgaonkar MP Parmar UM Kulkarni YA Oza MJ . SIRT1-FOXOs activity regulates diabetic complications. Pharmacol Res. (2022) 175:106014. 10.1016/j.phrs.2021.106014

  • 102.

    Chalotra R Gupta T Chib S Amanat M Kumar P Singh R . Treatment of diabetic complications: do flavonoids holds the keys?Crit Rev Food Sci Nutr. (2024) 64(30):11091112. 10.1080/10408398.2023.2232868

  • 103.

    Teymourian H Barfidokht A Wang J . Electrochemical glucose sensors in diabetes management: an updated review (2010–2020). Chem Soc Rev. (2020) 49(21):7671709. 10.1039/D0CS00304B

  • 104.

    Parmar UM Jalgaonkar MP Kulkarni YA Oza MJ . Autophagy-nutrient sensing pathways in diabetic complications. Pharmacol Res. (2022) 184:106408. 10.1016/j.phrs.2022.106408

  • 105.

    Ahlqvist E van Zuydam NR Groop LC McCarthy MI . The genetics of diabetic complications. Nat Rev Nephrol. (2015) 11(5):27787. 10.1038/nrneph.2015.37

  • 106.

    Zhang X Luo Z Li J Lin Y Li Y Li W . Sestrin2 in diabetes and diabetic complications. Front Endocrinol. (2023) 14:1274686. 10.3389/fendo.2023.1274686

  • 107.

    Demir S Nawroth PP Herzig S Ekim Üstünel B . Emerging targets in type 2 diabetes and diabetic complications. Adv Sci. (2021) 8(18):2100275. 10.1002/advs.202100275

  • 108.

    Tan E Khoo J Gani LU Malakar RD Tay TL Tirukonda PS et al Effect of multidisciplinary intensive targeted care in improving diabetes mellitus outcomes: a randomized controlled pilot study—the integrated diabetes education, awareness and lifestyle modification in Singapore (IDEALS) program. Trials. (2019) 20(1):549. 10.1186/s13063-019-3601-3

  • 109.

    Hahr AJ Molitch ME . Management of diabetes mellitus in patients with chronic kidney disease. Clin Diabetes Endocrinol. (2015) 1(1):2. 10.1186/s40842-015-0001-9

  • 110.

    Helou N Talhouedec D Shaha M Zanchi A . The impact of a multidisciplinary self-care management program on quality of life, self-care, adherence to anti-hypertensive therapy, glycemic control, and renal function in diabetic kidney disease: a cross-over study protocol. BMC Nephrol. (2016) 17(1):88. 10.1186/s12882-016-0279-6

  • 111.

    Wu Y Min H Li M Shi Y Ma A Han Y et al Effect of artificial intelligence-based health education accurately linking system (AI-HEALS) for type 2 diabetes self-management: protocol for a mixed-methods study. BMC Public Health. (2023) 23(1):1325. 10.1186/s12889-023-16066-z

  • 112.

    Sasso FC Pafundi PC Simeon V De Nicola L Chiodini P Galiero R et al Efficacy and durability of multifactorial intervention on mortality and MACEs: a randomized clinical trial in type-2 diabetic kidney disease. Cardiovasc Diabetol. (2021) 20(1):145. 10.1186/s12933-021-01343-1

  • 113.

    Sekandi JN Shi W Zhu R Kaggwa P Mwebaze E Li S . Application of artificial intelligence to the monitoring of medication adherence for tuberculosis treatment in Africa: algorithm development and validation. JMIR AI. (2023) 2:e40167. 10.2196/40167

  • 114.

    Makino M Yoshimoto R Ono M Itoko T Katsuki T Koseki A et al Artificial intelligence predicts the progression of diabetic kidney disease using big data machine learning. Sci Rep. (2019) 9(1):11862. 10.1038/s41598-019-48263-5

  • 115.

    Liu X Zhang J . Continuous glucose monitoring: a transformative approach to the detection of prediabetes. JMDH. (2024) 17:55139. 10.2147/JMDH.S493128

  • 116.

    Zhang X Zhang J Ren Y Sun R Zhai X . Unveiling the pathogenesis and therapeutic approaches for diabetic nephropathy: insights from panvascular diseases. Front Endocrinol. (2024) 15:1368481. 10.3389/fendo.2024.1368481

  • 117.

    Sathya Preiya V Kumar VDA . Deep learning-based classification and feature extraction for predicting pathogenesis of foot ulcers in patients with diabetes. Diagnostics. (2023) 13(12):1983. 10.3390/diagnostics13121983

  • 118.

    Ma M Wan X Chen Y Lu Z Guo D Kong H et al A novel explainable online calculator for contrast-induced AKI in diabetics: a multi-centre validation and prospective evaluation study. J Transl Med. (2023) 21(1):517. 10.1186/s12967-023-04387-x

  • 119.

    Anjum M Saher R Saeed MN . Optimizing type 2 diabetes management: AI-enhanced time series analysis of continuous glucose monitoring data for personalized dietary intervention. PeerJ Comput Sci. (2024) 10:e1971. 10.7717/peerj-cs.1971

  • 120.

    Dongling N Ziwei K Juanling S Li Z Chang W Ting L et al Universal nomogram for predicting referable diabetic retinopathy: a validated model for community and ophthalmic outpatient populations using easily accessible indicators. Front Endocrinol. (2025) 16:1557166. 10.3389/fendo.2025.1557166

  • 121.

    Malerbi FK Nakayama LF Melo GB Stuchi JA Lencione D Prado PV et al Automated identification of different severity levels of diabetic retinopathy using a handheld fundus camera and single-image protocol. Ophthalmol Sci. (2024) 4(4):100481. 10.1016/j.xops.2024.100481

  • 122.

    Ganjgahi H Häring DA Aarden P Graham G Sun Y Gardiner S et al AI-driven reclassification of multiple sclerosis progression. Nat Med. (2025) 31(10):341424. 10.1038/s41591-025-03901-6

  • 123.

    Christiansen F Konuk E Ganeshan AR Welch R Palés Huix J Czekierdowski A et al International multicenter validation of AI-driven ultrasound detection of ovarian cancer. Nat Med. (2025) 31(1):18996. 10.1038/s41591-024-03329-4

  • 124.

    Xiong R Aiken E Caldwell R Vernon SD Kozhaya L Gunter C et al AI-driven multi-omics modeling of myalgic encephalomyelitis/chronic fatigue syndrome. Nat Med. (2025) 31(9):29913001. 10.1038/s41591-025-03788-3

  • 125.

    Service RF . AI-driven robots discover record-setting laser compound. Science. (2024) 384(6697):725. 10.1126/science.adq4552

  • 126.

    Cho H Ackom E . Artificial intelligence (AI)-driven approach to climate action and sustainable development. Nat Commun. (2025) 16(1):1228. 10.1038/s41467-024-53956-1

  • 127.

    Sounderajah V Guni A Liu X Collins GS Karthikesalingam A Markar SR et al The STARD-AI reporting guideline for diagnostic accuracy studies using artificial intelligence. Nat Med. (2025) 31(10):32839. 10.1038/s41591-025-03953-8

  • 128.

    Li J Ying C . A sensitivity indicator screening and intelligent classification method for the diagnosis of T2D-CHD. Front Cardiovasc Med. (2024) 11:1358066. 10.3389/fcvm.2024.1358066

  • 129.

    Wang J Xue Q Zhang CWJ Wong KKL Liu Z . Explainable coronary artery disease prediction model based on AutoGluon from AutoML framework. Front Cardiovasc Med. (2024) 11:1360548. 10.3389/fcvm.2024.1360548

  • 130.

    Lee YC Cha J Shim I Park W-Y Kang SW Lim DH et al Multimodal deep learning of fundus abnormalities and traditional risk factors for cardiovascular risk prediction. NPJ Digit Med. (2023) 6(1):14. 10.1038/s41746-023-00748-4

  • 131.

    Sadr H Salari A Ashoobi MT Nazari M . Cardiovascular disease diagnosis: a holistic approach using the integration of machine learning and deep learning models. Eur J Med Res. (2024) 29(1):455. 10.1186/s40001-024-02044-7

  • 132.

    Drouard G Mykkänen J Heiskanen J Pohjonen J Ruohonen S Pahkala K et al Exploring machine learning strategies for predicting cardiovascular disease risk factors from multi-omic data. BMC Med Inform Decis Mak. (2024) 24(1):116. 10.1186/s12911-024-02521-3

  • 133.

    Wang X Cai S Wang H Li J Yang Y . Deep-learning-based renal artery stenosis diagnosis via multimodal fusion. J Appl Clin Med Phys. (2024) 25(3):e14298. 10.1002/acm2.14298

  • 134.

    Chen D Huang RS Jomy J Wong P Yan M Croke J et al Performance of multimodal artificial intelligence chatbots evaluated on clinical oncology cases. JAMA Netw Open. (2024) 7(10):e2437711. 10.1001/jamanetworkopen.2024.37711

  • 135.

    Ogunpola A Saeed F Basurra S Albarrak AM Qasem SN . Machine learning-based predictive models for detection of cardiovascular diseases. Diagnostics. (2024) 14(2):144. 10.3390/diagnostics14020144

  • 136.

    Carletti M Pandit J Gadaleta M Chiang D Delgado F Quartuccio K et al Multimodal AI correlates of glucose spikes in people with normal glucose regulation, pre-diabetes and type 2 diabetes. Nat Med. (2025) 31(9):31217. 10.1038/s41591-025-03849-7

  • 137.

    Yang Z Guo C Li J Li Y Zhong L Pu P et al An explainable multimodal artificial intelligence model integrating histopathological microenvironment and EHR phenotypes for germline genetic testing in breast cancer. Adv Sci. (2025) 12(31):e02833. 10.1002/advs.202502833

  • 138.

    Ploderer B Clark D Brown R Harman J Lazzarini PA Van Netten JJ . Self-monitoring diabetes-related foot ulcers with the MyFootCare app: a mixed methods study. Sensors. (2023) 23(5):2547. 10.3390/s23052547

  • 139.

    Matijevich E Minty E Bray E Bachus C Hajizadeh M Liden B . A multi-faceted digital health solution for monitoring and managing diabetic foot ulcer risk: a case series. Sensors. (2024) 24(9):2675. 10.3390/s24092675

  • 140.

    Hazenberg CEVB Aan De Stegge WB Van Baal SG Moll FL Bus SA . Telehealth and telemedicine applications for the diabetic foot: a systematic review. Diabetes Metabolism Res. (2020) 36(3):e3247. 10.1002/dmrr.3247

  • 141.

    Huang H-N Zhang T Yang C-T Sheen Y-J Chen H-M Chen C-J et al Image segmentation using transfer learning and fast R-CNN for diabetic foot wound treatments. Front Public Health. (2022) 10:969846. 10.3389/fpubh.2022.969846

  • 142.

    Cruvinel-Júnior RH Ferreira JSSP Veríssimo JL Monteiro RL Suda EY Silva ÉQ et al Could an internet-based foot–ankle therapeutic exercise program modify clinical outcomes and gait biomechanics in people with diabetic neuropathy? A clinical proof-of-concept study. Sensors. (2022) 22(24):9582. 10.3390/s22249582

  • 143.

    Haghverdian JC Noori N Hsu AR . Clinical pathway for the management of diabetic foot infections in the emergency department. Foot Ankle Orthop. (2023) 8(1):24730114221148166. 10.1177/24730114221148166

  • 144.

    Cigna E Fino P Onesti MG Amorosi V Scuderi N . Diabetic foot infection treatment and care. Int Wound J. (2016) 13(2):23842. 10.1111/iwj.12277

  • 145.

    Varghese C Harrison EM O’Grady G Topol EJ . Artificial intelligence in surgery. Nat Med. (2024) 30(5):125768. 10.1038/s41591-024-02970-3

  • 146.

    Zeltzer D Kugler Z Hayat L Brufman T Ilan Ber R Leibovich K et al Comparison of initial artificial intelligence (AI) and final physician recommendations in AI-assisted virtual urgent care visits. Ann Intern Med. (2025) 178(4):498506. 10.7326/ANNALS-24-03283

  • 147.

    Bhat M Rabindranath M Chara BS Simonetto DA . Artificial intelligence, machine learning, and deep learning in liver transplantation. J Hepatol. (2023) 78(6):121633. 10.1016/j.jhep.2023.01.006

  • 148.

    Burley SK Arap W Pasqualini R . Predicting proteome-scale protein structure with artificial intelligence. N Engl J Med. (2021) 385(23):21914. 10.1056/NEJMcibr2113027

  • 149.

    Butt H Khosa I Iftikhar MA . Feature transformation for efficient blood glucose prediction in type 1 diabetes mellitus patients. Diagnostics. (2023) 13(3):340. 10.3390/diagnostics13030340

  • 150.

    Mirshekarian S Shen H Bunescu R Marling C . LSTMs and neural attention models for blood glucose prediction: comparative experiments on real and synthetic data. Annu Int Conf IEEE Eng Med Biol Soc (2019). p. 70612. 10.1109/EMBC.2019.8856940

  • 151.

    Wang G Liu X Ying Z Yang G Chen Z Liu Z et al Optimized glycemic control of type 2 diabetes with reinforcement learning: a proof-of-concept trial. Nat Med. (2023) 29(10):263342. 10.1038/s41591-023-02552-9

  • 152.

    Shao J Pan Y Kou W-B Feng H Zhao Y Zhou K et al Generalization of a deep learning model for continuous glucose monitoring–based hypoglycemia prediction: algorithm development and validation study. JMIR Med Inform. (2024) 12:e56909. 10.2196/56909

  • 153.

    Kalia A Krishnan D Hassoun S . CSI: contrastive data stratification for interaction prediction and its application to compound–protein interaction prediction. Bioinformatics. (2023) 39(8):btad456. 10.1093/bioinformatics/btad456

  • 154.

    Zhang K Yang X Wang Y Yu Y Huang N Li G et al Artificial intelligence in drug development. Nat Med. (2025) 31(1):4559. 10.1038/s41591-024-03434-4

  • 155.

    Kistkins S Mihailovs T Lobanovs S Pīrāgs V Sourij H Moser O et al Comparative analysis of predictive interstitial glucose level classification models. Sensors. (2023) 23(19):8269. 10.3390/s23198269

  • 156.

    Rodríguez-Rodríguez I Rodríguez J-V Chatzigiannakis I Zamora Izquierdo . On the possibility of predicting glycaemia ‘on the fly’ with constrained IoT devices in type 1 diabetes mellitus patients. Sensors. (2019) 19(20):4538. 10.3390/s19204538

  • 157.

    Ji C Jiang T Liu L Zhang J You L . Continuous glucose monitoring combined with artificial intelligence: redefining the pathway for prediabetes management. Front Endocrinol. (2025) 16:1571362. 10.3389/fendo.2025.1571362

  • 158.

    Hjort A Iggman D Rosqvist F . Glycemic variability assessed using continuous glucose monitoring in individuals without diabetes and associations with cardiometabolic risk markers: a systematic review and meta-analysis. Clin Nutr. (2024) 43(4):91525. 10.1016/j.clnu.2024.02.014

  • 159.

    Rodriguez JA Palermo NE Song W Lipsitz S Caballero AE Samal L et al Lack of association between hemoglobin A1c and continuous glucose monitor metrics among individuals with prediabetes and normoglycemia. Diabetes Technol Ther. (2025). 10.1177/15209156251379506

  • 160.

    Mo Y Lu J Zhou J . Glycemic variability: measurement, target, impact on complications of diabetes and does it really matter?J Diabetes Invest. (2024) 15(1):514. 10.1111/jdi.14112

  • 161.

    Li J Lu J Tobore I Liu Y Kandwal A Wang L et al Gradient variability coefficient: a novel method for assessing glycemic variability and risk of hypoglycemia. Endocrine. (2022) 76(1):2935. 10.1007/s12020-021-02950-4

  • 162.

    Wang L Xie J Gu Z Miao X Ma L Yan S et al Predicting isolated impaired glucose tolerance without oral glucose tolerance test using machine learning in Chinese Han men. Front Endocrinol. (2025) 16:1514397. 10.3389/fendo.2025.1514397

  • 163.

    Pai R Barua S Kim BS McDonald M Wierzchowska-McNew RA Pai A et al Estimating breakfast characteristics using continuous glucose monitoring and machine learning in adults with or at risk of type 2 diabetes. J Diabetes Sci Technol. (2024). 10.1177/19322968241274800

  • 164.

    Zhang L Guo K Tian Q Ye J Ding Z Zhou Q et al Serum metabolomics reveals a potential benefit of methionine in type 1 diabetes patients with poor glycemic control and high glycemic variability. Nutrients. (2023) 15(3):518. 10.3390/nu15030518

  • 165.

    Mo Y Wang C Lu J Shen Y Chen L Zhang L et al Impact of short-term glycemic variability on risk of all-cause mortality in type 2 diabetes patients with well-controlled glucose profile by continuous glucose monitoring: a prospective cohort study. Diabetes Res Clin Pract. (2022) 189:109940. 10.1016/j.diabres.2022.109940

  • 166.

    Barua S Sabharwal A Glantz N Conneely C Larez A Bevier W et al The northeast glucose drift: stratification of post-breakfast dysglycemia among predominantly hispanic/latino adults at-risk or with type 2 diabetes. eClinicalMedicine. (2022) 43:101241. 10.1016/j.eclinm.2021.101241

  • 167.

    Sato Imuro SE Sabharwal A Bevier W Kerr D . Evaluating HbA1c-to-average glucose conversion with patient-specific kinetic models for diverse populations. Sci Rep. (2024) 14(1):22098. 10.1038/s41598-024-72837-7

  • 168.

    Mo Y Ma X Lu J Shen Y Wang Y Zhang L et al Defining the target value of the coefficient of variation by continuous glucose monitoring in Chinese people with diabetes. J Diabetes Invest. (2021) 12(6):102534. 10.1111/jdi.13453

  • 169.

    Prendin F Pavan J Cappon G Del Favero S Sparacino G Facchinetti A . The importance of interpreting machine learning models for blood glucose prediction in diabetes: an analysis using SHAP. Sci Rep. (2023) 13(1):16865. 10.1038/s41598-023-44155-x

  • 170.

    Xiong X Yang X Cai Y Xue Y He J Su H . Exploring the potential of deep learning models integrating transformer and LSTM in predicting blood glucose levels for T1D patients. Digital Health. (2025) 11:20552076251328980. 10.1177/20552076251328980

  • 171.

    Zhu X Fowler MJ Wells QS Stafford JM Gannon M . Predicting responsiveness to GLP-1 pathway drugs using real-world data. BMC Endocr Disord. (2024) 24(1):269. 10.1186/s12902-024-01798-9

  • 172.

    Adelson RP Garikipati A Zhou Y Ciobanu M Tawara K Barnes G et al Machine learning approach with harmonized multinational datasets for enhanced prediction of hypothyroidism in patients with type 2 diabetes. Diagnostics. (2024) 14(11):1152. 10.3390/diagnostics14111152

  • 173.

    Yang D Peng X Zheng S Peng S . Deep learning-based prediction of autoimmune diseases. Sci Rep. (2025) 15(1):4576. 10.1038/s41598-025-88477-4

  • 174.

    Reichman TW Markmann JF Odorico J Witkowski P Fung JJ Wijkstrom M et al Stem cell–derived, fully differentiated islets for type 1 diabetes. N Engl J Med. (2025) 393(9):85868. 10.1056/NEJMoa2506549

  • 175.

    Herold KC Delong T Perdigoto AL Biru N Brusko TM Walker LSK . The immunology of type 1 diabetes. Nat Rev Immunol. (2024) 24(6):43551. 10.1038/s41577-023-00985-4

  • 176.

    Ziegler A-G Cengiz E Kay TWH . The future of type 1 diabetes therapy. Lancet. (2025) 406(10511):152034. 10.1016/S0140-6736(25)01438-2

  • 177.

    Edelman SV . Regulation catches up to reality: nonadjunctive use of continuous glucose monitoring data. J Diabetes Sci Technol. (2017) 11(1):1604. 10.1177/1932296816667749

  • 178.

    Feig DS Donovan LE Corcoy R Murphy KE Amiel SA Hunt KF et al Continuous glucose monitoring in pregnant women with type 1 diabetes (CONCEPTT): a multicentre international randomised controlled trial. Lancet. (2017) 390(10110):234759. 10.1016/S0140-6736(17)32400-5

  • 179.

    Visser MM Charleer S Fieuws S De Block C Hilbrands R Van Huffel L et al Comparing real-time and intermittently scanned continuous glucose monitoring in adults with type 1 diabetes (ALERTT1): a 6-month, prospective, multicentre, randomised controlled trial. Lancet. (2021) 397(10291):227583. 10.1016/S0140-6736(21)00789-3

  • 180.

    Popova P Anopova A Vasukova E Isakov A Eriskovskaya A Degilevich A et al Trial protocol for the study of recommendation system DiaCompanion with personalized dietary recommendations for women with gestational diabetes mellitus (DiaCompanion I). Front Endocrinol. (2023) 14:1168688. 10.3389/fendo.2023.1168688

  • 181.

    Karter AJ Parker MM Moffet HH Gilliam LK Dlott R . Association of real-time continuous glucose monitoring with glycemic control and acute metabolic events among patients with insulin-treated diabetes. JAMA. (2021) 325(22):2273. 10.1001/jama.2021.6530

  • 182.

    Corbett JP Hsu L Brown SA Kollar L Vleugels K Buckingham B et al Smartwatch gesture-based meal reminders improve glycaemic control. Diabetes Obes Metab. (2022) 24(8):166770. 10.1111/dom.14737

  • 183.

    Lehmann V Föll S Maritsch M van Weenen E Kraus M Lagger S et al Noninvasive hypoglycemia detection in people with diabetes using smartwatch data. Diabetes Care. (2023) 46(5):9937. 10.2337/dc22-2290

  • 184.

    Castorino K Polsky S O'Malley G Levister C Nelson K Farfan C et al Performance of the dexcom G6 continuous glucose monitoring system in pregnant women with diabetes. Diabetes Technol Ther. (2020) 22(12):9437. 10.1089/dia.2020.0085

  • 185.

    Chekima K Noor MI Ooi YBH Yan SW Jaweed M Chekima B . Utilising a real-time continuous glucose monitor as part of a low glycaemic index and load diet and determining its effect on improving dietary intake, body composition and metabolic parameters of overweight and obese young adults: a randomised controlled trial. Foods. (2022) 11(12):1754. 10.3390/foods11121754

  • 186.

    Feig DS Asztalos E Corcoy R De Leiva A Donovan L Hod M et al CONCEPTT: continuous glucose monitoring in women with type 1 diabetes in pregnancy trial: a multi-center, multi-national, randomized controlled trial—study protocol. BMC Pregnancy Childbirth. (2016) 16(1):167. 10.1186/s12884-016-0961-5

  • 187.

    Mackillop LH Bartlett K Birks J Farmer AJ Gibson OJ Kevat DA et al Trial protocol to compare the efficacy of a smartphone-based blood glucose management system with standard clinic care in the gestational diabetic population. BMJ Open. (2016) 6(3):e009702. 10.1136/bmjopen-2015-009702

  • 188.

    Ho PW-L Szeto C-C Chow K-M . Continuous glucose monitoring device causes consternation on chest x-ray. Lancet. (2022) 399(10344):2412. 10.1016/S0140-6736(22)01017-0

  • 189.

    Joshi NK Arora V Purohit A Lohra A Joshi V Shih T et al Defeating diabetes in the desert: a community-based mHealth diabetes screening intervention in Jodhpur Rajasthan. J Family Med Prim Care. (2023) 12(8):15715. 10.4103/jfmpc.jfmpc_2273_22

  • 190.

    Vettoretti M Cappon G Acciaroli G Facchinetti A Sparacino G . Continuous glucose monitoring: current use in diabetes management and possible future applications. J Diabetes Sci Technol. (2018) 12(5):106471. 10.1177/1932296818774078

  • 191.

    Giriraja KV Govindaraj S Chandrakumar HP Ramesh B Prasad L Priyanka BR et al Clinical validation of integrated point-of-care devices for the management of non-communicable diseases. Diagnostics. (2020) 10(5):320. 10.3390/diagnostics10050320

  • 192.

    Bila R Varo R Madrid L Sitoe A Bassat Q . Continuous glucose monitoring in resource-constrained settings for hypoglycaemia detection: looking at the problem from the other side of the coin. Biosensors. (2018) 8(2):43. 10.3390/bios8020043

  • 193.

    Thapa A Chibvunde S Schwartz L Trujillo C Ferrari G Drown L et al Appropriateness and acceptability of continuous glucose monitoring in people with type 1 diabetes at rural first-level hospitals in Malawi: a qualitative study. BMJ Open. (2024) 14(5):e075559. 10.1136/bmjopen-2023-075559

  • 194.

    Bergenstal RM Layne JE Zisser H Gabbay RA Barleen NA Lee AA et al Remote application and use of real-time continuous glucose monitoring by adults with type 2 diabetes in a virtual diabetes clinic. Diabetes Technol Ther. (2021) 23(2):12832. 10.1089/dia.2020.0396

  • 195.

    Wang J Coleman DC Kanter J Ummer B Siminerio L . Connecting smartphone and wearable fitness tracker data with a nationally used electronic health record system for diabetes education to facilitate behavioral goal monitoring in diabetes care: protocol for a pragmatic multi-site randomized trial. JMIR Res Protoc. (2018) 7(4):e10009. 10.2196/10009

  • 196.

    Choi SG Oh M Park D Lee B Lee Y-h Jee SH et al Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods. Sci Rep. (2023) 13(1):13101. 10.1038/s41598-023-40170-0

  • 197.

    Pei D Gong Y Kang H Zhang C Guo Q . Accurate and rapid screening model for potential diabetes mellitus. BMC Med Inform Decis Mak. (2019) 19(1):41. 10.1186/s12911-019-0790-3

  • 198.

    Zhang L Wang Y Niu M Wang C Wang Z . Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan rural cohort study. Sci Rep. (2020) 10(1):4406. 10.1038/s41598-020-61123-x

  • 199.

    Shin J Kim J Lee C Yoon JY Kim S Song S et al Development of Various diabetes prediction models using machine learning techniques. Diabetes Metab J. (2022) 46(4):6507. 10.4093/dmj.2021.0115

  • 200.

    Liu X Zhang W Zhang Q Chen L Zeng T Zhang J et al Development and validation of a machine learning-augmented algorithm for diabetes screening in community and primary care settings: a population-based study. Front Endocrinol. (2022) 13:1043919. 10.3389/fendo.2022.1043919

  • 201.

    Tusongtuoheti X Shu Y Huang G Mao Y . Predicting the risk of subclinical atherosclerosis based on interpretable machine models in a Chinese T2DM population. Front Endocrinol. (2024) 15:1332982. 10.3389/fendo.2024.1332982

  • 202.

    Mo Y Zhao F Yuan L Xing Q Zhou Y Wu Q et al Healthcare providers’ perceptions of artificial intelligence in diabetes care: a cross-sectional study in China. Int J Nurs Sci. (2025) 12(3):21824. 10.1016/j.ijnss.2025.04.013

  • 203.

    Barreto TdO Farias FLdO Veras NVR Cardoso PH Silva GJPC Pinheiro CdO et al Artificial intelligence applied to bed regulation in Rio Grande do norte: data analysis and application of machine learning on the “RegulaRN leitos gerais” platform. PLoS One. (2024) 19(12):e0315379. 10.1371/journal.pone.0315379

  • 204.

    Hoyos W Hoyos K Ruiz R Aguilar J . An explainable analysis of diabetes mellitus using statistical and artificial intelligence techniques. BMC Med Inform Decis Mak. (2024) 24(1):383. 10.1186/s12911-024-02810-x

  • 205.

    Xiao Z Wang M Zhao Y Wang H . A biomarker-driven and interpretable machine learning model for diagnosing diabetes mellitus. Food Sci Nutr. (2025) 13(5):e70234. 10.1002/fsn3.70234

  • 206.

    Hindson J . Proteomics and machine-learning models for alcohol-related liver disease biomarkers. Nat Rev Gastroenterol Hepatol. (2022) 19(8):488. 10.1038/s41575-022-00655-1

  • 207.

    Radhakrishnan A Beaglehole D Pandit P Belkin M . Mechanism for feature learning in neural networks and backpropagation-free machine learning models. Science. (2024) 383(6690):14617. 10.1126/science.adi5639

  • 208.

    Maddali MV Churpek M Pham T Rezoagli E Zhuo H Zhao W et al Validation and utility of ARDS subphenotypes identified by machine-learning models using clinical data: an observational, multicohort, retrospective analysis. Lancet Respir Med. (2022) 10(4):36777. 10.1016/S2213-2600(21)00461-6

  • 209.

    Vu GT Tran BX McIntyre RS Pham HQ Phan HT Ha GH et al Modeling the research landscapes of artificial intelligence applications in diabetes (GAPRESEARCH). Int J Environ Res Public Health. (2020) 17(6):1982. 10.3390/ijerph17061982

  • 210.

    Rogers P McCall T Zhang Y Reese J Wang D Tong W . Leveraging AI to improve disease screening among American Indians: insights from the strong heart study. Exp Biol Med. (2025) 249:10341. 10.3389/ebm.2024.10341

  • 211.

    Vadlamani S Wachira E . AI’s ongoing impact: implications of AI’s effects on health equity for women’s healthcare providers. Rev Panam Salud Publica. (2025) 49:1. 10.26633/RPSP.2025.19

  • 212.

    Abel ED Gloyn AL Evans-Molina C Joseph JJ Misra S Pajvani UB et al Diabetes mellitus—progress and opportunities in the evolving epidemic. Cell. (2024) 187(15):3789820. 10.1016/j.cell.2024.06.029

  • 213.

    Chen J Ren W Liu J Fu Z Yao Y Chen X et al Feasibility of intelligent drug control in the maintenance phase of general anesthesia based on convolutional neural network. Heliyon. (2023) 9(1):e12481. 10.1016/j.heliyon.2022.e12481

  • 214.

    Agliata A Giordano D Bardozzo F Bottiglieri S Facchiano A Tagliaferri R . Machine learning as a support for the diagnosis of type 2 diabetes. Int J Mol Sci. (2023) 24(7):6775. 10.3390/ijms24076775

  • 215.

    Xu Z Lin A Han X . Current AI applications and challenges in oral pathology. Oral. (2025) 5(1):2. 10.3390/oral5010002

  • 216.

    Davis SE Embí PJ Matheny ME . Sustainable deployment of clinical prediction tools—a 360 ° approach to model maintenance. J Am Med Inform Assoc. (2024) 31(5):11958. 10.1093/jamia/ocae036

  • 217.

    Zhuo T Bartelt-Hofer J Brown S Lew E Sauriol L Annemans L et al The use of computer simulation modeling to estimate complications in patients with type 2 diabetes mellitus: comparative validation of the cornerstone diabetes simulation model. PharmacoEconomics Open. (2020) 4(1):3744. 10.1007/s41669-019-0156-x

  • 218.

    Voskergian D Bakir-Gungor B Yousef M . Engineering novel features for diabetes complication prediction using synthetic electronic health records. Front Genet. (2025) 16:1451290. 10.3389/fgene.2025.1451290

  • 219.

    Van Wilder L De Smedt D Sandra L Vandormael D Kaes J Willems R et al Cost-effectiveness of a digital supportive healthcare pathway for type 2 diabetes compared to usual care in Belgium. Digital Health. (2025) 11:20552076251319169. 10.1177/20552076251319169

  • 220.

    Li W Peng Y Peng K . Diabetes prediction model based on GA-XGBoost and stacking ensemble algorithm. PLoS One. (2024) 19(9):e0311222. 10.1371/journal.pone.0311222

  • 221.

    Sun X Bee YM Lam SW Liu Z Zhao W Chia SY et al Effective treatment recommendations for type 2 diabetes management using reinforcement learning: treatment recommendation model development and validation. J Med Internet Res. (2021) 23(7):e27858. 10.2196/27858

  • 222.

    Essien UR Tang Y Figueroa JF Litam TMA Tang F Jones PG et al Diabetes care among older adults enrolled in medicare advantage versus traditional medicare fee-for-service plans: the diabetes collaborative registry. Diabetes Care. (2022) 45(7):154957. 10.2337/dc21-1178

  • 223.

    Yao MS Chae A MacLean MT Verma A Duda J Gee JC et al Syntha1c: towards clinically interpretable patient representations for diabetes risk stratification. Predict Intell Med (2023). p. 4657. 10.1007/978-3-031-46005-0_5

  • 224.

    Reddick AL Gray DC . Impact of culturally tailored shared medical appointments on diabetes self-care ability and knowledge in African Americans. Prim Health Care Res Dev. (2023) 24:e30. 10.1017/S1463423623000166

  • 225.

    Holtrop JS Gurfinkel D Nederveld A Reedy J Rubinson C Kwan BM . What works in implementing shared medical appointments for patients with diabetes in primary care to enhance reach: a qualitative comparative analysis from the invested in diabetes study. Implement Sci Commun. (2024) 5(1):82. 10.1186/s43058-024-00608-6

  • 226.

    Marshall DA Grazziotin LR Regier DA Wordsworth S Buchanan J Phillips K et al Addressing challenges of economic evaluation in precision medicine using dynamic simulation modeling. Value Health. (2020) 23(5):56673. 10.1016/j.jval.2020.01.016

  • 227.

    Chalkou K Vickers AJ Pellegrini F Manca A Salanti G . Decision curve analysis for personalized treatment choice between multiple options. Med Decis Making. (2023) 43(3):33749. 10.1177/0272989X221143058

  • 228.

    Colasanti R MacLachlan A Silverman E McCann M Mitchell R Hunter R et al Using agent-based models to address non-communicable diseases: a review of models and their application to policy. Lancet. (2022) 400:S33. 10.1016/S0140-6736(22)02243-7

  • 229.

    Jiang X Lv G Li M Yuan J Lu ZK . Predicting diabetes self-management education engagement: machine learning algorithms and models. BMJ Open Diab Res Care. (2025) 13(1):e004632. 10.1136/bmjdrc-2024-004632

  • 230.

    Zisman-Ilani Y Khaikin S Savoy ML Paranjape A Rubin DJ Jacob R et al Disparities in shared decision-making research and practice: the case for black American patients. Ann Fam Med. (2023) 21(2):1128. 10.1370/afm.2943

  • 231.

    Boocha K Ploylearmsang C . Development and implementation of diabetes care by a community network based on a chronic care model. J Prim Care Community Health. (2023) 14:21501319231181874. 10.1177/21501319231181874

  • 232.

    Campbell JA Egede LE . Individual-, community-, and health system–level barriers to optimal type 2 diabetes care for inner-city African Americans: an integrative review and model development. Diabetes Educ. (2020) 46(1):1127. 10.1177/0145721719889338

  • 233.

    Cardona-Hernandez R de la Cuadra-Grande A Monje J Echave M Oyagüez I Álvarez M et al Are trends in economic modeling of pediatric diabetes mellitus up to date with the clinical practice guidelines and the latest scientific findings? J Health Econ Outcomes Res. (2025) 12(1):30. 10.36469/001c.127920

  • 234.

    Kawu AA Hederman L Doyle J O’Sullivan D . Patient generated health data and electronic health record integration, governance and socio-technical issues: a narrative review. Inform Med Unlocked. (2023) 37:101153. 10.1016/j.imu.2022.101153

  • 235.

    Xiong H Chen H Xu L Liu H Fan L Tang Q et al A survey of data element perspective: application of artificial intelligence in health big data. Front Neurosci. (2022) 16:1031732. 10.3389/fnins.2022.1031732

  • 236.

    Zhang K Qi Y Wang W Tian X Wang J Xu L et al Future horizons in diabetes: integrating AI and personalized care. Front Endocrinol. (2025) 16:1583227. 10.3389/fendo.2025.1583227

  • 237.

    Asselbergs FW Fraser AG . Artificial intelligence in cardiology: the debate continues. Eur Heart J Digit Health. (2021) 2(4):7216. 10.1093/ehjdh/ztab090

  • 238.

    DeCamp M Lindvall C . Latent bias and the implementation of artificial intelligence in medicine. J Am Med Inform Assoc. (2020) 27(12):20203. 10.1093/jamia/ocaa094

  • 239.

    Rudd J Igbrude C . A global perspective on data powering responsible AI solutions in health applications. AI Ethics. (2024) 4(4):103949. 10.1007/s43681-023-00302-8

  • 240.

    Reddy S Allan S Coghlan S Cooper P . A governance model for the application of AI in health care. J Am Med Inform Assoc. (2020) 27(3):4917. 10.1093/jamia/ocz192

  • 241.

    Larson DB Harvey H Rubin DL Irani N Tse JR Langlotz CP . Regulatory frameworks for development and evaluation of artificial intelligence–based diagnostic imaging algorithms: summary and recommendations. J Am Coll Radiol. (2021) 18(3):41324. 10.1016/j.jacr.2020.09.060

  • 242.

    Alanzi T Alhajri A Almulhim S Alharbi S Alfaifi S Almarhoun E et al Artificial intelligence and patient autonomy in obesity treatment decisions: an empirical study of the challenges. Cureus. (2023) 15:e49725. 10.7759/cureus.49725

  • 243.

    Ahluwalia VS Schapira MM Weissman GE Parikh RB . Primary care provider preferences regarding artificial intelligence in point-of-care cancer screening. MDM Policy Pract. (2025) 10(1):23814683251329007. 10.1177/23814683251329007

  • 244.

    Beam AL Manrai AK Ghassemi M . Challenges to the reproducibility of machine learning models in health care. JAMA. (2020) 323(4):305. 10.1001/jama.2019.20866

  • 245.

    Willem T Shitov VA Luecken MD Kilbertus N Bauer S Piraud M et al Biases in machine-learning models of human single-cell data. Nat Cell Biol. (2025) 27(3):38492. 10.1038/s41556-025-01619-8

  • 246.

    Ho RA Shah E De Armas JS Yan K Kaye R . Artificial intelligence models for dysphonia patient education. Otolaryngol Head Neck Surg. (2025) 173(6):145562. 10.1002/ohn.70030

  • 247.

    Straw I Rees G Nachev P . Sex-based performance disparities in machine learning algorithms for cardiac disease prediction: exploratory study. J Med Internet Res. (2024) 26:e46936. 10.2196/46936

  • 248.

    Liaw WR Ramos Silva Y Soltero EG Krist A Stotts A . An assessment of how clinicians and staff members use a diabetes artificial intelligence prediction tool: mixed methods study. JMIR AI. (2023) 2:e45032. 10.2196/45032

  • 249.

    Casella B Riviera W Aldinucci M Menegaz G . MERGE: a model for multi-input biomedical federated learning. Patterns. (2023) 4(11):100856. 10.1016/j.patter.2023.100856

  • 250.

    Ali M Naeem F Tariq M Kaddoum G . Federated learning for privacy preservation in smart healthcare systems: a comprehensive survey. IEEE J Biomed Health Inform. (2023) 27(2):77889. 10.1109/JBHI.2022.3181823

  • 251.

    Srinivasu PN Ahmed S Hassaballah M Almusallam N . An explainable artificial intelligence software system for predicting diabetes. Heliyon. (2024) 10(16):e36112. 10.1016/j.heliyon.2024.e36112

  • 252.

    Wichmann RM Robiati Bigoto MA Chiavegatto Filho ADP . Federated learning for COVID-19 mortality prediction in a multicentric sample of 21 hospitals. PLoS Comput Biol. (2025) 21(11):e1013695. 10.1371/journal.pcbi.1013695

  • 253.

    Wang Y Deng R Geng X . Exploring the integration of medical and preventive chronic disease health management in the context of big data. Front Public Health. (2025) 13:1547392. 10.3389/fpubh.2025.1547392

  • 254.

    Kalapaaking AP Khalil I Yi X Lam K-Y Huang G-B Wang N . Auditable and verifiable federated learning based on blockchain-enabled decentralization. IEEE Trans Neural Netw Learn Syst. (2025) 36(1):10215. 10.1109/TNNLS.2024.3407670

  • 255.

    Tzortzis IN Gutierrez-Torre A Sykiotis S Agulló F Bakalos N Doulamis A et al Towards generalizable federated learning in medical imaging: a real-world case study on mammography data. Comput Struct Biotechnol J. (2025) 28:10617. 10.1016/j.csbj.2025.03.031

  • 256.

    Alsaleh S Menai MEB Al-Ahmadi S . A heterogeneity-aware semi-decentralized model for a lightweight intrusion detection system for IoT networks based on federated learning and BiLSTM. Sensors. (2025) 25(4):1039. 10.3390/s25041039

  • 257.

    Elhussein A Megjhani M Nametz D Weiss M Savarraj J Kwon SB et al A generalizable physiological model for detection of delayed cerebral ischemia using federated learning. IEEE Int Conf Bioinformatics Biomed (2023). p. 18869. 10.1109/BIBM58861.2023.10385383

  • 258.

    Tong J Li L Reps JM Lorman V Jing N Edmondson M et al Advancing interpretable regression analysis for binary data: a novel distributed algorithm approach. Stat Med. (2024) 43(29):557382. 10.1002/sim.10250

  • 259.

    Wei B Li J Liu Y Wang W . Non-IID federated learning with sharper risk bound. IEEE Trans Neural Netw Learning Syst. (2024) 35(5):690617. 10.1109/TNNLS.2022.3213187

  • 260.

    Wang Y Shi Q Chang T-H . Why batch normalization damage federated learning on non-IID data?IEEE Trans Neural Netw Learning Syst. (2025) 36(1):1692706. 10.1109/TNNLS.2023.3323302

  • 261.

    Khalil SS Tawfik NS Spruit M . Federated learning for privacy-preserving depression detection with multilingual language models in social media posts. Patterns. (2024) 5(7):100990. 10.1016/j.patter.2024.100990

  • 262.

    Jing C Huang Y Zhuang Y Sun L Xiao Z Huang Y et al Exploring personalization via federated representation learning on non-IID data. Neural Netw. (2023) 163:35466. 10.1016/j.neunet.2023.04.007

  • 263.

    Tang T Han Z Cai Z Yu S Zhou X Oseni T et al Personalized federated graph learning on non-IID electronic health records. IEEE Trans Neural Netw Learning Syst. (2024) 35(9):1184356. 10.1109/TNNLS.2024.3370297

  • 264.

    Chen G Wang Q Feng Y Rekik I Chen G . Metadata-driven federated learning of connectional brain templates in non-IID multi-domain scenarios. IEEE Trans Med Imaging. (2025):1. 10.1109/TMI.2025.3631235

  • 265.

    Aketi SA Kodge S Roy K . Low precision decentralized distributed training over IID and non-IID data. Neural Netw. (2022) 155:45160. 10.1016/j.neunet.2022.08.032

  • 266.

    Sarani Rad F Hendawi R Yang X Li J . Personalized diabetes management with digital twins: a patient-centric knowledge graph approach. J Pers Med. (2024) 14(4):359. 10.3390/jpm14040359

  • 267.

    Zhang Y Qin G Aguilar B Rappaport N Yurkovich JT Pflieger L et al A framework towards digital twins for type 2 diabetes. Front Digit Health. (2024) 6:1336050. 10.3389/fdgth.2024.1336050

  • 268.

    Rigla M García-Sáez G Pons B Hernando ME . Artificial intelligence methodologies and their application to diabetes. J Diabetes Sci Technol. (2018) 12(2):30310. 10.1177/1932296817710475

  • 269.

    Singla R Singla A Gupta Y Kalra S . Artificial intelligence/machine learning in diabetes care. Indian J Endocr Metab. (2019) 23(4):495. 10.4103/ijem.IJEM_228_19

  • 270.

    Vallée A . Envisioning the future of personalized medicine: role and realities of digital twins. J Med Internet Res. (2024) 26:e50204. 10.2196/50204

  • 271.

    Thiong’o GM Rutka JT . Digital twin technology: the future of predicting neurological complications of pediatric cancers and their treatment. Front Oncol. (2022) 11:781499. 10.3389/fonc.2021.781499

  • 272.

    Rahmim A Brosch-Lenz J Fele-Paranj A Yousefirizi F Soltani M Uribe C et al Theranostic digital twins for personalized radiopharmaceutical therapies: reimagining theranostics via computational nuclear oncology. Front Oncol. (2022) 12:1062592. 10.3389/fonc.2022.1062592

  • 273.

    Berube LT Popp CJ Curran M Hu L Pompeii ML Barua S et al Diabetes telemedicine mediterranean diet (DiaTeleMed) study: study protocol for a fully remote randomized clinical trial evaluating personalized dietary management in individuals with type 2 diabetes. Trials. (2024) 25(1):506. 10.1186/s13063-024-08337-w

  • 274.

    Shamanna P Erukulapati RS Shukla A Shah L Willis B Thajudeen M et al One-year outcomes of a digital twin intervention for type 2 diabetes: a retrospective real-world study. Sci Rep. (2024) 14(1):25478. 10.1038/s41598-024-76584-7

  • 275.

    Venkatesh KP Brito G Boulos K N M . Health digital twins in life science and health care innovation. Annu Rev Pharmacol Toxicol. (2024) 64(1):15970. 10.1146/annurev-pharmtox-022123-022046

  • 276.

    Akbarialiabad H Pasdar A Murrell DF Mostafavi M Shakil F Safaee E et al Enhancing randomized clinical trials with digital twins. NPJ Syst Biol Appl. (2025) 11(1):110. 10.1038/s41540-025-00592-0

  • 277.

    Gonsard A Genet M Drummond D . Digital twins for chronic lung diseases. Eur Respir Rev. (2024) 33(174):240159. 10.1183/16000617.0159-2024

  • 278.

    Thamotharan P Srinivasan S Kesavadev J Krishnan G Mohan V Seshadhri S et al Human digital twin for personalized elderly type 2 diabetes management. J Clin Med. (2023) 12(6):2094. 10.3390/jcm12062094

  • 279.

    Lv Z . Digital twins in industry 5.0. Research. (2023) 6:0071. 10.34133/research.0071

  • 280.

    Bublil T Cohen R Kenett RS Bortman J . Machine health indicators and digital twins. Sensors. (2025) 25(7):2246. 10.3390/s25072246

  • 281.

    Islam R Keya KN Pan S Sarwate AD Foulds JR . Differential fairness: an intersectional framework for fair AI. Entropy. (2023) 25(4):660. 10.3390/e25040660

  • 282.

    Wang HE Weiner JP Saria S Kharrazi H . Evaluating algorithmic bias in 30-day hospital readmission models: retrospective analysis. J Med Internet Res. (2024) 26:e47125. 10.2196/47125

  • 283.

    Davis SE Dorn C Park DJ Matheny ME . Emerging algorithmic bias: fairness drift as the next dimension of model maintenance and sustainability. J Am Med Inform Assoc. (2025) 32(5):84554. 10.1093/jamia/ocaf039

  • 284.

    Sufian MA Alsadder L Hamzi W Zaman S Sagar ASMS Hamzi B . Mitigating algorithmic bias in AI-driven cardiovascular imaging for fairer diagnostics. Diagnostics. (2024) 14(23):2675. 10.3390/diagnostics14232675

  • 285.

    Anand P Jin Y Liu J Lii J Belitkar S Lin KJ . Minimizing racial algorithmic bias when predicting electronic health record data completeness. Clin Pharma Therapeutics. (2025) 118(5):112231. 10.1002/cpt.3758

  • 286.

    Rana SA Azizul ZH Awan AA . A step toward building a unified framework for managing AI bias. PeerJ Computer Science. (2023) 9:e1630. 10.7717/peerj-cs.1630

  • 287.

    Curto G Comim F . SAF: stakeholders’ agreement on fairness in the practice of machine learning development. Sci Eng Ethics. (2023) 29(4):29. 10.1007/s11948-023-00448-y

  • 288.

    Ghai B Mueller K . D-BIAS: a causality-based human-in-the-loop system for tackling algorithmic bias. IEEE Trans Visual Comput Graph. (2022) 29:47382. 10.1109/TVCG.2022.3209484

  • 289.

    Bechny M Fiorillo L van der Meer J Schmidt M Bassetti C Tzovara A et al Beyond accuracy: a framework for evaluating algorithmic bias and performance, applied to automated sleep scoring. Sci Rep. (2025) 15(1):21421. 10.1038/s41598-025-06019-4

  • 290.

    Bozeman JF Hollauer C Ramshankar AT Nakkasunchi S Jambeck J Hicks A et al Embed systemic equity throughout industrial ecology applications: how to address machine learning unfairness and bias. J Ind Ecol. (2024) 28(6):136276. 10.1111/jiec.13509

  • 291.

    Harvard S Winsberg E Symons J Adibi A . Value judgments in a COVID-19 vaccination model: a case study in the need for public involvement in health-oriented modelling. Soc Sci Med. (2021) 286:114323. 10.1016/j.socscimed.2021.114323

  • 292.

    Bedoya AD Economou-Zavlanos NJ Goldstein BA Young A Jelovsek JE O’Brien C et al A framework for the oversight and local deployment of safe and high-quality prediction models. J Am Med Inform Assoc. (2022) 29(9):16316. 10.1093/jamia/ocac078

  • 293.

    Digital Health and Rights Project Consortium, DavisSLM. Towards digital justice: participatory action research in global digital health. BMJ Glob Health. (2022) 7(5):e009351. 10.1136/bmjgh-2022-009351

  • 294.

    Sigfrids A Leikas J Salo-Pöntinen H Koskimies E . Human-centricity in AI governance: a systemic approach. Front Artif Intell. (2023) 6:976887. 10.3389/frai.2023.976887

  • 295.

    Shaw F McCosker A . Relational ethics in the administration of healthcare technology: AI, automation and proper distance. Sociol Health Illn. (2025) 47(5):e70055. 10.1111/1467-9566.70055

  • 296.

    Hager A Lindblad S Brommels M Salomonsson S Wannheden C . Sharing patient-controlled real-world data through the application of the theory of commons: action research case study. J Med Internet Res. (2021) 23(1):e16842. 10.2196/16842

  • 297.

    Chandler C Foltz PW Elvevåg B . Improving the applicability of AI for psychiatric applications through human-in-the-loop methodologies. Schizophr Bull. (2022) 48(5):94957. 10.1093/schbul/sbac038

  • 298.

    Nunnelley S Flood CM Da Silva M Horsley T Kanathasan S Thomas B et al Cracking the code: a scoping review to unite disciplines in tackling legal issues in health artificial intelligence. BMJ Health Care Inform. (2025) 32(1):e101112. 10.1136/bmjhci-2024-101112

  • 299.

    Krause F Paulheim H Kiesling E Kurniawan K Leva MC Estrada-Lugo HD et al Managing human-AI collaborations within industry 5.0 scenarios via knowledge graphs: key challenges and lessons learned. Front Artif Intell. (2024) 7:1247712. 10.3389/frai.2024.1247712

Summary

Keywords

artificial intelligence, digital-twin, federated learning, health-equity, precision medicine

Citation

Bai B, Liu X and Li H (2026) Federated multimodal AI for precision-equitable diabetes care. Front. Digit. Health 7:1678047. doi: 10.3389/fdgth.2025.1678047

Received

04 August 2025

Revised

10 December 2025

Accepted

16 December 2025

Published

16 January 2026

Volume

7 - 2025

Edited by

Tin-Chih Toly Chen, National Yang Ming Chiao Tung University, Taiwan

Reviewed by

Xu Zhai, China Academy of Chinese Medical Sciences, China

Mariano Vargas, National Council of Science and Technology (CONACYT), Mexico

Updates

Copyright

* Correspondence: Hong Li

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics