- 1Pingliang City Maternity and Child-care Hospital, Pingliang, China
- 2Department of Breast Cancer, Cancer Center, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
- 3SingularityFlow Co. Ltd., Beijing, China
Introduction: To improve the early prediction of hypertensive disorders of pregnancy (HDP) and gestational diabetes mellitus (GDM), we developed and validated an artificial intelligence (AI) model. This initiative was driven by the insufficient accuracy of current clinical tools. Our study aimed to determine whether integrating radiomics and deep learning features from first-trimester ultrasound scans could enhance predictive performance.
Methods: A total of 213 pregnant women who underwent ultrasound at 8 weeks of gestation were enrolled. Clinical data, radiomics features, and deep learning features were collected. Imaging features were selected using LASSO regression. Four predictive models were developed: a clinical model, a radiomics model, a deep learning model, and a fusion model combining all feature types. Model performance was evaluated on an independent test set using metrics including AUC, sensitivity, specificity, calibration, and decision curve analysis.
Results: In the training cohort, all models demonstrated excellent discriminatory ability, with the combined model achieving the highest AUC of 0.987 (95% CI: 0.9733–0.9999), followed by the DLR model (AUC = 0.985). The clinical model (AUC = 0.941) and radiomics model (AUC = 0.939) also performed well. In the test cohort, the combined model maintained superior performance with an AUC of 0.963 (95% CI: 0.9152–1.0000), significantly outperforming all single-modality models. Overall, the combined model exhibited optimal and stable predictive performance across both training and test datasets.
Discussion: This enables accurate early prediction of HDP and GDM. This non-invasive tool supports tailored prenatal care, with potential to improve outcomes. Further validation in diverse groups is needed.
1 Introduction
Hypertensive disorders of pregnancy (HDP) and gestational diabetes mellitus (GDM) are among the most prevalent complications affecting maternal and fetal health and are associated with significant long-term consequences, including preeclampsia and fetal growth restriction (1, 2). Notably, despite their distinct clinical presentations, HDP and GDM share a common pathophysiological foundation characterized by placental dysfunction, systemic inflammation, and endothelial dysfunction, which collectively contribute to profound insulin resistance. This overlapping biological rationale justifies their modeling as a combined outcome in predictive studies, as risk factors and early pathological processes for one condition often predispose to or exacerbate the other (3, 4). Early identification of high-risk pregnancies enables timely interventions, improving maternal and neonatal outcomes. Recent studies suggest that first-trimester metabolic, vascular, and inflammatory changes are independent predictive factors of mid-pregnancy complications (5). However, current screening methods rely on maternal history, biochemical markers, and lifestyle factors, which may lack sufficient predictive power. Due to the increasing prevalence of HDP and GDM worldwide, there is a critical need for non-invasive and early-stage predictive models that enable personalized pregnancy management.
Recent advances in medical imaging and artificial intelligence (AI) have demonstrated the potential of deep learning and radiomics in predicting disease outcomes by extracting high-dimensional, quantitative imaging biomarkers (6, 7). In prenatal care, ultrasound is widely accessible, non-invasive, and routinely used for fetal and maternal assessments (8). The integration of deep learning with first-trimester ultrasound has shown promising results in identifying subtle imaging patterns that may reflect early vascular dysfunction and metabolic abnormalities associated with pregnancy complications (9). Additionally, radiomics enables the extraction of texture, shape, and intensity features from ultrasound images, capturing microstructural variations that are imperceptible to the human eye (10, 11). By using these imaging-derived biomarkers along with maternal clinical parameters, fusion models can improve the accuracy of risk prediction for mid-pregnancy HDP and GDM (12).
Beyond early risk stratification, medical nutritional therapy (MNT) and structured physical activity interventions play a pivotal role in preventing and managing gestational hypertension and GDM (13, 14). Studies have shown that a balanced diet, including controlled carbohydrate intake and adequate protein consumption, reduces the risk of glucose intolerance and insulin resistance in pregnancy (15, 16). Supervised exercise programs can improve vascular function, glycemic control, and reduce gestational weight gain, mitigating the progression to GDM and hypertensive disorders (17). Furthermore, psychological support and stress management strategies have been associated with improved metabolic profiles and reduced inflammation in high-risk pregnancies (18). Incorporating these interventions into early pregnancy care, guided by AI-driven predictive models, may optimize maternal health and pregnancy outcomes.
This study aims to develop a fusion model integrating ultrasound-based deep learning, radiomics biomarkers, and maternal clinical parameters to predict mid-pregnancy HDP and GDM. By identifying patients at risk for HDP and GDM early in pregnancy, our model could facilitate timely interventions, optimize maternal-fetal health outcomes and guide personalized prenatal care strategies.
2 Materials and methods
2.1 Patient inclusion
This retrospective diagnostic study included pregnant women who underwent ultrasound screening at 8 weeks of gestation between January 2018 and December 2023. Inclusion criteria were: (1) singleton pregnancy confirmed by ultrasound; (2) gestational age determined by last menstrual period and confirmed by crown-rump length measurement at 8 weeks; (3) complete clinical data; and (4) availability of follow-up data to assess the development of HDP or GDM during the second trimester. Exclusion criteria were: (1) multiple gestation; (2) history of chronic hypertension, pregestational diabetes, renal disease, or autoimmune disorders; (3) missing clinical or ultrasound data; (4) major fetal structural anomalies; and (5) loss to follow-up before 28 weeks of gestation. Clinical and demographic data, including age, body mass index (BMI), medical history, and laboratory findings, were retrospectively collected from electronic medical records. The study workflow is shown in Figure 1. Clinical trial number: not applicable.
2.2 Ultrasound collection and preprocessing
All ultrasound examinations were performed using a high-resolution transabdominal ultrasound system (Voluson E10, GE Healthcare) equipped with a 3–9 MHz convex probe. The standard scanning protocol included acquisition of grayscale B-mode images of the gestational sac, yolk sac, and embryo in sagittal and transverse planes. Biometric parameters, including crown-rump length, gestational sac diameter, and yolk sac diameter, were recorded. Doppler imaging was used to assess early placental perfusion and maternal-fetal circulation, including uterine artery pulsatility index. Image preprocessing involved standardization of grayscale intensity, resolution normalization to 256 × 256 pixels, and conversion to grayscale for deep learning and radiomics analysis.
2.3 Multi-dimensional health management
For medical nutritional therapy, all participants received standardized dietary counseling at their first-trimester visit. Registered dietitians provided individualized nutritional plans based on maternal weight, body mass index (BMI), and dietary habits. Macronutrient distribution was adjusted according to guidelines for gestational metabolic health, with emphasis on complex carbohydrates, fiber intake, and limited saturated fats. Patients at higher risk of gestational diabetes were encouraged to follow a modified carbohydrate restriction plan with glycemic load control. For physical activity intervention, participants were counseled on maintaining an appropriate level of physical activity during pregnancy. Psychological well-being was assessed using validated stress and anxiety scales, such as the Perceived Stress Scale and the Edinburgh Postnatal Depression Scale, at baseline and during follow-up visits. Participants experiencing moderate to severe stress or anxiety were offered structured psychological support sessions, including cognitive behavioral therapy-based interventions, mindfulness training, and guided relaxation techniques.
2.4 Radiomics analysis
Radiomics feature extraction was performed using the Pyradiomics library. Segmented regions of interest (ROI) included the gestational sac and embryo, with features extracted from grayscale intensity, texture, shape, and wavelet-transformed domains. A total of 1,562 radiomics features were initially computed per ROI. Dimensionality reduction was achieved through variance thresholding, correlation analysis, and least absolute shrinkage and selection operator (LASSO) regression to identify the most predictive features.
2.5 Deep learning analysis
A convolutional neural network (CNN) was developed to automatically extract deep learning features from the ultrasound images. The network architecture was based on DenseNet121, ResNet50, ResNet101, and VGG19, pre-trained on ImageNet and fine-tuned with the study dataset. The model was trained using a cross-entropy loss function and Adam optimizer with an initial learning rate of 0.001. Data augmentation techniques, including rotation, flipping, and contrast normalization, were applied to enhance generalizability. A feature map visualization technique, such as Grad-CAM, was used to interpret the CNN's focus regions. Deep learning features from the penultimate layer of the trained model were extracted and combined with radiomics and clinical features for further feature selection and model construction.
2.6 Model construction
Four predictive models were constructed to evaluate the contribution of different data modalities: (1) Clinical Model: using logistic regression classifier, incorporating maternal age, BMI, blood pressure, glucose levels, and other relevant clinical parameters; (2) Radiomics Model: using a logistic regression classifier trained on selected radiomics features; (3) Deep Learning Model: a CNN-based model trained on ultrasound images for automatic feature extraction and classification; (4) Fusion Model: A multimodal fusion model incorporating all three feature sets. Feature-level fusion was performed by concatenating the radiomics, CNN-extracted, and clinical features before input them into a fully connected layer for final classification.
2.7 Statistical analysis
All statistical analyses were performed using R (version 4.1.0) and Python (version 3.8). Baseline characteristics between groups (with and without gestational hypertension or diabetes) were compared using the student's t-test or Mann–Whitney U test for continuous variables and the chi-square or Fisher's exact test for categorical variables. A two-tailed p-value < 0.05 was considered statistically significant. Model performance was evaluated using area under the curve, sensitivity and specificity. Calibration curves and decision curve analysis were performed to assess the clinical utility of each model. Deep learning model development and training were accelerated using an NVIDIA GeForce RTX 4090 GPU, while conventional machine learning algorithms were executed on an Intel Core i9-14900K CPU with 128 GB RAM.
2.8 Patient characteristics
A total of 213 patients were enrolled, including 47 with adverse outcomes and 166 with non-adverse outcomes (Table 1). Baseline characteristics such as fasting blood glucose (FBG) [median 5 mmol/L in both groups, p = 0.457], weight gain [median 12 kg in both groups, p = 0.479], age distribution (p = 0.259), and body mass index (BMI) categories (p = 0.759) showed no significant differences between two groups. However, significant differences emerged regarding hypertension status; 40.4% of patients in the adverse outcome group were hypertensive compared to only 9.6% in the non-adverse group (p < 0.001). Lifestyle and intervention adherence varied notably; a significantly lower proportion of patients with adverse outcomes attended weekly lectures (78.7% vs. 97%, p < 0.001), participated in psychological counseling (76.6% vs. 98.2%, p < 0.001), and completed online tutoring (17% vs. 3%, p = 0.001). Moreover, regular exercise (five times per week) was paradoxically more common among patients experiencing adverse outcomes (83%) compared to those without adverse outcomes (4.2%, p < 0.001).
2.9 Univariable and multivariable analysis
We conducted extensive univariable logistic regression analyses to screen clinical variables potentially associated with HDP or GDM. Variables such as psychological testing (OR = 1.507, p < 0.05), resistance exercise (OR = 1.783, p < 0.05), online tutoring frequency (OR = 1.783, p < 0.05), aerobic exercise frequency (OR = 1.929, p < 0.05), and relaxation exercises (OR = 2.268, p < 0.05) showed statistically significant association with pregnancy complications. Multivariable analysis identified dedicated psychological counseling (OR = 1.333, p < 0.05), weekly offline pregnancy classes (OR = 1.425, p < 0.05), regular weekly exercises (OR = 1.456, p < 0.05), and relaxation exercises (OR = 1.650, p < 0.05) as independently significant predictors, indicating their clinical relevance for subsequent model integration (Figure 2).
2.10 Deep learning analysis
In the deep learning analyses, various CNN architectures including DenseNet121, ResNet50, ResNet101, and VGG19 were tested. DenseNet121 outperformed the other CNN architectures. In the training cohort, DenseNet121 achieved an AUC of 0.924 (95%CI: 0.865–0.983), demonstrating excellent predictive capability. The sensitivity was 100%, specificity was 72.4%, PPV was 93.4%, and NPV was 100%. In the testing cohort, DenseNet121's performance slightly declined but remained robust, with an accuracy of 0.787 and AUC of 0.747 (95%CI: 0.574–0.920), indicating great generalizability. Sensitivity and PPV remained high (90.2% and 85.2%, respectively), although specificity (20%) and NPV (28.6%) were lower. Grad-CAM visualization further elucidated the interpretability of the deep learning model, highlighting specific ultrasound regions integral to predictive outcomes (Figure 3). Regions of high relevance were visually confirmed, providing transparency to the deep learning features and potential clinical interpretability.
Figure 3. The grad-CAM visualizations for two typical samples, identified as “1” and “26.” these visualizations are instrumental in demonstrating how the model focuses on different regions of the images for making its predictions.
2.11 Radiomics analysis
From ultrasound images obtained at 8 weeks of pregnancy, we extracted 1,562 radiomics features, including first-order intensity, shape, and texture features. Feature selection using LASSO regression identified a subset of highly predictive radiomics features. We evaluated several classifiers, including logistic regression (LR), RandomForest, and ExtraTrees, based on ultrasound-derived deep learning and radiomics features. Logistic regression outperformed other models, demonstrating superior predictive accuracy with an AUC of 0.985 (95%CI: 0.970–1.000) in the training cohort, reflecting perfect discriminatory capability. Performance remained robust in the independent testing cohort with an AUC of 0.820 (95%CI: 0.665–0.974), suggesting excellent model generalizability (Figure 4). In contrast, RandomForest and ExtraTrees models showed overfitting, with marked reductions in AUC from training to testing cohorts (RandomForest: training AUC 0.973, testing AUC 0.757; ExtraTrees: training AUC 0.988, testing AUC 0.816).
Figure 4. Receiver operating characteristic (ROC) curves for the deep learning signature. The figure presents (a) the performance from 5-fold cross-validation, followed by the ROC curves on the (b) training and (c) independent test cohorts, respectively.
2.12 Fusion model construction and comparasion
To optimize predictive performance, we integrated clinical, radiomics, and deep learning features. Initially, we developed a clinical model incorporating lifestyle factors, exercise routines, psychological interventions, and dietary habits, achieving robust predictive performance (AUC, 0.943; 95% CI: 0.887–0.999) in test cohort. The radiomics model, constructed using selected radiomics features and the ExtraTrees classifier, yielded an AUC of 0.751 (95% CI: 0.564–0.938) in the test set. Concurrently, a deep learning model leveraging DenseNet-121 architecture achieved an AUC of 0.747 (95% CI: 0.574–0.920) in the test set.
To further enhance predictive accuracy, we combined these distinct data sources into an integrated Fusion Model, merging clinical characteristics with deep learning and radiomic signatures. The combined model demonstrated significantly improved discriminatory capacity, achieving an AUC of 0.987 (95% CI: 0.973–1.000) in the training cohort and maintaining high performance with an AUC of 0.963 (95% CI: 0.915–1.000) in the test cohort, surpassing individual modalities (Figure 5). Calibration analyses indicated excellent model reliability, with Hosmer-Lemeshow statistics of 0.325 (training cohort) and 0.197 (test cohort), demonstrating close agreement between predicted and observed risk. Statistical comparison using the DeLong test confirmed that the fusion model significantly outperformed the individual radiomics and deep learning models (all p < 0.05).
Figure 5. The receiver operating characteristic (ROC) curves for different signatures across train (a) and test (b) cohorts, offering a visual comparison of their diagnostic abilities.
2.13 Clinical validation of models
To evaluate the clinical applicability of the predictive models, we performed validation using an independent cohort of pregnant women assessed at 8 weeks of gestation. The performance of the clinical, radiomics, deep learning, and combined models was thoroughly compared. Specifically, the fusion model, integrating clinical characteristics, radiomic signatures, and deep learning-derived imaging features, exhibited superior predictive capability. This represented notable improvement compared with the individual models, affirming the additive value of incorporating diverse sources of data into predictive modeling.
The fusion model demonstrated optimal sensitivity (100%) and acceptable specificity (83.6%) in the test cohort, underscoring its practical utility for clinical screening purposes. High sensitivity is particularly crucial for clinical scenarios where early identification of women at risk for GH or GDM is vital for timely preventive measures and interventions. The high NPV further indicates the combined model's strength in accurately identifying women unlikely to develop these complications, thus helping clinicians effectively allocate medical resources. Decision curve analysis further emphasized the substantial clinical net benefit conferred by the fusion model, affirming its applicability to prenatal healthcare practice.
3 Discussion
HDP and GDM are among the most common complications affecting pregnant women, leading to increased maternal and fetal morbidity and mortality. Early prediction and timely intervention remain critical for mitigating adverse pregnancy outcomes. Traditional risk stratification relies on maternal demographic and clinical characteristics, such as age, body mass index (BMI), family history, and metabolic markers, which often fail to capture complex physiological changes occurring early in pregnancy. In this study, we developed a fusion model incorporating first-trimester ultrasonographic deep learning features, radiomic signatures, and clinical parameters to predict mid-pregnancy hypertension or GDM, offering a novel and integrative approach to risk assessment.
Ultrasound imaging during early pregnancy provides a non-invasive and cost-effective modality, widely used for fetal viability and health evaluation (19). However, conventional ultrasound interpretation relies largely on qualitative assessments and is subject to interobserver variability. Compared to ultrasonographer's subjective assessment, deep learning and radiomics tools have demonstrated superior predictive power by leveraging complex spatial and textural information that is imperceptible to the human eye (20). The incorporation of deep learning-based feature extraction enables an objective and automated analysis of high-dimensional image data, enhancing diagnostic consistency and predictive accuracy. Radiomics further complements deep learning approaches by quantifying subtle texture patterns and tissue heterogeneity, providing biomarkers that reflect tissue microstructure, perfusion, and metabolic function related with hypertensive disorders and GDM. The ability of our fusion model to accurately predict the combined outcome of HDP and GDM may be attributed to its capacity to detect these subtle, shared imaging signatures of underlying placental and vascular dysfunction, which aligns with the common pathophysiological basis of these conditions. Several studies have demonstrated the efficacy of deep learning and radiomics in predicting obstetric complications, including preeclampsia and fetal growth restriction, underscoring the potential of image-based biomarkers in maternal-fetal medicine (21, 22). Compared to traditional clinical models, the integration of deep learning and radiomics enhances predictive accuracy and facilitates personalized risk stratification.
Beyond predictive modeling, the clinical impact of early identification extends to targeted intervention strategies, including MNT, structured exercise programs, and psychological support (23). MNT, particularly for high-risk women, plays a pivotal role in optimizing maternal glucose metabolism and blood pressure regulation. Emerging evidence suggests that early dietary modifications, such as a low-glycemic diet and appropriate caloric intake adjustments, can significantly reduce the incidence of GDM (24). Similarly, structured exercise programs tailored to pregnant women have been shown to mitigate the risks of both GDM and hypertensive disorders by improving insulin sensitivity and cardiovascular function (17). Furthermore, psychological counseling and stress management strategies may contribute to improved pregnancy outcomes by modulating maternal stress-related hormonal responses, which are implicated in both hypertensive and metabolic disorders during pregnancy (25). By integrating our model into routine prenatal screening, high-risk women can be identified as early as 8 weeks of gestation, allowing obstetricians to initiate MNT, exercise prescriptions, and psychological support in a timely manner to mitigate complications.
From a clinical perspective, the implementation of AI-driven predictive models holds substantial promise. In comparison to biochemical markers, which often require laboratory testing and may exhibit gestational variability, ultrasound-based tools offer a real-time, non-invasive, and widely accessible solution. Integrating deep learning and radiomics into routine prenatal screening could provide obstetricians with a more refined risk assessment tool, complementing standard laboratory and biometric evaluations. Moreover, our model's ability to identify high-risk pregnancies at an early gestational stage provides an extended window for targeted interventions, potentially improving maternal and neonatal outcomes. This approach aligns with the ongoing shift toward precision medicine, allowing for individualized monitoring and tailored intervention plans. However, several challenges remain before widespread clinical adoption can be achieved. Model generalizability requires validation across diverse populations with varying ethnic and socioeconomic backgrounds. Additionally, the black-box nature of deep learning necessitates efforts to enhance model interpretability, ensuring that predictions align with established pathophysiological mechanisms. Future research should also explore real-time ultrasound image acquisition and automated feature extraction workflows to streamline integration into clinical practice.
Our study has several limitations. First, variability in ultrasound acquisition protocols across institutions may introduce model bias, underscoring the need for standardization in image collection and preprocessing. Secondly, although deep learning models excel in pattern recognition, their black-box nature poses challenges for clinical explainability, necessitating further refinement in interpretable AI tools. Another limitation is the reliance on mid-pregnancy outcomes, which, although clinically relevant, may not fully capture long-term maternal and neonatal sequelae. Future studies should explore the longitudinal impact of early AI-based risk stratification on pregnancy outcomes and postnatal health.
In conclusion, our study demonstrates the feasibility of integrating deep learning, radiomics biomarkers, and clinical factors into a robust predictive model for gestational hypertension and GDM. The model's performance in predicting this combined outcome is supported by the overlapping pathophysiology of these disorders. This approach offers a non-invasive, early risk assessment tool with clinical utility, facilitating personalized prenatal care and timely intervention. Future study should focus on expanding external validation and integrating AI-driven predictions into clinical decision support systems to further improve obstetric care.
4 Limitations
This study has several limitations that should be acknowledged. First and foremost, the single-center, retrospective design is a key limitation. While we implemented rigorous internal validation with a randomized train-test split, which showed no significant demographic differences (Table 1), the generalizability of our findings remains to be verified. The patient population, imaging protocols, and clinical practices at a single institution may introduce inherent biases and may not be fully representative of other centers or broader demographic groups. Therefore, external validation on multi-center and prospective cohorts is essential to confirm the robustness and clinical applicability of our proposed model before it can be widely adopted.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Ethics Committee of Pingliang City Maternity and Child-care Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants' legal guardians/next of kin. Written informed consent was obtained from the individual(s) for the publication of any identifiable images or data included in this article.
Author contributions
JG: Conceptualization, Data curation, Validation, Writing – original draft, Writing – review & editing. YH: Conceptualization, Methodology, Validation, Writing – review & editing. ZZ: Investigation, Software, Writing – original draft. BS: Data curation, Methodology, Writing – original draft. SX: Formal analysis, Investigation, Writing – original draft. YM: Data curation, Investigation, Writing – original draft. YL: Formal analysis, Investigation, Writing – original draft. ZG: Data curation, Software, Writing – original draft. LS: Conceptualization, Supervision, Writing – review & editing.
Funding
The author(s) declared that financial support was not received for this work and/or its publication.
Conflict of interest
ZZ was employed by SingularityFlow Co. Ltd.
The remaining author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Footnote
Abbreviations AUC, area under the receiver operating characteristic curve; ROC, receiver operating characteristic; CI, confidence interval; CNN, convolutional neural network; DCA, decision curve analysis; DLR, deep learning radiomics; GDM, gestational diabetes mellitus; GLCM, gray-level co-occurrence matrix; GLRLM, gray-level run-length matrix; GLSZM, gray-level size zone matrix; NGTDM, neighborhood gray-tone difference matrix; Grad-CAM, gradient-weighted class activation mapping; HL test, Hosmer-Lemeshow test; LASSO, least absolute shrinkage and selection operator; LR, logistic regression; NPV, negative predictive value; PPV, positive predictive value; OR, odds ratio.
References
1. Rocha M, Daniels K, Chandrasekaran S, Michopoulos V. Trauma and posttraumatic stress disorder as important risk fac-tors for gestational metabolic dysfunction. Am J Perinatol. (2024) 41:1895–907. doi: 10.1055/a-2260-5051
2. Horsch A, Garthus-Niegel S, Ayers S, Chandra P, Hartmann K, Vaisbuch E, et al. Childbirth-related posttraumatic stress disorder: definition, risk factors, pathophysiology, diagnosis, prevention, and treatment. Am J Obstet Gynecol. (2024) 230:S1116–27. doi: 10.1016/j.ajog.2023.09.089
3. Jiang L, Tang K, Magee LA, von Dadelszen P, Ekeroma A, Li X, et al. A global view of hypertensive disorders and diabetes mellitus during pregnancy. Nat Rev Endocrinol. (2022) 18:760–75. doi: 10.1038/s41574-022-00734-y
4. Su Y, Niu Y, Zhao B, Su S, Klein C, Hou X, et al. Hypertensive disorders of pregnancy and gestational diabetes mellitus affect fetal growth and perinatal outcomes in women undergoing in vitro fertilization. AJOG Global Reports. (2025) 5:100571. doi: 10.1016/j.xagr.2025.100571
5. Nyando M, Makombe D, Mboma A, Mwakilama E, Nyirenda L. Perceptions of pregnant women on antenatal care visit during their first trimester at area 25 health center in Lilongwe, Malawi - a qualitative study. BMC Womens Health. (2023) 23:646. doi: 10.1186/s12905-023-02800-7
6. Demir K, Sokmen O, Karabey AI, Torenek-Agirman K. Comprehensive insights into artificial intelligence for dental lesion detection: a systematic review. Diagnostics. (2024) 14:2768. doi: 10.3390/diagnostics14232768
7. Podină N, Gheorghe EC, Constantin A, Cazacu I, Croitoru V, Gheorghe C, et al. Artificial intelligence in pancreatic imaging: a systematic review. United Eur Gastroenterol J. (2025) 13:55–77. doi: 10.1002/ueg2.12723
8. Pluym ID, Afshar Y, Holliman K, Kwan L, Bolagani A, Mok T, et al. Accuracy of automated three-dimensional ultrasound imaging technique for fetal head biometry. Ultrasound Obstet Gynecol. (2021) 57:798–803. doi: 10.1002/uog.22171
9. Coronado-Gutiérrez D, Eixarch E, Monterde E, Matas I, Traversi P, Gratacós E, et al. Automatic deep learning-based pipeline for automatic delineation and measurement of fetal brain structures in routine mid-trimester ultrasound images. Fetal Diagn Ther. (2023) 50:480–90. doi: 10.1159/000533203
10. Gelikman DG, Rais-Bahrami S, Pinto PA, Turkbey B. AI-powered radiomics: revolutionizing detection of urologic malig-nancies. Curr Opin Urol. (2024) 34:1–7. doi: 10.1097/MOU.0000000000001144
11. Tay SH, Zhang X, Chua MLK. Radiomics in precision oncology: hype or ludum mutante. BMC Med. (2023) 21:465. doi: 10.1186/s12916-023-03165-2
12. Pan YQ, Huang XX, Jiang XM. Risk factors and prediction model for low-birth-weight infants born to women with gesta-tional diabetes mellitus. Front Public Health. (2024) 12:1432033. doi: 10.3389/fpubh.2024.1432033
13. Wilkinson SA, Donaldson E, Willcox J. Nutrition and maternal health: a mapping of Australian dietetic services. BMC Health Serv Res. (2020) 20:660. doi: 10.1186/s12913-020-05528-4
14. Burstein R, Henry NJ, Collison ML, Marczak LB, Sligar A, Watson S, et al. Mapping 123 million neonatal, infant and child deaths between 2000 and 2017. Nature. (2019) 574:353–8. doi: 10.1038/s41586-019-1545-0
15. Apolo-Arenas MD, Guerrero-Nogales L, Díaz-Muñoz CL, Caro-Puértolas B, Parraca JA, Caña-Pino A. Use of the hy-perbaric chamber versus conventional treatment for the prevention of amputation in chronic diabetic foot and the in-fluence on fitting and rehabilitation: a systematic review. Int J Vasc Med. (2024) 2024:8450783. doi: 10.1155/ijvm/8450783
16. Giussani M, Lieti G, Orlando A, Parati G, Genovesi S. Fructose intake, hypertension and cardiometabolic risk factors in children and adolescents: from pathophysiology to clinical aspects. A Narrative Review. Front Med. (2022) 9:792949. doi: 10.3389/fmed.2022.792949
17. Campos MDSB, Buglia S, Colombo CSSDS, Buchler RDD, Brito ASXD, Mizzaci CC, et al. Position statement on exercise during pregnancy and the post-partum period - 2021. Arq Bras Cardiol. (2021) 117:160–80. doi: 10.36660/abc.20210408
18. Kokori E, Aderinto N, Olatunji G, Komolafe R, Abraham IC, Babalola AE, et al. Maternal and fetal neurocognitive outcomes in preeclampsia and eclampsia; a narrative review of current evidence. Eur J Med Res. (2024) 29:470. doi: 10.1186/s40001-024-02070-5
19. Shaamash AH, Aly HA, Abdel-Aleem M, Akhnowkh SN. Clinical and ultrasound evaluation of early threatened mis-carriage to predict pregnancy continuation up to 28 weeks: a prospective cohort study. J Ultrasound Med. (2020) 39:1777–85. doi: 10.1002/jum.15282
20. Liao H, Yuan J, Liu C, Zhang J, Yang Y, Liang H, et al. Feasibility and effectiveness of au-tomatic deep learning network and radiomics models for differentiating tumor stroma ratio in pancreatic ductal adenocar-cinoma. Insights Imaging. (2023) 14:223. doi: 10.1186/s13244-023-01553-z
21. Zheng W, Jiang Y, Jiang Z, Li J, Bian W, Hou H, et al. Association between deep learning radiomics based on placental MRI and preeclampsia with fetal growth restriction: a multicenter study. Eur J Radiol. (2025) 184:111985. doi: 10.1016/j.ejrad.2025.111985
22. Li Q, Wei X, Wu F, Qin C, Dong J, Chen C, et al. Development and validation of preeclampsia predictive models using key genes from bioinformatics and machine learning approaches. Front Immunol. (2024) 15:1416297. doi: 10.3389/fimmu.2024.1416297
23. Fullerton JM, Koller DL, Edenberg HJ, Foroud T, Liu H, Glowinski AL, et al. Assessment of first and second degree relatives of individuals with bipolar disorder shows in-creased genetic risk scores in both affected relatives and young at-risk individuals. Am J Med Genet B Neuropsychiatr Genet. (2015) 168:617–29. doi: 10.1002/ajmg.b.32344
24. Zhang C, Rawal S, Chong YS. Risk factors for gestational diabetes: is prevention possible? Diabetologia. (2016) 59:1385–90. doi: 10.1007/s00125-016-3979-3
Keywords: deep learning, gestational diabetes mellitus, hypertensive disorders of pregnancy, radiomics, ultrasound
Citation: Guo JConceptualizationData curationValidationWriting – original draftWriting – review & editing, Huang YConceptualizationMethodologyValidationWriting – review & editing, Zhang ZInvestigationSoftwareWriting – original draft, Shi BData curationMethodologyWriting – original draft, Xi SFormal analysisInvestigationWriting – original draft, Mai YData curationInvestigationWriting – original draft, Liang YFormal analysisInvestigationWriting – original draft, Guo ZData curationSoftwareWriting – original draft and Shang LConceptualizationSupervisionWriting – review & editing () Development and validation of a multi-modality system combining radiomics and deep learning for predicting mid-pregnancy complications and enabling timely pregnancy care. Front. Pediatr. 13:1716073. doi: 10.3389/fped.2025.1716073
Received: 30 September 2025; Revised: 28 November 2025;
Accepted: 29 November 2025;
Published: 18 December 2025.
Edited by:
Jun Shi, Peking University, ChinaReviewed by:
Yuxiao Liu, Sanford Burnham Prebys Medical, La Jolla, Discovery Institute, United StatesLixuan Qian, City University of New York, New York City, United States
Yunfei Li, MRL Global Medical Affairs, China
Copyright: © 2025 Guo, Huang, Zhang, Shi, Xi, Mai, Liang, Guo and Shang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Juan Guo, MTM4MzAzNTU5MDNAMTYzLmNvbQ==; Yuhong Huang, aHVhbmd5aDI4QHNpbmEuY24=; Lantian Shang, NDI1NTU3MTY3QHFxLmNvbQ==
Zhiwei Zhang3