- Department of Obstetrics and Gynecology, Binhai County People’s Hospital, Yancheng, Jiangsu, China
Background: Premature rupture of membranes (PROM) poses significant risks to both maternal and neonatal health. This study aims to construct a risk factor prediction model related to PROM by using machine learning technology and explore the association with nutritional inflammatory index.
Methods: A retrospective analysis was conducted on patients with PROM. Based on the variables screened out by ridge regression and Boruta algorithm, univariate and multivariate logistic regression analyses were further adopted. According to the sample data, it is divided into the training set and the internal validation set in a ratio of 7:3. The research group adopted four machine learning algorithms: Extreme Gradient Boost (XGBoost), Support Vector Machine (SVM), Logistic Regression (LR), and Random Forest (RF). The selected variables were incorporated into model construction, with the area under the receiver operating characteristic (ROC) curve (AUC) serving as a criterion for model selection. Model performance was assessed using AUC values, sensitivity, specificity, recall, F1 score, and accuracy. The variables were selected based on the contribution degree of the variables in Shapley additive Interpretation (SHAP) to construct the nomogram.
Results: A retrospective analysis was conducted involving 800 parturients at Binhai County People’s Hospital from January 2023 to October 2024, comprising 400 with PROM and 400 with normal delivery. The RF model demonstrated superior performance with an AUC of 0.757, sensitivity of 67.4%, and specificity of 65.1%. Key predictive factors identified included body mass index (BMI), prognostic nutritional index (PNI), platelet, albumin, and aggregate index of systemic inflammation (AISI). The ROC of the model also showed good efficacy, with an AUC of 0.777.
Conclusion: This study highlights the potential of machine learning in enhancing the understanding and prediction of PROM, and emphasizes the significance of inflammatory and nutritional indicators, paving the way for future research in maternal-fetal medicine.
1 Introduction
Premature rupture of membranes (PROM) is a well-known risk factor for preterm birth, with an incidence rate as high as 10%, accounting for 30%–40% of all preterm births (1). PROM is a significant obstetric complication characterized by the rupture of fetal membranes before the onset of labor. This condition can lead to increased risks for both maternal and neonatal morbidity and mortality, including infections and adverse fetal outcomes. Pregnant women may suffer from placental abruption, intrauterine infection, puerperal infection and postpartum hemorrhage, while newborns may suffer from fetal distress, NRDS, intracranial hemorrhage and septicemia, and the morbidity and mortality of newborns in the perinatal period are significantly increased (2). The pathophysiological mechanisms underlying PROM are complex and multifactorial, involving factors such as infection, inflammation, and mechanical stress on the membranes. Reproductive tract infections have been shown to affect the structural integrity and function of the fetal membrane, thereby promoting the occurrence of premature PROM (3, 4). Understanding the risk factors associated with PROM is crucial for developing preventive strategies and optimizing management to improve outcomes for both mothers and infants.
Systemic inflammation is characterized by changes in neutrophilia, lymphocytopenia, and thrombocytosis (5). However, the individual sensitivity of these parameters is quite poor, so various indices containing combinations of parameters have been developed. Neutrophil and lymphocyte ratio (NLR), lymphocyte and monocyte ratio (LMR), platelet and lymphocyte ratio (PLR), systemic immune inflammation index (SII) and Prognostic Nutritional Index (PNI) indicators have been studied as new inflammatory markers in many diseases (6–9). One study demonstrated the prognostic significance of NLR, PLR, MLR, and SII in predicting latency in patients with PROM (10). In recent years, studies have found a significant association between the systemic inflammatory response index (SIRI) and pregnancy-related pathological conditions (such as gestational hypertension, PROM, etc.) suggesting that it may play an important role in the inflammatory response and related complications during pregnancy (10, 11). The NLR in advanced lung cancer inflammation index (ALI) reflects the systemic inflammatory state, while the occurrence of PROM may be related to intrauterine infection or inflammatory response. Theoretically, the systemic inflammatory state may affect pregnancy outcomes, but currently there is a lack of direct evidence (12). Aggregate index of systemic inflammation (AISI) has the potential to predict admission to the neonatal intensive care unit and chorioamnionitis in pregnant women with PROM (13).
In recent years, the emergence of machine learning techniques in the field of obstetrics has paved the way for enhanced prediction models aimed at identifying risk factors associated with adverse pregnancy outcomes (14). Understanding the predictive factors for PROM can inform clinical decision-making and improve outcomes for affected patients. A notable gap in the research remains regarding the integration of machine learning methodologies in the identification of risk factors specifically associated with PROM. Traditional statistical methods, while valuable, often fall short in handling the complexity of high-dimensional data. In contrast, machine learning algorithms have demonstrated superior performance in various medical applications by effectively managing such complexities and enhancing predictive accuracy (15, 16).
This research endeavors to harness the capabilities of machine learning to advance our understanding of the risk factors associated with PROM, with the ultimate goal of improving patient outcomes through informed clinical decision-making. By bridging the gap between computational techniques and clinical application, we hope to contribute to the evolving landscape of obstetric care and precision medicine.
2 Materials and methods
2.1 Study population
From January 2023 to October 2024, a total of 800 cases of women giving birth in the obstetrics department were collected in Binhai County People’s Hospital, of which 400 were women giving birth normally and 400 were women with PROM. The study was approved and supervised by the Ethics Committee of Binhai County People’s Hospital (Approval No. 2024-BYKYLL-016) and was conducted in accordance with the standards of the Declaration of Helsinki (revised in 2013).
The inclusion criteria are as follows:
(1) Parturients aged ≥ 18 years old, with singleton pregnancies, and who delivered in the hospital during the gestation period from ≥ 28 weeks to < 42 weeks;
(2) Parturients who meet the diagnostic criteria for PROM and those with normal delivery.
Exclusion criteria:
(1) Pregnancy comorbidities and complications, such as severe maternal diseases (preeclampsia, placental abruption, placenta previa with hemorrhage), and other diseases that may interfere with the analysis of inflammatory indicators;
(2) Within 2 weeks before delivery, there is a clear history of bacterial/viral infection (such as positive Group B Streptococcus, positive TORCH [Toxoplasma; Other; Rubella virus; Cytomegalovirus; Herpessimplex virus)-Immunoglobulin M(IgM)] or treatment with antibiotics/immunosuppressants;
(3) Combined with autoimmune diseases (such as systemic lupus erythematosus, rheumatoid arthritis, etc.) or immune deficiency status like acquired Immune Deficiency Syndrome(AIDS);
(4) Missing data and information, incomplete laboratory indicators or basic personal information;
(5) Pregnant women with malignant tumors.
2.2 Clinical data collection
Our study included the following variables: (1) Basic information: age, smoking and alcohol consumption; (2) Laboratory indicators: white blood cell (WBC), neutrophil count, lymphocyte count, monocyte count and platelet count; (3) Underlying diseases: hypertension and diabetes; (4) Pregnancy status: whether it is multiple pregnancy and whether it is multiparous. The laboratory indicators for this study were collected by trained professionals and the biochemical components were analyzed using the same automated system, performed by the laboratory physician and scored by the specialist. Previous diseases and pregnancy status were mainly determined by the patient’s medical history and B-ultrasound examination.
Requirements for the collection of laboratory indicators: PROM group: All blood samples were collected from the last routine prenatal examination before the occurrence of PROM, and met the following conditions: collected before the occurrence of PROM, and no antibiotics (such as cephalosporins), glucocorticoids (such as dexamethasone), or uterine contraction induction therapy were received; There were no signs of infection from the prenatal examination to the occurrence of PROM. Exclude those who received vaginal procedures or immunomodulatory drugs after prenatal examinations. Normal delivery group: Blood samples were selected from pregnant women with matched gestational ages (± 1 week) and no PROM. They were collected from routine prenatal examinations at the same gestational age (such as examinations at 36 weeks of pregnancy), and met the following conditions: no uterine contractions were initiated and no analgesic drugs were used; No PROM occurred within at least two weeks after the prenatal examination (confirmed by follow-up).
2.3 Calculation of nutritional inflammation index
Neutrophil-to-lymphocyte ratio, LMR, PLR, SII, PNI, SIRI, ALI and AISI indicators are calculated as follows: NLR = neutrophil count/lymphocyte count; LMR = Lymphocyte count/monocyte count; PLR = Platelet count/lymphocyte count; SII = (neutrophil count × platelet count)/lymphocyte count; PNI = albumin + 5 × lymphocyte count; SIRI = Neutrophil count × monocyte count/lymphocyte count; ALI = body mass index (BMI) × serum albumin level/NLR; AISI = platelet count × neutrophil count × monocyte count/lymphocyte count. Due to the right-biased distribution of AISI, AISI underwent a logarithmic transformation (Ln) in the regression analysis.
2.4 Statistical analysis
Baseline characteristics of all included patients were stratified according to whether PROM had occurred. Variables that did not conform to the normal distribution were represented as the quartile range, and the Mann-Whitney U test was used to compare differences between groups. Categorical variables are expressed as percentages and compared using chi-square tests. Firstly, features are selected based on ridge regression analysis and the Boruta algorithm. This method selects important features through compression coefficients and reduces dimensions, screens out features with greater contributions and eliminates redundant features. Subsequently, based on the variables screened out by ridge regression and the Boruta algorithm, univariate and multivariate logistic regression analyses were further adopted to select the variables with statistical differences. Meanwhile, restricted cube plots (RCS) and threshold effect analysis were adopted to screen the conceptuality between variables and TPROM. Subsequently, the sample data were divided into the training set and the internal validation set in a ratio of 7:3. The team used four machine learning algorithms: extreme Gradient Boosting (XGBoost), support vector machines (SVM), logistic regression (LR), and Random Forest (RF). The final selected variables were incorporated into the model construction. In selecting the best model, we use the maximum area under the curve (AUC) of the receiver operating characteristic curve (ROC) in the verification set as the evaluation basis. The performance of the predictive model was evaluated by the AUC value, sensitivity, specificity, recall rate, F1 score and accuracy under the ROC of the training set and validation set. The higher the value of AUC, the better the ability of the model to distinguish. Sensitivity and specificity reflect the ability of the model to correctly identify positive and negative samples, respectively. Recall rates and F1 scores were combined to account for the sensitivity and specificity of the model. In addition, decision curve analysis (DCA) and calibration curves were plotted to demonstrate true clinical utility. Using the Shapley additive explanations (SHAP) method, a bar graph was plotted to show the contribution of each feature to the predicted results. SHAP evaluation of selected cases shows the impact of specific features on a particular sample and helps us understand the decision-making process of the model. Finally, based on the screened variables, we constructed the nomogram to enable readers to have a more intuitive understanding of the factors causing premature rupture of membranes. All statistical analyses were performed using R software (version 4.3.0) and STATA 17.0 (64-bit), with bilateral p-values < 0.05 considered statistically significant.
3 Results
3.1 Demographic and clinical characteristics
A total of 800 parturients were included in this study, of which 400 (50%) were normal parturients and 400 (50%) were PROM. Table 1 shows the baseline information. Compared with normal parturients, the levels of neutrophils, NLR, SII and AISI of parturients with PROM were significantly higher, with statistical significance (all p < 0.05). The indexes of lymphocytes, platelets, albumin, LMR, PLR, PNI and ALI in parturients with PROM were lower than those in normal parturients, and the differences were statistically significant (all p < 0.05). PROM was more likely to occur in multiple pregnancies, multiparous, people with hypertension, diabetes, smoking habits and drinking habits (all p < 0.05). However, there was no statistical significance for age.
3.2 Feature selection
Based on the indicators with statistically significant differences in the previous baseline characteristic analysis, this study optimized variable selection through a dual-feature screening strategy. Ridge regression analysis: To solve the problem of multicollinearity, a 10-fold cross-validation iteration was adopted to determine the penalty coefficient, and the variables that significantly contributed to PROM at term were screened through standardized coefficients (Figures 1A–C). Ultimately, 13 variables were included: BMI, neutrophils, platelets, albumin, SII, PNI, AISI, multiple pregnancies, multiparas, diabetes, hypertension, alcohol consumption and smoking. Boruta algorithm screening: 500 iterations were conducted, and 14 key variables were confirmed through importance scores (Figure 1D): BMI, neutrophils, LMR, PNI, lymphocytes, albumin, platelets, NLR, AISI, ALI, SII, hypertension, multiple pregnancies, and smoking. Retain the 10 variables jointly identified by the two algorithms (such as BMI, neutrophils, platelets, albumin, SII, PNI, AISI, hypertension, multiple pregnancies, and smoking); Add four immune-related indicators unique to the Boruta algorithm (LMR, lymphocytes, NLR, ALI).

Figure 1. Ridge regression analysis. (A–C) A 10–fold cross–validation iteration was used to determine the penalty coefficient, and the variables that significantly contributed to premature rupture of membranes at term were screened through standardized coefficients; (D) Fourteen key variables were confirmed through the importance score.
3.3 Univariate and multivariate analysis of nutritional inflammation index and TROM
Based on the 14 variables screened above, in order to further clarify their relationship with PROM, the research group conducted univariate and multivariate logistic regression analyses (Table 2). Univariate logistic regression analysis indicated that albumin, lymphocytes, platelets, ALI, LMR, and PNI were all protective factors. The remaining variables were all risk factors, with multiple pregnancies being the most significant. Subsequently, multivariate logistic regression analysis was further used for variable screening. There were statistically significant differences in BMI (OR = 1.29, 95% CI 1.20–1.39), albumin (OR = 0.83, 95% CI 0.78–0.89), platelets (OR = 0.99, 95% CI 0.98–0.99), AISI (OR = 2.73, 95% CI 1.13–6.58) and PNI (OR = 0.83, 95% CI 0.78–0.89).
3.4 Non-linear associations between platelets, albumin, AISI, PNI, and PROM
To deeply explore the non-linear dose-response relationship between platelets, albumin, AISI and PNI and PROM, this study used RCS and piecewise regression models for analysis (Figure 2 and Table 3). We observed that platelets had an “L” -shaped relationship with the occurrence of PROM (Figure 2A); Albumin and PNI levels were negatively correlated with the outcome (Figures 2B, C), and positively correlated with AISI (Figure 2D), but not completely linearly correlated. Through threshold effect analysis (Table 3), BMI and AISI were positively correlated with the outcome, and both had threshold effects. The inflection points were 24.81 and 7.694, respectively. Among them, when BMI was higher than 24.81, OR (95% CI): 5.76 (2.10–15.80). When AISI was lower than 7.694, OR (95% CI): 1.36 (1.08–1.71), and there were no statistically significant differences in the rest. Platelet, albumin and PNI also have threshold effects and are negatively correlated. The inflection points are 289.375, 30.003, and 48.152, respectively. When platelet is higher than 289.375, OR (95% CI): 0.97 (0.94–0.99), when Albumin was higher than 30.003, OR (95% CI): 0.88 (0.82–0.94) and when PN was lower than 48.152, OR (95% CI): 0.87 (0.83–0.91).

Figure 2. Restricted cube plots (RCS) between platelets, albumin, aggregate index of systemic inflammation (AISI) and prognostic nutritional index (PNI) and premature rupture of membranes (PROM). (A) platelets; (B) albumin; (C) AISI; (D) PNI.

Table 3. Analyze the threshold effect of aggregate index of systemic inflammation (AISI) and other indicators on premature rupture of membranes through two segmented regression models.
3.5 Multi-model comparison
Based on the screening of the above variables, in order to obtain the best model, we constructed four machine learning models to identify the risk factors of PROM in parturients (Figure 3). Figure 3A shows the discriminatory performance of these four models in terms of the ROC curve. The four models with PROM all exhibited considerable effects in predictive performance, among which the RF model performed the best. The AUC of the four models are as follows: Logistic: 0.756, RF: 0.757, SVM: 0.642 and XGBoost: 0.725. Among them, all three models have good predictive capabilities, but they are ranked in descending order of performance as follows: The RF model is the best, followed by the Logistic, XGBoost and SVM models. Table 4 shows the detailed performance indicators of the four models. The RF model demonstrated superior overall performance (training set sensitivity: 0.721, specificity: 0.683; validation set sensitivity: 0.674, specificity: 0.651). It is notable that both the accuracy rate and the F1 value of the RF model are the highest, with the accuracy rate being (training set: 0.698; validation set: 0.661) and F1 value (training set: 0.694; verification set: 0.670). Meanwhile, the calibration curves of the four models are presented in Figure 3B, demonstrating the good consistency between the predicted probabilities of the RF model and the observed results. Figure 3C shows the DCA, and the result remains consistent with the previous one. It is found that the DCA of the RF model is the best.

Figure 3. Four machine learning models are used to identify the risk factors of PROM. (A) The discriminatory performance of the four models in terms of the receiver operating characteristic (ROC) curve; (B) Calibration curves of the four models; (C) decision curve analysi (DCA); (D,E) Comprehensive group maps; (F) Detailed case.

Table 4. Performances of the machine learning models for predicting premature rupture of membranes (PROM).
3.6 Interpretability analysis
Figures 3D, E present a comprehensive population graph, illustrating the variables in the RF model. The horizontal axis represents the SHAP value, while the vertical axis shows the features that are ranked according to their cumulative SHAP values. Each data point corresponds to a specific instance, and its position on the X-axis represents the SHAP value of that specific instance and feature. The results showed that the contribution proportions of each variable, in descending order, were: BMI, PNI, platelets, albumin and AISI. Figure 3F provides a detailed case study, demonstrating the prediction process of the model for a specific patient. In this visualization, the red indicator represents the negative contribution to the prediction, while the blue indicator represents the positive impact. The f(x) value represents the actual SHAP value of each factor.
3.7 Construction of the nomogram
In order to enhance the clinical applicability of the model and facilitate the rapid decision-making of clinicians, we constructed the nomogram model based on the above screening variables (Figure 4A). These five variables include: BMI, PNI, platelets, albumin and AISI. The ROC of the model also showed good efficacy, with an AUC of 0.777 (Figure 4C). Meanwhile, the calibration curve was plotted (Figure 4B), demonstrating good consistency. Meanwhile, the research group also plotted the ROC of five individual variables, and their AUC were respectively: BMI: 0.656, PNI:0.645, platelet: 0.575, albumin: 0.613 and AISI: 0.573 (Figure 4D). According to the above results, it shows that the model of the research group has a better predictive efficiency compared with other single factors.

Figure 4. The construction of the nomogram. (A) Nomogram model; (B) Calibration curve; (C) Receiver operating characteristic (ROC) of the model; (D) ROC of five individual variables.
4 Discussion
So far, the pathogenesis of PROM has been unclear, often as a result of multifactorial interactions. Once the membrane ruptures, the barrier protection immediately disappears. Pregnant women will irreversibly be subjected to amniotic infection, chorioamniotic inflammation, fetal distress and placental abruption until preterm delivery (17). Although the causes of PROM are varied, infection and inflammation are often highlighted as important factors. During pregnancy, the level of progesterone in the body of women increases, promoting the increase of vaginal secretions and vaginal mucosal congestion, thus providing necessary conditions for bacterial reproduction. According to statistics, the incidence of bacterial vaginitis in pregnant women is 10%–50% (18).
In this retrospective study, we developed a machine learning-based model to identify risk factors associated with PROM and constructed an effective predictive framework. One of the significant innovations of this research lies in the utilization of multiple machine learning algorithms, including XGBoost, SVM, LR, and RF, alongside Lasso regression for feature selection. This combination not only enhances the robustness of our model but also provides a comprehensive approach to understanding the underlying factors contributing to PROM. Previous studies have primarily focused on traditional statistical methods, often neglecting the advantages that machine learning can offer in terms of predictive power and interpretability. Our findings align with existing literature that emphasizes the importance of integrating advanced computational techniques to improve clinical decision-making in obstetrics. Furthermore, our analysis demonstrated that the use of SHAP provided valuable insights into feature contributions to the predictive model. This method effectively illustrated the impact of individual variables on the likelihood of PROM occurrence, enhancing the interpretability of the model’s predictions. By focusing on the five factors: BMI, PNI, platelets, albumin, and AISI, we promote a more intuitive understanding of the risk factors involved, which is critical for clinicians to make informed decisions about prenatal care.
A previous study showed that blood routine examination was a good predictor of premature rupture of membranes, but only leukocyte had statistical significance between the PROM group and the normal group (19). Systemic inflammation characterized by neutrophilism, lymphocytopenia, and thrombocytosis can be evaluated with a simple blood test. However, the reliability of a single parameter in identifying inflammation is low. Therefore, it is necessary to combine the indices of multiple inflammatory parameters. It is of great significance to incorporate inflammatory nutritional index into the prediction of risk factors for PROM. Our study found that neutrophils, NLR, SII, and AISI in PROM were significantly higher, while lymphocytes, platelets, albumin, LMR, PLR, PNI, and ALI were lower, and the differences were statistically significant. During pathogen invasion and infection, neutrophils are stimulated to secrete a series of pro-inflammatory cytokines, regulatory cytokines, and chemokines, thereby inducing an inflammatory cascade. Conversely, lymphocytes, which are a major component of the immune system, suppress the inflammatory response of the body by secreting anti-inflammatory factors, such as IL-10 (20). Monocytes are a crucial component of the innate immune response. After their recruitment, monocytes continuously secrete pro-inflammatory cytokines, enzymes, and growth factors (21). A study by Cappelletti et al. (22) determined that placental inflammation is a result of histological chorioamnionitis, which can be predicted by NLR. Kim et al. (23) found that NLR had a high predictive value for placental inflammation. PLR is more valuable in inflammatory and thrombotic diseases, but it may be affected by pregnant women with gestational diabetes, acute pancreatitis, preeclampsia, or PROM (24). Tanacan et al. (25) demonstrated that SII levels in PROM pregnant women were positively correlated with adverse neonatal outcomes. However, NLR, LMR, PLR and SII only evaluate the occurrence of the systemic inflammatory response induced by the activation of immune function and do not consider the immune and nutritional status of the individuals. PNI, which is an important parameter that reflects the nutritional and inflammatory state of the body, is calculated using serum albumin level and lymphocyte count. In recent years, studies have highlighted a close relationship between PNI and the prognosis of various tumors, myocardial infarction, and congenital heart diseases (9, 26). ALI reflects the systemic inflammatory state, while the occurrence of PROM may be related to intrauterine infection or inflammatory response. Theoretically, the systemic inflammatory state may affect pregnancy outcomes, but currently there is a lack of direct evidence (12). AISI is an emerging systemic inflammatory indicator, aiming to comprehensively assess the inflammatory status of the body. It provides a more comprehensive assessment of inflammation by integrating the levels of multiple inflammatory factors. This indicator is used clinically to predict various pathological conditions, especially in the health management of pregnant women and newborns. The physiological significance of AISI lies in that it can reflect the overall response of the body to inflammatory stimuli, thereby providing clinicians with important information about the severity and prognosis of the disease (27). The research results show that the AISI value is significantly higher in pregnant women who need to be admitted to the neonatal intensive care unit than in those who do not. Moreover, in patients with chorioamnionitis, the AISI value is also significantly higher than that in patients without chorioamnionitis (13).
The monitoring of BMI has important clinical significance during pregnancy. Studies have shown that BMI levels during pregnancy are closely related to a variety of adverse pregnancy outcomes. Both too low and too high BMI are associated with an increased risk of complications such as PROM, preterm birth, and gestational hypertension (28, 29). Obesity is regarded as a low-grade chronic inflammatory state. Adipose tissue in the body secretes various pro-inflammatory factors, such as tumor necrosis factor-α (TNF-α), interleukin-6 (IL-6), etc. These factors may affect pregnancy outcomes, including PROM (30). Hypertension and diabetes are significant risk factors for PROM in pregnant women. Abnormal cytotrophoblast invasion and endothelial dysfunction of spiral arterioles can lead to placental ischemia and placental endothelial injury (31, 32). Elevated blood glucose levels affect the expression of metalloproteinases in the fetal membrane, leading to dehydration of the membrane cells and fibrinogen, which ultimately weakens the tensile strength of the fetal membrane. At the same time, multiple pregnancies and multiparous may cause cervical muscle damage, increase the likelihood of cervical incompetence, and subsequently develop reproductive tract infections after pregnancy, and increase the risk of PROM (33). Smoking and drinking in pregnant women are risk factors for PROM. Nicotine in tobacco constricts blood vessels, increases carbon monoxide levels in the body, and causes PROM to occur when the mother and fetus are deprived of oxygen. Alcohol tends to constrict placental blood vessels, affecting fetal blood supply, resulting in fetal membrane dysplasia, resulting in PROM (34).
However, we acknowledge that our study has certain limitations that are worth discussing. Limitations of this study include its retrospective design, which may introduce selection bias and limit the generality of our findings. In addition, our study is limited to a single institution, which may affect the external validity of the predictive models developed. Reliance on historical data can also result in incomplete or inaccurate clinical records, potentially affecting the accuracy of identifying risk factors. Future studies should aim to validate these findings in larger, multicenter coves to improve the reliability of predictive models and explore the integration of real-time clinical data. Addressing these limitations is critical to advancing the role of machine learning in obstetric practice and improving outcomes for women at risk of PROM.
5 Conclusion
In conclusion, our research successfully identified the key risk factors related to PROM through machine learning methods and explored the relationship between the inflammatory nutrition index and PROM, providing a valuable predictive model for clinical applications. By evaluating multiple machine learning algorithms, we build a model with strong discriminative ability, as can be seen from the AUC value. The application of SHAP analysis further illuminates the contribution of individual characteristics and enhances our understanding of the underlying mechanisms. These findings highlight the potential of machine learning in obstetric practice, paving the way for more targeted risk assessment and management strategies in clinical Settings.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Ethics Committee of Binhai County People’s Hospital (Approval No. 2024-BYKYLL-016). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author contributions
MZ: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review and editing. XZ: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Supervision, Writing – original draft, Writing – review and editing. HW: Conceptualization, Data curation, Methodology, Supervision, Writing – original draft, Writing – review and editing. PY: Data curation, Investigation, Resources, Software, Writing – original draft, Writing – review and editing. QY: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review and editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Acknowledgments
We thanks to all the collaborators of this manuscript for their data analysis and manuscript writing.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Jena B, Biks G, Gete Y, Gelaye K. Incidence of preterm premature rupture of membranes and its association with inter-pregnancy interval: A prospective cohort study. Sci Rep. (2022) 12:5714. doi: 10.1038/s41598-022-09743-3
2. Meller C, Carducci M, Ceriani Cernadas J, Otaño L. Preterm premature rupture of membranes. Arch Argent Pediatr. (2018) 116:e575–81. doi: 10.5546/aap.2018.eng.e575
3. Freeman S, Denoble A, Kuller J, Ellestad S, Dotters-Katz S. Management of preterm premature rupture of membranes in the late preterm period. Obstet Gynecol Surv. (2022) 77:283–92. doi: 10.1097/OGX.0000000000001024
4. Gupta T, Harikrishnan P, Kolte D, Khera S, Aronow W, Mujib M, et al. Outcomes of acute myocardial infarction in patients with hypertrophic cardiomyopathy. Am J Med. (2015) 128:879–887.e1. doi: 10.1016/j.amjmed.2015.02.025
5. Gabay C, Kushner I. Acute-phase proteins and other systemic responses to inflammation. N Engl J Med. (1999) 340:448–54. doi: 10.1056/NEJM199902113400607
6. Wang W, Wang L, Liu Y, Yang F, Zhu L, Zhang X. Value of the ratio of monocytes to lymphocytes for monitoring tuberculosis therapy. Can J Infect Dis Med Microbiol. (2019) 2019:3270393. doi: 10.1155/2019/3270393
7. Wang R, Wen W, Jiang Z, Du Z, Ma Z, Lu A, et al. The clinical value of neutrophil-to-lymphocyte ratio (NLR), systemic immune-inflammation index (SII), platelet-to-lymphocyte ratio (PLR) and systemic inflammation response index (SIRI) for predicting the occurrence and severity of pneumonia in patients with intracerebral hemorrhage. Front Immunol. (2023) 14:1115031. doi: 10.3389/fimmu.2023.1115031
8. Buonacera A, Stancanelli B, Colaci M, Malatino L. Neutrophil to lymphocyte ratio: An emerging marker of the relationships between the immune system and diseases. Int J Mol Sci. (2022) 23:3636. doi: 10.3390/ijms23073636
9. Li M, Cai J, Jiang K, Li Y, Li S, Wang Q, et al. Prognostic nutritional index during hospitalization correlates with adverse outcomes in elderly patients with acute myocardial infarction: A single-center retrospective cohort study. Aging Clin Exp Res. (2024) 36:56. doi: 10.1007/s40520-024-02702-0
10. Karabay G, Bayraktar B, Seyhanli Z, Cakir B, Aktemur G, Sucu S, et al. Predictive value of inflammatory markers (NLR, PLR, MLR, SII, SIRI, PIV, IG, and MII) for latency period in Preterm premature rupture of membranes (PPROM) pregnancies. BMC Pregnancy Childbirth. (2024) 24:564. doi: 10.1186/s12884-024-06756-w
11. Zhang J, Fan C, Xu C, Zhang Y, Liu J, Zhou C, et al. Serum calcium level at 32 weeks of gestation could be applied as a predictor of preterm delivery: A retrospective study. Eur J Med Res. (2024) 29:400. doi: 10.1186/s40001-024-01984-4
12. Zhong P, Chen X, Miao P, Xing Y, Li Y. Advanced lung cancer inflammation index is associated with mortality in critically ill patients with non-traumatic cerebral hemorrhage. Sci Rep. (2025) 15:14972. doi: 10.1038/s41598-025-98854-8
13. Tokalioglu E, Tanacan A, Agaoglu M, Özbebek ÜG, Okutucu G, Kayaalp H, et al. Aggregate index of systemic inflammation: A novel systemic inflammatory index for prediction of neonatal outcomes and chorioamnionitis in women with preterm premature rupture of membranes. Int J Gynaecol Obstet. (2025) 168:640–9. doi: 10.1002/ijgo.15868
14. Wei C, Wei Y, Cheng J, Tan X, Zhou Z, Lin S, et al. Identification and verification of diagnostic biomarkers in recurrent pregnancy loss via machine learning algorithm and WGCNA. Front Immunol. (2023) 14:1241816. doi: 10.3389/fimmu.2023.1241816
15. Xie H, Jia Y, Liu S. Integration of artificial intelligence in clinical laboratory medicine: Advancements and challenges. Interdiscip Med. (2024) 2:e20230056. doi: 10.1002/INMD.20230056
16. Defilippo A, Bertucci G, Zurzolo C, Veltri P, Guzz P. On the computational approaches for supporting triage systems. Interdiscip Med. (2023) 1:e20230015. doi: 10.1002/INMD.20230015
17. Baldwin E, Walther-Antonio M, MacLean A, Gohl D, Beckman K, Chen J, et al. Persistent microbial dysbiosis in preterm premature rupture of membranes from onset until delivery. PeerJ. (2015) 3:e1398. doi: 10.7717/peerj.1398
18. Osman Mohamed A, Suliman Mohamed M, Hussain Mallhi T, Abdelrahman Hussain M, Ali Jalloh M, Ali Omar K, et al. Prevalence of vulvovaginal candidiasis among pregnant women in Africa: A systematic review and meta-analysis. J Infect Dev Ctries. (2022) 16:1243–51. doi: 10.3855/jidc.15536
19. Zhan F, Zhu S, Liu H, Wang Q, Zhao G. Blood routine test is a good indicator for predicting premature rupture of membranes. J Clin Lab Anal. (2019) 33:e22673. doi: 10.1002/jcla.22673
20. Huang Z, Fu Z, Huang W, Huang K. Prognostic value of neutrophil-to-lymphocyte ratio in sepsis: A meta-analysis. Am J Emerg Med. (2020) 38:641–7. doi: 10.1016/j.ajem.2019.10.023
21. Monneret G, Venet F. Sepsis-induced immune alterations monitoring by flow cytometry as a promising tool for individualized therapy. Cytometry B Clin Cytom. (2016) 90:376–86. doi: 10.1002/cyto.b.21270
22. Cappelletti M, Della Bella S, Ferrazzi E, Mavilio D, Divanovic S. Inflammation and preterm birth. J Leukoc Biol. (2016) 99:67–78. doi: 10.1189/jlb.3MR0615-272RR
23. Kim M, Lee Y, Seo K. Assessment of predictive markers for placental inflammatory response in preterm births. PLoS One. (2014) 9:e107880. doi: 10.1371/journal.pone.0107880
24. Esercan A, Demir I. Neutrophil/lymphocyte and platelet/lymphocyte ratio in preterm premature rupture of membranes. Cureus. (2023) 15:e38664. doi: 10.7759/cureus.38664
25. Tanacan A, Uyanik E, Unal C, Beksac MS. A cut-off value for systemic immune-inflammation index in the prediction of adverse neonatal outcomes in preterm premature rupture of the membranes. J Obstet Gynaecol Res. (2020) 46:1333–41. doi: 10.1111/jog.14320
26. Wakita M, Fukatsu A, Amagai T. Nutrition assessment as a predictor of clinical outcomes for infants with cardiac surgery: Using the prognostic nutritional index. Nutr Clin Pract. (2011) 26:192–8. doi: 10.1177/0884533611399922
27. Jiang Y, Luo B, Lu W, Chen Y, Peng Y, Chen L, et al. Association between the aggregate index of systemic inflammation and clinical outcomes in patients with acute myocardial infarction: A retrospective study. J Inflamm Res. (2024) 17:7057–67. doi: 10.2147/JIR.S481515
28. Chahal N, Qureshi T, Eljamri S, Catov J, Fazeli P. Impact of low maternal weight on pregnancy and neonatal outcomes. J Endocr Soc. (2024) 9:bvae206. doi: 10.1210/jendso/bvae206
29. Pigatti Silva F, Souza R, Cecatti J, Passini R, Tedesco R, Lajos G, et al. Role of body mass index and gestational weight gain on preterm birth and adverse perinatal outcomes. Sci Rep. (2019) 9:13093. doi: 10.1038/s41598-019-49704-x
30. Sfregola G, Sfregola P, Ruta F, Zendoli F, Musicco A, Garzon S, et al. Effect of maternal age and body mass index on induction of labor with oral misoprostol for premature rupture of membrane at term: A retrospective cross-sectional study. Open Med (Wars). (2023) 18:20230747. doi: 10.1515/med-2023-0747
31. Granger J, Alexander B, Llinas M, Bennett W, Khalil R. Pathophysiology of hypertension during preeclampsia linking placental ischemia with endothelial dysfunction. Hypertension. (2001) 38:718–22. doi: 10.1161/01.hyp.38.3.718
32. Savvidou M, Hingorani A, Tsikas D, Frölich J, Vallance P, Nicolaides K. Endothelial dysfunction and raised plasma concentrations of asymmetric dimethylarginine in pregnant women who subsequently develop pre-eclampsia. Lancet. (2003) 361:1511–7. doi: 10.1016/S0140-673613177-7
33. Lin D, Hu B, Xiu Y, Ji R, Zeng H, Chen H, et al. Risk factors for premature rupture of membranes in pregnant women: A systematic review and meta-analysis. BMJ Open. (2024) 14:e077727. doi: 10.1136/bmjopen-2023-077727
Keywords: PROM, machine learning, nomogram, nutritional inflammation index, predictive models
Citation: Zheng M, Zhang X, Wang H, Yuan P and Yu Q (2025) Interpretable machine learning model for identification and risk factor of premature rupture of membranes (PROM) and its association with nutritional inflammatory index: a retrospective study. Front. Med. 12:1557919. doi: 10.3389/fmed.2025.1557919
Received: 09 January 2025; Accepted: 20 May 2025;
Published: 18 June 2025.
Edited by:
Maria Luisa Ojeda, University of Seville, SpainReviewed by:
Takuya Kondo, Kyushu University, JapanYefang Huang, Hospital of Chengdu University of Traditional Chinese Medicine, China
Akihiro Yasui, Nagoya University, Japan
Copyright © 2025 Zheng, Zhang, Wang, Yuan and Yu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Qiulan Yu, MTg5MTQ2MzEwOTBAMTYzLmNvbQ==