- Department of Pediatrics, Nantong First People's Hospital, Nantong, Jiangsu, China
Background: Early lactation failure among mothers of moderate preterm infants poses a persistent public health concern. This study identified key predictors and developed a risk stratification tool.
Methods: We conducted a prospective cohort study of 3,210 mother-infant dyads (32.0–34.9 weeks gestation) at a tertiary hospital in China (February 2022–April 2025). Early lactation failure was defined as absence of direct breastfeeding with documented latch or provision of <5 ml cumulative expressed breast milk within 72 h postpartum. Data included psychosocial assessments [breastfeeding self-efficacy scale-short form (BSES-SF), Edinburgh postnatal depression scale (EPDS), family support], obstetric factors, neonatal characteristics, and early care variables. Missing data was imputed using multiple chained equations (50 datasets). Multilevel logistic regression with ward-level random intercepts identified predictors, with bootstrap validation (1,000 resamples) assessing performance.
Results: Among 3,210 mother-infant dyads enrolled, 716 (22.3%) experienced early lactation failure within 72 h postpartum. Key predictors included higher BSES-SF scores [adjusted odds ratio (aOR) = 0.96 per point, 95% confidence interval (CI): 0.94–0.98], higher EPDS scores (aOR = 1.08, 95% CI: 1.04–1.12), cesarean delivery (aOR = 1.42, 95% CI: 1.15–1.75), neonatal respiratory support (aOR = 1.28, 95% CI: 1.05–1.56), and shorter kangaroo care duration (aOR = 0.92 per 10 min, 95% CI: 0.87–0.97). A five-factor risk tool stratified mothers into low-risk (13.0% failure rate), moderate-risk (20.5%), and high-risk (31.0%) groups, with strong discrimination (C-statistic = 0.704; calibration slope = 0.952; Hosmer-Lemeshow p = 0.267). Population-attributable risks were highest for cesarean delivery (20.4%), low self-efficacy (18.9%), and depression (18.2%), with numbers needed to treat (NNT) ranging from 11 to 18.
Conclusion: This study confirms the multifactorial basis of early lactation failure, highlighting maternal psychosocial factors as key predictors. The validated risk tool enables identification of high-risk dyads for targeted nutritional support interventions.
1 Introduction
Preterm birth remains a significant global health concern, accounting for substantial neonatal morbidity, disability, and mortality (1–3). The World Health Organization estimates that over 15 million infants, more than 10% of global live births, are born preterm annually, with disproportionately higher rates in low- and middle-income countries Late- and moderate-preterm (LMPT; 32∧0–36∧6/7 weeks) births constitute the majority of preterm deliveries; the present study restricted inclusion to 32.0–34.9 weeks to reduce heterogeneity and to focus on the neonatal period when feeding support is most consequential (1, 3). From a nutrition standpoint, early lactation success determines the neonate's primary enteral intake during the immediate postnatal period; timely human-milk exposure delivers essential macronutrients, bioactive peptides, and immunomodulatory components that shape gut colonization and metabolic programming, whereas failure redirects infants toward formula or suboptimal intake (1, 4). Nonetheless, initiating and sustaining breastfeeding remains difficult for many mothers, particularly where postnatal support systems are inadequate (4).
In China, preterm birth rates have risen to an estimated 7.8% in recent years (5), alongside persistently high cesarean section rates and institutional practices that often limit rooming-in, delay skin-to-skin contact, and provide inadequate breastfeeding support (6, 7). These factors may impede lactation initiation, particularly among mothers of preterm infants. Early lactation failure, defined as the inability to establish effective breastfeeding within the first 72 h postpartum, represents a critical yet understudied outcome (8, 9). This period, known as the “lactogenesis window,” is essential for hormonal priming of milk production and establishing infant feeding patterns (7, 10). Delays during this window are linked to early breastfeeding cessation, increased formula use, and negative neonatal outcomes (9).
Despite its clinical and public health relevance, the determinants of early lactation failure remain poorly defined, particularly in Chinese populations (10). Previous studies have identified maternal, psychosocial, obstetric, and neonatal contributors, including low breastfeeding self-efficacy, postnatal depressive symptoms, cesarean delivery, and delayed Kangaroo Mother Care (KMC) (10–13). The breastfeeding self-efficacy scale–short form (BSES-SF) and the Edinburgh postnatal depression scale (EPDS) are validated psychometric instruments that have been widely adopted in international studies and increasingly applied in Chinese cohorts (13–16). Additionally, early pacifier use, neonatal respiratory support, and operative delivery may impair maternal-infant bonding and hormonal lactation triggers (17, 18). However, most prior studies are cross-sectional, single-center, and lack multilevel models to address institutional variability in postnatal care (19, 20).
China's postnatal care system exhibits marked variability in resources, staff training, ward-level policies, and implementation of breastfeeding initiatives like KMC (21, 22). These institutional differences may influence breastfeeding outcomes as both modifiers and confounders, underscoring the need for analytic approaches that accommodate hierarchical data. Multilevel modeling, by incorporating random effects, distinguishes individual-level risks from institutional influences and yields more precise, generalizable estimates (22–24). This enhances applicability across diverse clinical settings, supporting targeted public health strategies. Translationally, predictive models and risk stratification tools based on modifiable factors can guide targeted lactation support for high-risk dyads, aligning with the Healthy China 2030 agenda, which emphasizes maternal–child health and equitable newborn care (25–27).
The present study aimed to identify independent determinants of early lactation failure, using a large prospective dataset from a single tertiary referral center in eastern China. It assessed the impact of maternal psychological factors (e.g., breastfeeding self-efficacy, depressive symptoms), delivery characteristics (e.g., mode of birth, parity), neonatal clinical conditions (notably respiratory support), and early-care practices (e.g., KMC duration, pacifier use). A multilevel logistic regression model accounted for postpartum ward clustering and enhanced estimate precision. The study also developed and internally validated a predictive model, created a modifiable risk-based stratification tool, and quantified dose–response associations and population-attributable burdens of key factors. These findings were integrated into an implementation framework to inform targeted interventions and improve breastfeeding outcomes for preterm infants in Chinese maternity care settings.
2 Methodology
2.1 Study design and setting
We conducted a prospective cohort study at the Department of Pediatrics, Nantong First People's Hospital, Jiangsu Province, China, from February 2022 to April 2025. This study was designed within a theoretical framework integrating the Theory of Planned Behavior and the Socioecological Model to examine multilevel determinants of early lactation outcomes in moderately preterm infants. Reporting follows STROBE guidelines for observational cohort studies.
Nantong First People's Hospital is a tertiary academic medical center serving 2.8 million residents across Nantong Prefecture. The facility maintains a 120-bed level III neonatal intensive care unit with ~4,500 annual deliveries, of which 18% are preterm births. Ward-level characteristics including nurse-to-patient ratios and lactation consultant availability were systematically documented quarterly as potential level-2 covariates.
2.2 Theoretical framework
This investigation pursued dual analytic objectives: (1) an etiologic analysis estimating causal associations between theoretically informed determinants and early lactation failure, adjusted for confounding; and (2) a predictive analysis developing and internally validating a clinical risk stratification tool optimized for discrimination and calibration. The etiologic analysis prioritized temporal ordering and confounder control, while the predictive analysis incorporated all strong predictors regardless of causal interpretation.
2.3 Participant recruitment and screening
Research coordinators maintained systematic screening logs identifying all eligible pre-term deliveries during regular staffing periods. Mothers were approached for consent when clinically stable following delivery. All births were prospectively logged to ensure comprehensive screening coverage, with the majority of participants enrolled within the immediate postpartum period. Systematic documentation of screening, eligibility assessment, consent procedures, exclusions, and final analysis cohort was maintained throughout the study period.
2.4 Inclusion and exclusion criteria
Eligible mother-infant dyads met these criteria: singleton pregnancy, moderately preterm neonates (32.0–34.9 weeks) by best obstetric estimate. In this study, we use “neonates” to refer specifically to infants within the first 72 h postpartum (the study observation window), while “infants” refers to the broader pediatric population. Moderate preterm birth is defined as delivery between 32+0 and 34+6 weeks' gestation, representing a distinct clinical category with intermediate morbidity risk between very preterm (< 32 weeks) and late preterm (34–36 weeks) births.
Exclusion criteria included: multiple gestation, maternal contraindications to breastfeeding, severe mental illness requiring hospitalization, neonatal death anticipated within 72 h, maternal transfer within 48 h, previous study enrollment, substance use disorder, and planned adoption. In current study, did not exclude mothers with GDM or elevated pre-pregnancy body mass index (BMI), as both conditions are prevalent in our source population (GDM: 15.0%; BMI ≥25 kg/m2: 18.3%) and represent important clinical subgroups requiring lactation support. While these factors have been associated with delayed lactogenesis II in some populations, we sought to estimate their independent effects through multivariable adjustment rather than restriction, thereby preserving generalizability to real-world NICU populations. Sensitivity analyses stratified by GDM status and BMI category confirmed that primary predictors (BSES-SF, EPDS, cesarean delivery) operated consistently across these subgroups (all interaction p > 0.15).
2.5 Temporal ordering and assessment schedule
To ensure appropriate temporal precedence, we implemented a structured assessment timeline: maternal characteristics and obstetric variables at baseline (0–6 h postpartum); BSES-SF at 24 ± 6 h; EPDS and family support assessment at 48 ± 6 h; and outcome ascertainment covering hours 24–72 postpartum. For etiologic models of EPDS associations, outcome events recorded prior to the EPDS assessment were censored in sensitivity analyses to ensure temporal precedence.
2.6 Primary outcome definition and measurement
Early lactation failure was defined as: (1) complete absence of direct breastfeeding attempts with evidence of infant latch, AND (2) failure to provide ≥5 ml cumulative expressed breast milk to the infant by 72 h postpartum. Expressed volumes were measured using direct pump gradations calibrated monthly; donor milk was excluded from threshold calculations. Outcome assessment employed triangulated methods: structured nursing observations every 8 h, maternal feeding diaries with research assistant verification, and medical record review. When data sources conflicted, the hierarchical adjudication rule prioritized direct observation > nursing record > maternal diary, with discrepancies reviewed weekly by a blinded adjudication committee. However, the 5 ml cumulative threshold was selected based on: (1) physiological evidence that colostrum volumes < 5 ml within 72 h predict delayed lactogenesis II (onset typically > 72 h postpartum, associated with increased formula supplementation and early breastfeeding cessation) (28–31), (2) clinical guidelines recommending minimum 2–3 ml per feeding by 24 h and 5–7 ml per feeding by 48 h for preterm neonates, with cumulative target ≥ 5 ml by 72 h (32, 33), and (3) distribution analysis in our cohort showing bimodal separation: 89.2% of mothers who successfully transitioned to exclusive breastfeeding at discharge produced > 5 ml cumulatively by 72 h, while 78.4% of those requiring ongoing formula supplementation produced < 5 ml (threshold sensitivity 78.4%, specificity 89.2% for discharge breastfeeding failure; AUC = 0.84). This threshold aligns with lactation physiology literature documenting that inadequate colostrum transfer during the lactogenesis I-to-II transition predicts poor milk supply establishment (33–35).
2.7 Data collection procedures
Comprehensive sociodemographic assessment included validated instruments for socioeconomic status, employment classification using international standard classification of occupations (ISCO-08), and cultural factors including traditional postpartum confinement practices.
2.8 Psychosocial assessment battery
2.8.1 Breastfeeding self-efficacy
The validated Chinese BSES-SF (14 items, range 14–70, higher scores indicating greater confidence) was administered at 24 ± 6 h postpartum (36). In our cohort, internal consistency was excellent (Cronbach α = 0.93, McDonald's ω = 0.94), with confirmatory factor analysis supporting unidimensional structure [comparative fit index (CFI) = 0.96, Tucker-Lewis index [TLI] = 0.95, root mean square error of approximation [RMSEA] = 0.058].
2.8.2 Postpartum depression
The Chinese EPDS (10 items, range 0–30, higher scores indicating more depressive symptoms) was administered at 48 ± 6 h postpartum, supplemented by generalized anxiety disorder-7 scale (GAD-7) for anxiety assessment, cohort-specific psychometric properties: EPDS α = 0.89, GAD-7 α = 0.91 (37–40).
2.8.3 Family support
A validated 20-item Chinese postpartum family support scale assessed practical and emotional support from partner, parents, and in-laws (41, 42). Raw scores (0–100) were divided by 10 to yield a 0–10 Family Support Index for analysis (higher scores indicating greater support); regression coefficients therefore represent the change in odds of failure per 10-point increase in raw support. Item-level missingness was handled by prorating (person-mean substitution) when ≤ 2 of 20 items was missing; otherwise, the scale score was set to missing and excluded from analyses involving this measure (Cronbach α = 0.94 in the analytic cohort).
2.9 Neonatal assessment and early care practices
KMC duration was calculated as total documented minutes divided by days at risk (minimum 2 of 3 days required for valid summary) (43). Preterm infant breastfeeding behavior scale (PIBBS) scores used the maximum performance across all attempts, as peak feeding competence demonstrated stronger correlation with subsequent breastfeeding success than mean scores in preliminary analyses (44, 45). For dyads with early discharge or NICU transfer before 72 h, KMC duration was prorated, while respiratory support was initially categorized as none, supplemental oxygen, CPAP, or invasive ventilation, with CPAP and invasive categories subsequently collapsed to “any support” given similar failure rates in preliminary analysis (46, 47).
2.10 Statistical analysis
Sample size calculations targeted odds ratios ≥ 1.5 for key predictors, assuming 20% baseline failure rate (derived from pilot data collected January-June 2021 showing 18.7% failure among 287 moderate preterm dyads, and consistent with systematic review estimates of 15%−30% in similar populations), 80% power, α = 0.05, and design effect 1.2 for clustering (based on preliminary ICC estimate of 0.03 from pilot data across 6 wards). The OR ≥ 1.5 threshold was selected as the minimum clinically meaningful effect size representing 50% increased odds, consistent with effect sizes for established lactation predictors (cesarean delivery OR = 1.4–1.6, depression OR = 1.3–1.8 in prior literature). Accounting for subgroup analyses, missing data (~10% anticipated), and interaction testing yielded target N = 3,200. Simulation-based power analyses confirmed adequacy for planned multilevel analyses. Continuous variables are presented as mean ± SD or median (IQR) based on distributional assessment. Categorical variables appear as n (%). Between-group comparisons used appropriate tests with standardized mean differences for continuous variables and odds ratios with 95% confidence interval (CI) for categorical variables.
Missing data (0.3%−8.7% per variable) were handled using multiple imputations by chained equations (MICE v3.16.0, 50 imputations). Little's MCAR test (χ2 = 127.4, p = 0.81) and pattern-mixture modeling (all p > 0.15) supported missing-at-random assumptions. Variable-specific imputation used predictive mean matching (continuous), logistic regression (binary), and multinomial regression (categorical), with models including all analysis variables, auxiliary predictors, ward identifiers, and outcome status. Imputed values were constrained to plausible ranges (BSES-SF: 14–70; EPDS: 0–30), and convergence confirmed via trace plots (potential scale reduction factors < 1.1). Proportion missing by variable ranged from 0.3% (maternal age) to 8.7% (complete KMC data). Analyses were performed within each imputed dataset and combined using Rubin's rules; for odds ratios, logit coefficients were pooled then exponentiated.
2.10.1 Multilevel modeling framework
The multilevel modeling framework employed generalized linear mixed models (GLMM) with a logit link and random intercepts for postpartum wards, estimated using adaptive Gauss-Hermite quadrature (nAGQ = 15) in lme4 version 1.1–27.1 with R version 4.3.0. Results demonstrated robustness to Laplace approximation (nAGQ = 1) in sensitivity analyses. Ward-level covariates, including quarterly staffing ratios and lactation consultant availability, were incorporated as level-2 predictors. Convergence criteria were set to require a gradient < 0.001 and a relative change in deviance < 10−6, with a random seed of 12,345 established for reproducibility. Furthermore, variable selection strategy considered predictors with p < 0.20 in univariable analyses and those possessing biological plausibility. Educational attainment was collapsed from four to two categories (university vs. basic schooling) following preliminary modeling, which revealed statistically indistinguishable effects for intermediate categories (likelihood ratio test: χ2 = 1.2, df = 2, p = 0.55). Calendar epoch was included as a categorical covariate in etiologic models and evaluated for its impact on predictive model calibration. These approaches ensured a theoretically grounded and statistically rigorous model specification.
Continuous predictors were modeled with careful assessment of linearity using restricted cubic splines with knots at the 5th, 35th, 65th, and 95th percentiles; non-linear terms were retained if the likelihood ratio test yielded p < 0.05. The breastfeeding self-efficacy scale-short form (BSES-SF) and Edinburgh postnatal depression scale (EPDS) scores were analyzed per point to enhance clinical interpretability, while kangaroo mother care (KMC) duration was scaled per 10 min to improve numerical stability. In contrast, model validation and performance assessment incorporated internal validation through bootstrap resampling (n = 1,000, seed = 54,321) at the ward level, estimating optimism-corrected performance measures using rms version 6.7–0. Discrimination was evaluated via C-statistics, calibration through plots, Hosmer-Lemeshow tests, and slopes, and overall performance via Brier scores and net reclassification improvement for risk categories of < 15%, 15%−25%, and >25%. These validation metrics confirmed the model's reliability and predictive utility in similar populations.
2.10.2 Receiver operating characteristic (ROC) curve methodology
The receiver operating characteristic (ROC) curve visualizes model discrimination by plotting sensitivity (true positive rate) against 1-specificity (false positive rate) across all possible classification thresholds from 0 (all positive) to 1 (all negative), with each point representing a different threshold value rather than a single operating point. The area under the ROC curve (AUC, equivalent to C-statistic) quantifies discriminative ability as the probability that the model assigns higher risk to a randomly selected case than to a non-case, with interpretation benchmarks of 0.50 (chance), 0.60–0.70 (acceptable), 0.70–0.80 (good), 0.80–0.90 (excellent), and >0.90 (outstanding); however, complex multifactorial outcomes inherently limit achievable discrimination, and moderate C-statistics may provide substantial clinical utility when combined with good calibration. For clinical implementation, a single operating point must be selected based on context-specific priorities (intervention burden, costs of misclassification, resource availability). We constructed ROC curves using pROC package (version 1.18.0) in R with 95% confidence intervals from 2,000 stratified bootstrap resamples, selecting the optimal threshold by maximizing Youden index (sensitivity + specificity – 1) while considering clinical feasibility, with sensitivity analyses exploring alternative threshold choices.
2.10.3 Advanced analytical considerations
Advanced analytical considerations included interaction testing for pre-specified terms, such as maternal education by parity, delivery mode by pacifier use, and self-efficacy by family support, using likelihood ratio tests (k = 3 total interactions). All interactions were treated as a single family for Benjamini-Hochberg false discovery rate control at 5%. Cross-level interactions examined ward-level lactation staffing effects on individual BSES-SF associations to test structural moderation hypotheses. Multiple comparisons employed two-sided hypothesis tests with α = 0.05, without adjustment for main effects given their theoretical prioritization, while interaction tests utilized false discovery rate control. These considerations addressed potential complexities in the multilevel data structure and enhanced the interpretability of findings.
2.10.4 Risk stratification and clinical translation
Risk stratification and clinical translation involved developing a simplified risk score based on β-coefficients from the final model, with performance assessed through the area under the receiver operating characteristic curve and decision curve analysis. Population attributable risk was calculated for modifiable factors using standardization methods, and number needed to treat estimates were derived from model-based absolute risk differences. Notably, these translational elements facilitated the application of study results to clinical practice and public health interventions.
2.10.5 Quality control and data management
Quality control and data management protocols required research staff to complete 40-h training programs with ongoing competency assessment. Data collection utilized REDCap with real-time validation, audit trails, and automated backups. Inter-rater reliability exceeded κ = 0.85 for all observational measures, and data lock occurred on May 15, 2025. These measures ensured data integrity and reproducibility throughout the analytical process.
3 Results
3.1 Study population and baseline characteristics
We first compared baseline characteristics between mothers who successfully established lactation and those who experienced failure to identify potential confounding structures before multivariable adjustment. A total of 3,210 mother-infant dyads met inclusion criteria and were enrolled during the study period, with 2,494 (77.7%) achieving successful early lactation and 716 (22.3%) experiencing lactation failure within 72 h postpartum. The observed failure rate falls within ranges reported in recent systematic reviews of preterm populations (15%−30%), though direct comparisons are complicated by heterogeneous outcome definitions across studies. Maternal sociodemographic characteristics were largely balanced between groups, suggesting minimal confounding by measured demographic factors. Maternal age (success: 30.1 ± 5.0 years vs. failure: 30.5 ± 4.9 years, p = 0.073), pre-pregnancy BMI (23.0 ± 3.1 vs. 23.1 ± 3.1 kg/m2, p = 0.417), educational attainment distribution, and employment sector showed no meaningful differences (all p > 0.08). However, rural residence was slightly more prevalent among mothers experiencing lactation failure (38.7 vs. 36.1%, p = 0.041), suggesting potential socioeconomic or healthcare access disparities that warranted adjustment in multivariable models.
Cesarean delivery emerged as the most striking obstetric difference, occurring in 72.1% of the failure group vs. 62.8% of the success group (p < 0.001, OR = 1.53). This 9.3 percentage-point absolute difference represents a clinically meaningful disparity consistent with known physiological mechanisms (delayed lactogenesis II, reduced oxytocin release, postoperative pain interfering with positioning). Notably, antenatal complications including gestational diabetes (15.5 vs. 14.9%, p = 0.683), hypertensive disorders (11.0 vs. 9.6%, p = 0.245), and postpartum hemorrhage (9.9 vs. 8.7%, p = 0.323) showed no associations, suggesting that the mode of delivery itself, rather than underlying medical conditions, drives the cesarean-lactation relationship (Table 1).
Table 1. Maternal, neonatal, and psychosocial characteristics of the study cohort, stratified by early lactation outcome.
3.2 Psychosocial and early care factors
Given theoretical frameworks emphasizing maternal psychological state and early care practices as proximal determinants of lactation outcomes, we examined detailed psychosocial profiles and care practices to inform both variable selection and mechanistic interpretation. Mothers experiencing lactation failure exhibited substantially lower breastfeeding self-efficacy (BSES-SF: 48.5 ± 8.2 vs. 50.4 ± 8.1, p < 0.001, SMD = 0.233), with this 1.9-point difference representing nearly one-quarter of a standard deviation and falling within the clinically meaningful range. Mothers scoring below 50, a validated threshold associated with 85% sensitivity for early cessation, comprised 52.4% of the failure group vs. 45.1% of the success group. The failure group demonstrated significantly higher depressive symptomatology (EPDS: 9.4 ± 5.1 vs. 7.9 ± 4.7, p < 0.001, SMD = 0.312), with 28.6% scoring ≥10 (suggesting possible depression) compared to 19.8% in the success group. Family support scores were lower in the failure group (5.9 ± 2.0 vs. 6.2 ± 2.0, p < 0.001, SMD = 0.150), indicating that perceived social support was diminished among mothers struggling to establish breastfeeding. The distribution patterns for BSES-SF and EPDS scores demonstrated clear separation between groups in violin plots (Figures 1A, B), with moderate-to-strong effect sizes providing compelling preliminary evidence that maternal psychological state represents a critical determinant.
Figure 1. Psychosocial determinants and multivariable model results. (A) Violin plots showing BSES-SF score distributions by lactation outcome (success: orange, n = 2,494; failure: red, n = 716). Mann-Whitney U p < 0.001, SMD = 0.233. (B) EPDS score distributions by outcome. Mann-Whitney U p < 0.001, SMD = 0.312. (C) Forest plot of adjusted odds ratios (log scale) from multilevel logistic regression with 95% confidence intervals. Dashed vertical line indicates null effect (aOR = 1.0). Psychosocial factors show strong effects, followed by cesarean delivery and early care practices.
Neonatal profiles showed marginally lower gestational age in the failure group (33.4 ± 0.7 vs. 33.5 ± 0.7 weeks, p = 0.018), though the 0.1-week difference represents minimal clinical significance. Birth weight Z-scores and 5-min Apgar scores were similar between groups, indicating comparable acute perinatal adaptation. Respiratory support differed substantially and exhibited a clear dose-response pattern, with only 44.4% of the failure group requiring no support compared to 52.2% of the success group (p < 0.001), and invasive mechanical ventilation notably higher in the failure group (25.1 vs. 18.4%). Daily kangaroo mother care (KMC) duration was significantly shorter among dyads experiencing lactation failure (62.1 ± 26.2 vs. 68.3 ± 25.3 min/day, p < 0.001, SMD = 0.245), with 38.1% of the failure group receiving < 60 min/day compared to 30.2% of the success group. In contrast, preterm infant breastfeeding behavior scale (PIBBS) scores were comparable (36.5 ± 9.4 vs. 36.0 ± 9.1, p = 0.229), suggesting infant feeding competence did not differ systematically. Early pacifier use showed a non-significant trend toward higher prevalence in the failure group (38.7 vs. 35.2%, p = 0.086). These findings collectively indicate that psychosocial factors and modifiable care practices differentiate failure from success more consistently than neonatal biologic factors within the moderate preterm range. Full comparisons are detailed in Table 1.
3.3 Univariable associations with early lactation failure
Before constructing multivariable models, we conducted systematic univariable screening to identify candidate predictors (p < 0.20 threshold), assess crude effect directions, detect potential confounding structures, and inform functional form decisions. This screening approach follows best practices in prognostic model development, ensuring theoretically plausible variables receive fair consideration while avoiding overfitting. Advanced maternal age showed only borderline association with increased failure risk (OR = 1.02 per year, p = 0.073), while pre-pregnancy BMI and employment sector were unrelated (p > 0.25). Educational attainment showed a non-linear pattern, with mothers lacking formal education facing 58% higher odds of failure compared to those with basic schooling (OR = 1.58, p = 0.028), though higher education levels conferred no additional benefit, a threshold effect that informed our decision to collapse educational categories in multivariable models. Rural hukou status (OR = 1.12, p = 0.162) and NCMS insurance (OR = 1.20, p = 0.118) showed non-significant trends toward elevated risk.
Cesarean delivery emerged as the strongest univariable obstetric predictor, conferring 53% increased odds vs. vaginal birth (OR = 1.53, 95% CI: 1.27–1.83, p < 0.001), consistent with physiological mechanisms including delayed lactogenesis II and reduced oxytocin release. Parity showed no association (all p > 0.39), and pregnancy complications, gestational diabetes, hypertensive disorders, postpartum hemorrhage, were unrelated to outcomes. Psychosocial variables demonstrated the largest effect sizes: each one-point increase in BSES-SF reduced failure odds by 3% (OR = 0.97, p < 0.001), depression scores increased risk by 6% per point (OR = 1.06, p < 0.001), and family support was protective (OR = 0.93, p = 0.006). Earlier gestational age increased failure risk (OR = 0.85 per week, p = 0.024), while respiratory support exhibited dose-response patterns with invasive ventilation conferring 60% increased odds (OR = 1.60, p < 0.001). Daily KMC duration showed strong protection (OR = 0.91 per 10 min, p < 0.001), while pacifier use demonstrated borderline significance (OR = 1.16, p = 0.086). PIBBS scores showed no predictive value (OR = 1.01, p = 0.229). These patterns identified 11 candidate predictors for multivariable inclusion and revealed that psychosocial variables exerted the strongest effects (Table 2).
3.4 Multivariable model results and risk factors
To identify independent predictors while accounting for confounding and institutional clustering, we constructed a multilevel logistic regression model with random intercepts for postpartum wards, distinguishing individual-level risks from structural factors in this hierarchical hospital-based data. The final model incorporated 11 fixed-effect predictors and one random effect, demonstrating strong model fit (−2LL = 2,913.4; AIC = 2,935.4; BIC = 3,012.8), no multicollinearity [all variance inflation factor (VIFs) < 2.0], and modest but significant between-ward heterogeneity [ward variance σ2_u = 0.089, SE = 0.024; intraclass correlation coefficient (ICC) = 0.026], with 2.6% of variance attributable to ward-level factors, validating the multilevel approach and suggesting ward-level quality improvement could yield population-wide benefits.
Among psychosocial factors, breastfeeding self-efficacy emerged as the strongest protective factor (aOR = 0.96 per point, 95% CI: 0.94–0.98, p < 0.001), translating to 37% lower adjusted odds comparing the 25th percentile (BSES-SF = 44) vs. 75th percentile 56. Depression was the strongest risk factor (aOR = 1.08 per point, 95% CI: 1.04–1.12, p < 0.001), with moderate depression (EPDS = 12) conferring 83% higher adjusted odds vs. minimal symptoms (EPDS = 4). Family support showed borderline protection (aOR = 0.94, 95% CI: 0.88–1.00, p = 0.051), suggesting it operates partially through enhancing self-efficacy and buffering mood. Cesarean delivery independently increased failure odds by 42% (aOR = 1.42, 95% CI: 1.15–1.75, p = 0.001), while any respiratory support increased odds by 28% (aOR = 1.28, 95% CI: 1.05–1.56, p = 0.014), likely reflecting maternal-infant separation and heightened anxiety. Each 10-min daily increase in KMC duration reduced odds by 8% (aOR = 0.92, 95% CI: 0.87–0.97, p = 0.002), with mothers providing ≥ 80 min/day having 28% lower adjusted odds than those providing ≤ 50 min/day. Early pacifier use showed borderline risk (aOR = 1.23, 95% CI: 0.99–1.53, p = 0.062). Maternal age (p = 0.156), university education (p = 0.254), and parity ≥1 (p = 0.264) showed no independent associations, suggesting sociodemographic factors exert minimal direct effects within standardized hospital care. Relative effect sizes (Figure 1C) demonstrate psychosocial factors exert the strongest influence, followed by cesarean delivery, respiratory support, and KMC duration, all representing clinically actionable targets (Table 3).
3.5 Model performance and internal validation
To assess generalizability and predictive accuracy, we conducted rigorous internal validation using 1,000-bootstrap resampling, evaluating discrimination, calibration, and overall accuracy. The apparent C-statistic was 0.719 (95% CI: 0.694–0.744), indicating good discrimination, while the optimism-corrected C-statistic was 0.704 (95% CI: 0.681–0.727), reflecting modest optimism of 0.015 and minimal overfitting. This performance compares favorably to existing models: Hoban et al. (C = 0.68), SPIN calculator (C = 0.71), and generic early cessation models (C = 0.62–0.69). The ROC curve (Figure 2B) plots sensitivity (true positive rate) against 1-specificity (false positive rate) across all possible risk classification thresholds, from 0 to 1. This comprehensive visualization displays the full spectrum of sensitivity/specificity trade-offs available when varying the decision threshold; the curve does NOT represent a single fixed sensitivity/specificity pair. Each point along the ROC curve corresponds to a different classification threshold. The curve's substantial elevation above the diagonal reference line (representing chance performance, C = 0.50) confirms discriminative ability, with the area under the curve (C-statistic = 0.704) exceeding conventional benchmarks for acceptable clinical prediction models (C ≥ 0.70). The C-statistic represents the probability that the model assigns a higher risk score to a randomly selected mother experiencing lactation failure than to a randomly selected mother with successful lactation. For clinical implementation, we selected a specific operating point from the infinite possibilities shown on the ROC curve. At the clinically selected high-risk threshold (score ≥4, marked with red triangle on Figure 2B), the model achieves sensitivity of 42.5% and specificity of 72.9%, with positive predictive value of 31.0% and negative predictive value of 82.0% (see Supplementary Table 5 for complete confusion matrix). This threshold represents a deliberate balance between two clinical priorities: (1) identifying a substantial proportion of mothers who will experience lactation failure (sensitivity), and (2) maintaining practical feasibility by limiting intensive interventions to ~31% of the population (1-specificity = 0.271). This operating point reflects the screening context where false positives receive beneficial lactation support without harm, while false negatives still receive standard care protocols. The high-risk category demonstrates 1.39-fold enrichment in failure rate (31.0 vs. 22.3% baseline prevalence), enabling efficient resource allocation. Alternative thresholds would provide different sensitivity/specificity combinations; for example, score ≥3 achieves 61.5% sensitivity with 58.2% specificity (less targeted), while score ≥5 achieves 28.4% sensitivity with 85.6% specificity (highly specific but misses more failures).
Figure 2. Multilevel model validation, discrimination, calibration, and non-linearity. (A) Ward-level random intercepts (log-odds scale) with 95% CI across 16 postpartum wards. ICC = 0.026, ward variance = 0.089. Red dashed lines indicate ±1 SD. (B) Receiver operating characteristic (ROC) curve plotting sensitivity (y-axis) vs. 1-specificity (x-axis) across all possible classification thresholds. The curve represents the complete spectrum of threshold options, NOT a single operating point. Diagonal dashed line = chance performance (C = 0.50). Model curve elevation above chance confirms good discrimination: optimism-corrected C-statistic = 0.704 (95% CI: 0.681–0.727). Shaded region = 95% CI from bootstrap resampling. Red triangle marks the clinically selected threshold (score ≥4: sensitivity = 42.5%, specificity = 72.9%, PPV = 31.0%, NPV = 82.0%), chosen to balance case identification with intervention feasibility. (C) Calibration plot: predicted probabilities (x-axis) vs. observed frequencies (y-axis). Purple circles = decile groups (size ∞ sample size). Diagonal dashed line = perfect calibration. Model demonstrates excellent agreement: calibration slope = 0.952 (ideal = 1.0), calibration-in-the-large = −0.018 (ideal = 0), Hosmer-Lemeshow χ2 = 10.2 (df = 8, p = 0.267). Restricted cubic spline (4 knots: 5th, 35th, 65th, 95th percentiles) assessing gestational age (32.0–34.9 weeks) association with failure probability. Shaded region = 95% CI. Linear relationship justified: non-linearity test p = 0.41. Rug plot shows observed data distribution. (D) Non-linear spline of the modeled probability of lactation failure across gestational age (solid line) with 95% confidence band (shaded); rugs denote failures (top) and successes (bottom).
Calibration-in-the-large was −0.018 (95% CI: −0.089 to 0.053), indicating nearly perfect agreement, while calibration slope was 0.952 (95% CI: 0.847–1.057), approaching the ideal value of 1.0. Hosmer-Lemeshow test (χ2 = 10.2, df = 8, p = 0.267) supported good fit, and the calibration plot (Figure 2C) confirmed strong agreement between predicted and observed probabilities across the entire risk spectrum. Brier score was 0.152 (95% CI: 0.144–0.161), with scaled Brier score of 0.124 indicating 12.4% improvement over the null model. Bootstrap validation revealed optimism of 0.015 for C-statistic and uniform shrinkage factor of 0.967, indicating minimal overfitting likely reflecting large sample size (65 events per candidate predictor), prespecified variable selection, and simple functional forms. Flexible spline modeling revealed a linear gestational age-failure relationship within 32–35 weeks (Figure 2D, non-linearity test p = 0.41), justifying linear modeling. Full validation metrics (Table 4) demonstrate good discrimination, excellent calibration, strong overall accuracy, and minimal overfitting.
3.6 Sensitivity and subgroup analyses
Multiple sensitivity analyses affirmed the robustness of the findings. Complete case analysis excluding 30 dyads with missing KMC data (n = 3,180) yielded consistent results: BSES-SF remained protective (OR = 0.96, 95% CI: 0.94–0.98, p < 0.001), EPDS retained its risk association (OR = 1.08, 95% CI: 1.04–1.12, p < 0.001), and cesarean delivery remained significantly associated with failure (OR = 1.45, 95% CI: 1.17–1.79, p = 0.001). Pre-specified interactions (maternal education × parity; pacifier use × delivery mode) were non-significant (all FDR-adjusted p > 0.05). Subgroup analysis of neonates without respiratory support (n = 1,620) showed similar effect sizes: BSES-SF (OR = 0.95, 95% CI: 0.93–0.98, p < 0.001), EPDS (OR = 1.09, 95% CI: 1.04–1.14, p < 0.001), cesarean delivery (OR = 1.38, 95% CI: 1.05–1.82, p = 0.021), and KMC duration (OR = 0.90, 95% CI: 0.84–0.97, p = 0.005), corroborating generalizability across clinical strata as, shown in Table 5.
3.7 Risk stratification and clinical translation
To facilitate clinical implementation, we translated the multivariable model into a simplified five-factor risk score using integer scoring systematically derived from standardized β-coefficients. Point assignments were calculated by dividing each predictor's β-coefficient by the smallest significant coefficient (BSES-SF, β = −0.041), then rounding to the nearest integer: BSES-SF < 50 (+1 point, β ratio = 1.0), cesarean delivery (+1, β ratio = 0.85), respiratory support (+1, β ratio = 0.61), KMC < 60 min/day (+1, β ratio = 0.93), and EPDS ≥ 10 (+2 points, β ratio = 1.88), with EPDS weighted approximately twice as heavily given its larger standardized effect. This empirically-derived scoring system (range 0–6) preserves relative predictor importance while enabling rapid bedside risk assessment.
The tool stratified participants into three risk categories demonstrating clear dose-response (Figure 3B): low-risk (score 0–1; n = 602, 18.7%) with 13.0% failure rate, moderate-risk (score 2–3; n = 1,627, 50.7%) with 20.5% failure rate (RR = 1.58, 95% CI: 1.25–2.00), and high-risk (score 4–6; n = 981, 30.6%) with 31.0% failure rate (RR = 2.39, 95% CI: 1.89–3.01; linear trend p < 0.001). The high-risk category, while representing 30.6% of the population, captured 42.5% of all failures—a 1.39-fold enrichment enabling efficient targeted intervention. The five-factor tool achieved C-statistic of 0.681 (95% CI: 0.657–0.705), with sensitivity 42.5% and specificity 72.9% using high-risk threshold. Decision curve analysis demonstrated net benefit over treat-all and treat-none strategies across threshold probabilities 0.05–0.40, with peak benefit at 15%−20% threshold corresponding to moderate-to-high risk categories (Figure 3C). Alternative scoring systems (uniform one-point weights, continuous scoring) yielded inferior discrimination (C-statistic 0.67–0.68), validating the β-coefficient-derived approach.
Figure 3. Performance and utility of a lactation failure prediction model. (A) Education × Parity Interaction: Predicted probability of lactation failure decreases with increasing parity, especially among women with university education. (B) Risk Stratification Performance: Lactation failure rates increase across risk groups—13.0% in low-risk, 20.5% in moderate-risk, and 31.0% in high-risk groups. (C) Decision Curve Analysis: The prediction model shows higher net benefit compared to “treat all” or “treat none” strategies across a wide range of threshold probabilities. (D) Population Impact Analysis: Risk stratification achieves the greatest population-level reduction in lactation failure (failures prevented per 100 patients), albeit at higher cost.
3.8 Dose-response relationships and population impact
Strong dose–response associations were observed for all key modifiable predictors. Lower BSES-SF quartiles were progressively associated with higher failure rates: Q1 ( ≤ 44) had a 28.8% failure rate, compared to 22.4% in Q2 (45–50), 21.0% in Q3 (51–56), and 16.1% in Q4 (>56; p < 0.001). Adjusted odds ratios relative to Q1 were 0.72 (95% CI: 0.58–0.89), 0.66 (95% CI: 0.53–0.82), and 0.48 (95% CI: 0.37–0.61), respectively. Similarly, longer KMC duration showed graded protection: ≤ 60 min (26.0%), 61–80 min (21.7%), and >80 min (19.2%; p < 0.001), with corresponding ORs of 0.79 (95% CI: 0.65–0.96) and 0.68 (95% CI: 0.56–0.83) for medium and high categories. EPDS scores also stratified risk, with failure rates of 16.4% (0–5), 21.2% (6–10), and 31.8% (≥11; p < 0.001), and ORs of 1.37 (95% CI: 1.12–1.68) and 2.41 (95% CI: 1.94–2.99) for moderate and high scores, respectively (as shown in Supplementary Table 2).
Population-level impact metrics further highlighted the contribution of these factors. Cesarean delivery accounted for the highest proportion of lactation failures (20.4%), followed by low self-efficacy (18.9%), maternal depression (18.2%), respiratory support (13.4%), and inadequate KMC (< 60 min/day; 6.2%; Supplementary Table 3). Corresponding NNT values to prevent one failure were 11 for EPDS-targeted intervention, 14 for improving BSES-SF or avoiding cesarean delivery, and 18 for increasing KMC duration (Supplementary Table 4). These findings are synthesized in the population impact analysis (Figure 3D), illustrating the relative effectiveness and preventive potential of targeted interventions.
3.9 Ward-level variation and structural factors
Lactation failure rates varied across the 16 postpartum wards, with random intercepts spanning −0.75 to +0.50 on the log-odds scale (Figure 2A), corresponding to ICC = 0.026 indicating 2.6% of variance attributable to ward-level factors. While this proportion is modest relative to individual-level determinants, it justifies the multilevel modeling approach and identifies structural targets for quality improvement initiatives such as nurse staffing optimization, lactation consultant availability, unit culture regarding breastfeeding support, and family visitation policies. The education × parity interaction revealed modest heterogeneity in university education's protective effect by parity, though not statistically significant after multiple testing correction (Figure 3A). Decision curve analysis confirmed clinical utility, with the model demonstrating net benefit over treat-all or treat-none approaches across relevant probability thresholds (Figure 3C), supporting implementation in clinical decision-making. These findings indicate that while individual-level interventions should be prioritized given their larger effects, ward-level quality improvement represents an important complementary strategy that could benefit all patients through enhanced systems of care.
4 Discussion
The present prospective multilevel cohort demonstrates that early lactation failure in moderate preterm neonates is common and nutrition-critical, arising from a multifactorial interplay of psychosocial, obstetric, neonatal, and early-care determinants. Failure occurred in 22.3% (716/3,210) of dyads within 72 h postpartum, a frequency consistent with contemporary preterm literature (≈15%−30%) despite heterogeneity in definitions and gestational-age ranges. Baseline comparisons indicated minimal differences by maternal age, pre-pregnancy BMI, educational attainment, employment sector, or parity; however, mothers in the failure group were modestly more likely to reside in rural settings (38.7 vs. 36.1%; p = 0.041) and to deliver by cesarean section (72.1 vs. 62.8%; p < 0.001). Psychosocial profiles differed markedly, with lower breastfeeding self-efficacy (BSES-SF 48.5 vs. 50.4; p < 0.001), higher depressive symptoms (EPDS 9.4 vs. 7.9; p < 0.001), and slightly lower family support (5.9 vs. 6.2; p < 0.001) among mothers with failure. In multivariable models accounting for ward-level clustering, modifiable factors remained salient, higher BSES-SF scores and longer kangaroo-mother-care duration were protective (aOR 0.96 per point, 95% CI 0.94–0.98; aOR 0.92 per 10 min, 95% CI 0.87–0.97), whereas elevated EPDS scores, cesarean delivery, and any neonatal respiratory support increased risk (aOR 1.08 per point, 95% CI 1.04–1.12; aOR 1.42, 95% CI 1.15–1.75; aOR 1.28, 95% CI 1.05–1.56); maternal age, education, and parity were not independently associated. Collectively, these results underscore that early human-milk provision in this population is determined predominantly by modifiable psychosocial and care exposures, thereby identifying tractable targets for nutrition-oriented intervention (73–75). The strong association between psychosocial factors and early lactation outcomes observed in our study is consistent with prior findings. Our finding that each one-point increase in BSES-SF score reduced failure odds by 4% (aOR = 0.96) aligns closely with Dennis et al.'s seminal work demonstrating that breastfeeding self-efficacy scores below 50 predict early cessation with 85% sensitivity. For instance, a study demonstrated that maternal confidence and social support are key determinants of breast milk production following a preterm birth (1). Similarly, our observation that depression scores independently increased failure risk by 8% per point (aOR = 1.08) is remarkably consistent with Stuebe et al.'s large cohort study showing that each additional EPDS point increased cessation risk by 7% in the first week postpartum. Recent evidence highlights that maternal breastfeeding experiences may be adversely affected by separation and inadequate professional support, while qualitative findings underscore the importance of tailored interventions and proactive guidance within the NICU to address early lactation challenges (48, 49).
The physiological mechanisms underlying these associations likely involve the hypothalamic-pituitary-adrenal axis, where maternal stress and depression can suppress oxytocin release and delay lactogenesis II. Our findings support previous studies neurobiological framework, where psychological stress interferes with the neuroendocrine reflexes essential for milk ejection and production (50, 51). The dose-response relationship we observed, with failure rates increasing from 16.4% in mothers with low depression scores to 31.8% in those with high scores, provides compelling evidence for this biological gradient.
In terms of obstetric factors, our finding that cesarean delivery independently increased the odds of lactation failure (aOR = 1.42) aligns with previous research and is consistent with the meta-analysis by a study, which reported a pooled odds ratio of 1.37 for lactation failure following cesarean delivery (52). Our findings extend this evidence by demonstrating that the cesarean effect persists even after adjusting for multiple confounders, suggesting that the mechanism involves more than just delayed mother-infant contact. The physiological explanation likely includes delayed gut microbiome establishment, altered maternal stress hormones, and reduced early skin-to-skin contact, as demonstrated in Chen et al.'s prospective cohort of 1,200 mother-infant dyads. Although maternal educational attainment has been identified as a predictor of breastfeeding success in some reports, our analysis suggests that, at least within the early postpartum window for moderately preterm infants, psychosocial and clinical factors exert a more pronounced influence than socioeconomic indicators (53, 54). This divergence from term infant studies may reflect the unique NICU environment, where immediate clinical support and standardized protocols may buffer against socioeconomic disparities.
Neonatal factors also played a notable role. The observation that neonates in the lactation failure group were more likely to require respiratory support (with higher frequencies of both CPAP and invasive ventilation) reinforces the concept that the severity of neonatal illness can adversely affect early mother-infant interactions and, consequently, the establishment of lactation (76–78). Our finding that any respiratory support increased failure odds is consistent with Maastrup et al.'s (55) large Danish cohort, which reported that respiratory support duration was inversely associated with exclusive breastfeeding at discharge. This protective effect is remarkably consistent with Conde-Agudelo and Díaz-Rossello's Cochrane review, which demonstrated that KMC increases exclusive breastfeeding rates by 40% in preterm populations. The dose-response relationship we observed, with failure rates decreasing from 26.0% in low-KMC mothers to 19.2% in high-KMC mothers, provides novel evidence for optimal KMC duration targets. The mechanism likely involves both physical barriers to direct breastfeeding and maternal stress from infant illness severity. A study reported how increased respiratory support may interfere with optimal breastfeeding practices, a finding that supports our data. Concurrently, our model identified each additional 10-min increment in daily KMC as being associated with an 8% reduction in the odds of lactation failure (56). This protective effect of KMC is corroborated by previous studies which report that enhanced skin-to-skin contact not only promotes maternal-infant bonding but also facilitates the physiological processes involved in lactogenesis (53, 54, 79). The biological mechanisms underlying KMC's benefits include enhanced maternal oxytocin release, improved milk volume and composition, and strengthened maternal confidence, as demonstrated in randomized controlled trials by Boundy et al. (57).
The multilevel logistic regression model used in our analysis displayed robust discrimination (apparent C-statistic = 0.719) and calibration. Our optimism-corrected C-statistic of 0.704 compares favorably with existing prediction models in this population, including Hoban et al.'s model (C-statistic = 0.68) and the SPIN risk calculator (C-statistic = 0.71), while offering superior calibration performance. This performance is comparable to previous work evaluating NICU-based breastfeeding support interventions, and suggests that accounting for ward-level clustering can help capture variations in NICU practices that influence lactation outcomes (49). The excellent calibration (calibration slope = 0.952, Hosmer-Lemeshow p = 0.267) indicates that our model accurately estimates absolute risk across the probability spectrum, a crucial requirement for clinical decision-making tools. Such methodological rigor is essential for identifying modifiable factors that can be targeted in interventions designed to improve early breastfeeding success.
Our optimism-corrected C-statistic of 0.704, while indicating “acceptable” rather than “excellent” discrimination by conventional thresholds (C > 0.80), represents strong performance within the context of perinatal prediction models where outcomes reflect complex multifactorial processes. For comparison, widely implemented obstetric risk tools achieve similar discrimination: preeclampsia prediction models (C = 0.65–0.76), preterm birth prediction (C = 0.64–0.73), and gestational diabetes screening (C = 0.70–0.78). Even extensively validated cardiovascular risk calculators such as the Framingham risk score achieve C-statistics of 0.70–0.75 in external validation. The modest discrimination likely reflects the inherent unpredictability of complex biopsychosocial outcomes influenced by unmeasured factors (moment-to-moment maternal stress, infant state variability, subtle differences in provider counseling) and stochastic biological variation in lactogenesis physiology. Importantly, our model's excellent calibration (slope = 0.952, Hosmer-Lemeshow p = 0.267) ensures that predicted probabilities accurately estimate absolute risk—a critical property for clinical decision-making that often matters more than discrimination alone.
Our optimism-corrected C-statistic of 0.704, while indicating “acceptable-to-good” rather than “excellent” discrimination by conventional thresholds (C > 0.80), represents strong performance within the context of perinatal and psychosocial prediction models where outcomes reflect complex multifactorial processes. For comparison, widely implemented obstetric risk tools achieve similar or lower discrimination: preeclampsia prediction models (C = 0.65–0.76), preterm birth prediction algorithms (C = 0.64–0.73), gestational diabetes screening tools (C = 0.70–0.78), and postpartum depression prediction (C = 0.68–0.74). Even extensively validated cardiovascular risk calculators such as the Framingham risk score achieve C-statistics of 0.70–0.75 in external validation cohorts, and the widely used APACHE II score for ICU mortality prediction performs at C = 0.70–0.85 depending on population. Our model's C-statistic of 0.704 exceeds that of existing breastfeeding prediction tools, including Hoban et al.'s model (C = 0.68) and the SPIN risk calculator (C = 0.71), while offering superior calibration performance. The modest C-statistic likely reflects the inherent unpredictability of complex biopsychosocial outcomes influenced by unmeasured factors (moment-to-moment variations in maternal stress and fatigue, infant state variability, subtle differences in provider counseling style and support quality, family dynamics during the immediate postpartum period) and stochastic biological variation in lactogenesis physiology that cannot be captured by baseline predictors alone. Additionally, the narrow gestational age range (32.0–34.9 weeks) restricts variance in neonatal factors, potentially limiting discriminative capacity compared to models spanning wider ranges. The 72-h outcome window, while clinically actionable, may include some outcomes influenced by unpredictable acute events (sudden infant illness, unexpected family stressors) occurring after baseline predictor assessment. Critically, discrimination (C-statistic) and calibration represent distinct and complementary aspects of model performance. While our C-statistic of 0.704 indicates good—but not perfect—ability to rank-order mothers by risk, our model's excellent calibration (slope = 0.952, calibration-in-the-large = −0.018, Hosmer-Lemeshow p = 0.267) ensures that predicted probabilities accurately estimate absolute risk—a property essential for clinical decision-making that often matters more than discrimination alone for guiding resource allocation. As emphasize, a well-calibrated model with moderate discrimination can guide clinical decisions more effectively than a poorly calibrated model with high discrimination, because clinicians need accurate estimates of absolute risk to weigh benefits and harms of interventions. Our model provides both good discrimination AND excellent calibration, a combination rarely achieved in complex biopsychosocial prediction. Furthermore, the C-statistic represents average performance across all possible thresholds, whereas clinical utility depends on performance at the specific threshold selected for practice. Decision curve analysis (Figure 3C) demonstrates that our model provides substantial net clinical benefit compared to default strategies (treat all mothers intensively or treat no mothers intensively) across the clinically relevant range of risk thresholds (5%−40%). This confirms practical utility for clinical decision-making despite a C-statistic in the “good” rather than “excellent” range. At our selected threshold (score ≥4), the positive predictive value of 31.0% means that among mothers classified as high-risk, nearly one-third will experience early lactation failure—substantially higher than the baseline rate of 22.3%, representing 1.39-fold enrichment. The negative predictive value of 82.0% means that among mothers classified as low or moderate risk, more than four out of five will achieve successful early lactation. These predictive values, combined with the low-risk nature of enhanced lactation support interventions (false positives receive beneficial counseling without harm), support the model's clinical implementation value.
In sensitivity analyses that excluded 30 dyads with missing Kangaroo Mother Care (KMC) data, the protective effect of breastfeeding self-efficacy measured by the breastfeeding self-efficacy scale-short form (BSES-SF; OR = 0.96) and the detrimental effect of maternal depressive symptoms measured by the Edinburgh postnatal depression scale (EPDS; OR = 1.08) remained statistically significant, as did the association with cesarean delivery (58, 59). The consistency of effect estimates across sensitivity analyses, including restriction to neonates without respiratory support, strengthens causal inference and suggests our findings are not driven by residual confounding or selection bias. These findings align with previous literature emphasizing the impact of maternal self-efficacy and psychological wellbeing on lactation outcomes among mothers of preterm infants (60, 61). The lack of significant interactions between education and parity, or pacifier use and delivery mode, suggests that our identified risk factors operate consistently across demographic subgroups, supporting the generalizability of our risk stratification approach.
The use of a multilevel logistic regression approach to account for clustering by postpartum ward was further supported by methodological studies emphasizing the need to consider hierarchical data structures in healthcare outcomes (62, 63). The population attributable risk calculations reveal that modifiable factors account for substantial proportions of lactation failures: cesarean delivery (20.4%), low self-efficacy (18.9%), depression (18.2%), and inadequate KMC (6.2%). These findings suggest that comprehensive interventions targeting multiple risk factors simultaneously could prevent up to 40%−50% of early lactation failures, representing significant clinical and public health impact. Similar strategies have been recommended in clinical contexts to facilitate early interventions, with previous studies highlighting the clinical utility of incorporating psychosocial as well as clinical markers such as post-cesarean lactation challenges (64, 65). Similar strategies have been recommended in clinical contexts to facilitate early interventions, with previous studies highlighting the clinical utility of incorporating psychosocial as well as clinical markers such as post-cesarean lactation challenges (66–68).
The dose–response analyses further illuminated the impact of modifiable factors. Lower quartiles of BSES-SF scores and shorter durations of KMC were progressively linked with higher failure rates, consistent with established literature on the importance of maternal confidence and skin-to-skin contact in promoting sustained breastfeeding (1, 69). The strong linear trends observed (all p < 0.001) provide compelling evidence for intervention thresholds: BSES-SF scores below 50 and KMC duration below 60 min daily emerge as critical cut-points for risk stratification. These thresholds align with clinical guidelines from the academy of breastfeeding medicine and provide practical targets for quality improvement initiatives. Likewise, increasing EPDS scores were associated with progressively higher odds of lactation failure, reinforcing previous reports that maternal depressive symptoms can adversely affect breastfeeding duration and exclusivity (58, 60, 70). The number needed to treat estimates provide actionable guidance for intervention prioritization: treating depression (NNT = 11) offers the most efficient impact, followed by self-efficacy interventions and cesarean delivery optimization (both NNT = 14). These benchmarks facilitate evidence-based resource allocation in resource-constrained settings. These dose–response relationships provide compelling targets for intervention, suggesting that even modest improvements in self-efficacy and KMC duration could yield clinically meaningful reductions in lactation failure.
The modest but statistically significant ward-level variation [Intraclass Correlation Coefficient (ICC) = 0.026, 2.6% variance] carries important clinical and policy implications despite representing a small proportion of total variance. First, from a quality improvement perspective, even small ICC values indicate opportunities for system-wide interventions that benefit all patients: if ward-level protocols or staffing patterns account for 2.6% of variance, optimizing these factors across all 16 wards could prevent failures among mothers at all individual risk levels. Second, the specific ward-level factors warranting investigation include: (a) lactation consultant-to-patient ratios (ranged 0.8–2.4 FTE per 100 deliveries across wards in our institution), (b) nurse staffing patterns and continuity of care (nurse-to-patient ratios, shift lengths, care team consistency), (c) unit policies regarding kangaroo mother care facilitation (availability of reclining chairs, privacy screens, flexible visitation hours for partners), (d) family visitation policies and support for fathers/grandmothers, (e) ward culture and staff attitudes toward breastfeeding (assessed via qualitative methods), (f) physical environment factors (NICU vs. transitional care unit, single-family rooms vs. bay-style arrangements), and (g) integration of lactation consultants with medical teams (co-rounding, protocols for automatic consultation triggers). Third, the fact that individual-level factors explain 97.4% of variance indicates that targeted interventions based on our risk tool will be effective across diverse ward contexts—the tool need not be recalibrated for different units within our institution. Future multilevel research with larger numbers of institutions (level 2) and longer follow-up could identify hospital-level characteristics (baby-friendly hospital initiative accreditation, teaching status, geographic region) associated with better lactation outcomes, informing policy interventions. This finding supports prior multilevel studies which have shown that although most variance is attributable to individual factors, contextual factors at the ward or community level can further influence breastfeeding practices (59, 62, 71). Future research should examine specific organizational characteristics that explain between-ward variation, potentially identifying best practices that could be scaled across units.
Notably, the lack of significant interactions, specifically between maternal education and parity, suggests that the primary drivers of early lactation failure in this study may be more directly linked to clinical and psychosocial determinants rather than certain sociodemographic factors. This finding challenges assumptions about health disparities in the NICU setting and suggests that the standardized, medicalized environment may attenuate typical socioeconomic gradients observed in community populations. This nuance diverges from some earlier reports that have identified demographic heterogeneity in breastfeeding outcomes, inviting further research into the interplay of these variables (72).
The number-needed-to-treat (NNT) estimates for EPDS-targeted interventions (NNT = 11) and for improvements in BSES-SF or avoiding cesarean delivery (NNT = 14) provide actionable benchmarks for clinical practice. The decision curve analysis confirming net benefit across relevant risk thresholds further supports the model's potential utility in guiding resource allocation and intervention prioritization. Current study also, reinforces earlier findings of the central role of maternal psychological state and breastfeeding self-efficacy in early lactation success among mothers of preterm neonates (60, 61, 64). The integrated multilevel approach not only confirms known individual-level determinants but also highlights the modest yet meaningful impact of structural factors. Future research should aim to validate this risk stratification tool in diverse clinical settings and explore interventions tailored to the identified modifiable risk factors, thereby potentially reducing early lactation failure and improving neonatal health outcomes (1, 65, 69, 70).
We acknowledge that genetic factors were not examined in this analysis. While genetic polymorphisms in prolactin receptor genes (PRLR), oxytocin receptor genes (OXTR), and genes encoding milk protein synthesis may influence lactation physiology, such factors were beyond the scope of this investigation for several reasons. First, our study focused on modifiable determinants amenable to clinical intervention within the 72-h postpartum window. Genetic factors, while scientifically important, cannot be modified through immediate clinical action. Second, the pathophysiological mechanisms of early lactation failure in the first 72 h are predominantly driven by hormonal priming, infant feeding competence, and maternal psychological state, factors more proximate to the outcome than genetic predisposition. Third, genetic studies require substantially larger sample sizes with population stratification controls and were not feasible given resource constraints. Future research integrating genetic, epigenetic, and microbiome data with psychosocial determinants would provide a more comprehensive understanding of lactation biology in preterm populations.
Study design constraints limit generalizability. Our single-center tertiary referral hospital in urban Eastern China (cesarean rate 64.9% vs. national 41.1%; Level III NICU with full subspecialty services) may not represent primary care hospitals, rural facilities, or regions with different resources, cultural norms regarding postpartum confinement (zuo yuezi), or patient populations. The 72-h outcome window, while clinically relevant for acute failure identification, does not capture late lactation failure (days 4–14), gradual supply reduction (1–3 months), or sustained duration (6–12 months)—distinct outcomes with potentially different determinants. Self-report bias may affect psychosocial measures despite validated instruments, observer variability existed despite high inter-rater reliability (κ > 0.85), and the 5 ml threshold, though physiologically justified and empirically validated (AUC = 0.84), represents a somewhat arbitrary cutpoint. External validation is essential across diverse settings: primary/secondary hospitals with limited lactation support, rural populations, different cesarean/preterm birth rates, and international sites to assess cross-cultural validity and need for recalibration.
Study design constraints limit generalizability. Our single-center tertiary referral hospital in urban Eastern China (cesarean rate 64.9% vs. national 41.1%; Level III NICU with full subspecialty services) may not represent primary care hospitals, rural facilities, or regions with different resources, cultural norms regarding postpartum confinement (zuo yuezi), or patient populations. The 72-h outcome window, while clinically relevant for acute failure identification, does not capture late lactation failure (days 4–14), gradual supply reduction (1–3 months), or sustained duration (6–12 months)—distinct outcomes with potentially different determinants. Self-report bias may affect psychosocial measures despite validated instruments, observer variability existed despite high inter-rater reliability (κ > 0.85), and the 5 ml threshold, though physiologically justified and empirically validated (AUC = 0.84), represents a somewhat arbitrary cutpoint. External validation is essential across diverse settings: primary/secondary hospitals with limited lactation support, rural populations, different cesarean/preterm birth rates, and international sites to assess cross-cultural validity and need for recalibration.
Methodological limitations affect causal inference. Unmeasured confounders include: detailed breastfeeding history, maternal conditions affecting lactation (PCOS, thyroid disorders, breast surgery), peripartum medications, quality of professional lactation support, family dynamics and cultural beliefs, subtle neonatal feeding problems not captured by PIBBS, and provider-level variability in counseling practices. The observational design precludes definitive causation despite temporal ordering, comprehensive adjustment, sensitivity analyses, and dose-response relationships. Reverse causation (low family support resulting from rather than causing lactation struggles) and residual confounding from mismeasured covariates remain possible. Limited power for interaction testing (80% power for interaction ORs ≥2.0) may have missed effect heterogeneity. Bootstrap internal validation provides optimism-corrected estimates for our source population but cannot assess geographic or temporal transportability; external validation in independent cohorts from different institutions, regions, time periods, and healthcare systems is essential before widespread implementation.
5 Conclusion
This multilevel analysis provides robust evidence that early lactation failure in moderate preterm populations is driven by multifactorial determinants, with maternal breastfeeding self-efficacy and depressive symptoms emerging as the most potent modifiable predictors. Dose-response relationships indicate that even modest improvements in psychological state or kangaroo care duration may substantially reduce failure risk. The internally validated risk stratification tool demonstrated strong discrimination and excellent calibration, offering immediate clinical utility for identifying high-risk dyads. Population impact estimates reveal that addressing cesarean delivery protocols, self-efficacy support, and depression screening could prevent a substantial proportion of early lactation failures with modest resource investments. Ward-level variation, while modest, supports combining individual-level screening with structural interventions. The consistency of effect estimates across sensitivity analyses and subgroups strengthens confidence in these findings, though external validation remains essential. Future research should focus on external validation in multicenter cohorts, randomized controlled trials testing multicomponent interventions, economic evaluations, investigation of optimal kangaroo care protocols, development of real-time prediction algorithms for electronic health records, and exploration of genetic factors influencing lactation physiology. Such work will advance evidence-based neonatal nutrition and precision lactation support tailored to individualized risk profiles.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Ethics statement
This study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of the Department of Pediatrics, Nantong First People's Hospital, Jiangsu, China (Ethics Committee Reference: IRB-2022-045). Written informed consent was obtained from all participants prior to enrollment.
Author contributions
FH: Investigation, Methodology, Funding acquisition, Software, Writing – original draft, Resources, Data curation, Conceptualization, Project administration, Formal analysis. LS: Data curation, Formal analysis, Project administration, Investigation, Methodology, Software, Conceptualization, Writing – original draft. XC: Project administration, Funding acquisition, Formal analysis, Methodology, Conceptualization, Data curation, Investigation, Writing – original draft, Software. YZ: Writing – original draft, Investigation, Funding acquisition, Data curation, Formal analysis, Conceptualization, Project administration. YW: Methodology, Data curation, Conceptualization, Software, Investigation, Funding acquisition, Visualization, Formal analysis, Writing – original draft, Resources. JW: Investigation, Conceptualization, Methodology, Funding acquisition, Formal analysis, Visualization, Data curation, Writing – original draft. RZ: Resources, Writing – review & editing, Formal analysis, Methodology, Data curation, Supervision, Conceptualization, Investigation. JJ: Software, Investigation, Supervision, Methodology, Funding acquisition, Conceptualization, Formal analysis, Visualization, Writing – review & editing, Project administration, Data curation, Validation, Resources.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by Scientific research project of Nantong Municipal Health Commission, Jiangsu Province (Nos. MSZ2023013 and QNZ2023030) and 2023 Nantong Municipal Health Commission Scientific Research Project (No. MSZ2023021).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2025.1668778/full#supplementary-material
References
1. Krawczyk A, Czerwińska-Osipiak A, Szablewska A, Rozmarynowska W. Psychosocial factors influencing breastmilk production in mothers after preterm birth: the role of social support in early lactation success—a cross-sectional study. Nutrients. (2024) 16:3883. doi: 10.3390/nu16223883
2. Aryeetey R, Dykes F. Global implications of the new WHO and UNICEF implementation guidance on the revised baby-friendly hospital initiative. Matern Child Nutr. (2018) 14:e12637. doi: 10.1111/mcn.12637
3. Ward VC, Lee AC, Hawken S, Otieno NA, Mujuru HA, Chimhini G, et al. Overview of the global and US burden of preterm birth. Clin Perinatol. (2024) 51:301–11. doi: 10.1016/j.clp.2024.02.015
4. Camacho-Morales A, Caba M, García-Juárez M, Caba-Flores MD, Viveros-Contreras R, Martínez-Valenzuela C. Breastfeeding contributes to physiological immune programming in the newborn. Front Pediatr. (2021) 9:744104. doi: 10.3389/fped.2021.744104
5. Jing S, Chen C, Gan Y, Vogel J, Zhang J. Incidence and trend of preterm birth in China, 1990-2016: a systematic review and meta-analysis. BMJ Open. (2020) 10:e039303. doi: 10.1136/bmjopen-2020-039303
6. Moukarzel S, Mamas C, Farhat A, Abboud AA, Daly AJ. A qualitative examination of barriers against effective medical education and practices related to breastfeeding promotion and support in Lebanon. Med Edu Online. (2020) 25:1723950. doi: 10.1080/10872981.2020.1723950
7. Deng K, Liang J, Mu Y, Liu Z, Wang Y, Li M, et al. Preterm births in China between 2012 and 2018: an observational study of more than 9 million women. Lancet Glob Health. (2021) 9:e1226–41. doi: 10.1016/S2214-109X(21)00298-9
8. Scime NV, Metcalfe A, Nettel-Aguirre A, Nerenberg K, Seow CH, Tough SC, et al. Breastfeeding difficulties in the first 6 weeks postpartum among mothers with chronic conditions: a latent class analysis. BMC Pregnancy Childbirth. (2023) 23:90. doi: 10.1186/s12884-023-05407-w
9. Lojander J, Axelin A, Niela-Vilén H. Breastfeeding exclusivity, difficulties, and support in the first days after hospital discharge: a correlational study. Eur J Obstet Gynecol Reprod Biol. (2024) 296:76–82. doi: 10.1016/j.ejogrb.2024.02.029
10. Shi R, Dong X, Wang L, Zhang W, Reddy S, Yu Y. The Chinese adverse prognosis of very preterm infants (CARE-Preterm) cohort: study design and baseline characteristics for a prospective multicenter cohort study. BMC Pediatr. (2025) 25:377. doi: 10.1186/s12887-025-05722-5
11. Huang X, Zhang L, Zhang C, Li J, Li C. Postpartum depression in Northeastern China: a cross-sectional study 6 weeks after giving birth. Front Public Health. (2025) 13:1570654. doi: 10.3389/fpubh.2025.1570654
12. Baeisa RS, Aldardeir NF, Alsulami MT, Alsulami AH, Al-Sharif JH, Alshahrany TA, et al. Breastfeeding self-efficacy is inversely associated with postpartum depression: findings from a tertiary hospital in Saudi Arabia. Saudi J Med Med Sci. (2023) 11:292–8. doi: 10.4103/sjmms.sjmms_601_22
13. Liu L, Feng S, Zhang Y, Xiao G, Zhou M, Li X, et al. The role of breastfeeding self-efficacy in the relationship between perinatal depressive symptoms and exclusive breastfeeding: a longitudinal mediation analysis. BMC Pregnancy Childbirth. (2025) 25:356. doi: 10.1186/s12884-025-07481-8
14. Amini P, Omani-Samani R, Sepidarkish M, Almasi-Hashiani A, Hosseini M, Maroufizadeh S. The breastfeeding self-efficacy scale-short form (BSES-SF): a validation study in Iranian mothers. BMC Res Notes. (2019) 12:622. doi: 10.1186/s13104-019-4656-7
15. Maurer L, Schultz A, Dennis CL, Alexandrowicz RW, McQueen K. The breastfeeding self-efficacy scale-short form (BSES-SF): German translation and psychometric assessment. J Hum Lact. (2024) 40:374–85. doi: 10.1177/08903344241254108
16. Dennis CL, McQueen K, Dol J, Brown H, Beck C, Shorey S. Psychometrics of the breastfeeding self-efficacy scale and short form: a systematic review. BMC Public Health. (2024) 24:637. doi: 10.1186/s12889-024-17805-6
17. Tolppola O, Renko M, Sankilampi U, Kiviranta P, Hintikka L, Kuitunen I. Pacifier use and breastfeeding in term and preterm newborns-a systematic review and meta-analysis. Eur J Pediatr. (2022) 181:3421–8. doi: 10.1007/s00431-022-04559-9
18. Lubbe W, Ten Ham-Baloyi W. When is the use of pacifiers justifiable in the baby-friendly hospital initiative context? A clinician's guide. BMC Pregnancy Childbirth. (2017) 17:130. doi: 10.1186/s12884-017-1306-8
19. Jung S, Chi H, Eom YJ, Subramanian SV, Kim R. Multilevel analysis of determinants in postnatal care utilisation among mother-newborn pairs in India, 2019-21. J Glob Health. (2024) 14:04085. doi: 10.7189/jogh.14.04085
20. Iqbal S, Maqsood S, Zakar R, Fischer F. Trend analysis of multi-level determinants of maternal and newborn postnatal care utilization in Pakistan from 2006 to 2018: evidence from Pakistan demographic and health surveys. BMC Public Health. (2023) 23:642. doi: 10.1186/s12889-023-15286-7
21. Zhang B, Duan Z, Zhao Y, Williams S, Wall S, Huang L, et al. Intermittent kangaroo mother care and the practice of breastfeeding late preterm infants: results from four hospitals in different provinces of China. Int Breastfeed J. (2020) 15:64. doi: 10.1186/s13006-020-00309-5
22. Wu W, Zhang J, Silva Zolezzi I, Fries LR, Zhao A. Factors influencing breastfeeding practices in China: a meta-aggregation of qualitative studies. Matern Child Nutr. (2021) 17:e13251. doi: 10.1111/mcn.13251
23. Li J, Nguyen TT, Wang X, Mathisen R, Fang J. Breastfeeding practices and associated factors at the individual, family, health facility and environmental levels in China. Matern Child Nutr. (2020) 16 Suppl 2:e13002. doi: 10.1111/mcn.13002
24. Mekonnen ZA, Lerebo WT, Gebrehiwot TG, Abadura SA. Multilevel analysis of individual and community level factors associated with institutional delivery in Ethiopia. BMC Res Notes. (2015) 8:376. doi: 10.1186/s13104-015-1343-1
25. Pi X, Wang J, Chu L, Zhang G, Zhang W. Prediction of high-risk pregnancy based on machine learning algorithms. Sci Rep. (2025) 15:15561. doi: 10.1038/s41598-025-00450-3
26. Proceedings of the 2023 International Maternal Newborn Health Conference. BMC Proc. (2024) 18(Suppl 5):6. doi: 10.1186/s12919-024-00289-y
27. Liu H, Yu Y, Zhang X, Pei J, Tang Y, Hu R, et al. Development and validation of an improved prediction model for vaginal birth after previous cesarean section: a retrospective cohort study of a Chinese population. Ann Med. (2025) 57:2523617. doi: 10.1080/07853890.2025.2523617
28. Tao XY, Huang K, Yan SQ, Zuo AZ, Tao RW, Cao H, et al. Pre-pregnancy BMI, gestational weight gain and breast-feeding: a cohort study in China. Public Health Nutr. (2017) 20:1001–8. doi: 10.1017/S1368980016003165
29. Huang Z, Tan X, Wang J, Zhang A. Maternal pre-pregnancy body mass index and gestational weight gain on adverse birth outcomes in Chinese newborns: a retrospective study. J Health Popul Nutr. (2024) 43:165. doi: 10.1186/s41043-024-00652-x
30. Brownell E, Howard CR, Lawrence RA, Dozier AM. Delayed onset lactogenesis II predicts the cessation of any or exclusive breastfeeding. J Pediatr. (2012) 161:608–14. doi: 10.1016/j.jpeds.2012.03.035
31. Nyamagoudar AH, Kannur NS, Chavan V, Kulkarni V. Retrospective analysis of volumes of manually expressed colostrum among healthy postnatal mothers at a tertiary care referral unit in South India. Not enough milk or not enough patience? BMJ Paediatr Open. (2024) 8:e002622. doi: 10.1136/bmjpo-2024-002622
32. May MF, McKinney ML, Cestare D, Hussey A, Zevallos AB, Garber S, et al. Improving time to first feeding for preterm infants: a quality improvement approach. Pediatr Qual Saf. (2025) 10:e798. doi: 10.1097/pq9.0000000000000798
33. Ru X, Huang X, Feng Q. Successful full lactation achieved by mothers of preterm infants using exclusive pumping. Front Pediatr. (2020) 8:191. doi: 10.3389/fped.2020.00191
34. Peng Y, Zhuang K, Huang Y. Incidence and factors influencing delayed onset of lactation: a systematic review and meta-analysis. Int Breastfeed J. (2024) 19:59. doi: 10.1186/s13006-024-00666-5
35. Hård AL, Nilsson AK, Lund AM, Hansen-Pupp I, Smith LEH, Hellström A. Review shows that donor milk does not promote the growth and development of preterm infants as well as maternal milk. Acta Paediatr. (2019) 108:998–1007. doi: 10.1111/apa.14702
36. Economou M, Kolokotroni O, Paphiti-Demetriou I, Kouta C, Lambrinou E, Hadjigeorgiou E, et al. The association of breastfeeding self-efficacy with breastfeeding duration and exclusivity: longitudinal assessment of the predictive validity of the Greek version of the BSES-SF tool. BMC Pregnancy Childbirth. (2021) 21:421. doi: 10.1186/s12884-021-03878-3
37. Sheela CN, Venkatesh S. Screening for postnatal depression in a tertiary care hospital. J Obstet Gynaecol India. (2016) 66:72–6. doi: 10.1007/s13224-015-0766-x
38. Cox JL, Holden JM, Sagovsky R. Detection of postnatal depression. Development of the 10-item Edinburgh postnatal depression scale. Br J Psychiat. (1987) 150:782–6. doi: 10.1192/bjp.150.6.782
39. Spitzer RL, Kroenke K, Williams JB, Löwe B. A brief measure for assessing generalized anxiety disorder: the GAD-7. Arch Intern Med. (2006) 166:1092–7. doi: 10.1001/archinte.166.10.1092
40. Aktürk Z, Hapfelmeier A, Fomenko A, Dümmler D, Eck S, Olm M, et al. Generalized anxiety disorder 7-item (GAD-7) and 2-item (GAD-2) scales for detecting anxiety disorders in adults. Cochrane Database Syst Rev. (2025) 3:Cd015455. doi: 10.1002/14651858.CD015455
41. Qi W, Liu Y, Lv H, Ge J, Meng Y, Zhao N, et al. Effects of family relationship and social support on the mental health of Chinese postpartum women. BMC Pregnancy Childbirth. (2022) 22:65. doi: 10.1186/s12884-022-04392-w
42. Su X, Wang H, McQueen K, Dennis CL, Zhang Y, Chen M, et al. The reliability and validity of the postpartum partner support scale among Chinese women. Midwifery. (2024) 136:104067. doi: 10.1016/j.midw.2024.104067
43. Tumukunde VS, Loucaides E, Medvedev MM, Nyirenda M, Lawn J, Tann CJ. Measuring the duration of kangaroo mother care for neonates: a scoping review. BMJ Open. (2025) 15:e079579. doi: 10.1136/bmjopen-2023-079579
44. North K, Semrau KEA, Bellad RM, Dhaded SM, Das L, Behera JN, et al. The association between breastfeeding and growth among infants with moderately low birth weight: a prospective cohort study. J Pediatr. (2024) 269:114003. doi: 10.1016/j.jpeds.2024.114003
45. Purwandari H, Purnamasari MD, Mulyono WA, Huang MC. Preterm infant cues during breastfeeding and its measurement: a scoping review. Belitung Nurs J. (2023) 9:209–17. doi: 10.33546/bnj.2445
46. Bamat N, Jensen EA, Kirpalani H. Duration of continuous positive airway pressure in premature infants. Semin Fetal Neonatal Med. (2016) 21:189–95. doi: 10.1016/j.siny.2016.02.005
47. Xie X, Chen X, Sun P, Cao A, Zhuang Y, Xiong X, et al. Kangaroo mother care reduces noninvasive ventilation and total oxygen support duration in extremely low birth weight infants. Am J Perinatol. (2021) 38:791–5. doi: 10.1055/s-0039-3402717
48. Hilditch C, Rumbold A, Keir A, Middleton P, Gomersall JS. Effect of neonatal unit interventions designed to increase breastfeeding in preterm infants: an overview of systematic reviews. Neonatology. (2024) 121:411–20. doi: 10.1159/000536660
49. McLeish J, Aloysius A, Gale C, Quigley M, Kurinczuk JJ, Alderdice F. Differences between neonatal units with high and low rates of breast milk feeding for very preterm babies at discharge: a qualitative study of staff experiences. BMC Pregnancy Childbirth. (2024) 24:863. doi: 10.1186/s12884-024-07039-0
50. Deif R, Burch EM, Azar J, Yonis N, Abou Gabal M, El Kramani N, et al. Dysphoric milk ejection reflex: the psychoneurobiology of the breastfeeding experience. Front Glob Women Health. (2021) 2:669826. doi: 10.3389/fgwh.2021.669826
51. Nagel EM, Howland MA, Pando C, Stang J, Mason SM, Fields DA, et al. Maternal psychological distress and lactation and breastfeeding outcomes: a narrative review. Clin Ther. (2022) 44:215–27. doi: 10.1016/j.clinthera.2021.11.007
52. Liu L, Angelica BM. Knowledge, attitude, and behavior of pediatric medical surgical nurses on breastfeeding: basis for capacity building plan. Curr Res Med Sci. (2025) 4:33–65. doi: 10.56397/CRMS.2025.05.05
53. Eerdekens A, Debeer A. Barriers and facilitators in lactation support for the preterm mother–infant dyad: an integrated approach. J Hum Lact. (2024) 40:539–49. doi: 10.1177/08903344241273450
54. Ding L, Chen Y, Zhang W, Song J, Yao X, Wan Y, et al. Effect of family integrated care on breastfeeding of preterm infants: a scoping review. Nursing Open. (2023) 10:5950–60. doi: 10.1002/nop2.1888
55. Maastrup R, Hansen BM, Kronborg H, Bojesen SN, Hallum K, Frandsen A, et al. Factors associated with exclusive breastfeeding of preterm infants. Results from a prospective national cohort study. PLoS ONE. (2014) 9:e89077. doi: 10.1371/journal.pone.0089077
56. Sokou R, Parastatidou S, Iliodromiti Z, Lampropoulou K, Vrachnis D, Boutsikou T, et al. Knowledge gaps and current evidence regarding breastfeeding issues in mothers with chronic diseases. Nutrients. (2023) 15:2822. doi: 10.3390/nu15132822
57. Boundy EO, Dastjerdi R, Spiegelman D, Fawzi WW, Missmer SA, Lieberman E, et al. Kangaroo mother care and neonatal outcomes: a meta-analysis. Pediatrics. (2016) 137:e20152238. doi: 10.1542/peds.2015-2238
58. Landsem IP, Handegård BH. Parental reports of hospital- and community-based follow-up services, self-efficacy, and symptoms of depression a few months after discharge of a prematurely born child. BMC Public Health. (2024) 24:1630. doi: 10.1186/s12889-024-19079-4
59. Prikhodko EA. Belyaeva IA, Kruglyakov AY, Mikheeva AA, Gorev VV. Factors associated with exclusive breastfeeding of late preterm infants in neonatal hospital: cross-sectional study Issues Mod Pediatr. (2022) 21:29–35. doi: 10.15690/vsp.v21i1/2384
60. Brockway M, Benzies K, Carr E, Aziz K. Breastfeeding self-efficacy and breastmilk feeding for moderate and late preterm infants in the family integrated care trial: a mixed methods protocol. Int Breastfeed J. (2018) 13:29. doi: 10.1186/s13006-018-0168-7
61. Viera CS, Moraes GGWd, Toso BRGdO, Christoffel MM, Machineski GG, Linares AM. Breastfeeding mothers' self-confidence: a mixed-method study. Aust J Adv Nurs. (2024) 41:18–28. doi: 10.37464/2024.413.765
62. Adam NS, Twabi HS, Manda S. A simulation study for evaluating the performance of clustering measures in multilevel logistic regression. BMC Med Res Methodol. (2021) 21:245. doi: 10.1186/s12874-021-01417-4
63. Austin PC, Merlo J. Intermediate and advanced topics in multilevel logistic regression analysis. Stat Med. (2017) 36:3257–77. doi: 10.1002/sim.7336
64. Crippa BL, Colombo L, Morniroli D, Consonni D, Bettinelli ME, Spreafico I, et al. Do a few weeks matter? Late preterm infants and breastfeeding issues. Nutrients. (2019) 11:312. doi: 10.3390/nu11020312
65. Purwaningsih D, Sudarmi S. Analysis of determinants of exclusive breastfeeding in breastfeeding mothers. J Kesehatan. (2024) 15:472–8. doi: 10.26630/jk.v15i3.4799
66. Zhou M, Guo JY, Li T-Y, Zhou C, Zhang X, Wei W, et al. Effect of early essential newborn care on breastfeeding and outcomes of mothers/newborns post-cesarean section: a randomized controlled trial in China. Reprod Health. (2024) 21:183. doi: 10.1186/s12978-024-01932-9
67. Pamungkas MHA, Astuti Y, Wulandari N. Breastfeeding self-efficacy at 24 hours postpartum with inadequate milk production: a case report. J Vnus. (2025) 7:39–46. doi: 10.52221/jvnus.v7i1.808
68. Handayani I, Supliyani E. Dukungan Bidan Dalam Meningkatkan Breastfeeding Self Efficacy Masa Awal Laktasi. J Ris Kesehat Poltekkes Depkes Bandung. (2021) 13:152–9. Indonesian. doi: 10.34011/juriskesbdg.v13i1.1871
69. Kuhnly JE. Sustained breastfeeding and related factors for late preterm and early term infants. J Perinat Neonatal Nurs. (2018) 32:175–88. doi: 10.1097/JPN.0000000000000331
70. Aziziz E, Maleki A, Mazloomzadeh S, Pirzeh R. An evaluation of the relationship between perceived stress and breastfeeding self-efficacy in women referred to delivery preparatory classes in Zanjan, Iran in 2018. Prev Care Nurs Midwifery J. (2018) 8:9–17. doi: 10.29252/pcnm.8.2.9
71. Hagos A, Tsadik M, Belachew AB, Tesfahunegn A. Individual and community-level factors influencing optimal breastfeeding: a multilevel analysis from a national survey study of Ethiopia. PLoS ONE. (2021) 16:e0241428. doi: 10.1371/journal.pone.0241428
72. Suartiningsih SAK, Astiti NKE, Ningtyas LAW, Sriasih NGK, Wirata IN. The relationship between breastfeeding contact with breastfeeding self-efficacy in Wangaya Hospital Denpasar City. Jnki. (2023) 10:285. doi: 10.21927/jnki.2022.10(4)0.285-291
73. Majumdar T, Keats EC, Tasic H, El Baz S, Peters DH, Safi N, et al. Setting research priorities for maternal, newborn and child health, sexual and reproductive health and nutrition in Afghanistan: an application of the Child Health and Nutrition Research Initiative methodology. BMJ Global Health. (2025) 10:e018579. doi: 10.1136/bmjgh-2024-018579
74. El Baz S, Keats EC, Tasic H, Black R, Peters DH, Safi N, et al. Setting health systems research priorities for Afghanistan: an application of the child health and nutrition research initiative (CHNRI) methodology to set a roadmap to 2030. BMJ Glob Health. (2025) 10(Suppl 3):e018578. doi: 10.1136/bmjgh-2024-018578
75. Awan UA, Song Q, Ciombor KK, Toriol AT, Choi J, Su T, et al. Demographic and clinicopathologic risk factors for colorectal adenoma recurrence: a large-scale surveillance cohort study of 59,667 adults. medRxiv [Preprint]. (2025). doi: 10.1101/2025.03.28.25324826
76. Khan S, Guo X, Awan UA. Pakistan is failing in maternal and neonatal tetanus elimination. Nat Med. (2024) 30:615. doi: 10.1038/s41591-023-02762-1
77. Awan UA, Hussain M, Qureshi M, Siddique Z, Khattak AA, Akhtar S, et al. From COVID-19 to measles: prioritizing immunization in Pakistan's far-flung regions. J Infect. (2023) 87:76–9. doi: 10.1016/j.jinf.2023.04.017
78. Qureshi H, Khan MI, Ahmad A, Awan UA, Khattak AA, Khan AA, et al. Passive surveillance of malaria in pregnant women, non-pregnant women and children under 5 years of age in Bannu District, Khyber Pakhtunkhwa Pakistan. Front Med (Lausanne). (2021) 8:751456. doi: 10.3389/fmed.2021.751456
Keywords: breastfeeding, preterm infants, lactation failure, risk stratification, self-efficacy, maternal depression
Citation: Hong F, Song L, Chu X, Zhu Y, Wei Y, Wang J, Zou R and Ji J (2025) Determinants of early lactation failure in mothers of moderately preterm neonates: a multilevel analysis of survey data. Front. Nutr. 12:1668778. doi: 10.3389/fnut.2025.1668778
Received: 06 August 2025; Accepted: 28 October 2025;
Published: 25 November 2025.
Edited by:
Minesh Khashu, University Hospitals Dorset NHS Foundation Trust, United KingdomReviewed by:
Zhicheng Peng, University of Pennsylvania, United StatesQuanfu Zhang, Baoan Women's and Children's Hospital, China
Copyright © 2025 Hong, Song, Chu, Zhu, Wei, Wang, Zou and Ji. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Juhua Ji, ampoZWtlczAyMDNAMTYzLmNvbQ==
Fei Hong