Your new experience awaits. Try the new design now and help us make it even better

REVIEW article

Front. Pediatr., 04 December 2025

Sec. Neonatology

Volume 13 - 2025 | https://doi.org/10.3389/fped.2025.1629795

This article is part of the Research TopicArtificial Intelligence and Machine Learning in PediatricsView all 12 articles

Enhancing bronchopulmonary dysplasia prediction in preterm infants using artificial intelligence and multimodal data integration


Xinkai Zhang,,&#x;Xinkai Zhang1,2,†Anping Wang,&#x;Anping Wang3,†Rongwei Xu,Rongwei Xu1,2Dongyun Liu

Dongyun Liu2*
  • 1Qingdao Medical College, Qingdao University, Qingdao, China
  • 2Department of Pediatrics Medical Center, The Affiliated Hospital of Qingdao University, Qingdao, China
  • 3MS in Computer Information Systems, Boston University Metropolitan College, Boston, MA, United States

Bronchopulmonary dysplasia (BPD) remains a prevalent respiratory condition among preterm infants, with its development influenced by a combination of perinatal and postnatal factors. The development of artificial intelligence (AI) and machine learning (ML) technologies has provided new ideas for building BPD prediction models based on multimodal data (such as clinical information, physiological signals, imaging data, biomarkers, and omics data). This article systematically reviewed the research progress of AI in BPD prediction, analyzed representative models and key tools (such as RTI BPD Outcome Estimator), and assessed their performance and limitations in actual clinical settings. It also sorted out the challenges faced by AI models in clinical translation, including data standardization, model interpretability, system integration capabilities, model update mechanisms, and ethical and legal issues. To address the clinical need of “moving from prediction to intervention”, this article discussed the PALM translation framework (Predict–Act–Learn–Monitor) organized around key clinical nodes. In the future, it is necessary to strengthen multi-center data sharing, develop privacy protection technologies such as federated learning, and build a design, validation, integration, regulation, and feedback closed-loop management system to help AI models move from risk prediction to precise intervention, ultimately improving the clinical outcomes of children with BPD.

1 Introduction

Bronchopulmonary dysplasia (BPD) represents a significant respiratory condition frequently observed in preterm infants, with its incidence inversely associated with gestational age. Studies have shown that the incidence of BPD in extremely premature infants with a gestational age of 22–28 weeks is as high as 60.7% (1). The introduction and widespread implementation of antenatal corticosteroids (ACS) and pulmonary surfactant (PS) therapies have contributed to improved survival rates among extremely preterm infants—those with birth weights under 1,000 g and gestational ages below 28 weeks. However, this shift has also been accompanied by a rise in cases of “new BPD,” a phenotype marked by impaired alveolar development and diminished pulmonary microvascularization, with relatively milder alveolar and airway injury or fibrosis (2).

Infants with BPD frequently exhibit adverse long-term outcomes, including growth and developmental delays, as well as persistent impairments in cardiopulmonary and neurodevelopmental functioning (3, 4). In recent years, artificial intelligence (AI) has emerged as a transformative tool in healthcare, particularly in the analysis and management of complex, high-volume clinical data. This review provides a comprehensive overview of the application of AI and related technologies in the context of BPD prediction and management among preterm infants. Emphasis is placed on improving strategies for early identification and intervention, enhancing clinical decision-making, and supporting favorable long-term outcomes in this vulnerable population. Some previous reviews have addressed the ethical challenges and interpretability issues of AI models in the neonatal field (5, 6), but there is still a lack of a systematic assessment and framework analysis centered on BPD and integrated with specific clinical practice procedures. Therefore, this review, on the basis of summarizing the existing BPD prediction models, focuses on the key obstacles in their clinical translation process. At the same time, this article discussed the PALM translation framework (Predict–Act–Learn–Monitor), which is organized around key clinical nodes, with the aim of providing theoretical basis and practical reference for promoting the clinical implementability of BPD prediction models.

2 Overview of the research status of artificial intelligence in BPD prediction

2.1 Advantages of integration of multi-source data and AI models

In Neonatal Intensive Care Units (NICUs), preterm infants constitute a population particularly well-suited for AI-based monitoring of BPD. Currently, predictive models frequently incorporate physiological parameters such as heart rate, respiratory rate, oxygen saturation, and ventilator settings (7, 8). While NICU equipment facilitates continuous acquisition of large-scale physiological data, conventional monitoring techniques often lack the capacity to deliver accurate and real-time analyses due to the high variability of these indicators in preterm infants.

AI offers significant advantages in the prediction and management of BPD by enabling the integration and interpretation of complex, multidimensional clinical datasets. For instance, AI models embedded within electronic health record systems have been utilized to identify early risk signals for BPD from vital sign trends, ventilator metrics, and additional physiological variables. These models support clinical decision-making by contributing to early risk stratification, prognostic assessment, and the development of individualized treatment strategies (9, 10). Moreover, AI systems are capable of integrating diverse data sources, including imaging, genomic data, and biochemical profiles, to examine inter-organ interactions, thereby enhancing both the accuracy and comprehensiveness of predictive models (11, 12). The development of clinically applicable BPD risk assessment tools requires minimization of false-positive rates and maximization of clinical interpretability. It remains essential that healthcare professionals synthesize AI-generated outputs with comprehensive clinical evaluations to ensure that decision-making remains clinically grounded and contextually appropriate.

2.2 Pathway for AI model development and validation

In the context of BPD prediction, supervised learning is the predominant methodology. That is, based on the data with known labels, the model is trained to achieve the mapping between features and outcomes. In terms of application division of labor: structured clinical variables and laboratory data often adopt machine learning approaches (1215). Medical imaging and high-dimensional continuous monitoring signals are more suitable for deep learning (13). During the model development phase, hyperparameter optimization is typically used to improve generalization performance. For model performance evaluation, in addition to AUROC (Area Under the Receiver Operating Characteristic Curve), it is recommended to report AUPRC (Area Under the Precision-Recall Curve) simultaneously to better reflect the ability to identify positive cases. Meanwhile, the Brier score is used to assess the consistency between probabilistic predictions and actual outcomes (16). The model validation phase mainly includes: internal validation (e.g., cross-validation, bootstrapping), which is used to evaluate consistency and robustness; external validation (using data from different institutions or time periods), which is used to test transferability across regions and systems (17). At present, most BPD-related AI models are still in the development or internal validation stage, lacking cross-center external validation and real-world data testing, which limits their feasibility for clinical application. Ultimately, evidence still relies on randomized controlled trials (RCTs) to assess their real-world benefits and safety for clinical outcomes, thereby supporting routine application.

2.3 Analysis of the progress of AI prediction models, tools and related literature

Currently, RTI BPD Outcome Estimator (https://neonatal.rti.org) is a widely used clinical tool for BPD prediction, featuring clear parameter settings, transparent logic, and a user-friendly interface, making it convenient for bedside rapid application (18). However, its limitation lies in that the model is constructed based on a small number of static variables (such as gestational age, birth weight, and oxygen concentration), and it is difficult to incorporate continuous physiological signals, medical images, and multi-omics data in the NICU. It also lacks a dynamic update mechanism and adaptive capabilities, which limits its adaptability in complex scenarios.

In recent years, the application of AI in BPD research has become increasingly widespread. Lei et al. used a random forest model with 6 variables to predict BPD, achieving an AUC of 0.929 (19). Dai et al. combined clinical and genetic information, and the AUC values for predicting BPD and severe BPD were 0.915 and 0.907, respectively—outperforming the model that used only clinical data (20). A study based on chest x-rays demonstrated that the AI model can accurately predict BPD within 24 h of birth, outperforming expert diagnosis (21); another transcriptomics study also showed that a small number of genes can effectively predict BPD, expanding the data dimension (22). However, a high AUC alone is insufficient to support clinical translation. First, in the context of class imbalance, AUC tends to be overestimated. It is recommended to report AUPRC simultaneously to reflect the value of identifying minority classes (patients with diseases). Second, features must be accessible at the time of prediction to avoid data/label leakage. Recent reviews and empirical studies have shown that leakage significantly exaggerates effect estimates, especially in small sample sizes (23); third, external validation must be completed across three dimensions—geography, time, and equipment—and the sample size, distribution of gestational age/body weight, definition of BPD, and its sensitivity analysis should be transparently disclosed; fourth, in terms of evaluation dimensions, the focus should expand from discrimination to calibration and clinical net benefit: beyond AUC/AUPRC, regularly report the Brier score, expected calibration error (ECE), and calibration slope/intercept, use Decision Curve Analysis (DCA) to quantify net benefits at preset adoption thresholds (24), and simultaneously provide the PPV (Positive Predictive Value), NPV (Negative Predictive Value), and alert rate at these thresholds to align with the action thresholds of NICUs; the aforementioned practices collectively enhance the model's robustness and adoptability, serving as a prerequisite for its entry into real-world evaluation and clinical trials.

3 Multimodal data sources supporting AI modeling: current status and integration potential

The integration of diverse data modalities is anticipated to significantly enhance both the understanding and predictive accuracy of AI models for BPD. Figure 1 illustrates the connections between various data sources and BPD: clinical informatics (gestational age, birth weight, duration of oxygen therapy/ventilation, and structured EHR variables) provides patients' baseline information and disease progression context; continuous physiological monitoring (HR, RR, SpO2, and blood pressure) reflects dynamic changes in respiratory/circulatory status; imaging (CXR, LUS, and CT/MRI) offers objective signs of lung aeration and interstitial changes; laboratory biomarkers (inflammatory factors, KL-6, NT-proBNP, VEGF/PLGF, etc.) indicate inflammation, epithelial/interstitial damage, and pulmonary vascular load; and multi-omics (metabolomics, microbiomics, genomics/transcriptomics) reveal pathways and molecular-level mechanisms. On this basis, cross-modal synergy has clear practical value: the combination of imaging and physiological monitoring can be used for early warning and ventilation strategy optimization; the integration of physiological monitoring and laboratory biomarkers helps identify “re-elevation of inflammation-pulmonary vascular load,” guiding fluid management or pharmacological interventions (25); existing studies have shown that models integrating clinical information with multi-omics data yield better performance than single-modal models, suggesting that this direction has feasibility and scalable potential (20).

Figure 1
Diagram illustrating the integrated data sources for BPD (Bronchopulmonary Dysplasia) analysis. It includes multimodal omics data (metabolomics, microbiomics, genomics), clinical informatics (EHRs syntactic information, ventilator settings, GA, birth weight, oxygen therapy duration), laboratory biomarkers (blood count, metabolic panel, VEGF, PLGF, IL, KL-6, NT-proBNP), imaging data (X-rays, LUS, CT), and continuous physiological monitoring (RR, HR, BP, SpO₂). Arrows indicate data flow towards BPD analysis.

Figure 1. Multiple omics data sources used for modeling in BPD prediction by AI. BPD, bronchopulmonary dysplasia; EHRs, electronic health records; GA, gestational age; CT, computed tomography; RR, respiratory rate; HR, heart rate; BP, blood pressure; SpO₂, peripheral capillary oxygen saturation; VEGF, vascular endothelial growth factor; PLGF, placental growth factor; IL, interleukin; KL-6, Krebs von den Lungen-6; NT-proBNP, N-terminal pro-B-type natriuretic peptide; LUS, lung ultrasound.

3.1 Clinical informatics data

BPD is recognized as a multifactorial disorder, and reliance on individual predictors is insufficient for accurate risk estimation, multivariate integration offers greater advantages (26). Observable clinical information corresponding to alveolar simplification is mainly reflected in the distribution and fluctuation of FiO₂ and SpO₂, the frequency and duration of intermittent hypoxemia (IH), and the oxygen saturation index (OSI). Phenotypes associated with increased pulmonary vascular load often manifest as persistently high oxygen demand under similar ventilation settings and difficulty in weaning from ventilation. In NICUs, high-resolution data (e.g., ventilator parameters, continuous SpO₂, blood gas results, OSI) and low-resolution data (e.g., demographics, medical history, medication records) together form longitudinal evidence, enabling early identification of ventilation-oxygenation mismatch and trends in respiratory work (27). Based on this, it is recommended to extract robust temporal features (median, variability, trend slope, etc.) from EHRs across three clinical time windows: ≤24 h after birth, 3–7 days after birth, and approximately 28 days after birth, with the features limited to those accessible at the time of prediction. The model output should provide calibrated individualized risk and report clinical net benefit under preset thresholds, facilitating direct alignment with ventilator parameter fine-tuning, oxygen therapy target setting, and weaning assessment.

It is important to emphasize that despite the richness of available clinical data, multiple challenges persist, including the labor-intensive nature of manual data entry, missing values, changes in disease definitions over time, difficulties in capturing complete longitudinal records, potential biases in data collection, and concerns related to data governance and privacy. These limitations underscore the necessity of advancing data acquisition and storage methodologies to enhance the robustness of BPD-focused research and model development.

3.2 The role of imaging and imaging AI in BPD

It complements the sequential variables such as EHR and enables repeatable quantification. In the NICU, chest X-ray (CXR) and bedside lung ultrasound (LUS) are the most accessible: CXR is convenient for longitudinal follow-up of lung gas volume and interstitial changes, while LUS can be frequently performed bedside and sensitively reflect lung gas volume and edema status (28). CT and MRI are more suitable for evaluating the pulmonary microstructure and ventilation/perfusion conditions in critical decision-making or research scenarios, but they are often limited by factors such as radiation exposure, time requirements, and the need for sedation (29).

The imaging AI technology is mainly divided into radiomics based on manual features and end-to-end deep learning. Usually, it needs to be combined with lung field segmentation, quality control and clinical variables for multimodal fusion to enhance the robustness and interpretability of the model (30). Research shows that the deep learning model based on neonatal CXR can output risk signals comparable to or even better than those of expert interpretations at an extremely early stage of birth. However, cross-center external validation is still insufficient, and it is urgently necessary to evaluate its generalizability under a unified annotation and standardized collection framework (21). Based on this, this article further proposes: The imaging AI evidence should be integrated into specific clinical time windows and intervention nodes, rather than just being a comparison of methodological performance. Thus, the experimental “imaging-AI” is transformed into a clinical pathway of “imaging-AI-guided intervention pathway”.

3.3 Structured biomarker data and the prospects of AI integration

Blood biomarkers are easy to standardize and structurally input, making them important potential input features for AI models. Existing evidence shows the following: Angiogenic factors (VEGF, PLGF) are crucial for pulmonary vascular and lung development (31). Serum VEGF/PLGF is expected to be used for early risk identification and intervention stratification (32). Inflammatory factors (IL-4/6/8, TNF-α, IL-33) reflect the persistence of inflammation and are associated with a high risk of BPD (33, 34). Systemic immune-inflammatory indices (e.g., NLR, SIRI) are significantly elevated at birth and at 36 weeks of gestational age in infants with moderate-to-severe BPD, indicating an increased systemic inflammatory load (35, 36). Among lung injury biomarkers, KL-6 indicates alveolar epithelial and pulmonary interstitial damage (37) and its level at 14 days after birth shows good discriminative power for BPD (AUC = 0.88) (38). NT-proBNP reflects volume load/pulmonary vascular stress (39); in very low birth weight infants, NT-proBNP > 2.264 ng/L at 14 days after birth can early indicate the risk of BPD (sensitivity: 100%, specificity: 86%, AUC = 0.93) (40).

For clinical application, biomarker modeling should align with disease course time windows and pathological pathways: Within ≤24 h after birth, prioritize indicators reflecting alveolar/interstitial damage and early inflammation (e.g., KL-6, IL-6/IL-8) for extremely early risk stratification; from 3 to 7 days after birth, combine NT-proBNP with inflammatory profiles to identify the possible “re-elevation of inflammation-pulmonary vascular load,” so as to indicate the timing of respiratory support and fluid/pharmacological intervention; at approximately 28 days after birth, integrate persistent oxygen dependence, nutritional/growth indicators, and VEGF/PLGF to assess phenotypes of persistent damage and microvascular development disorders and plan follow-up. Integration with other modalities should follow the principle of “starting with the easy and moving to the difficult”: When sample size is limited or modalities are frequently missing, adopt late fusion (train each modality independently, then perform confidence weighting or stacking integration after outputting calibrated probabilities)—this facilitates external validation and allows the use of Decision Curve Analysis (DCA) to demonstrate clinical adoptability; after multi-center data and annotations are improved, transition to mid-level fusion (jointly model “pathway-level” biomarkers with imaging and temporal physiological data within the same time window) to obtain cross-modal information gain and mechanistic interpretability (41).

Although the above studies mostly employed traditional statistical models, their data structure is highly suitable for being incorporated into AI systems for multi-feature modeling. Although the blood biomarkers have shown certain potential initially, due to the small sample size of the research and the high clinical heterogeneity, they are currently still in the research stage and have not yet entered the routine clinical application process. Larger-scale multi-center studies are needed to verify their efficacy (36).

3.4 A large number of biological information data and future AI modeling directions

Multi-omics provides a key supplement for revealing the molecular and pathway-level mechanisms of BPD. At the metabolomics level: Premature infants with BPD show elevated levels of alanine, serine, taurine, and citrulline in bronchoalveolar lavage fluid on the first day after birth (42). Meanwhile, studies have shown that the levels of hydroxy fatty acids and oxidized fatty acids in the amniotic fluid of premature infants with BPD are higher than those in the non-BPD group (43, 44). At the microbiomics level: Research indicates that compared with the non-BPD group, the pulmonary microbial community of the BPD group shows a trend of increasing proportion of Firmicutes and decreasing proportion of Proteobacteria over time (45). Additionally, gut microbiota may affect BPD development through modulation of systemic immunity and inflammatory responses.

At the genomics/transcriptomics level: Variations in angiogenesis pathways (e.g., VEGF) and abnormalities in airway exosomal miRNAs (such as miR-876–3p, miR-20a-5p) are associated with BPD susceptibility (43, 46).

To advance omics evidence from “description” to “computational evidence usable for prediction and stratification,” two types of alignment need to be achieved. First is methodological alignment: perform batch effect correction and scale unification for multi-center/multi-platform data, and reduce high-dimensional omics features to pathway/module-level representations (e.g., aggregating genes/metabolites into activity scores for pathways such as “inflammation, oxidative stress, and angiogenesis”) to lower dimensionality and facilitate mechanism-based interpretation. Second is temporal alignment: align omics data with concurrent imaging data and continuous physiological signals according to clinical prediction windows (0–24 h, 3–7 d, ≈28 d) for joint modeling, ensuring features are accessible at the time of prediction and avoiding data/label leakage.

For integration strategies, progression should follow the principle of “late fusion first, then mid-level fusion.” When sample size and modality accessibility are limited, late fusion should be adopted first. As multi-center samples and annotation standards are gradually refined, transition to mid-level fusion: jointly characterize pathway-level omics, imaging, and other data within the same time window, and use multi-modal Transformers (with cross-modal attention and mask training to enhance robustness to missing modalities) or heterogeneous graph neural networks (constructing graphs of “gene/metabolite/microbiota—pathway—phenotype” and performing message passing for importance attribution) to improve mechanistic interpretability (47, 48). Existing small-sample prototypes have shown that combined metabolomic-microbiomic multi-omics exhibit cross-modal consistent signals in inflammation/oxidative stress pathways (49). However, regardless of the approach, multi-omics integration can only be transformed into interpretable, verifiable, and implementable decision support in NICU settings if discrimination, calibration, net benefit, and robustness all meet the required standards.

4 Analysis of the actual obstacles in the clinical application of AI prediction models

4.1 Data-related limitations

The implementation of DL models within clinical settings necessitates seamless integration with healthcare information systems, including EHRs and imaging databases. However, this process often encounters challenges such as high technical barriers, long implementation periods, large capital investment, and complex compliance requirements. Especially in developing regions with lower levels of informatization, the data foundation is weak, and obtaining high-quality samples is limited. Moreover, the training data has insufficient representativeness for some populations, all of which affect the generalization performance of the model (50). Furthermore, medical records often suffer from missing information and incorrect input. The data structures from different sources such as EHR, wearable devices, and medical images vary greatly, making it difficult to integrate across institutions, which undermines the stability and accuracy of the model (5154). Currently, the related research on BPD lacks open and shared datasets and code libraries, which limits the verification of results and the re-development of models, and hinders its generalizability across different institutions.

4.2 Model-related limitations

1. Lack of interpretability (“Black Box” issue): Deep learning-based AI models often function as “black boxes” due to their inherent complexity and opaque decision-making mechanisms. The lack of transparency may undermine clinician trust, particularly in critical clinical decisions such as treatment discontinuation or experimental interventions, where comprehensible rationale is essential for adoption (55).

2. Insufficient generalization ability and external validation: AI models are susceptible to overfitting, wherein they capture noise or irrelevant patterns specific to the training dataset. While such models may exhibit high accuracy in controlled experiments, their application is ineffective in different populations or medical systems. The prevalence, manifestations and data sources of BPD are greatly influenced by factors such as race, gestational age, and socioeconomic status, which limits the universality of the model (56). At present, most models lack systematic external validation studies, especially cross-institutional and multi-center data validation is rare, and their applicability in real-world environments cannot be confirmed.

3. The model update mechanism is not perfect: Clinical data and disease spectra have dynamic characteristics. If the model lacks a real-time monitoring and update mechanism, it is prone to “model drift”, that is, changes in the distribution of input data lead to performance degradation (57). Ideally, an AI model should have the capabilities of continuous performance monitoring, drift detection, and automated re-training, forming a closed-loop feedback system. Currently, most BPD prediction models lack drift monitoring mechanisms, as well as no standardized procedures for model re-training or version iteration, making it difficult to ensure their long-term stability and clinical applicability (58).

4.3 Clinical integration and ethical supervision aspects

1. The integration and transformation path of clinical applications are unclear: Most BPD AI models are limited to research or tool levels and have not been deeply coupled with treatment paths and intervention processes, making it difficult to guide actual clinical behaviors. Currently, there is a lack of standardized prediction, stratification, intervention processes, and operational guidelines, which limits their practical application in the clinical setting.

2. Ethical compliance and regulatory mechanisms are lagging behind: When AI is applied to the prediction of neonatal diseases, it involves sensitive populations, and its data usage and model deployment must strictly comply with ethical norms. Currently, there is a lack of specialized ethical guidelines and data regulatory mechanisms for pediatric AI models. Issues such as informed consent, data anonymization, algorithm bias correction, and liability attribution for model failures all urgently require clear legal and institutional definitions (59).

5 From prediction to intervention: building an AI-driven clinical intervention path for BPD

5.1 Establishing a “time window–PALM” translation framework for NICUs

Currently, the research on AI in BPD management mainly focuses on risk prediction, but there is less discussion on how to convert the prediction results into specific intervention measures (60). Based on existing systematic evidence, this review proposes a Time Window–PALM (Predict–Act–Learn–Monitor) framework, which takes ≤24 h, 3–7 days, and ≈28 days after birth as key nodes in the disease course, requires the model to output calibrated individualized risk at each node and use Decision Curve Analysis (DCA) to provide net benefits under acceptable thresholds, and maps risk thresholds to specific intervention measures (e.g., ventilator fine-tuning, oxygen therapy targets, medication/fluid management, imaging re-evaluation); within fixed time windows, the following should be routinely reported: calibration and net benefit, alert rate, and robustness across subgroups (e.g., by gestational age or center), which are used to trigger or delay interventions, and subsequently, real-world outcomes are fed back to recalibrate the model and conduct incremental update training while continuous monitoring of model performance and drift is required to ensure traceability and auditability.

5.2 Individualized treatment intervention strategies at different stages

The clinical value of AI lies in achieving refined, phased, and individualized treatment guidance (6). Within the first 24 h after birth, the model provides calibrated risk, net benefits at selected thresholds, and simultaneously presents PPV, NPV, and alert rate; if the risk exceeds the center's threshold, parameter optimization for lung-protective ventilation is first implemented (limiting tidal volume within the allowable oxygenation/carbon dioxide targets, setting appropriate PEEP, and reducing high-pressure impact), and lung recruitment strategies and oxygen therapy targets are reviewed based on CXR/LUS, followed by re-evaluation of risk trajectory and adverse events at 24–48 h to verify the immediate effect.

From 3 to 7 days after birth, calibrated individualized risk assessment is updated under the same methodological framework and interpreted in conjunction with inflammatory profiles/cardiopulmonary load signals (e.g., CRP/IL-6/IL-8, NT-proBNP, LUS scores); when both risk and net benefits meet the standards, stratified medication and fluid management can be initiated: for infants with severe inflammatory phenotypes and weaning difficulties, glucocorticoid regimens are selected based on evidence, while infants with suspected increased fluid/pulmonary vascular load receive fluid restriction/diuresis and supportive therapy, with re-testing at 48–72 h to evaluate efficacy and safety.

At approximately 28 days after birth, if infants still have persistent oxygen dependence, longitudinal oxygen demand curves, weight gain/nutrient intake, KL-6/NT-proBNP, and imaging features are comprehensively interpreted to distinguish phenotypes dominated by interstitial damage/alveolar simplification or accompanied by significant pulmonary vascular load: the former prioritizes nutritional and antioxidant support, rehabilitation, and weaning assessment, while the latter undergoes multi-disciplinary cardiopulmonary evaluation and necessary advanced imaging/functional tests, with the establishment of a long-term follow-up plan.

To facilitate understanding of practical implementation, two contextual examples are provided below: A very preterm infant was identified as high-risk by the model 6 h after admission, with positive threshold net benefits shown by DCA; the team accordingly adjusted down ventilator parameters and calibrated oxygen therapy targets, and re-evaluation at 48 h showed decreased OI/OSI and controllable alert rate, so the intervention was retained and documented. On day 5, NT-proBNP increased and LUS score worsened, with risk exceeding the threshold again; short-course low-dose glucocorticoids and fluid restriction were administered in accordance with prescription guidelines, and re-evaluation at 72 h determined whether to upgrade imaging or discontinue pharmacological intervention.

5.3 Operational key points and required resources for the PALM closed-loop

Predict: Use features accessible at the time of prediction, and routinely report calibration and threshold ranges indicated by DCA; it is recommended to centrally establish time-windowed risk control tables (including: risk, threshold, net benefit, preset actions, and re-evaluation time points) (61).

Act: Solidify “threshold-action” into executable medical orders/pathways (ventilator parameter targets, oxygen therapy ranges, recruitment strategies, inclusion/exclusion criteria for glucocorticoid and fluid management); run in parallel (shadow mode) in the initial launch phase to prevent premature automation (62).

Learn: Conduct incremental/periodic retraining and threshold recalibration based on real-world outcomes; supplement with feature ablation and key subgroup reviews to maintain stability (62).

Monitor: Continuously monitor performance, fairness, and drift; retain audit logs and version control; align with incident reporting-rollback processes, as well as information notification and appeal channels for family members and medical staff to ensure traceability and accountability (63).

6 Future research prospects

Although the application of artificial intelligence (AI) in the risk prediction and individualized management of bronchopulmonary dysplasia (BPD) has achieved initial results (6, 12). However, its clinical application still faces multiple challenges such as data, models, ethics, and clinical pathways. Future research can focus on the following key directions:

First, a multi-center, high-quality clinical database should be established. Currently, most models are based on single-center data, which is limited by sample size and population heterogeneity, resulting in weak generalization ability. Efforts should be made to establish structured clinical databases covering multiple regions, multiple ethnic groups, and multiple NICUs. The data format and label system should be unified to provide a data foundation for model development, external validation, and multimodal integration, and to promote algorithm openness and research reproducibility.

Second, enhance model transparency and interpretability. Black-box models limit doctors' trust and willingness to adopt. In the future, interpretable artificial intelligence (XAI) technologies should be introduced (64) to provide visualizations of the contributions of clinical variables and tracking of decision paths, enhancing the clinical reviewability and adoption intention of the model.

Third, promote the deep integration of the model with the clinical system. The AI model should seamlessly integrate with clinical information system (HIS/EHR) through standard interfaces, automatically extracting key variables (such as gestational age, ventilator parameters, oxygen saturation, blood gas indicators, etc.) to enable real-time prediction and risk alerts (6568). At the same time, a dynamic update mechanism should be established to adapt to data drift and changes in the clinical environment, ensuring the long-term stable operation of the model. For example, a periodic re-training mechanism and an automatic warning system should be introduced to build an “adaptive model update platform”.

Fourth, develop individualized prediction and mechanism-driven models. The development process of BPD is time-dependent, and it is suitable to adopt time series models to integrate continuous physiological signals (6971). This enables dynamic identification of the disease progression trend. To reduce the impact of model drift in the BPD intervention path, it is necessary to regularly collect the latest data of BPD children for re-training (72). A real-time data monitoring and performance warning mechanism can be established, and the deviation between the input data and the initial training set can be analyzed regularly. To improve the model's adaptability, XAI tools such as SHAP can be introduced to trace the reasons for drift, clarify the reasons for changes in feature contributions, and optimize the individualized intervention strategies of the clinical path in a targeted manner. In addition, integrating multi-omics data (such as transcriptome, proteome, microbiome, etc.) will help identify the biological subtypes of BPD (73), it helps to identify biological subtypes and promotes precise prediction and the formulation of treatment paths.

Fifth, improve ethical governance and regulatory mechanisms. The clinical application of AI models must comply with ethical and legal norms. Data security can be protected through technologies such as differential privacy and federated learning (74), to clarify the ownership and responsibility of data usage. At the same time, establish an AI ethics approval and supervision mechanism for newborns to ensure their compliant and safe application in clinical settings.

7 Discussion

The advancement of AI within the medical domain has highlighted its increasing potential in the prediction and management of BPD among preterm infants. The present review has provided a comprehensive evaluation of AI applications in the development of BPD prediction models, including aspects such as multimodal data integration, model architecture and refinement, performance assessments of existing models, and limitations related to clinical implementation. Collectively, AI offers innovative tools to facilitate the early detection and individualized management of BPD. However, the practical application of these tools remains constrained by several unresolved challenges, necessitating ongoing interdisciplinary collaboration and model refinement.

Currently, a substantial proportion of AI-based BPD prediction models are limited to internal validation, with minimal utilization of external datasets for independent evaluation. Given the considerable variability in neonatal care practices, data acquisition protocols, and infant characteristics across NICUs, model generalizability and applicability require verification through external validation. Consequently, the establishment of a broad, multi-center collaborative framework is imperative. Such a network would support the harmonization of data standards and facilitate data sharing, thereby enhancing the diversity and representativeness of training datasets and strengthening the reliability of predictive models in diverse clinical contexts.

Ethical and legal issues must be considered in the clinical promotion of AI models. AI models typically rely on personal data, such as health records, hematological biomarkers, imaging data, and multimodal data, including genomics. As the volume of these data continues to increase, particularly when integrating multiple data sources and developing new ones, privacy concerns become more prominent (75). However, overly stringent privacy protections during data collection, use, and sharing may impede the potential benefits that patients could derive from using this data to advance AI-driven solutions, such as new diagnostic or treatment options (75, 76). Emerging methods, such as generating synthetic data through AI-generated samples, offer a way to protect privacy while fostering innovation. This synthetic data can be shared securely and used to develop new models (77). Additionally, legal disputes persist regarding the ownership and usage rights of health data, with current regulations failing to clearly define the ownership of AI training data. Therefore, prior to the clinical application of AI models, it is essential to establish comprehensive safety assessments and accountability mechanisms.

The Good Machine Learning Practice (GMLP) proposed by the FDA/IMDRF outlines 10 full-lifecycle principles. The EU's AI Act classifies AI for medical use as a high-risk system, requiring the establishment of lifecycle-wide risk management, high-quality data governance, transparency and user information, human oversight, as well as post-market monitoring and traceable documentation—all of which align with the NICU's goal of protecting minors (78). For pediatric scenarios, the ACCEPT-AI framework and UNICEF's AI Policy Guidelines for Children can be integrated to implement child-centered rights protection.

Under this framework, this study proposes three categories of actionable solutions:

Informed consent adopts a “stratified + dynamic” model (signed by parents/guardians as proxies, with reviews at admission, discharge, and follow-up nodes). The consent document must define the purpose and secondary use of data, withdrawal procedures and timelines, and implement “minimum necessary collection” and de-identification.

Bias auditing runs through both data and models: assess representativeness by gestational age, weight, gender, and region/center; report calibration and subgroup performance simultaneously during internal and external validation, and use Decision Curve Analysis (DCA) to present net benefits at different thresholds; develop plans for regular fairness reviews and bias correction.

For accountability and operational governance, a three-line responsibility system consisting of “clinical lead + data protector + engineering lead” is implemented: complete model and Data Protection Impact Assessment (DPIA) before launch; first run in shadow mode (model operates in parallel with manual processes without directly issuing medical orders) and gradually integrate with “gating thresholds”; during operation, conduct drift monitoring, retain audit logs and version management, and establish incident reporting-rollback channels as well as explanation and appeal channels for family members and medical staff.

In recent years, various AI models related to BPD have emerged continuously, but the ones that have truly achieved clinical routine application are still very limited. Most AI medical models are still at the development and preliminary validation stage, and only a very few can pass strict external validation. Moreover, the models that can be deployed in clinical settings or undergo prospective clinical trials are even more limited (7981). For instance, a systematic review included 65 studies, covering 158 developed models and 108 externally validated models. Although some models have good discrimination, they generally have high bias risks, and most lack external validation and calibration assessment, which limits their clinical application (81). Although the French EPIPAGE-2 cohort has undergone cross-border validation, its predictive ability has reached a relatively high level (c-statistic 0.80–0.81), yet due to regional calibration deviations, it is still difficult to be generalized to real clinical settings (82).

To promote the effective transformation of AI models in the clinical management of BPD, it is suggested that the clinical needs be incorporated into the model development process to construct an interpretable and integrable model architecture. In terms of data, a multi-center shared database should be established to promote the standardization of data collection, storage and exchange, and enhance the generalization and reproducibility of the model (83). After deployment, a model performance monitoring mechanism should be set up and a dynamic update strategy should be adopted to deal with “model drift”. In terms of technical implementation, privacy-preserving computing (such as federated learning) can be introduced to facilitate cross-institutional collaboration. In terms of regulation, a unified AI medical regulation system has not yet been established globally, and only some countries have explored preliminary paths. It is recommended to draw on the management ideas of the FDA regarding AI medical software (74, 84). The supervision should cover model approval review, performance assessment, ethical review, continuous monitoring and exit mechanism, to ensure the safety, effectiveness and fairness of the model at all stages of clinical application. It is suggested that the national health administrative department take the lead in supervision, and collaborate with interdisciplinary experts from ethics, law, information technology, etc., to establish a “Medical AI Safety Assessment Platform”, formulate unified supervision norms, and achieve cross-institutional and cross-regional collaborative supervision.

The application of AI in BPD prediction has demonstrated considerable potential; however, its clinical translation still faces several challenges, including poor model interpretability, insufficient data quality, and inadequate ethical regulations. Moving forward, AI technology can effectively assist physicians in identifying high-risk infants at an early stage and optimizing intervention strategies, thereby improving both the quality of survival and long-term prognosis for premature infants, provided there is collaborative effort across technical, ethical, and institutional domains.

8 Conclusion

BPD is a prevalent and serious respiratory disease among premature infants, artificial intelligence technology provides a brand-new perspective for its early identification and precise intervention. This review systematically summarizes the research progress of BPD prediction models from aspects such as AI modeling principles, representative tools, clinical transformation obstacles and future directions. It focuses on the critical comprehensive evaluation of AI methods and clinical tools, highlighting core challenges such as model transparency, data standardization, system integration and ethical supervision, at the same time, this article discussed the PALM translation framework (Predict–Act–Learn–Monitor), which is organized around key clinical nodes. Future research needs to focus more on the deployment adaptability of AI models in real clinical environments, and build a complete transformation chain covering model design, verification, integration, regulation and feedback. To this end, priority should be given to promoting the construction of multi-center standardized databases, developing AI tools with interpretability and dynamic update capabilities, exploring privacy protection mechanisms such as federated learning, and developing closed-loop intelligent systems that can be embedded in clinical processes. This will truly achieve the transformation from “risk prediction” to “precise intervention”, and ultimately improve the long-term health outcomes of premature infants with BPD.

Author contributions

XZ: Writing – original draft, Formal analysis, Data curation. AW: Writing – review & editing, Conceptualization, Formal analysis. RX: Writing – original draft, Formal analysis. DL: Writing – original draft, Writing – review & editing, Funding acquisition.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by a grant from Shandong Province Medical and Health Science and Technology Project (No. 202406031054).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

BPD, bronchopulmonary dysplasia; AI, artificial intelligence; ML, machine learning; NICHD, National Institute of Child Health and Human Development; ACS, antenatal corticosteroid; NICU, neonatal intensive care unit; DL, deep learning; AUC, area under the curve; RCT, randomized controlled trial; NLP, natural language processing; EHR, electronic health records; VEGF, vascular endothelial growth factor; PLGF, placental growth factor; IL, interleukin; SIRI, systemic inflammatory response index; KL-6, Krebs von den Lungen-6; NT-proBNP, N-terminal pro-B-type natriuretic peptide; MAP, mean airway pressure.

References

1. Bell EF, Hintz SR, Hansen NI, Bann CM, Wyckoff MH, DeMauro SB, et al. Mortality, in-hospital morbidity, care practices, and 2-year outcomes for extremely preterm infants in the US, 2013–2018. JAMA. (2022) 327(3):248–63. doi: 10.1001/jama.2021.23580

PubMed Abstract | Crossref Full Text | Google Scholar

2. Voynow JA. “New” bronchopulmonary dysplasia and chronic lung disease. Paediatr Respir Rev. (2017) 24:17–8. doi: 10.1016/j.prrv.2017.06.006

PubMed Abstract | Crossref Full Text | Google Scholar

3. Van Marter LJ, Kuban KC, Allred E, Bose C, Dammann O, O’Shea M, et al. Does bronchopulmonary dysplasia contribute to the occurrence of cerebral palsy among infants born before 28 weeks of gestation? Arch Dis Child Fetal Neonatal Ed. (2011) 96(1):F20–9. doi: 10.1136/adc.2010.183012

PubMed Abstract | Crossref Full Text | Google Scholar

4. Jeon GW, Oh M, Chang YS. Definitions of bronchopulmonary dysplasia and long-term outcomes of extremely preterm infants in Korean neonatal network. Sci Rep. (2021) 11(1):24349. doi: 10.1038/s41598-021-03644-7

PubMed Abstract | Crossref Full Text | Google Scholar

5. Keles E, Bagci U. The past, current, and future of neonatal intensive care units with artificial intelligence: a systematic review. NPJ Digit Med. (2023) 6(1):220. doi: 10.1038/s41746-023-00941-5

PubMed Abstract | Crossref Full Text | Google Scholar

6. Jha T, Suhail S, Northcote J, Moreira AG. Artificial intelligence in bronchopulmonary dysplasia: a review of the literature. Information. (2025) 16(4):262. doi: 10.3390/info16040262

Crossref Full Text | Google Scholar

7. Rusin CG, Acosta SI, Vu EL, Ahmed M, Brady KM, Penny DJ. Automated prediction of cardiorespiratory deterioration in patients with single ventricle. J Am Coll Cardiol. (2021) 77(25):3184–92. doi: 10.1016/j.jacc.2021.04.072

PubMed Abstract | Crossref Full Text | Google Scholar

8. Miller C, Manious M, Portnoy J. Artificial intelligence and machine learning for anaphylaxis algorithms. Curr Opin Allergy Clin Immunol. (2024) 24(5):305–12. doi: 10.1097/ACI.0000000000001015

PubMed Abstract | Crossref Full Text | Google Scholar

9. Saria S, Rajani AK, Gould J, Koller D, Penn AA. Integration of early physiological responses predicts later illness severity in preterm infants. Sci Transl Med. (2010) 2(48):48ra65. doi: 10.1126/scitranslmed.3001304

PubMed Abstract | Crossref Full Text | Google Scholar

10. Moreira AG, Husain A, Knake LA, Aziz K, Simek K, Valadie CT, et al. A clinical informatics approach to bronchopulmonary dysplasia: current barriers and future possibilities. Front Pediatr. (2024) 12:1221863. doi: 10.3389/fped.2024.1221863

PubMed Abstract | Crossref Full Text | Google Scholar

11. Pammi M, Aghaeepour N, Neu J. Multiomics, artificial intelligence, and precision medicine in perinatology. Pediatr Res. (2023) 93(2):308–15. doi: 10.1038/s41390-022-02181-x

PubMed Abstract | Crossref Full Text | Google Scholar

12. Verder H, Heiring C, Ramanathan R, Scoutaris N, Verder P, Jessen TE, et al. Bronchopulmonary dysplasia predicted at birth by artificial intelligence. Acta Paediatr. (2021) 110(2):503–9. doi: 10.1111/apa.15438

PubMed Abstract | Crossref Full Text | Google Scholar

13. Sarker IH. Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions. SN Comput Sci. (2021) 2(6):420. doi: 10.1007/s42979-021-00815-1

PubMed Abstract | Crossref Full Text | Google Scholar

14. Rubinger L, Gazendam A, Ekhtiari S, Bhandari M. Machine learning and artificial intelligence in research and healthcare. Injury. (2023) 54(Suppl 3):S69–73. doi: 10.1016/j.injury.2022.01.046

PubMed Abstract | Crossref Full Text | Google Scholar

15. Janiesch C, Zschech P, Heinrich K. Machine learning and deep learning. Electron Markets. (2021) 31:685–95. doi: 10.1007/s12525-021-00475-2

Crossref Full Text | Google Scholar

16. Martin B, Bennett TD, DeWitt PE, Russell S, Sanchez-Pinto LN. Use of the area under the precision-recall curve to evaluate prediction models of rare critical illness events. Pediatr Crit Care Med. (2025) 26(6):e855–9. doi: 10.1097/PCC.0000000000003752

PubMed Abstract | Crossref Full Text | Google Scholar

17. Nevin L. Advancing the beneficial use of machine learning in health care and medicine: toward a community understanding. PLoS Med. (2018) 15(11):e1002708. doi: 10.1371/journal.pmed.1002708

PubMed Abstract | Crossref Full Text | Google Scholar

18. Laughon MM, Langer JC, Bose CL, Smith PB, Ambalavanan N, Kennedy KA, et al. Prediction of bronchopulmonary dysplasia by postnatal age in extremely premature infants. Am J Respir Crit Care Med. (2011) 183(12):1715–22. doi: 10.1164/rccm.201101-0055OC

PubMed Abstract | Crossref Full Text | Google Scholar

19. Lei J, Sun T, Jiang Y, Wu P, Fu J, Zhang T, et al. Risk identification of bronchopulmonary dysplasia in premature infants based on machine learning. Front Pediatr. (2021) 9:719352. doi: 10.3389/fped.2021.719352

PubMed Abstract | Crossref Full Text | Google Scholar

20. Dai D, Chen H, Dong X, Chen J, Mei M, Lu Y, et al. Bronchopulmonary dysplasia predicted by developing a machine learning model of genetic and clinical information. Front Genet. (2021) 12:689071. doi: 10.3389/fgene.2021.689071

PubMed Abstract | Crossref Full Text | Google Scholar

21. Chou HY, Lin YC, Hsieh SY, Chou HH, Lai CS, Wang B, et al. Deep learning model for prediction of bronchopulmonary dysplasia in preterm infants using chest radiographs. J Imaging Inform Med. (2024) 37(5):2063–73. doi: 10.1007/s10278-024-01050-9

PubMed Abstract | Crossref Full Text | Google Scholar

22. Moreira A, Tovar M, Smith AM, Lee GC, Meunier JA, Cheema Z, et al. Development of a peripheral blood transcriptomic gene signature to predict bronchopulmonary dysplasia. Am J Physiol Lung Cell Mol Physiol. (2023) 324(1):L76–87. doi: 10.1152/ajplung.00250.2022

PubMed Abstract | Crossref Full Text | Google Scholar

23. Kapoor S, Narayanan A. Leakage and the reproducibility crisis in machine-learning-based science. Patterns (N Y). (2023) 4(9):100804. doi: 10.1016/j.patter.2023.100804

PubMed Abstract | Crossref Full Text | Google Scholar

24. Van Calster B, McLernon DJ, van Smeden M, Wynants L, Steyerberg EW, Topic group “evaluating diagnostic tests and prediction models” of the STRATOS initiative. Calibration: the Achilles heel of predictive analytics. BMC Med. (2019) 17(1):230. doi: 10.1186/s12916-019-1466-7

PubMed Abstract | Crossref Full Text | Google Scholar

25. Ambalavanan N, Weese-Mayer DE, Hibbs AM, Claure N, Carroll JL, Moorman JR, et al. Cardiorespiratory monitoring data to predict respiratory outcomes in extremely preterm infants. Am J Respir Crit Care Med. (2023) 208(1):79–97. doi: 10.1164/rccm.202210-1971OC

PubMed Abstract | Crossref Full Text | Google Scholar

26. Gursoy T, Hayran M, Derin H, Ovali F. A clinical scoring system to predict the development of bronchopulmonary dysplasia. Am J Perinatol. (2015) 32(7):659–66. doi: 10.1055/s-0034-1393935

PubMed Abstract | Crossref Full Text | Google Scholar

27. Xie F, Yuan H, Ning Y, Ong MEH, Feng M, Hsu W, et al. Deep learning for temporal data representation in electronic health records: a systematic review of challenges and methodologies. J Biomed Inform. (2022) 126:103980. doi: 10.1016/j.jbi.2021.103980

PubMed Abstract | Crossref Full Text | Google Scholar

28. Corsini I, Parri N, Ficial B, Ciarcià M, Migliaro F, Capasso L, et al. Lung ultrasound in Italian neonatal intensive care units: a national survey. Pediatr Pulmonol. (2022) 57(9):2199–206. doi: 10.1002/ppul.26025

PubMed Abstract | Crossref Full Text | Google Scholar

29. Vanhaverbeke K, Van Eyck A, Van Hoorenbeeck K, De Winter B, Snoeckx A, Mulder T, et al. Lung imaging in bronchopulmonary dysplasia: a systematic review. Respir Med. (2020) 171:106101. doi: 10.1016/j.rmed.2020.106101

PubMed Abstract | Crossref Full Text | Google Scholar

30. Xiao SZ, Zhou LW, Wang LL, Zheng Q, Yang J. Clinical applicatton value of imaging-AI in bronchopulmonary dysplasia of prematurity. J Clin Images Med Case Rep. (2025) 6(2):3459. doi: 10.52768/2766-7820/3459

Crossref Full Text | Google Scholar

31. Draker N, Torry DS, Torry RJ. Placenta growth factor and sFlt-1 as biomarkers in ischemic heart disease and heart failure: a review. Biomark Med. (2019) 13(9):785–99. doi: 10.2217/bmm-2018-0492

PubMed Abstract | Crossref Full Text | Google Scholar

32. Procianoy RS, Hentges CR, Silveira RC. Vascular endothelial growth factor/placental growth factor heterodimer levels in preterm infants with bronchopulmonary dysplasia. Am J Perinatol. (2016) 33(5):480–5. doi: 10.1055/s-0035-1566294

PubMed Abstract | Crossref Full Text | Google Scholar

33. Zhang Z, Wu W, Hou L, Jiang J, Wan W, Li Z. Cytokines and exhaled nitric oxide are risk factors in preterm infants for bronchopulmonary dysplasia. Biomed Res Int. (2021) 2021:6648208. doi: 10.1155/2021/6648208

PubMed Abstract | Crossref Full Text | Google Scholar

34. Cakir U, Tayman C, Yucel C. A novel diagnostic marker for the severity of bronchopulmonary dysplasia in very low birth weight infants: interleukin-33. Pediatr Allergy Immunol Pulmonol. (2019) 32(1):12–7. doi: 10.1089/ped.2019.0994

PubMed Abstract | Crossref Full Text | Google Scholar

35. Jiang J, Mao Y, Zhou Q, Wu J. Neutrophil-to-lymphocyte ratio and platelet-to-lymphocyte ratio are early predictors of bronchopulmonary dysplasia. Medicine (Baltimore). (2023) 102(35):e34987. doi: 10.1097/MD.0000000000034987

PubMed Abstract | Crossref Full Text | Google Scholar

36. Cakir U, Tayman C, Tugcu AU, Yildiz D. Role of systemic inflammatory indices in the prediction of moderate to severe bronchopulmonary dysplasia in preterm infants. Arch Bronconeumol. (2023) 59(4):216–22. doi: 10.1016/j.arbres.2023.01.003

PubMed Abstract | Crossref Full Text | Google Scholar

37. He Q, Tang Y, Huang J, Rao Y, Lu Y. The value of KL-6 in the diagnosis and assessment of interstitial lung disease. Am J Transl Res. (2021) 13(8):9216–23.34540037

PubMed Abstract | Google Scholar

38. Dilli D, Özyazici A, Dursun A, Beken S. Predictive values of plasma KL-6 in bronchopulmonary dysplasia in preterm infants. Turk J Med Sci. (2017) 47(2):621–6. doi: 10.3906/sag-1512-78

PubMed Abstract | Crossref Full Text | Google Scholar

39. Khan S, Concina VA, Schneider D, Westgate P, Arriagada S, Bada H. Role of NT-proBNP in the prediction of moderate to severe bronchopulmonary dysplasia in preterm infants. Pediatr Pulmonol. (2020) 55(2):376–82. doi: 10.1002/ppul.24610

PubMed Abstract | Crossref Full Text | Google Scholar

40. Méndez-Abad P, Zafra-Rodríguez P, Lubián-López S, Benavente-Fernández I. NTproBNP is a useful early biomarker of bronchopulmonary dysplasia in very low birth weight infants. Eur J Pediatr. (2019) 178(5):755–61. doi: 10.1007/s00431-019-03347-2

PubMed Abstract | Crossref Full Text | Google Scholar

41. Stahlschmidt SR, Ulfenborg B, Synnergren J. Multimodal deep learning for biomedical data fusion: a review. Brief Bioinform. (2022) 23(2):bbab569. doi: 10.1093/bib/bbab569

PubMed Abstract | Crossref Full Text | Google Scholar

42. Piersigilli F, Lam TT, Vernocchi P, Quagliariello A, Putignani L, Aghai ZH, et al. Identification of new biomarkers of bronchopulmonary dysplasia using metabolomics. Metabolomics. (2019) 15(2):20. doi: 10.1007/s11306-019-1482-9

PubMed Abstract | Crossref Full Text | Google Scholar

43. Pintus MC, Lussu M, Dessì A, Pintus R, Noto A, Masile V, et al. Urinary 1H-NMR metabolomics in the first week of life can anticipate BPD diagnosis. Oxid Med Cell Longev. (2018) 2018:7620671. doi: 10.1155/2018/7620671

Crossref Full Text | Google Scholar

44. Baraldi E, Giordano G, Stocchero M, Moschino L, Zaramella P, Tran MR, et al. Untargeted metabolomic analysis of amniotic fluid in the prediction of preterm delivery and bronchopulmonary dysplasia. PLoS One. (2016) 11(10):e0164211. doi: 10.1371/journal.pone.0164211

PubMed Abstract | Crossref Full Text | Google Scholar

45. Lohmann P, Luna RA, Hollister EB, Devaraj S, Mistretta TA, Welty SE, et al. The airway microbiome of intubated premature infants: characteristics and changes that predict the development of bronchopulmonary dysplasia. Pediatr Res. (2014) 76(3):294–301. doi: 10.1038/pr.2014.85

PubMed Abstract | Crossref Full Text | Google Scholar

46. Lal CV, Olave N, Travers C, Rezonzew G, Dolma K, Simpson A, et al. Exosomal microRNA predicts and protects against severe bronchopulmonary dysplasia in extremely premature infants. JCI Insight. (2018) 3(5):e93994. doi: 10.1172/jci.insight.93994

PubMed Abstract | Crossref Full Text | Google Scholar

47. Im JE, Yoon SA, Shin YM, Park S. Real-time prediction for neonatal endotracheal intubation using multimodal transformer network. IEEE J Biomed Health Inform. (2023) 27(6):2625–34. doi: 10.1109/JBHI.2023.3267521

PubMed Abstract | Crossref Full Text | Google Scholar

48. Qu Z, Chen T, Zhou X, Zhu F, Guo W, Xu Y, et al. Multi-modal disease prediction with hierarchical self-supervised learning. IEEE J Biomed Health Inform. (2025) 29(10):7057–67. doi: 10.1109/JBHI.2025.3561546

PubMed Abstract | Crossref Full Text | Google Scholar

49. El Saie A, Fu C, Grimm SL, Robertson MJ, Hoffman K, Putluri V, et al. Metabolome and microbiome multi-omics integration from a murine lung inflammation model of bronchopulmonary dysplasia. Pediatr Res. (2022) 92(6):1580–9. doi: 10.1038/s41390-022-02002-1

PubMed Abstract | Crossref Full Text | Google Scholar

50. Franklin G, Stephens R, Piracha M, Tiosano S, Lehouillier F, Koppel R, et al. The sociodemographic biases in machine learning algorithms: a biomedical informatics perspective. Life (Basel). (2024) 14(6):652. doi: 10.3390/life14060652

PubMed Abstract | Crossref Full Text | Google Scholar

51. Picard M, Scott-Boyer MP, Bodein A, Périn O, Droit A. Integration strategies of multi-omics data for machine learning analysis. Comput Struct Biotechnol J. (2021) 19:3735–46. doi: 10.1016/j.csbj.2021.06.030

PubMed Abstract | Crossref Full Text | Google Scholar

52. Baena-Miret S, Reverter F, Vegas E. A framework for block-wise missing data in multi-omics. PLoS One. (2024) 19(7):e0307482. doi: 10.1371/journal.pone.0307482

PubMed Abstract | Crossref Full Text | Google Scholar

53. Tang AS, Woldemariam SR, Miramontes S, Norgeot B, Oskotsky TT, Sirota M. Harnessing EHR data for health research. Nat Med. (2024) 30(7):1847–55. doi: 10.1038/s41591-024-03074-8

PubMed Abstract | Crossref Full Text | Google Scholar

54. Kim MK, Rouphael C, McMichael J, Welch N, Dasarathy S. Challenges in and opportunities for electronic health record-based data analysis and interpretation. Gut Liver. (2024) 18(2):201–8. doi: 10.5009/gnl230272

PubMed Abstract | Crossref Full Text | Google Scholar

55. Markus AF, Kors JA, Rijnbeek PR. The role of explainability in creating trustworthy artificial intelligence for health care: a comprehensive survey of the terminology, design choices, and evaluation strategies. J Biomed Inform. (2021) 113:103655. doi: 10.1016/j.jbi.2020.103655

PubMed Abstract | Crossref Full Text | Google Scholar

56. Lewis TR, Kielt MJ, Walker VP, Levin JC, Guaman MC, Panitch HB, et al. Association of racial disparities with in-hospital outcomes in severe bronchopulmonary dysplasia. JAMA Pediatr. (2022) 176(9):852–9. doi: 10.1001/jamapediatrics.2022.2663

PubMed Abstract | Crossref Full Text | Google Scholar

57. Cabanillas Silva P, Sun H, Rezk M, Roccaro-Waldmeyer DM, Fliegenschmidt J, Hulde N, et al. Longitudinal model shifts of machine learning-based clinical risk prediction models: evaluation study of multiple use cases across different hospitals. J Med Internet Res. (2024) 26:e51409. doi: 10.2196/51409

PubMed Abstract | Crossref Full Text | Google Scholar

58. Feng J, Gossmann A, Sahiner B, Pirracchio R. Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees. J Am Med Inform Assoc. (2022) 29(5):841–52. doi: 10.1093/jamia/ocab280

PubMed Abstract | Crossref Full Text | Google Scholar

59. Chng SY, Tern MJW, Lee YS, Cheng LT, Kapur J, Eriksson JG, et al. Ethical considerations in AI for child health and recommendations for child-centered medical AI. NPJ Digit Med. (2025) 8(1):152. doi: 10.1038/s41746-025-01541-1

PubMed Abstract | Crossref Full Text | Google Scholar

60. Hwang JK, Kim DH, Na JY, Son J, Oh YJ, Jung D, et al. Two-stage learning-based prediction of bronchopulmonary dysplasia in very low birth weight infants: a nationwide cohort study. Front Pediatr. (2023) 11:1155921. doi: 10.3389/fped.2023.1155921

PubMed Abstract | Crossref Full Text | Google Scholar

61. Collins GS, Moons KGM, Dhiman P, Riley RD, Beam AL, Van Calster B, et al. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. Br Med J. (2024) 385:e078378. doi: 10.1136/bmj-2023-078378

PubMed Abstract | Crossref Full Text | Google Scholar

62. Efthimiou O, Seo M, Chalkou K, Debray T, Egger M, Salanti G. Developing clinical prediction models: a step-by-step guide. Br Med J. (2024) 386:e078276. doi: 10.1136/bmj-2023-078276

PubMed Abstract | Crossref Full Text | Google Scholar

63. Moons KGM, Damen JAA, Kaul T, Hooft L, Andaur Navarro C, Dhiman P, et al. PROBAST+AI: an updated quality, risk of bias, and applicability assessment tool for prediction models using regression or artificial intelligence methods. Br Med J. (2025) 388:e082505. doi: 10.1136/bmj-2024-082505

PubMed Abstract | Crossref Full Text | Google Scholar

64. Hassan SU, Abdulkadir SJ, Zahid MSM, Al-Selwi SM. Local interpretable model-agnostic explanation approach for medical imaging analysis: a systematic literature review. Comput Biol Med. (2025) 185:109569. doi: 10.1016/j.compbiomed.2024.109569

PubMed Abstract | Crossref Full Text | Google Scholar

65. Taylor JA, Stout JW, de Greef L, Goel M, Patel S, Chung EK, et al. Use of a smartphone app to assess neonatal jaundice. Pediatrics. (2017) 140(3):e20170312. doi: 10.1542/peds.2017-0312

PubMed Abstract | Crossref Full Text | Google Scholar

66. Ji J, Ling XB, Zhao Y, Hu Z, Zheng X, Xu Z, et al. A data-driven algorithm integrating clinical and laboratory features for the diagnosis and prognosis of necrotizing enterocolitis. PLoS One. (2014) 9(2):e89860. doi: 10.1371/journal.pone.0089860

PubMed Abstract | Crossref Full Text | Google Scholar

67. Reed RA, Morgan AS, Zeitlin J, Jarreau PH, Torchin H, Pierrat V, et al. Machine-learning vs. expert-opinion driven logistic regression modelling for predicting 30-day unplanned rehospitalisation in preterm babies: a prospective, population-based study (EPIPAGE 2). Front Pediatr. (2021) 8:585868. doi: 10.3389/fped.2020.585868

PubMed Abstract | Crossref Full Text | Google Scholar

68. Cabrera-Quiros L, Kommers D, Wolvers MK, Oosterwijk L, Arents N, van der Sluijs-Bens J, et al. Prediction of late-onset sepsis in preterm infants using monitoring signals and machine learning. Crit Care Explor. (2021) 3(1):e0302. doi: 10.1097/CCE.0000000000000302

PubMed Abstract | Crossref Full Text | Google Scholar

69. Sullivan BA, Fairchild KD. Vital signs as physiomarkers of neonatal sepsis. Pediatr Res. (2022) 91(2):273–82. doi: 10.1038/s41390-021-01709-x

PubMed Abstract | Crossref Full Text | Google Scholar

70. Kumar N, Akangire G, Sullivan B, Fairchild K, Sampath V. Continuous vital sign analysis for predicting and preventing neonatal diseases in the twenty-first century: big data to the forefront. Pediatr Res. (2020) 87(2):210–20. doi: 10.1038/s41390-019-0527-0

PubMed Abstract | Crossref Full Text | Google Scholar

71. Fairchild KD, Srinivasan V, Moorman JR, Gaykema RP, Goehler LE. Pathogen-induced heart rate changes associated with cholinergic nervous system activation. Am J Physiol Regul Integr Comp Physiol. (2011) 300(2):R330–9. doi: 10.1152/ajpregu.00487.2010

PubMed Abstract | Crossref Full Text | Google Scholar

72. Jenkins DA, Martin GP, Sperrin M, Riley RD, Debray TPA, Collins GS, et al. Continual updating and monitoring of clinical prediction models: time for dynamic prediction systems? Diagn Progn Res. (2021) 5(1):1. doi: 10.1186/s41512-020-00090-3

PubMed Abstract | Crossref Full Text | Google Scholar

73. Lal CV, Bhandari V, Ambalavanan N. Genomics, microbiomics, proteomics, and metabolomics in bronchopulmonary dysplasia. Semin Perinatol. (2018) 42(7):425–31. doi: 10.1053/j.semperi.2018.09.004

PubMed Abstract | Crossref Full Text | Google Scholar

74. Hanif A, Lu C, Chang K, Singh P, Coyner AS, Brown JM, et al. Federated learning for multicenter collaboration in ophthalmology: implications for clinical diagnosis and disease epidemiology. Ophthalmol Retina. (2022) 6(8):650–6. doi: 10.1016/j.oret.2022.03.005

PubMed Abstract | Crossref Full Text | Google Scholar

75. Price WN 2nd, Cohen IG. Privacy in the age of medical big data. Nat Med. (2019) 25(1):37–43. doi: 10.1038/s41591-018-0272-7

PubMed Abstract | Crossref Full Text | Google Scholar

76. Seastedt KP, Schwab P, O’Brien Z, Wakida E, Herrera K, Marcelo PGF, et al. Global healthcare fairness: we should be sharing more, not less, data. PLoS Digit Health. (2022) 1(10):e0000102. doi: 10.1371/journal.pdig.0000102

PubMed Abstract | Crossref Full Text | Google Scholar

77. Torkzadehmahani R, Nasirigerdeh R, Blumenthal DB, Kacprowski T, List M, Matschinske J, et al. Privacy-preserving artificial intelligence techniques in biomedicine. Methods Inf Med. (2022) 61(S 01):e12–27. doi: 10.1055/s-0041-1740630

PubMed Abstract | Crossref Full Text | Google Scholar

78. Aboy M, Minssen T, Vayena E. Navigating the EU AI act: implications for regulated digital medical products. NPJ Digit Med. (2024) 7(1):237. doi: 10.1038/s41746-024-01232-3

PubMed Abstract | Crossref Full Text | Google Scholar

79. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. (2019) 25(1):44–56. doi: 10.1038/s41591-018-0300-7

PubMed Abstract | Crossref Full Text | Google Scholar

80. Nagendran M, Chen Y, Lovejoy CA, Gordon AC, Komorowski M, Harvey H, et al. Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies. Br Med J. (2020) 368:m689. doi: 10.1136/bmj.m689

PubMed Abstract | Crossref Full Text | Google Scholar

81. Romijn M, Dhiman P, Finken MJJ, van Kaam AH, Katz TA, Rotteveel J, et al. Prediction models for bronchopulmonary dysplasia in preterm infants: a systematic review and meta-analysis. J Pediatr. (2023) 258:113370. doi: 10.1016/j.jpeds.2023.01.024

PubMed Abstract | Crossref Full Text | Google Scholar

82. Torchin H, Dhiman P, Ancel PY, Durrmeyer X, Jarreau PH, Nuytten A, et al. Early prediction of bronchopulmonary dysplasia: comparison of modelling methods, development and validation studies. Pediatr Res. (2025). doi: 10.1038/s41390-025-04170-2

PubMed Abstract | Crossref Full Text | Google Scholar

83. Mandel JC, Kreda DA, Mandl KD, Kohane IS, Ramoni RB. SMART On FHIR: a standards-based, interoperable apps platform for electronic health records. J Am Med Inform Assoc. (2016) 23(5):899–908. doi: 10.1093/jamia/ocv189

PubMed Abstract | Crossref Full Text | Google Scholar

84. Elmas G, Dar SUH, Korkmaz Y, Ceyani E, Susam B, Ozbey M, et al. Federated learning of generative image priors for MRI reconstruction. IEEE Trans Med Imaging. (2023) 42(7):1996–2009. doi: 10.1109/TMI.2022.3220757

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: artificial intelligence, bronchopulmonary dysplasia, machine learning, multimodal data, predictive models

Citation: Zhang X, Wang A, Xu R and Liu D (2025) Enhancing bronchopulmonary dysplasia prediction in preterm infants using artificial intelligence and multimodal data integration. Front. Pediatr. 13:1629795. doi: 10.3389/fped.2025.1629795

Received: 16 May 2025; Revised: 7 November 2025;
Accepted: 18 November 2025;
Published: 4 December 2025.

Edited by:

Kaya Kuru, University of Central Lancashire, United Kingdom

Reviewed by:

Kee Hyun Cho, Kangwon National University, Republic of Korea
Tng Chang Kwok, University of Nottingham, United Kingdom

Copyright: © 2025 Zhang, Wang, Xu and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dongyun Liu, bGl1ZG9uZ3l1bjAwNkAxNjMuY29t

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.