Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell., 16 January 2026

Sec. Medicine and Public Health

Volume 8 - 2025 | https://doi.org/10.3389/frai.2025.1702087

This article is part of the Research TopicArtificial Intelligence and Machine Learning in PediatricsView all 14 articles

Artificial intelligence and precision medicine: a pilot study predicting optimal ceftaroline dosage for pediatric patients

Maria Frasca&#x;Maria Frasca1Gianluca Gazzaniga&#x;Gianluca Gazzaniga2Agnese Graziosi,
&#x;Agnese Graziosi1,3*Valentina De NicoloValentina De Nicolo1Costantino De GiacomoCostantino De Giacomo4Stefano MartinelliStefano Martinelli5Michele SenatoreMichele Senatore6Alessandra RomandiniAlessandra Romandini6Chiara MorettiChiara Moretti4Giulia Angela Carla PattarinoGiulia Angela Carla Pattarino4Alice ProtoAlice Proto5Romano Danesi,Romano Danesi1,6Francesco ScaglioneFrancesco Scaglione1Gianluca VagoGianluca Vago1Davide La Torre,,Davide La Torre1,7,8Arianna PaniArianna Pani1
  • 1Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
  • 2Department of General Surgery and Surgical Specialty Paride Stefanini, Sapienza University of Rome, Rome, Italy
  • 3Department of Medical Biotechnology and Translational Medicine, Postgraduate School of Clinical Pharmacology and Toxicology, University of Milan, Milan, Italy
  • 4Pediatric Unit, ASST Grande Ospedale Metropolitano Niguarda, Milan, Italy
  • 5Neonatal Intensive Care Unit, ASST Grande Ospedale Metropolitano Niguarda, Milan, Italy
  • 6Chemical-Clinical Analyses Unit, ASST Grande Ospedale Metropolitano Niguarda, Milan, Italy
  • 7SKEMA Business School, Université Côte d’Azur, Nice, France
  • 8Department of Mathematics, University of Bologna, Bologna, Italy

Background: Accurate drug dosing in pediatrics is complicated by age-related physiological variability. Standard weight-based dosing may result in either subtherapeutic exposure or toxicity. Machine learning (ML) models can capture complex relationships among clinical variables and support individualized therapy.

Methods: We analyzed clinical and pharmacokinetic data from 20 pediatric patients enrolled in the PUERI study (January 2020–November 2021, ASST Grande Ospedale Metropolitano Niguarda, Milan, Italy). Eight ML models-including linear regression (LR), ridge regression (RR), lasso regression (LaR), Huber regression (HR), random forest (RF), XGBoost, LightGBM, and a neural network (MLP)-were trained to predict ceftaroline doses that would achieve plasma concentrations close to the therapeutic target of 10 mg/L. Model performance was evaluated using mean absolute error (MAE), root mean squared error (RMSE), and the coefficient of determination (R2). To ensure interpretability, we applied local interpretable model-agnostic explanations (LIME) to identify the most influential predictors of dose.

Results: MLP (MAE 1.53 mg, R2 0.94) and XGBoost (MAE 2.04 mg, R2 0.89) outperformed linear models. Predicted doses were more frequently aligned with therapeutic concentrations than those clinically administered. Model-based simulated concentrations fell within the therapeutic range in approximately 85% of cases, and the best ML models showed over 90% patient-level clinical. RF, LightGBM and XGBoost achieved the highest clinical alignment, with 94.2, 92.4 and 91.5% of patients reaching therapeutic levels. Renal function markers, such as serum creatinine and azotemia, together with anthropometric parameters including weight, height, and body mass index, were consistently the most influential features.

Conclusion: Advanced ML models can optimize ceftaroline dosing in pediatric patients and outperform traditional dosing strategies. Combining predictive accuracy with interpretability (via LIME) supports clinical trust and may enhance precision antibiotic therapy while reducing the risks of antimicrobial resistance and toxicity.

1 Introduction

The correct administration of antibiotics in pediatrics represents a complex challenge for modern pharmacotherapy. Unlike adults, children exhibit significant physiological variability related to age, weight, renal function, and liver maturation, all of which critically influence drug metabolism and elimination (Willems et al., 2021). Standardized dosing based on simple proportional formulas (e.g., mg/kg) is often insufficient to ensure effective therapy personalization, exposing patients to the risk of overdosing, which can lead to adverse effects, or underdosing, with potential therapeutic failure (Anderson and Holford, 2008; Kearns et al., 2003). This challenge is compounded by the growing threat of antimicrobial resistance (AMR). Inappropriate dosing not only risks therapeutic failure but also accelerates the selection and proliferation of drug-resistant bacterial strains, rendering treatments less effective and contributing to one of the most critical global public health threats of this century, with an estimated 10 million deaths per year by 2050 (Murray et al., 2022; Ahmed et al., 2024). In this context, Artificial Intelligence (AI) and Machine Learning (ML) techniques are emerging as innovative tools to address the complexities of personalized medicine, with a particular focus on optimizing drug dosages (Arnold et al., 2025; Smith et al., 2020). By analyzing vast amounts of clinical data and identifying complex patterns among physiological variables, AI models offer a novel approach to individualized pharmacotherapy. This capability enables clinicians to predict optimal antibiotic dosages based on specific patient characteristics, thereby improving treatment efficacy and reducing toxicity risks (Poweleit et al., 2023). Ceftaroline, a fifth-generation cephalosporin, has gained attention for its broad spectrum of action, which includes resistant pathogens like Methicillin-Resistant Staphylococcus Aureus (MRSA; Welte et al., 2019). However, its optimal dosage in pediatric patients remains a challenge due to the scarcity of clinical data and significant interindividual variability. While standard doses are outlined in ceftaroline’s summary of product characteristics, ensuring therapeutic drug concentrations while avoiding toxicity is crucial. This critical balance underscores the urgent need for individualized dosing strategies (Ruggiero et al., 2019). This study leverages multiple regression models to predict the optimal ceftaroline dose required to achieve a target therapeutic plasma concentration of 10 mg/L. We compare these AI-predicted doses with those actually administered to identify potential instances of under- or overdosing. We also employ Explainable AI (XAI) technologies, such as Local Interpretable Model-agnostic Explanations (LIME), to enhance model interpretability by highlighting the most influential patient characteristics in dose prediction. This research aims to demonstrate how AI can increase the accuracy of ceftaroline administration, paving the way for safer and more effective treatments for pediatric patients.

2 Materials and methods

2.1 Study design and data source

This retrospective analysis utilized data from the “Observational, single-center, open-label, sequential, pharmacokinetic and tolerability study of ceftaroline in pediatric patients from 2 to 24 months of age, with suspected or confirmed infection (PUERI).” The PUERI study was conducted between January 2020 and November 2021 at the Pediatric Department of the ASST Grande Ospedale Metropolitano Niguarda (Milan, Italy). The study adhered to the Declaration of Helsinki and Good Clinical Practice guidelines and was approved by the Niguarda Hospital Ethics Committee (protocol no. 189–042019, April 15, 2019). Written informed consent was obtained from parents or legal guardians.

2.2 Patient population and data collection

The dataset comprised pharmacological, clinical, and demographic information from 21 pediatric patients; one was excluded due to insufficient data, yielding a final cohort of 20. Ceftaroline therapy was prescribed at physician discretion, with documentation of infusion parameters, dosage, and administration time. Opportunistic plasma sampling used residual specimens from routine clinical procedures. Variables collected are detailed in Supplementary Table S1.

2.3 Data preprocessing and feature selection

Prior to model development, the dataset underwent a systematic preprocessing pipeline. First, missing data were handled by mean imputation for each numerical feature. Categorical variables were then converted into a numerical format. To ensure uniform scaling and improve model stability, all selected features were standardized to a mean of 0 and a standard deviation of 1. The features used for training, including anthropometric, biochemical, and derived variables, e.g., BMI and the ratio between target and real plasma concentrations (DV ratio), were selected based on their clinical relevance and availability (Table 1). Time elapsed since the first drug administration (TIME) and Infusion rate (mg/h; RATE) were included as retrospective descriptors of the pharmacokinetic context at sampling and were not used for prospective pre-dose prediction. DV ratio was used only as an exploratory feature during training and had no role in dose optimisation. For patients with multiple treatment cycles, each cycle was treated as a distinct event to ensure accurate representation.

Table 1
www.frontiersin.org

Table 1. Features used in the ML models for dose prediction.

2.4 Model development and evaluation

The dataset included 20 pediatric patients, contributing a total of 58 pharmacokinetic events. Since several patients had more than one therapeutic cycle, events were treated as independent units. The 80/20 split was performed at the event level (46 training events and 12 independent test events) using a fixed random seed of 42. The independent test set was not used during hyperparameter tuning or model selection. This approach was used to evaluate model generalization and prevent overfitting. We utilized a range of eight ML regression models to predict the personalized optimized ceftaroline dose, including linear regression (LR), ridge regression (RR), Lasso Regression (LaR), Huber regression (HR), Random Forest (RF), XGBoost, LightGBM, and a Neural Network (MLP). Each model was trained with the optimized dose (Optimized AMT) as the dependent variable. Model performance was evaluated on the test set using standard metrics for regression tasks: Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), Coefficient of Determination (R2), and Mean Absolute Percentage Error (MAPE; Supplementary file 2).

2.5 Model validation and overfitting control

To ensure robustness given the limited sample size, all models were internally validated using a leave-one-out cross-validation (LOOCV) strategy, which maximizes data utilization and provides an unbiased estimate of generalization performance. Additionally, bootstrap resampling (1,000 iterations) was performed to derive 95% confidence intervals for MAE, RMSE and R2 and to assess model stability. Regularization techniques (L1/L2 penalties) were applied to linear models, while early stopping and dropout were used for tree-based and neural architectures to minimize overfitting and improve generalization. Hyperparameter tuning for RF, XGBoost, LightGBM and MLP was performed outside the LOOCV framework, using predefined configurations based on best practices in the literature.

2.6 Explainable AI analysis and dose comparison

After model training, we calculated a personalized optimized dose for each patient by minimizing the absolute deviation between the predicted drug concentration and the target value of 10 mg/L. This Optimized AMT was then compared to the dose actually administered clinically (AMT) to identify potential under- or overdosing. Finally, to enhance model interpretability, the best-performing models were analyzed using Local Interpretable Model-agnostic Explanations (LIME). This technique was used to identify the most significant clinical variables contributing to the dose predictions, providing insight into the key features driving the model’s recommendations. LIME was implemented using 5,000 perturbed samples, a kernel width of 0.75, quantile-based discretization, and a fixed random seed of 42.

3 Results

3.1 Model performance and comparison

The ML models demonstrated varying capabilities in predicting the Optimized AMT. A key metric was the R2 score, which represents the proportion of variability in the dependent variable explained by the model. The Neural Network showed the highest R2 score at 0.94, followed by Random Forest (0.93), XGBoost (0.89), and LightGBM (0.84). The performance of linear models was generally lower, with the Linear Regression achieving an R2 of 0.74 and the Ridge, Lasso, and Huber models ranging from 0.35 to 0.54. Prediction accuracy was further assessed using Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE). The Neural Network achieved the lowest MAE (1.53 mg) and RMSE (3.38 mg), indicating high accuracy. XGBoost also performed exceptionally well, with a low MAE of 2.04 mg and RMSE of 4.70 mg. In contrast, linear models showed higher errors, with Ridge Regression having the largest MAE (7.92 mg) and RMSE (9.94 mg). A summary of the performance metrics for all models is presented in Table 2, the whole results are uploaded in Supplementary file 2.

Table 2
www.frontiersin.org

Table 2. Performance comparison of all ML models used in the study.

3.2 Clinical alignment of predicted doses

The clinical utility of the models was evaluated by comparing simulated plasma concentrations derived from predicted doses with the therapeutic range. Clinical alignment was defined as the proportion of patients whose model-predicted dose resulted in simulated plasma concentrations falling within the predefined therapeutic window of 1–10 mg/L. RF achieved the highest clinical alignment (94.2% within range). XGBoost and LGBM performed well (91.5 and 92.4% within range, respectively). In contrast, the MLP and LR showed lower alignment (~49.9%), underscoring that models with slightly higher error may still yield more clinically relevant recommendations. Administered versus predicted doses for all models are provided in Table 3.

Table 3
www.frontiersin.org

Table 3. Comparison between clinically administered doses and model-predicted optimized doses (mg) for representative patients.

3.3 Visual representation of model performance

To clearly assess the accuracy and clinical coherence of the best-performing model (XGBoost), two graphical analyses were generated and are presented in Figure 1. Figure 1A shows the relationship between the predicted and observed ceftaroline doses, highlighting a strong correspondence between the two values across patients. This result confirms the model’s ability to capture individual variability in dosing requirements. Figure 1B presents the Visual Predictive Check (VPC), which compares the observed plasma concentrations with those simulated using the optimized doses predicted by the XGBoost model (Supplementary file 2; Ratain et al., 2003). Approximately 85% of the simulated plasma concentrations fall within the therapeutic range of 1–10 mg/L. Although some variability is still present, which is expected in a small pediatric cohort, the overall consistency of the results supports the validity of XGBoost as a robust proof-of-concept tool for individualized dose optimization.

Figure 1
Scatter plot on the left compares predicted versus true dose using XGBoost, with a red dashed line indicating a perfect fit. A visual predictive check on the right shows plasma concentration for patients, with observed data, median, and percentile ranges highlighted and therapeutic limits marked with dashed lines.

Figure 1. Visual representation of model performance (XGBoost). Left Observed vs. predicted ceftaroline doses. The dashed red line represents the identity line (y = x); blue points correspond to individual patients. Right Distributional visual predictive check (VPC). The shaded yellow area represents the 5th–95th percentile prediction interval derived from 1,000 bootstrap simulations, and the orange line indicates the model median. Blue crosses show the observed plasma concentrations. The dashed red lines delimit the therapeutic range (1–10 mg/L). This VPC illustrates the agreement between the observed data and the predictive distribution generated by the model, rather than a point-by-point fit.

To further validate the model’s predictive consistency, an additional VPC was performed by comparing the observed plasma concentrations with those simulated using the optimized doses predicted by the XGBoost model. As shown in Figure 2, the observed and simulated concentration curves exhibited a consistent overall trend across patients, confirming that the model accurately captured the shape and magnitude of the plasma concentration distribution. Approximately 85% of simulated concentrations and 37.5% of observed concentrations were within the therapeutic range of 1–10 mg/L, supporting the model’s ability to enhance dosing precision and minimize the risk of sub- or supra-therapeutic exposure. It is important to distinguish patient-level clinical alignment (91–94% depending on the model) from the Visual Predictive Check, where approximately 85% of simulated concentrations fell within the therapeutic window. Moreover, the VPCs demonstrated that the model preserved patient-specific exposure patterns while recentering the simulated concentrations around the therapeutic target (≈10 mg/L), indicating that the XGBoost-based optimization improved the dose homogeneity without diminishing interindividual variability.

Figure 2
Line graph comparing observed and simulated plasma concentrations across patient IDs. Simulated data is shown in black, observed in blue. Red dotted lines indicate the therapeutic range of 1 to 10 mg/L.

Figure 2. Combined visual predictive check (XGBoost). Blue line: median observed plasma concentrations (DV) per patient. Black line: median simulated concentrations obtained from model-predicted optimized doses. Red dashed lines: therapeutic range limits (1–10 mg/L). This combined VPC compares the observed and model-simulated concentration trends across patients. The divergence between curves reflects the difference between empirical dosing and model-optimized dosing, confirming the model’s ability to reproduce the overall distributional pattern while reducing inter-patient variability.

3.4 Correlation between clinical variables and dose

A correlation matrix was used to analyze the linear relationships between the clinical variables and the target dose. This analysis was crucial for identifying the variables with the greatest impact on dose prediction and for understanding the underlying relationships in the data. As expected, physiological parameters like WT (r = 0.82), HT (r = 0.76), and SCR (r = 0.68) showed the strongest positive correlations with the dose. This reflects their direct influence on drug pharmacokinetics. Other variables, such as albumin and liver function parameters (AST, ALT), showed weaker correlations, suggesting a less significant direct impact on dose prediction. Figure 3 displays the complete correlation matrix, highlighting the relationships among all variables.

Figure 3
Correlation matrix heatmap displaying relationships between various medical and demographic variables. The color gradient ranges from blue (negative correlation) to red (positive correlation), with white indicating no correlation. The matrix includes variables like AGE, WT, SCR, BMI, and others, with color intensity reflecting correlation strength. A legend on the right indicates correlation values from -1.0 to 1.0.

Figure 3. Correlation matrix of clinical variables. The figure highlights the linear relationships between key predictors, such as WT, HT, SCR, and the AMT. Variables like albumin, AST, and ALT show weaker correlations, indicating a less direct influence on the target variable.

3.5 Explainable AI with LIME

In this study, one of the main objectives was to ensure the interpretability of the predictions made by the ML models. LIME creates a simpler model to explain the behaviour of a complex model for a single prediction, highlighting the features that contributed most to a specific outcome. This approach proved invaluable for local interpretation at an individual patient level. LIME analyses consistently identified WT, HT, SCR, and azotemia as the most influential features across models, with additional contributions from albumin and liver enzymes in some cases. The five most important predictors across models are summarized in Table 4. An illustrative LIME example for MLP is shown in Figure 4, which highlights how specific clinical variables influenced the Optimized AMT for an individual patient. The example shown corresponds to the best-performing model, the Neural Network, selected for its superior predictive accuracy and robustness.

Table 4
www.frontiersin.org

Table 4. The top five features contributing to dose predictions across ML models.

Figure 4
Bar chart titled

Figure 4. LIME explanation for a representative patient generated by MLP. The graph illustrates the contribution of individual clinical variables to the predicted dose. Positive contributions indicate variables that increased the predicted dose, while negative contributions represent variables that reduced the prediction.

4 Discussion

This study presents a comprehensive evaluation of various ML models for predicting an optimized Ceftaroline dose, focusing on both predictive accuracy and clinical utility. Our results establish a clear hierarchy of performance, with advanced models like the MLP and XGBoost demonstrating excellent predictive capability. Specifically, the MLP achieved a MAE of 1.53 mg and an R2 of 0.94, while XGBoost performed similarly with an MAE of 2.04 mg and an R2 of 0.89. The performance of these models significantly surpassed that of traditional linear methods. RR and LaR, for instance, yielded notably lower R2 scores (0.51 and 0.54, respectively) and higher MAEs (7.92 mg and 7.61 mg), confirming their limitations in capturing the complex, non-linear relationships inherent in pharmacokinetic data. Similarly, HR, while more stable with outliers, did not match the performance of the tree-based and neural models, achieving a MAE of 7.20 mg and an R2 of 0.35. The efficacy of gradient boosting was further underscored by LightGBM, which delivered performance metrics close to those of XGBoost (MAE of 2.92 mg and R2 of 0.84). Beyond mathematical accuracy, our analysis revealed a crucial finding regarding the clinical utility of the models. While the MLP provided the most precise predictions from a statistical standpoint, ensemble models like RF and XGBoost showed superior performance in producing clinically meaningful recommendations. Specifically, XGBoost, RF and LGBM predicted doses that resulted in therapeutic plasma concentrations for 91.5, 94.2 and 92.4% of patients, respectively, while the MLP and LR achieved lower alignment (≈49.9%). This discrepancy highlights a critical point for clinical decision support systems: a low mathematical error does not always equate to high clinical relevance. The high accuracy of the MLP may stem from its ability to model complex, non-linear patterns that lead to highly precise yet potentially out-of-range dose predictions. Conversely, the ensemble models, despite slightly lower accuracy, consistently recommended doses that fall within a clinically safe and effective therapeutic window, making them more valuable in a real-world setting. This suggests that, for dose optimization, models that prioritize a high percentage of clinically aligned predictions may be more valuable than those focused solely on minimizing mathematical error. The interpretability of these models was a cornerstone of our study, achieved through both correlation analysis and the application of LIME. Correlation analysis validated the strong influence of key physiological parameters, such as WT, HT, and SCR, which aligns with established pharmacokinetic principles for Ceftaroline. Notably, we observed a strong correlation between WT and HT (r = 0.89) and between SCR and blood urea nitrogen (r = 0.72). The application of LIME to “black-box” models, such as the MLP, provided critical, per-patient explanations for predictions. This is vital for clinical adoption as it allows physicians to trust and understand the reasoning behind a recommended dose, ensuring consistency with their expertise. Our analysis of feature importance confirmed the central role of renal parameters like SCR and azotemia in predicting dosage, which is consistent with the primary route of Ceftaroline elimination (Zhou et al., 2021). However, given the pharmacokinetic vulnerability of neonatal and especially preterm patients, all model-derived optimized doses in this study were considered strictly exploratory and were not used for clinical decision-making. Moreover, the uneven distribution of renal function markers in this small cohort may bias model behaviour and dose suggestions, and this potential source of error cannot be robustly quantified without larger prospective datasets. Conversely, the appearance of features such as AST and ALT among locally important variables in LIME explanations is likely influenced by multicollinearity and small-sample variability, and was therefore not interpreted as indicating a direct mechanistic role in dose–concentration prediction. Interestingly, our analysis also uncovered subtle sex-based differences, with males tending to require slightly higher doses. While modest, this finding suggests that sex-specific physiological factors, such as differences in body composition or renal function, may warrant further investigation to enhance the precision and equity of future dosing models. Traditional population pharmacokinetic (popPK) models remain the cornerstone of dose optimization, offering valuable mechanistic insights into how drugs are absorbed, distributed, metabolized, and eliminated. However, these models often require predefined structural assumptions and may not easily capture the complex, non-linear relationships that exist among clinical variables in real-world settings. In this work, we propose a complementary, data-driven perspective: the use of ML to model such variability directly from clinical data, without relying on strict compartmental structures. Rather than replacing popPK approaches, our method can enhance them, helping clinicians explore patient-specific factors and identify subtle interactions that traditional frameworks might overlook especially for peculiar population such as infants. The inclusion of explainability tools such as LIME further bridges the gap between computational modeling and clinical reasoning, allowing predictions to be interpreted in a way that aligns with medical decision-making.

5 Significance of the study and limitations

This study, although based on a limited pediatric cohort (n = 20), serves as a proof of concept for the application of ML in personalized dose optimization. The small sample size reflects the well-known challenges of conducting pharmacokinetic research in neonates and young infants but nonetheless provides valuable insights in this difficult-to-study population. The strong performance of advanced models warrants validation in larger, multicenter cohorts. Despite the limited sample size, the risk of model overfitting was carefully mitigated through the use of LOOCV and bootstrap-based validation, combined with regularization and early stopping strategies. The convergence and performance metrics remained stable across validation iterations, supporting the reliability of the findings within the intended proof-of-concept framework. Nevertheless, larger multicentric datasets and external validation will be essential to confirm model generalizability and clinical applicability. Future research will also explore the use of synthetic data generation to expand training variability and further enhance model robustness. Formal hypothesis testing for pairwise model comparison was not performed because paired statistical tests are underpowered and unstable in datasets of this size; therefore, model performance is reported descriptively using LOOCV metrics and bootstrap confidence intervals.

6 Conclusion

Advanced ML models, particularly XGBoost, RF, and MLP, improve the prediction of ceftaroline dosing in pediatric patients. While MLP achieved the greatest statistical accuracy, ensemble models demonstrated superior clinical alignment, underscoring the importance of balancing precision with therapeutic applicability. The successful application of LIME and correlation analysis further validates that ML models can be both powerful and interpretable, a prerequisite for their seamless integration into clinical practice. The analysis of clinical parameters reaffirmed the predominant role of renal function and anthropometric characteristics in dose determination, which is consistent with known pharmacological principles. These findings advocate for AI-driven dose optimization to advance personalized pediatric pharmacotherapy and contribute to mitigating AMR (Branda and Scarpa, 2024).

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Niguarda Hospital Ethics Committee (protocol no. 189–042019, April 15, 2019). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

MF: Formal analysis, Investigation, Software, Writing – original draft, Writing – review & editing. GG: Data curation, Writing – review & editing. AG: Data curation, Writing – original draft, Writing – review & editing. VD: Writing – review & editing. CD: Writing – review & editing. SM: Writing – review & editing. MS: Writing – review & editing. AR: Writing – review & editing. CM: Writing – review & editing. GP: Writing – review & editing. APr: Writing – review & editing. RD: Writing – review & editing. FS: Writing – review & editing. GV: Writing – review & editing. DL: Writing – review & editing. APe: Writing – review & editing.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Acknowledgments

We thank the Pediatric Unit and Neonatal Intensive Care Unit at ASST Grande Ospedale Metropolitano Niguarda for their collaboration.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/frai.2025.1702087/full#supplementary-material

References

Ahmed, S. K., Hussein, S., Qurbani, K., Ibrahim, R. H., Fareeq, A., Mahmood, K. A., et al. (2024). Antimicrobial resistance: impacts, challenges, and future prospects. J. Med. Surg. Public Health 2:100081. doi: 10.1016/j.glmedi.2024.100081

Crossref Full Text | Google Scholar

Anderson, B. J., and Holford, N. H. G. (2008). Mechanism-based concepts of size and maturity in pharmacokinetics. Annu. Rev. Pharmacol. Toxicol. 48, 303–332. doi: 10.1146/annurev.pharmtox.48.113006.094708,

PubMed Abstract | Crossref Full Text | Google Scholar

Arnold, A., McLellan, S., and Stokes, J. M. (2025). How AI can help us beat AMR. NPJ Antimicrob. Resist. 3:18. doi: 10.1038/s44259-025-00085-4,

PubMed Abstract | Crossref Full Text | Google Scholar

Branda, F., and Scarpa, F. (2024). Implications of artificial intelligence in addressing antimicrobial resistance: innovations, global challenges, and healthcare’s future. Antibiotics 13:502. doi: 10.3390/antibiotics13060502,

PubMed Abstract | Crossref Full Text | Google Scholar

Kearns, G. L., Abdel-Rahman, S. M., Alander, S. W., Blowey, D. L., Leeder, J. S., and Kauffman, R. E. (2003). Developmental pharmacology — drug disposition, action, and therapy in infants and children. N. Engl. J. Med. 349, 1157–1167. doi: 10.1056/NEJMra035092,

PubMed Abstract | Crossref Full Text | Google Scholar

Murray, C. J. L., Ikuta, K. S., Sharara, F., Swetschinski, L., Robles Aguilar, G., Gray, A., et al. (2022). Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet 399, 629–655. doi: 10.1016/S0140-6736(21)02724-0,

PubMed Abstract | Crossref Full Text | Google Scholar

Poweleit, E. A., Vinks, A. A., and Mizuno, T. (2023). Artificial intelligence and machine learning approaches to facilitate therapeutic drug management and model-informed precision dosing. Ther. Drug Monit. 45, 143–150. doi: 10.1097/FTD.0000000000001078,

PubMed Abstract | Crossref Full Text | Google Scholar

Ratain, MJ, William, K., and Plunkett, J. (2003). Principles of pharmacokinetics. Journal of Medicine, surgery, and Public Health, Elsevier. Available online at: https://www.sciencedirect.com/science/article/pii/S2949916X24000343

Google Scholar

Ruggiero, A., Ariano, A., Triarico, S., Capozza, M. A., Ferrara, P., and Attinà, G. (2019). Neonatal pharmacology and clinical implications. Drugs Context 8:212608. doi: 10.7573/dic.212608,

PubMed Abstract | Crossref Full Text | Google Scholar

Smith, N. M., Lenhard, J. R., Boissonneault, K. R., Landersdorfer, C. B., Bulitta, J. B., Holden, P. N., et al. (2020). Using machine learning to optimize antibiotic combinations: dosing strategies for meropenem and polymyxin B against carbapenem-resistant Acinetobacter baumannii. Clin. Microbiol. Infect. 26, 1207–1213. doi: 10.1016/j.cmi.2020.02.004,

PubMed Abstract | Crossref Full Text | Google Scholar

Welte, T., Kantecki, M., Stone, G. G., and Hammond, J. (2019). Ceftaroline fosamil as a potential treatment option for Staphylococcus aureus community-acquired pneumonia in adults. Int. J. Antimicrob. Agents 54, 410–422. doi: 10.1016/j.ijantimicag.2019.08.012,

PubMed Abstract | Crossref Full Text | Google Scholar

Willems, J., Hermans, E., Schelstraete, P., Depuydt, P., and De Cock, P. (2021). Optimizing the use of antibiotic agents in the pediatric intensive care unit: a narrative review. Pediatr. Drugs 23, 39–53. doi: 10.1007/s40272-020-00426-y,

PubMed Abstract | Crossref Full Text | Google Scholar

Zhou, J., You, X., Guo, G., Ke, M., Xu, J., Ye, L., et al. (2021). Ceftaroline dosage optimized for pediatric patients with renal impairment using physiologically based pharmacokinetic modeling. J. Clin. Pharmacol. 61, 1646–1656. doi: 10.1002/jcph.1944,

PubMed Abstract | Crossref Full Text | Google Scholar

Glossary

AI - Artificial Intelligence

AMR - Antimicrobial Resistance

AMT - Amount administered

ALT - Alanine Aminotransferase

AST - Aspartate Aminotransferase

BMI - Body Mass Index

DV ratio - Ratio between target and real plasma concentration

eGFR - Estimated Glomerular Filtration Rate

HR - Huber Regression

LaR - Lasso Regression

LGBM - Light Gradient Boosting Machine

LIME - Local Interpretable Model-agnostic Explanations

LR - Linear Regression

MAE - Mean Absolute Error

MAPE - Mean Absolute Percentage Error

MLP - Neural Network

PK - Pharmacokinetics

RF - Random Forest

RMSE - Root Mean Squared Error

RR - Ridge Regression

SCR - Serum Creatinine

WT - Weight

HT - Height

XAI - Explainable AI

XGBoost - Extreme Gradient Boosting

Keywords: artificial intelligence, ceftaroline, drug dosing, explainable AI, machine learning, pediatric pharmacokinetics, personalized medicine

Citation: Frasca M, Gazzaniga G, Graziosi A, De Nicolo V, De Giacomo C, Martinelli S, Senatore M, Romandini A, Moretti C, Pattarino GAC, Proto A, Danesi R, Scaglione F, Vago G, La Torre D and Pani A (2026) Artificial intelligence and precision medicine: a pilot study predicting optimal ceftaroline dosage for pediatric patients. Front. Artif. Intell. 8:1702087. doi: 10.3389/frai.2025.1702087

Received: 09 September 2025; Revised: 19 December 2025; Accepted: 24 December 2025;
Published: 16 January 2026.

Edited by:

Kaya Kuru, University of Central Lancashire, United Kingdom

Reviewed by:

Vinothkumar Kolluru, Stevens Institute of Technology, United States
Liu Bo, Shanghai University of Engineering Sciences, China

Copyright © 2026 Frasca, Gazzaniga, Graziosi, De Nicolo, De Giacomo, Martinelli, Senatore, Romandini, Moretti, Pattarino, Proto, Danesi, Scaglione, Vago, La Torre and Pani. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Agnese Graziosi, YWduZXNlLmdyYXppb3NpQHVuaW1pLml0

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.