Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy

Zhang, Xiaote; Xie, Qiaoyi; Wu, Ganggang

doi:10.3389/fped.2025.1614495

ORIGINAL RESEARCH article

Front. Pediatr., 19 June 2025

Sec. Pediatric Otolaryngology

Volume 13 - 2025 | https://doi.org/10.3389/fped.2025.1614495

Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy

Xiaote Zhang^1*

Qiaoyi Xie²

Ganggang Wu¹

¹Department of Otolaryngology Head and Neck Surgery, Ningbo Yinzhou No.2 Hospital, Ningbo, Zhejiang, China
²Department of Pediatrics, The Affiliated People’s Hospital of Ningbo University, Ningbo, Zhejiang, China

Background: Otitis media with effusion (OME) affects a significant proportion of children with adenoid hypertrophy (AH) and can lead to developmental sequelae when chronic. Current non-invasive screening modalities rely predominantly on acoustic immittance measurements, which demonstrate variable diagnostic performance. Given the urgent need for improved diagnostic methods and extensive characterization of risk factors for OME in AH children, developing diagnostic models represents an efficient strategy to enhance clinical identification accuracy in practice.

Objective: This study aims to develop and validate an optimal machine learning (ML)-based prediction model for OME in AH children by comparing multiple algorithmic approaches, integrating clinical indicators with acoustic measurements into a widely applicable diagnostic tool.

Methods: A retrospective analysis was conducted on 847 pediatric patients with AH. Five ML algorithms were developed to identify OME using demographic, clinical, laboratory, and acoustic immittance parameters. The dataset underwent 7:3 stratified partitioning for training and testing cohorts. Within the training cohort, models were initially optimized through randomized grid search with 5-fold cross-validation, followed by comprehensive training with optimized parameters. Model performance was evaluated in the testing cohort using discrimination, calibration, clinical utility metrics, and confusion matrix-derived statistics. The optimal ML model was subsequently analyzed through SHapley Additive exPlanations (SHAP) methodology for interpretability, with sequential ablation testing performed to identify critical predictive variables.

Results: Among 847 children with AH, 262 (30.9%) were diagnosed with OME. The Random Forest (RF) model demonstrated superior performance with the highest discrimination (area under the receiver operating characteristic curve = 0.919), balanced calibration (Brier score = 0.102), and optimal clinical utility across decision thresholds of 0.4–0.6. Confusion matrix analysis further confirmed RF as the optimal model, achieving 0.875 accuracy and robust inter-rater agreement (Cohen's kappa coefficient = 0.696) in the testing cohort. SHAP analysis identified the adenoid-to-nasopharyngeal ratio as the predominant diagnostic indicator, followed by tympanometric type and history of recurrent respiratory infections.

Conclusion: An RF-based diagnostic model effectively identifies OME in AH children by integrating anatomical, functional, and inflammatory parameters, providing a clinically applicable tool for enhanced diagnostic accuracy and evidence-based management decisions.

Introduction

Otitis media with effusion (OME) represents one of the most common childhood conditions, with approximately 2.2 million new cases diagnosed annually in the United States (1). While most episodes resolve spontaneously within 3 months, about 25% of cases persist for ≥3 months, classified as chronic OME (2). It represents the leading cause of acquired hearing loss in pediatric populations and may be associated with significant developmental sequelae including speech delays, vestibular disturbances, behavioral problems, and educational difficulties (3, 4). The etiology of OME is multifactorial, with adenoid hypertrophy (AH) established as a primary contributor in young children (5). AH mechanically obstructs the Eustachian tube, creating negative middle ear pressure, and serves as a pathogen reservoir, facilitating the retrograde migration of microorganisms into the middle ear, disrupting mucosal function, and promoting persistent effusion (6).

Despite the established relationship between AH and OME, accurate identification of OME in AH children presents significant challenges. Young children demonstrate limited ability to recognize and articulate subtle hearing changes, complicating early detection by caregivers. While tympanocentesis represents the diagnostic gold standard, its invasive nature precludes widespread implementation for screening purposes (7). Among non-invasive alternatives, acoustic immittance measurement has gained prominence due to its simplicity, brief administration time, minimal cooperation requirements, and widespread availability across healthcare settings (8). However, conventional tympanometry demonstrates important limitations in diagnostic reliability, with area under the receiver operating characteristic curve (AUROC) values ranging from 0.68 to 0.93 in detecting middle ear effusion (9–12). These diagnostic uncertainties highlight the need for more accurate, child-appropriate diagnostic methods to identify AH children at risk for developing OME.

While wideband acoustic immittance represents a promising advancement in tympanometric assessment, its clinical integration faces significant temporal constraints (13, 14). These technologies remain in early validation phases, with widespread implementation delayed by requirements for additional efficacy studies, specialized equipment procurement, and healthcare provider training. This implementation timeline creates an urgent diagnostic gap for the substantial population of AH children who require immediate, accurate assessment for OME during critical developmental windows. In contrast, extensive research has thoroughly characterized risk factors for OME development in AH children, providing a robust knowledge foundation (15–17); yet this evidence remains underutilized in clinical practice due to the absence of validated predictive instruments. With the rapid advancement of artificial intelligence, particularly machine learning (ML) technologies, a timely solution emerges to bridge this implementation gap (18, 19). ML algorithms can rapidly process multidimensional clinical data, identifying complex patterns and relationships between variables that conventional statistical approaches may fail to capture (20–24). These computational techniques enable efficient integration of readily available clinical indicators into unified predictive frameworks that can be rapidly deployed across all levels of healthcare facilities.

Therefore, this study aims to develop and validate an optimal ML-based diagnostic model for OME in AH children by comparing the performance of multiple algorithmic approaches. By incorporating readily available clinical indicators with acoustic immittance findings, we seek to create a practical, non-invasive diagnostic tool implementable across various healthcare settings. The resulting model will facilitate individualized risk stratification, guiding appropriate clinical interventions while minimizing unnecessary invasive procedures, ultimately supporting evidence-based management decisions in this vulnerable pediatric population.

Materials and methods

Study design

The study protocol received formal ethical approval from the Ethics Committee of Ningbo Yinzhou No.2 Hospital (2025-014) and adhered to all principles established in the Declaration of Helsinki. Given the retrospective, observational design, the ethics committee granted a waiver of individual informed consent. Patient confidentiality was maintained through comprehensive deidentification procedures, with systematic removal of all personal identifiers from electronic health records prior to analysis in accordance with institutional privacy standards.

Sample size determination followed established methodological principles for predictive model development. Based on previous studies and clinical experience (25), we estimated the prevalence of OME among AH children at approximately 30%. Adhering to the recommended minimum of 10 events per predictor variable to minimize overfitting risk, and planning to evaluate up to 10 potential predictors (26), we calculated a required minimum cohort of 333 patients. This sample size would yield approximately 100 cases with confirmed OME, ensuring sufficient statistical power for robust model development and internal validation.

Study population

Consecutive pediatric patients diagnosed with AH at our institution were enrolled from January 2021 until 1,000 cases were identified in January 2025. From this cohort, patients were included in the study if they: (1) were aged 3–12 years; (2) had complete clinical documentation; and (3) received confirmed AH diagnosis. Exclusion criteria encompassed: (1) alternative causes of hearing abnormalities, including cranial trauma, middle or inner ear injury, or congenital hearing impairment; (2) craniofacial anomalies attributable to other conditions such as Down syndrome or congenital cleft palate; or (3) severe underlying systemic disease. Bilateral acoustic immittance testing and lateral nasopharyngeal radiography were performed on all subjects. AH was diagnosed when the adenoid-to-nasopharyngeal (A/N) ratio measured from lateral radiographs exceeded 0.70 (27–29), with this parameter also utilized to quantify hypertrophy severity. OME diagnosis was established through a two-step protocol involving bilateral acoustic immittance measurement followed by otoendoscopic examination when indicated. Tympanograms were categorized as normal (bilateral Type A) or abnormal (Type B or C) (30). Patients exhibiting abnormal tympanometric findings underwent confirmatory otoendoscopic examination to exclude cerumen impaction and confirm middle ear effusion; those with confirmed effusion were classified as the AH + OME Group, while subjects with normal tympanograms or those with abnormal tympanograms but no effusion on otoendoscopy were designated as the AH Group.

Potential predictors

Multiple potential predictive variables were systematically extracted from electronic medical records. Demographic and clinical history data included age, gender, duration of AH-related symptoms (including nasal obstruction, mouth breathing, and snoring), and body mass index (BMI). Physical examination findings incorporated tonsil size grading. Environmental and comorbidity factors were documented, including passive smoke exposure, chronic rhinosinusitis, allergic rhinitis, asthma, and history of recurrent respiratory infections (defined as ≥6 episodes within 12 months). Laboratory parameters included comprehensive hematologic assessment comprising differential leukocyte distribution (neutrophil, lymphocyte, monocyte, eosinophil, and basophil percentages) and quantitative Total IgE measurements.

Data preprocessing and partitioning

The dataset underwent stratified partitioning to maintain proportional representation of OME cases, with 70% allocated to model training and 30% reserved for independent testing. This stratification process preserved the outcome distribution across subsets while ensuring statistical independence between cohorts. Balanced distribution of patient characteristics between partitions was confirmed using standardized mean differences, with values below 0.1 considered indicative of adequate equilibrium. Feature preprocessing employed an adaptive standardization protocol based on distribution characteristics. Variables exhibiting approximately normal distributions (skewness < 2, kurtosis < 7) underwent z-score normalization to center at zero with unit standard deviation. For non-normally distributed variables or those with substantial outliers (>10%), robust scaling was implemented to achieve median centering with interquartile range normalization, thereby minimizing the influence of extreme values while preserving relative relationships. Categorical variables received targeted encoding: binary factors were processed through dichotomous encoding (0/1), while ordinal variables underwent sequential encoding to preserve their inherent hierarchical relationships.

Model training and hyperparameter tuning

Five ML algorithms were implemented for OME identification in AH children. Logistic regression (LR) was selected for its transparent coefficient interpretation and established clinical utility; random forest (RF) and eXtreme Gradient Boosting (XGBoost) were incorporated for their ability to model complex non-linear relationships and quantify variable importance; support vector machine (SVM) was included for its efficacy in handling high-dimensional feature spaces; and K-nearest neighbors (KNN) was employed to capture local patterns through its instance-based learning approach. Algorithm-specific hyperparameter optimization was executed through randomized grid search with 5-fold cross-validation. The parameter space for LR encompassed regularization strength and penalty type; RF optimization targeted maximum tree depth, estimator count, and minimum samples per leaf; XGBoost tuning addressed learning rate, maximum depth, and estimator quantity; SVM optimization included kernel selection, regularization parameter, and kernel coefficient; and KNN parameter tuning focused on neighbor count, distance metric, and weighting function. Each algorithm underwent 100 iterations of systematic parameter exploration using validation loss as the optimization metric to ensure optimal model configuration. Final models were then trained on the complete training dataset with these optimized parameters to maximize statistical power and enhance generalizability.

Model performance evaluation

Comparative assessment of algorithm performance was executed in the independent testing cohort using a multi-faceted evaluation framework. Discriminative capacity was quantified through AUROC analysis. Calibration curves and Brier scores (BS) were employed to assess the correspondence between predicted probabilities and observed outcomes, quantifying the models' potential over- or under-estimation tendencies. Clinical utility was examined through decision curve analysis (DCA), which assessed net benefit across a range of clinically relevant threshold probabilities while accounting for intervention consequences. Threshold-dependent performance was characterized using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and overall accuracy at clinically determined decision thresholds. Additionally, Cohen's Kappa coefficients were calculated to measure inter-rater reliability between predicted and observed outcomes, providing a metric of agreement that accounts for chance concordance in classification tasks. Based on this comprehensive evaluation protocol, the algorithm demonstrating optimal performance across multiple metrics was identified as the preferred prediction model for clinical implementation.

Interpretability analysis

The mechanistic underpinnings of the optimal prediction model were elucidated through comprehensive interpretability analysis employing SHapley Additive exPlanations (SHAP) methodology. SHAP values were calculated to quantify individual feature contributions based on cooperative game theory principles (31). Global feature importance was determined by computing mean absolute SHAP values across the testing cohort, identifying the relative contribution of each predictor to model discrimination. SHAP summary bar plots were generated to visualize the relative importance of features, while SHAP beeswarm plots were constructed to illustrate both magnitude and directionality of feature effects, characterizing their influence on OME probability. To validate these findings, sequential feature ablation testing was conducted by iteratively removing important variables and quantifying performance changes through confusion matrix metrics, thereby confirming the practical significance of identified predictors.

Statistical analysis

Statistical evaluations were conducted using methodologies aligned with variable distribution characteristics. Descriptive statistics were presented as frequencies with percentages for categorical variables and as medians with interquartile ranges for continuous variables given their predominantly non-normal distributions. Between-group comparisons employed chi-square or Fisher exact tests for categorical variables and Mann–Whitney U tests for continuous measures. Statistical significance was established at p < 0.05. All analyses were implemented in Python (version 3.12.0) utilizing scikit-learn (version 1.4.0) for machine learning operations and matplotlib (version 3.8.0) for visualization.

Results

Patient characteristics

Among the 1,000 AH children initially evaluated, 847 met eligibility criteria after applying inclusion and exclusion parameters (Figure 1). Within this cohort, 262 patients (30.9%) were diagnosed with OME comorbid with AH, while 585 children presented with isolated AH. Comparative analysis of clinical characteristics between these groups revealed several significant predictors associated with OME development, as detailed in Table 1. Children in the AH + OME group exhibited higher BMI and longer symptom duration compared to the AH group. Advanced tonsillar hypertrophy was more prevalent among OME patients, with Grade 3 tonsils being notably overrepresented. Comorbid conditions were observed with significantly greater frequency in the AH + OME group, including allergic rhinitis, asthma, chronic rhinosinusitis, passive smoke exposure, and recurrent respiratory infections. Laboratory parameters revealed elevated monocyte and neutrophil percentages, along with increased total IgE levels in the AH + OME group. The A/N ratio was significantly higher in AH + OME patients. Tympanometric findings differed markedly between groups, with Type B tympanograms predominating in the AH + OME group.

Figure 1

Flowchart of a study's cohort selection process. From 1,000 AH children, eligible participants, totaling 847, are identified based on inclusion criteria: age three to twelve years, complete documentation, confirmed AH. Exclusion criteria include alternative hearing pathologies, craniofacial anomalies, severe systemic disease. Eligible participants are divided into AH group with 585 members and AH+OME group with 262. Further split into training cohort with 592 and testing cohort with 255 participants.

Figure 1. Patient recruitment and study flow diagram.

Table 1

Table 1. Comparison of clinical characteristics between AH group and AH + OME group.

For model training and validation purposes, the study population was stratified into training (n = 592) and testing (n = 255) cohorts, with 183 (30.9%) and 79 (30.9%) OME cases distributed proportionally between groups. No statistically significant differences were observed between the training and testing cohorts across all evaluated parameters (all p > 0.05), as detailed in Supplementary Table S1.

Model training and optimization

Five distinct ML algorithms were implemented to identify OME in AH patients: LR, RF, XGBoost, SVM, and KNN. The optimization process was visualized in Supplementary Figure S1, with training and validation loss trajectories indicating appropriate model fitting and no significant overfitting across all algorithms. The comprehensive hyperparameter configurations determined through systematic cross-validation are detailed in Supplementary Table S2. These optimized parameters were subsequently employed for final model training on the complete training dataset.

Model performance comparison

A comprehensive evaluation of the five ML models revealed varying capabilities in OME risk stratification, as shown in Figure 2, which illustrates model performance across three critical dimensions: discrimination ability, calibration accuracy, and clinical utility. In the testing cohort, ROC curve assessment identified SVM, RF, and KNN as superior performers with AUROC values exceeding 0.9. Calibration curve analysis subsequently eliminated KNN due to substantial probability underestimation, while RF and SVM maintained balanced calibration profiles. DCA further differentiated between the remaining contenders, with RF demonstrating superior net benefit across the clinically relevant threshold probability range (0.4–0.6).

Figure 2

Composite image featuring three plots comparing different models. Panel A shows ROC curves for Logistic Regression, SVM, Random Forest, XGBoost, and KNN, with AUC values indicated. Panel B displays calibration curves for the same models, highlighting Brier Scores. Panel C presents DCA curves, analyzing net benefit across varying threshold probabilities, comparing treat-all versus treat-none strategies. Each model's performance is color-coded consistently across the panels.

Figure 2. Comprehensive model evaluation metrics. (A) ROC curves for all models with corresponding AUC values. (B) Calibration curves demonstrating the relationship between predicted and observed probabilities across models. (C) DCA illustrating clinical net benefit across various threshold probabilities.

Confusion matrix analysis further validated these findings, as demonstrated in Figure 3, with RF correctly identifying 58 OME cases while misclassifying only 21 OME patients as non-OME, in contrast to SVM's performance of 55 correct and 24 misclassified OME cases. Additionally, RF demonstrated enhanced specificity, accurately classifying 165 non-OME cases with merely 11 false positives, surpassing SVM's 163 correct non-OME classifications with 13 false positives. As summarized in Table 2, RF achieved superior performance metrics across critical parameters, including sensitivity (0.734), specificity (0.938), accuracy (0.875), and Cohen's kappa coefficient (0.696). Through this comprehensive evaluation process, RF emerged as the optimal algorithm for OME risk stratification in pediatric patients with AH.

Figure 3

Five confusion matrices for different models: (A) Logistic Regression with 160 true negatives, 16 false positives, 24 false negatives, 55 true positives. (B) Random Forest with 165 true negatives, 11 false positives, 21 false negatives, 58 true positives. (C) XGBoost with 171 true negatives, 5 false positives, 56 false negatives, 23 true positives. (D) SVM with 163 true negatives, 13 false positives, 24 false negatives, 55 true positives. (E) KNN with 174 true negatives, 2 false positives, 60 false negatives, 19 true positives. Color intensity indicates value magnitude.

Figure 3. Comparative confusion matrices displaying classification performance of five ML algorithms in the testing cohort. (A) LR; (B) RF; (C) XGBoost; (D) SVM; (E) KNN.

Table 2

Table 2. Performance metrics of ML models in the testing cohort.

Model interpretability and ablation analysis

SHAP analysis of the optimal RF model revealed the relative influence of predictor variables on OME risk assessment. Feature importance quantification (Figure 4A) identified A/N ratio as the predominant predictor with substantially higher impact than other variables, followed by tympanometric type and history of recurrent respiratory infections. The top five factors also included chronic rhinosinusitis and total IgE levels, reflecting the multifactorial etiology of OME in AH children. The SHAP summary plot (Figure 4B) demonstrated that elevated A/N ratio strongly predicted OME occurrence, while abnormal tympanometric findings (particularly Type B) exhibited consistent association with positive classification.

Figure 4

Panel A shows a bar chart of feature importance based on SHAP values, highlighting A/N ratio, tympanometric type, and recurrent respiratory infections as the most significant features. Panel B is a SHAP summary plot illustrating the impact of each feature value on model output, with A/N ratio being the most influential. Color gradient from blue to pink represents low to high feature values.

Figure 4. SHAP analysis of the RF model. (A) Feature importance ranking based on mean absolute SHAP values. (B) SHAP summary plot illustrating feature effects on model output.

Ablation experiments validated these contribution patterns, with sequential elimination of features producing systematic performance degradation proportional to SHAP-derived importance rankings (Table 3). Removal of A/N ratio caused the most substantial decline in sensitivity (from 0.734 to 0.519) while preserving specificity, indicating its critical role in identifying true OME cases. Similarly, eliminating tympanometric type reduced sensitivity to 0.544, underscoring its complementary diagnostic value. These findings demonstrate that structural factors and functional measurements provide the foundation for accurate OME prediction, while inflammatory and immunological parameters contribute additional discriminative value through interactions with primary predictors.

Table 3

Table 3. Ablation analysis results showing performance metrics with sequential feature removal from the RF model.

Discussion

The diagnosis of OME in AH children remains challenging due to limitations in existing screening modalities, particularly conventional tympanometry which exhibits variable diagnostic performance. In this study, a ML-based diagnostic tool was developed and validated that integrates clinical, demographic, and acoustic immittance parameters to enhance identification of OME in AH children. The optimal RF model demonstrated excellent discrimination, robust inter-rater agreement, and superior clinical utility in the testing cohort. Unlike previous studies that predominantly utilized LR for linear predictions, this investigation conducted a comprehensive comparison of multiple ML algorithms, enabling thorough examination of both linear and non-linear associations between variables and clinical outcomes. This model offers clinicians an interpretable diagnostic instrument that effectively categorizes patients according to anatomical, functional, and inflammatory parameters without requiring specialized equipment or invasive procedures.

The diagnosis and management of OME in pediatric populations constitute significant public health concerns with substantial developmental consequences. Chronic OME, commonly observed in children with AH, represents a primary cause of acquired hearing impairment during critical developmental periods (32). This auditory deficit often leads to developmental complications including speech delay, communication difficulties, behavioral abnormalities, and educational challenges (33, 34). Untreated persistent OME may evolve to more severe middle ear conditions, including adhesive otitis media, tympanosclerosis, cholesterol granuloma, and acquired primary cholesteatoma. OME identification in pediatric AH patients presents considerable diagnostic difficulties, as young children typically cannot adequately express subtle auditory deficits due to limited metacognitive awareness and language capabilities (35). Parental detection of hearing impairment often occurs only after significant auditory deterioration or when attention deficits become evident in educational environments (36). Tympanometric assessment, although widely used as a non-invasive screening method, exhibits notable diagnostic limitations (10, 37), demonstrated by current findings wherein 11.8% of confirmed OME cases presented with Type A tympanograms (conventionally interpreted as normal), while 13.0% of children without OME displayed Type B tympanograms (traditionally associated with middle ear effusion). These diagnostic inconsistencies highlight the need for improved diagnostic methodologies that overcome the limitations of single-parameter assessment approaches.

In this investigation, the comparative analysis of five ML algorithms identified RF as the optimal approach for OME diagnosis in AH children, demonstrating superior performance across multiple evaluation metrics. The RF model achieved excellent discrimination, balanced calibration, and superior clinical utility, significantly outperforming conventional LR approaches (Figure 2). While LR exhibited acceptable diagnostic accuracy, its inferior PPV highlighted the limitations of linear modeling in capturing complex pathophysiological relationships. To our knowledge, this study represents the first AI-based predictive model for OME risk stratification in AH children. While previous studies have identified individual risk factors using conventional statistical methods (17, 38, 39), our RF model integrates multiple variables to achieve superior predictive performance.

SHAP analysis provided crucial insights into the multifactorial etiology of OME in this population, identifying A/N ratio and tympanometric type as predominant diagnostic indicators. This aligns with recent studies, which identified adenoid grade as a primary risk factor (16, 17). The substantial contribution of these parameters supports established pathophysiological mechanisms whereby AH mechanically obstructs the Eustachian tube orifice, creating negative middle ear pressure with subsequent effusion formation (40, 41). Additionally, inflammatory parameters including recurrent respiratory infections and chronic rhinosinusitis emerged as significant contributors to diagnostic accuracy, consistent with findings from Restuti et al. (38) and previous investigations confirming that bacterial colonization may exert greater influence on OME development than mechanical obstruction alone, with inflammatory responses disrupting ciliary transport and Eustachian tube function (42–44). Immunological parameters (Total IgE) and environmental exposures (passive smoke) demonstrated modest but clinically relevant contributions to the diagnostic model, consistent with prior research indicating tobacco smoke exposure adversely affects both innate and adaptive immunity in children while directly impairing mucociliary clearance through increased goblet cell proliferation and mucus production (45, 46). Importantly, our integrated ML approach enables simultaneous consideration of multiple interacting variables, providing superior predictive performance compared to traditional single-factor analyses while maintaining clinical interpretability through SHAP methodology.

The developed ML-based diagnostic framework provides clinicians with a practical decision support tool that integrates routinely collected clinical parameters to improve OME identification in children with AH. In clinical practice, physicians can implement this model by entering standard patient data to generate an OME probability score that enables risk stratification, allowing clinicians to identify high-risk children who would benefit from immediate confirmatory testing vs. low-risk cases suitable for routine monitoring. For busy clinical practices, this tool offers particular value in optimizing resource allocation by identifying which patients require immediate specialist consultation vs. those who can be safely monitored with standard care protocols. The model can be easily implemented through simple electronic interfaces without requiring additional equipment or specialized training, making it accessible across diverse healthcare settings from community clinics to academic medical centers. Furthermore, the transparent nature of the model allows physicians to understand which specific factors contribute most to OME risk in individual patients, facilitating more informed discussions with families about diagnostic recommendations and treatment plans while improving overall diagnostic confidence in this challenging pediatric population.

Despite the promising results, several methodological limitations warrant consideration. The retrospective single-center design imposed constraints on both generalizability and variable selection. The analysis was restricted to routinely collected clinical parameters, necessarily excluding potentially influential socioeconomic and environmental determinants such as breastfeeding duration and household allergen exposures. Similarly, we were unable to incorporate potentially significant biochemical markers, particularly vitamin D3 levels, despite emerging evidence supporting their association with OME pathogenesis (47, 48). Furthermore, while otoscopic examination represents a routine clinical assessment, these findings were not incorporated due to the challenges in pediatric patient cooperation and the inherent subjectivity in interpretation across clinicians. Additionally, although our internal validation demonstrated robust predictive metrics, the model lacks external validation across diverse clinical settings, geographic regions, and patient populations. This validation gap limits immediate broad clinical implementation. Future research directions should address these limitations through prospective multicenter validation studies, incorporation of comprehensive variable panels including standardized otoscopic findings, socioeconomic factors, and relevant biomarkers, and evaluation of model performance across seasonal variations and diverse clinical contexts. Such refinements would strengthen diagnostic precision and facilitate practical implementation of machine learning approaches in routine pediatric otolaryngology practice.

In conclusion, an RF-based diagnostic model was established that effectively identifies OME in children with AH through integration of accessible clinical parameters. Comparative analysis revealed RF superiority in capturing complex pathophysiological relationships between variables and clinical outcomes. This model addresses limitations in conventional tympanometric assessment by reducing misclassifications, thereby providing a reliable diagnostic instrument for clinical implementation. The framework offers significant value in pediatric populations where symptom reporting remains challenging and early detection prevents developmental sequelae. Through enhanced diagnostic confidence and improved clinical decision-making, this approach advances evidence-based management of OME in the pediatric AH population.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by the Ethics Committee of Ningbo Yinzhou No.2 Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants' legal guardians/next of kin because the requirement for informed consent was waived by the Ethics Committee of Ningbo Yinzhou No.2 Hospital due to the retrospective nature of the study.

Author contributions

XZ: Methodology, Software, Conceptualization, Data curation, Investigation, Validation, Writing – review & editing, Formal analysis, Writing – original draft, Visualization, Project administration. QX: Methodology, Writing – review & editing, Validation, Software, Writing – original draft, Formal analysis, Visualization. GW: Writing – original draft, Visualization, Resources, Formal analysis, Data curation, Investigation, Validation, Software, Conceptualization.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fped.2025.1614495/full#supplementary-material

References

1. Rosenfeld RM, Shin JJ, Schwartz SR, Coggins R, Gagnon L, Hackell JM, et al. Clinical practice guideline: otitis media with effusion (update). Otolaryngol Head Neck Surg. (2016) 154(1_suppl):S1–41. doi: 10.1177/0194599815623467

PubMed Abstract | Crossref Full Text | Google Scholar

2. Daly KA, Hunter LL, Giebink GS. Chronic otitis media with effusion. Pediatr Rev. (1999) 20(3):85–94. doi: 10.1542/pir.20-3-85

PubMed Abstract | Crossref Full Text | Google Scholar

3. Vanneste P, Page C. Otitis media with effusion in children: pathophysiology, diagnosis, and treatment. A review. J Otol. (2019) 14(2):33–9. doi: 10.1016/j.joto.2019.01.005

PubMed Abstract | Crossref Full Text | Google Scholar

4. Bennett KE, Haggard MP, Silva PA, Stewart IA. Behaviour and developmental effects of otitis media with effusion into the teens. Arch Dis Child. (2001) 85(2):91–5. doi: 10.1136/adc.85.2.91

PubMed Abstract | Crossref Full Text | Google Scholar

5. Bhat V, Mani IP, Aroor R, Saldanha M, Goutham MK, Pratap D. Association of asymptomatic otitis media with effusion in patients with adenoid hypertrophy. J Otol. (2019) 14(3):106–10. doi: 10.1016/j.joto.2018.12.001

PubMed Abstract | Crossref Full Text | Google Scholar

6. Sogebi OA, Oyewole EA, Ogunbanwo O. Asymptomatic otitis media with effusion in children with adenoid enlargement. J Natl Med Assoc. (2021) 113(2):158–64. doi: 10.1016/j.jnma.2020.08.005

PubMed Abstract | Crossref Full Text | Google Scholar

7. Harmes KM, Blackwood RA, Burrows HL, Cooke JM, Harrison RV, Passamani PP. Otitis media: diagnosis and treatment. Am Fam Physician. (2013) 88(7):435–40.24134083

PubMed Abstract | Google Scholar

8. Merchant GR, Al-Salim S, Tempero RM, Fitzpatrick D, Neely ST. Improving the differential diagnosis of otitis media with effusion using wideband acoustic immittance. Ear Hear. (2021) 42(5):1183–94. doi: 10.1097/aud.0000000000001037

PubMed Abstract | Crossref Full Text | Google Scholar

9. Chianese J, Hoberman A, Paradise JL, Colborn DK, Kearney D, Rockette HE, et al. Spectral gradient acoustic reflectometry compared with tympanometry in diagnosing middle ear effusion in children aged 6 to 24 months. Arch Pediatr Adolesc Med. (2007) 161(9):884–8. doi: 10.1001/archpedi.161.9.884

PubMed Abstract | Crossref Full Text | Google Scholar

10. Anwar K, Khan S, Rehman HU, Javaid M, Shahabi I. Otitis media with effusion: accuracy of tympanometry in detecting fluid in the middle ears of children at myringotomies. Pak J Med Sci. (2016) 32(2):466–70. doi: 10.12669/pjms.322.9009

PubMed Abstract | Crossref Full Text | Google Scholar

11. Şentürk M, Ardıç FN, Tümkaya F, Kara CO. Wideband tympanometry and absorbance for diagnosing middle ear fluids in otitis media with effusion. J Int Adv Otol. (2023) 19(2):140–8. doi: 10.5152/iao.2023.22697

PubMed Abstract | Crossref Full Text | Google Scholar

12. Keefe DH, Sanford CA, Ellison JC, Fitzpatrick DF, Gorga MP. Wideband aural acoustic absorbance predicts conductive hearing loss in children. Int J Audiol. (2012) 51(12):880–91. doi: 10.3109/14992027.2012.721936

PubMed Abstract | Crossref Full Text | Google Scholar

13. Merchant GR, Neely ST. Conductive hearing loss estimated from wideband acoustic immittance measurements in ears with otitis media with effusion. Ear Hear. (2023) 44(4):721–31. doi: 10.1097/aud.0000000000001317

PubMed Abstract | Crossref Full Text | Google Scholar

14. Merchant GR, Neely ST. The influence of otitis media with effusion on middle-ear impedance estimated from wideband acoustic immittance measurements. J Acoust Soc Am. (2021) 150(2):969–78. doi: 10.1121/10.0005822

PubMed Abstract | Crossref Full Text | Google Scholar

15. Songu M, Islek A, Imre A, Aslan H, Aladag I, Pinar E, et al. Risk factors for otitis media with effusion in children with adenoid hypertrophy. Acta Otorhinolaryngol Ital. (2020) 40(2):133–7. doi: 10.14639/0392-100x-2456

PubMed Abstract | Crossref Full Text | Google Scholar

16. Hu R, Xia L, Shi C, Zhou Y, Guo X. Otitis media with effusion in preschool children with adenoid hypertrophy: risk factors and nursing care. Nurs Open. (2024) 11(5):e2165. doi: 10.1002/nop2.2165

PubMed Abstract | Crossref Full Text | Google Scholar

17. Chen W, Yin G, Chen Y, Wang L, Wang Y, Zhao C, et al. Analysis of factors that influence the occurrence of otitis media with effusion in pediatric patients with adenoid hypertrophy. Front Pediatr. (2023) 11:1098067. doi: 10.3389/fped.2023.1098067

PubMed Abstract | Crossref Full Text | Google Scholar

18. Kononenko I. Machine learning for medical diagnosis: history, state of the art and perspective. Artif Intell Med. (2001) 23(1):89–109. doi: 10.1016/S0933-3657(01)00077-X

PubMed Abstract | Crossref Full Text | Google Scholar

19. Deo RC. Machine learning in medicine. Circulation. (2015) 132(20):1920–30. doi: 10.1161/CIRCULATIONAHA.115.001593

PubMed Abstract | Crossref Full Text | Google Scholar

20. Basubrin O. Current status and future of artificial intelligence in medicine. Cureus. (2025) 17(1):e77561. doi: 10.7759/cureus.77561

PubMed Abstract | Crossref Full Text | Google Scholar

21. Orphanidou C, Wong D. Machine learning models for multidimensional clinical data. In: Khan SU, Zomaya AY, Abbas A, editors. Handbook of Large-Scale Distributed Computing in Smart Healthcare. Cham: Springer International Publishing (2017). p. 177–216.

Google Scholar

22. Mateussi N, Rogers MP, Grimsley EA, Read M, Parikh R, Pietrobon R, et al. Clinical applications of machine learning. Ann Surg Open. (2024) 5(2):e423. doi: 10.1097/as9.0000000000000423

PubMed Abstract | Crossref Full Text | Google Scholar

23. Peiffer-Smadja N, Rawson TM, Ahmad R, Buchard A, Georgiou P, Lescure FX, et al. Machine learning for clinical decision support in infectious diseases: a narrative review of current applications. Clin Microbiol Infect. (2020) 26(5):584–95. doi: 10.1016/j.cmi.2019.09.009

PubMed Abstract | Crossref Full Text | Google Scholar

24. Faiyazuddin M, Rahman SJQ, Anand G, Siddiqui RK, Mehta R, Khatib MN, et al. The impact of artificial intelligence on healthcare: a comprehensive review of advancements in diagnostics, treatment, and operational efficiency. Health Sci Rep. (2025) 8(1):e70312. doi: 10.1002/hsr2.70312

PubMed Abstract | Crossref Full Text | Google Scholar

25. Khayat FJ, Dabbagh LS. Incidence of otitis media with effusion in children with adenoid hypertrophy. Zanco J Med Sci. (2011) 15(2):57–63. doi: 10.15218/zjms.2011.022

Crossref Full Text | Google Scholar

26. Riley RD, Snell KI, Ensor J, Burke DL, Harrell FE Jr, Moons KG, et al. Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes. Stat Med. (2019) 38(7):1276–96. doi: 10.1002/sim.7992

PubMed Abstract | Crossref Full Text | Google Scholar

27. Somayaji G, Jain M. Significance of adenoid nasopharyngeal ratio in the assessment of adenoid hypertrophy in children. Res Otolaryngol. (2012) 1:1–5. doi: 10.5923/j.otolaryn.20120101.01

Crossref Full Text | Google Scholar

28. Elwany S. The adenoidal-nasopharyngeal ratio (AN ratio). its validity in selecting children for adenoidectomy. J Laryngol Otol. (1987) 101(6):569–73. doi: 10.1017/s0022215100102269

PubMed Abstract | Crossref Full Text | Google Scholar

29. Moideen SP, Mytheenkunju R, Govindan Nair A, Mogarnad M, Afroze MKH. Role of adenoid-nasopharyngeal ratio in assessing adenoid hypertrophy. Indian J Otolaryngol Head Neck Surg. (2019) 71(Suppl 1):469–73. doi: 10.1007/s12070-018-1359-7

PubMed Abstract | Crossref Full Text | Google Scholar

30. Lidén G, Peterson JL, Björkman G. Tympanometry. Arch Otolaryngol. (1970) 92(3):248–57. doi: 10.1001/archotol.1970.04310030038009

PubMed Abstract | Crossref Full Text | Google Scholar

31. Bordt S, von Luxburg U. From Shapley values to generalized additive models and back. Proceedings of the 26th International Conference on Artificial Intelligence and Statistics. Proceedings of Machine Learning Research: PMLR (2023). p. 709–45; Francisco R, Jennifer D, van de Jan-Willem M, editors.

Google Scholar

32. Rana AK, Singh R, Upadhyay D, Prasad S. Chronic otitis media and its correlation with unilateral sensorineural hearing loss in a tertiary care centre of north India. Indian J Otolaryngol Head Neck Surg. (2019) 71(2):1580–5. doi: 10.1007/s12070-019-01671-5

PubMed Abstract | Crossref Full Text | Google Scholar

33. Lieu JEC, Kenna M, Anne S, Davidson L. Hearing loss in children: a review. J Am Med Assoc. (2020) 324(21):2195–205. doi: 10.1001/jama.2020.17647

PubMed Abstract | Crossref Full Text | Google Scholar

34. Roland L, Fischer C, Tran K, Rachakonda T, Kallogjeri D, Lieu JE. Quality of life in children with hearing impairment: systematic review and meta-analysis. Otolaryngol Head Neck Surg. (2016) 155(2):208–19. doi: 10.1177/0194599816640485

PubMed Abstract | Crossref Full Text | Google Scholar

35. Chiappini E, Ciarcià M, Bortone B, Doria M, Becherucci P, Marseglia GL, et al. Updated guidelines for the management of acute otitis media in children by the Italian society of pediatrics: diagnosis. Pediatr Infect Dis J. (2019) 38(12S):S3–9. doi: 10.1097/inf.0000000000002429

PubMed Abstract | Crossref Full Text | Google Scholar

36. Swierniak W, Gos E, Skarzynski PH, Czajka N, Skarzynski H. The accuracy of parental suspicion of hearing loss in children. Int J Pediatr Otorhinolaryngol. (2021) 141:110552. doi: 10.1016/j.ijporl.2020.110552

PubMed Abstract | Crossref Full Text | Google Scholar

37. Onusko E. Tympanometry. Am Fam Physician. (2004) 70(9):1713–20.15554489

PubMed Abstract | Google Scholar

38. Restuti RD, Tamin S, Nugroho DA, Hutauruk SM, Mansyur M. Factors affecting the occurrence of otitis media with effusion in preschool and elementary school children: a comparative cross-sectional study. BMJ Open. (2022) 12(9):e065291. doi: 10.1136/bmjopen-2022-065291

PubMed Abstract | Crossref Full Text | Google Scholar

39. Dange PS, Bhat VK, Yadav M. Adenoid morphology and other prognostic factors for otitis media with effusion in school children. Indian J Otolaryngol Head Neck Surg. (2022) 74(3):3649–53. doi: 10.1007/s12070-020-02332-8

PubMed Abstract | Crossref Full Text | Google Scholar

40. Manno A, Iannella G, Savastano V, Vittori T, Bertin S, Pasquariello B, et al. Eustachian tube dysfunction in children with adenoid hypertrophy: the role of adenoidectomy for improving ear ventilation. Ear Nose Throat J. (2021) 145561321989455. doi: 10.1177/0145561321989455

PubMed Abstract | Crossref Full Text | Google Scholar

41. Durgut O, Dikici O. The effect of adenoid hypertrophy on hearing thresholds in children with otitis media with effusion. Int J Pediatr Otorhinolaryngol. (2019) 124:116–9. doi: 10.1016/j.ijporl.2019.05.046

PubMed Abstract | Crossref Full Text | Google Scholar

42. Nistico L, Kreft R, Gieseke A, Coticchia JM, Burrows A, Khampang P, et al. Adenoid reservoir for pathogenic biofilm bacteria. J Clin Microbiol. (2011) 49(4):1411–20. doi: 10.1128/jcm.00756-10

PubMed Abstract | Crossref Full Text | Google Scholar

43. Fagö-Olsen H, Dines LM, Sørensen CH, Jensen A. The adenoids but not the palatine tonsils serve as a reservoir for bacteria associated with secretory otitis media in small children. mSystems. (2019) 4(1):e00169–18. doi: 10.1128/msystems.00169-18

Crossref Full Text | Google Scholar

44. Massa HM, Lim DJ, Kurono Y, Cripps AW. Chapter 101: Middle ear and Eustachian tube mucosal immunology. In: Mestecky J, Strober W, Russell MW, Kelsall BL, Cheroutre H, Lambrecht BN, editors. Mucosal Immunology, 4th ed. Boston: Academic Press (2015). p. 1923–42.

Google Scholar

45. Tarhun YM. The effect of passive smoking on the etiology of serous otitis media in children. Am J Otolaryngol. (2020) 41(3):102398. doi: 10.1016/j.amjoto.2020.102398

PubMed Abstract | Crossref Full Text | Google Scholar

46. Patel MA, Mener DJ, Garcia-Esquinas E, Navas-Acien A, Agrawal Y, Lin SY. Tobacco smoke exposure and eustachian tube disorders in US children and adolescents. PLoS One. (2016) 11(10):e0163926. doi: 10.1371/journal.pone.0163926

PubMed Abstract | Crossref Full Text | Google Scholar

47. Akcan FA, Dündar Y, Akcan HB, Uluat A, Cebeci D, Sungur MA, et al. Clinical role of vitamin D in prognosis of otitis media with effusion. Int J Pediatr Otorhinolaryngol. (2018) 105:1–5. doi: 10.1016/j.ijporl.2017.11.030

PubMed Abstract | Crossref Full Text | Google Scholar

48. Asghari A, Bagheri Z, Jalessi M, Salem MM, Amini E, GhalehBaghi S, et al. Vitamin D levels in children with adenotonsillar hypertrophy and otitis media with effusion. Iran J Otorhinolaryngol. (2017) 29(90):29–33.28229060

PubMed Abstract | Google Scholar

Keywords: otitis media with effusion, adenoid hypertrophy, machine learning, random forest, diagnostic accuracy, SHAP analysis

Citation: Zhang X, Xie Q and Wu G (2025) Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy. Front. Pediatr. 13:1614495. doi: 10.3389/fped.2025.1614495

Received: 19 April 2025; Accepted: 6 June 2025;
Published: 19 June 2025.

Edited by:

Yanda Meng, University of Exeter, United Kingdom

Reviewed by:

Aline Sanches, Centro Universitário Einstein de Limeira, Brazil
Junbo Zeng, Sun Yat-sen Memorial Hospital, China

Copyright: © 2025 Zhang, Xie and Wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xiaote Zhang, emhhbmd4aWFvdGUxNjNAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.