Heavy metal biomarkers and their impact on hearing loss risk: a machine learning framework analysis

Nabavi, Ali; Kashkooli, Mohammad; Nabavizadeh, Sara Sadat; Safari, Farimah

doi:10.3389/fpubh.2025.1471490

ORIGINAL RESEARCH article

Front. Public Health, 16 April 2025

Sec. Environmental Health and Exposome

Volume 13 - 2025 | https://doi.org/10.3389/fpubh.2025.1471490

Heavy metal biomarkers and their impact on hearing loss risk: a machine learning framework analysis

Ali Nabavi¹

Mohammad Kashkooli¹

Sara Sadat Nabavizadeh²

Farimah Safari¹^*

¹Student Research Committee, Shiraz University of Medical Sciences, Shiraz, Iran
²Department of Otolaryngology, Otolaryngology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran

Background: Exposure to heavy metals has been implicated in adverse auditory health outcomes, yet the precise relationships between heavy metal biomarkers and hearing status remain underexplored. This study leverages a machine learning framework to investigate these associations, offering a novel approach to understanding the interplay between environmental exposures and hearing loss.

Methods: We conducted a retrospective cross-sectional analysis using data from the 2012–2018 National Health and Nutrition Examination Survey (NHANES), encompassing 2,772 participants after applying exclusion criteria. Demographic, clinical, and heavy metal biomarker data (e.g., blood lead and cadmium levels) were analyzed as features, with hearing loss status—defined as a pure-tone average threshold exceeding 25 dB HL across 500, 1,000, 2000, and 4,000 Hz in the better ear—serving as the binary outcome. Multiple machine learning algorithms, including Random Forest, XGBoost, Gradient Boosting, Logistic Regression, CatBoost, and MLP, were optimized and evaluated. Model performance was assessed using accuracy, area under the curve (AUC), sensitivity, and specificity, while SHAP (SHapley Additive exPlanations) elucidated feature contributions.

Results: The CatBoost model demonstrated the strongest performance, achieving an accuracy of 74.9% and an AUC of 0.792 on test data. Age, education level, gender, and blood levels of lead and cadmium emerged as the most significant features associated with hearing loss, as determined by SHAP analysis. These findings highlight key correlates of hearing impairment within the study population.

Conclusion: This study underscores the utility of a machine learning framework in identifying associations between heavy metal biomarkers and hearing loss in a nationally representative sample. While not designed to forecast hearing loss over time, our findings suggest potential clinical relevance for identifying individuals with elevated heavy metal exposure who may warrant further audiometric evaluation. This work lays a foundation for future longitudinal studies to explore these relationships more comprehensively.

Introduction

Hearing loss is a critical global concern affecting approximately 27.7 million adults in the United States alone (1, 2). Beyond its prevalence, hearing loss imposes considerable psychological and socioeconomic burdens (3). Evidence accumulated over recent decades suggests the ototoxic effects of heavy metals like lead, cadmium, and mercury, even at low levels of exposure (4, 5). Agricultural, pharmaceutical, industrial settings and certain medical applications are common pathways for heavy metal exposure, with exposure rates rising dramatically in recent decades (6, 7).

Some proposed mechanisms of heavy metal-induced hearing loss include damage to the structures and nervous system of the inner ear, reduced blood flow, and lipid peroxidation in the cochlea (8, 9). A study conducted by Wang et al. proved the relationship between heavy metal exposure and risk of hearing loss through a meta-analysis of recent studies (5). Also, the exacerbating effect of heavy metals even in noise-induced hearing loss has been determined in the Korean population (10). Many animal studies have also demonstrated the correlation and underlying mechanism (11–13). However, no studies to date have utilized machine learning (ML) to formally quantify these relationships and detect hearing loss based on objective exposure biomarker values. Since the screening and diagnostic fields are becoming smarter, an automatic system and platform based on artificial intelligence that investigates hearing loss can be developed to reflect the complex relationship between heavy metals and hearing outcome.

Due to prolonged exposure to heavy metals leading to an increased risk of hearing loss, an accurate investigation tool for high-risk populations enables early intervention and reduces the burden of hearing loss. Artificial intelligence, especially supervised learning which uses labeled inputs and outputs, can develop accurate models to explore the risk of hearing loss. In this study, we try different supervised classification algorithms on a large, nationally representative sample from the National Health and Nutrition Examination Survey (NHANES) dataset to help the model learn the complex relationships between heavy metal levels and other attributes and the risk of hearing loss. As ML models become more complex, it is necessary to have explainable methods that can clarify the contribution of different features to the final result. Feature importance-based explanations have been used to enhance the safety and tractability of the models. We conduct an interpretability analysis to address a key limitation of black box models by explaining the reasons behind their performances. This kind of analysis provides clinicians and researchers more confidence in using the models for risk assessment purposes.

Method

Study design and data source

We conducted a retrospective analysis using data from the National Health and Nutrition Examination Survey (NHANES) from 2012 to 2018. NHANES is an ongoing cross-sectional survey conducted by the National Center for Health Statistics to assess the health and nutritional status of adults and children in the United States. The survey combines interviews, physical examinations, and laboratory tests using a complex, multistage probability sampling design to obtain nationally representative samples (14).

From the initial 28,874 NHANES participants, we applied several exclusion criteria to obtain our final analytic sample. We excluded individuals under 20 years of age, those with incomplete audiometric data, and those who self-reported any hearing loss or related conditions. This exclusion criterion was applied to focus the analysis on objectively classified hearing loss based on audiometric data, thereby reducing potential biases from heterogeneous underlying conditions and supporting the study’s aim of identifying associations rather than longitudinal prediction. Participants were excluded if they answered affirmatively to questions about potential causes of hearing loss such as genetic/hereditary factors, ear infections, ear diseases, illnesses/infections, drugs/medications, head/neck injuries, exposure to loud brief noise, long-term noise exposure, or aging. We excluded participants who self-reported significant exposure to loud noise based on the NHANES Audiology Questionnaire. Specifically, individuals who indicated past or current occupational noise exposure or substantial off-work exposure to loud noise were removed from the analysis. To account for the potential impact of ototoxic medications on hearing loss, we reviewed data from the NHANES Prescription Medication Section (DSQ), which records prescription drugs taken by participants in the past 30 days. Based on clinical evidence and prior research, we identified a list of ototoxic medications, including acetaminophen (15), hydrocodone (16), ciprofloxacin (17), phenytoin (18), levofloxacin (19), rifampin (20), minocycline (21), aspirin (22), metronidazole (23), nitroglycerin (24), and bumetanide (25, 26). Only 35 participants (1.3% of the total sample) reported using these medications, and due to the low prevalence of exposure, we retained these individuals in the analysis to avoid unnecessary reduction in sample size and maintain the representativeness of the cohort. After rigorously applying these exclusion criteria, our final analytic sample consisted of 2,772 eligible NHANES participants (Figure 1).

Figure 1

Figure 1. Flow diagram of the cohort study.

To develop our model for hearing impairment, we selected demographic variables (gender, age, race/ethnicity, education level, marital status, family income-to-poverty ratio), clinical measures (blood pressure, physical activity, smoking status, diabetes diagnosis, body mass index, health insurance status), and biomarkers of heavy metal exposure (mercury, lead, cadmium, arsenic, barium, cobalt, cesium, molybdenum, manganese, antimony, tin, thallium, tungsten, and uranium) consistently measured in the NHANES dataset from 2012 to 2018. These variables were chosen based on their potential associations with hearing health, as identified through an extensive literature review.

Hearing loss status

We defined hearing impairment as a binary outcome based on the speech-frequency pure-tone average (PTA), calculated as the mean hearing threshold in decibels hearing level (dB HL) across 500, 1,000, 2000, and 4,000 Hz frequencies for the better ear. This approach aligns with the audiometric testing procedures and guidelines established by the American Speech-Language-Hearing Association. Specifically, we categorized participants as having a hearing impairment (coded as 1) if their PTA value exceeded 25 dB HL, indicating mild or worse impairment. Participants with PTA values below or equal to 25 dB HL were considered to have normal hearing (coded as 0) (27) (Figure 2).

Figure 2

Figure 2. Proposed methodology.

Preprocessing

Prior to model development, a series of preprocessing steps were undertaken to optimize the NHANES dataset for analysis. Responses indicating refusal or lack of knowledge were considered missing values to mitigate potential biases in performance metrics. A threshold was established to filter out variables and participants with substantial missing information — in particular, variables lacking over 20% of their data and participants missing crucial data points were removed from the study.

To address the presence of outliers, a two-stage detection strategy was employed. Initially, DBSCAN (Density-Based Spatial Clustering of Applications with Noise) was applied to isolate outliers based on density estimations, effectively managing clusters of varying densities. This was complemented by a tree-based anomaly detection method that isolates outliers through random feature partitioning, enhancing robustness in the multidimensional feature space. This dual approach ensured comprehensive outlier management without relying on distributional assumptions, further refining the dataset for analysis.

Categorical and ordinal features were one-hot encoded, transforming them into a machine-readable numerical format, thereby facilitating their inclusion in the modeling process. For numerical variables, scalar normalization techniques were applied to ease the influence of outliers and ensure equitable feature contributions (28).

To address missing data, a hybrid imputation strategy was implemented to enhance dataset robustness. For categorical variables, missing values were imputed using the mode, maintaining their integrity. For numerical variables, an iterative imputation technique was employed, predicting missing values based on relationships with other features across multiple iterations. This approach offers a more sophisticated alternative to mean imputation, minimizing bias and preserving the dataset’s predictive power while ensuring all relevant cases were retained for analysis.

To address missing data, an imputation strategy was implemented, utilizing the mode for categorical variables and the mean for numerical variables. This approach preserved the integrity of the dataset while enabling the inclusion of all relevant cases in the analysis.

Given the dataset’s noticeable imbalance, predominantly skewed toward individuals without hearing impairment, the Synthetic Minority Over-sampling Technique (SMOTE) was implemented. SMOTE enhances model performance on imbalanced data by creating synthetic examples of the under-represented class, thereby balancing the dataset and promoting a more equitable learning environment.

For feature selection, Recursive Feature Elimination (RFE) was utilized in conjunction with a Random Forest estimator. RFE is an iterative process that recursively eliminates the least important features based on the estimator’s feature importance rankings, ultimately retaining the 20 most relevant features for hearing impairment. This approach facilitated the identification of the most significant factors, enhancing the interpretability and performance of the final model.

Machine learning model evaluation

In our study, we employed a rigorous evaluation process to assess the performance of several machine learning algorithms in exploring hearing outcomes based on heavy-metal exposure. We used Python version 3.8.8. to develop Random Forest, XGBoost, Gradient Boosting, Logistic Regression, CatBoost, and MLP (29–31). These algorithms were selected based on their proven track record in similar tasks across various domains, as demonstrated by their superior performance metrics in related research (32, 33).

To optimize the capabilities of each model, we undertook a comprehensive hyperparameter tuning process. This involved an extensive exploration of the hyperparameter spaces for each algorithm, to identify the configurations that yielded the most accurate performances. We employed a randomized search strategy across 5 iterations, with 5-fold cross-validation for each model. This approach ensured that the models were fine-tuned to their optimal settings for our specific task (Supplementary Table S1).

Our validation methodology involved splitting the dataset into training and test subsets, with 70% of the data allocated for training purposes and the remaining 30% used for model evaluation. The models’ performances were assessed using a comprehensive suite of metrics, including accuracy, sensitivity, specificity, precision, the area under the curve (AUC), and the F1 score. These metrics provided a holistic view of each model’s strength and reliability, taking into account their ability to correctly identify true positives (sensitivity) and true negatives (specificity), as well as their overall accuracy and precision.

Model interpretation

To enhance the interpretability of these models, we leveraged SHAP (SHapley Additive exPlanations), a powerful framework that provides intuitive explanations by attributing the model’s results to the contributions of individual features (34).

One of the key visualizations we utilized was the SHAP beeswarm plot. These plots aggregate the SHAP values of all features across the dataset, illustrating the distribution of each feature’s impact on the model’s output. The color coding within these plots aids in discerning whether a feature increases or decreases the likelihood of hearing loss, offering a clear understanding of the feature’s influence.

Furthermore, we employed SHAP decision plots to gain a granular understanding of the decision-making process for individual outcomes. These plots trace the path from the base value (the model’s output value without any feature information) to the final performance, sequentially adding the effect of each feature. This step-by-step breakdown provides a transparent narrative of how each feature contributes to the outcome, highlighting the complex interactions and nonlinear relationships between features (35).

Integrating SHAP into our analysis bridged the gap between model accuracy and interpretability, allowing for a deeper comprehension of the underlying patterns and relationships within the data.

Result

Characteristics of the study population

Among 2,779 participants who retrained for the analysis, 468 (16.88%) contributors had experienced hearing loss. A comparative analysis between the hearing loss and no hearing loss groups revealed significant differences in their baseline characteristics, as presented in Tables 1, 2. Participants with hearing loss tended to be older, smokers, with lower educational levels and family income, and a higher prevalence of hypertension and diabetes (all p < 0.001). The proportion of male participants decreased from 63.03% in the hearing loss group to 47.74% in the no hearing loss group, suggesting potential gender differences between the groups (p < 0.001).

Table 1

Table 1. Baseline numerical characteristics of the participants.

Table 2

Table 2. Baseline categorical characteristics of the participants.

Blood sample analysis showed significantly higher levels of lead, cadmium, and ethylmercury in the hearing loss group (all p < 0.001). Furthermore, urine samples from the same group exhibited considerably higher levels of lead, cadmium, arsenous acid, thallium, antimony, tungsten, and monomethylarsonic acid (all p < 0.001).

Machine learning model performance

We compared the performance of our six ML algorithms which were each trained and tested precisely. To prevent overfitting or uncertainty in the models, we utilized RFE to penalize and select our 22 optimal features for model development. The details of models’ performance metrics are presented in Table 3 and the visualized comparison between ML models is shown in Figure 3.

Table 3

Table 3. Prediction performance of ML models.

Figure 3

Figure 3. Comparison of ML models’ performance.

CatBoost classifier outperformed other models with an accuracy of 74.9 and 74.0% and AUC of 0.792 and 0.786 for train and test groups. Compared to Logistic Regression, our best model achieved a satisfactory increase of more than 4% in AUC and accuracy. Among all six models, XGBoost had the best precision with 0.786 for test group, while the best sensitivity belongs to MLP with 0.701 for train group. The results of the AUC metric are depicted in Figure 4, which shows the averaged ROC curves across the full range of specificity and sensitivity thresholds for all six models.

Figure 4

Figure 4. AUC-ROC curves of ML models.

Feature importance

We applied the Bee Swarm SHAP method to explain the role of each feature and the effect on the model for detecting hearing loss in the CatBoost model, where the red and blue features represent associated factors and protective factors, respectively (Figure 5A). Also, in terms of SHAP values, longer bars meant more importance. Additionally, we sorted the importance of variables in ascending order according to the average value as presented in Figure 5B. The result shows that age was the most influential associated factor for HL classification, followed by lower educational levels. Male gender was also positively correlated with hearing loss which made it more likely that the model classifies the participant as a hearing loss case. Among heavy metals, higher blood lead and cadmium levels were the most important associated factors, while selenium and barium were other important heavy metals with lower impact on positive association.

Figure 5

Figure 5. SHAP BeeSwarm plot and feature importance. (A) SHAP BeeSwarm plot for detecting hearing loss in the CatBoost model. (B) Feature importance ranking for detecting hearing loss in the CatBoost model. ug/L: Mean concentration; BMI: Body mass index.

By utilizing the SHAP decision plot, we tried to clarify how the CatBoost model arrived at the outcome for each particular instance. As shown in Figure 6, each line represents a participant in the decision plot. Lines toward the right side mean that features like older age, higher blood lead levels, and less education pushed the model result toward hearing loss. Conversely, lines toward the left indicate that features like female gender, nonsmoking status, and lower selenium levels led the model to demonstrate no hearing loss.

Figure 6

Figure 6. SHAP decision plot.

Discussion

We evaluated six ML models to expedite early detection of hearing loss caused by heavy metal exposure. Our study used a representative NHANES dataset sample to identify the model with the highest classification accuracy. An estimated AUC in the range of 0.7–0.8 in our models is generally considered a “Good” capacity (36). The CatBoost algorithm, the best-performing model, attained a notable accuracy of 74.9% and an AUC of 0.792. Besides, the SHAP algorithm identified age, education level, gender, lead, and cadmium concentration as the main features that contributed to the conclusion. The timely identification of hearing loss through the application of this screening tool facilitates early intervention and treatment, thereby enhancing the quality of life.

Recently many studies have utilized ML to examine hearing outcomes based on various features. An AUC of 0.93 and accuracy of 85% based on speech-in-noise testing, an AUC of 0.80 and accuracy of 80% based on noise exposure, and an AUC of 0.96 based on demographics and clinical outcomes are some of the best examples of exploring hearing outcome. However, we believe our study represents a pioneering effort to utilize heavy metals as associated factors of hearing outcomes (37–39). SVM, RF, and LightGBM were the best models in recent studies, while, almost none of them used the CatBoost algorithm in their study. In addition, recent studies mainly focused on minimizing the absolute error rate and maximizing the precision. While having a precise performance and minimizing error across all cases is statistically favorable, it does not necessarily translate to the best real-world results in a medical setting. If a model is overly focused on precision and mean error, it risks being too conservative in its performance. As a result, patients who do need additional follow-up may be missed. This could have serious consequences for individuals’ long-term hearing health and quality of life. Therefore, instead of focusing solely on precision, we have opted to optimize sensitivity and recall along with accuracy and AUC. This approach is more appropriate for diagnostic evaluations for hearing, which are relatively simple, inexpensive, and low-risk. By combining ML screening with standardized diagnostic testing only for flagged individuals, populations can be efficiently monitored for hearing loss.

The CatBoost algorithm provided our first aim to find the best ML model performance. This particular classifier is known for its ability to effectively manage categorical variables through the application of Ordered Boosting, a variant of gradient boosting. The algorithm’s use of multiple weak models (decision trees) to create a strong ensemble model results in a high level of accuracy, which is particularly useful for capturing nonlinear patterns in data (40). While our primary objective was to develop the most effective hearing loss classification algorithm based on heavy metal exposure, we were also interested in understanding the key factors that influence the decision-making process of these classifiers. To this end, we opted to utilize the SHAP algorithm, as it allows for a greater degree of transparency and interpretability in our model outputs.

The SHAP analysis assigns a value to each feature, which denotes its contribution to the outcome. By using this technique, we can obtain insights into the contribution of each feature in noticing hearing loss for a particular patient (local interpretation), the accuracy of the result for that patient (local explanation), and the contribution of each feature to the model’s performance across the entire dataset (global explanation and interpretation) (41). To provide better global and local insights, and to avoid focusing solely on the global interpretation of Bee Swarm SHAP, we utilized the SHAP decision plot. This plot helps to show the impact of each feature on the model’s output for a specific instance, thus providing a clearer understanding of the diagnosis and reasons behind it.

Our models retained known associated factors from patient characteristics such as increasing age, worse education level, and male gender. Aging is the primary associated factor for hearing loss, while lower education levels are associated with poorer healthcare and greater exposure to the related factors (42, 43). We have also identified being male as a significant associated factor for hearing loss, likely due to increased exposure to occupational and environmental noise that men face on average, as well as biological differences resulting from male sex hormones and their potential effects on hearing vulnerability compared to female hormones (44, 45). In addition, blood lead and cadmium concentrations have been shown to impact hearing outcomes due to their toxic effects on the cochlea and auditory nerve (46, 47). However, urine lead and cadmium appear to have a lower impact on hearing outcomes due to their nature as daily exposure indicators.

Heavy metals, including lead, cadmium, mercury, barium, arsenic, and manganese, have been conclusively linked to hearing loss. Exposure to these heavy metals through various routes, such as diet, inhalation, or dermal absorption, can increase hearing loss by anywhere from 3 to 75% (5, 48). Heavy metals can damage both the axons and myelin of peripheral nerves, with small nerve fibers being more susceptible than their larger counterparts (49). Furthermore, heavy metals can induce oxidative stress, inflammation, and vascular damage in the inner ear (48). The combination of these mechanisms can explain the association between heavy metal exposure and hearing loss. Yang et al. (50) demonstrated a significant association between blood lead levels and hearing loss in their meta-analysis, reinforcing the evidence that heavy metal exposure plays a role in auditory health. Our study builds on this foundation by employing machine learning and SHAP-based analysis to classify hearing status, thereby offering a more detailed, data-driven interpretation of how heavy metals, particularly lead and cadmium, influence hearing impairment.

The ML models developed and evaluated in this study show promise as effective screening tools to help address the growing problem of heavy metal-induced hearing loss. This screening approach could lighten the burden on healthcare systems when prospectively implemented, enabling the efficient monitoring of large populations. By facilitating early detection, the models may help improve long-term patient outcomes by allowing for timely clinical intervention that is tailored to an individual’s risk profile. The non-invasive nature of ML screening makes it a suitable first-line assessment prior to more resource-intensive diagnostic evaluations.

This study is the first of its kind to develop ML models incorporating heavy metal biomarker data in order to discover hearing loss risk. By analyzing which features like lead and cadmium levels contributed most to model outcomes, our study provides a novel perspective on their relative importance in auditory function. This pioneering work lays the foundation for more comprehensively understanding the relationship between environmental exposures and hearing outcomes through the use of advanced analytical techniques. The examination of a high-quality population-based registry was conducted to ensure the accuracy of the outcome. It appears more reasonable to consider this model when selecting high-risk patients rather than ruling out low-risk ones, considering the high recall and low precision. This approach can help capture all high-risk participants due to the availability of diagnosis approaches. However, the study has some limitations that require careful consideration. First, due to the cross-sectional nature of the dataset and lack of follow-up, the causality between the features and hearing loss could not be determined. Second, the use of a single retrospective dataset may lead to confounding effects of unmeasured factors, thereby making it difficult to apply this model to other institutions. Nonetheless, this study provides a foundation for developing a model that fits well with each institution. Third, the analysis of available data was limited by the fact that a significant proportion of participants did not undergo urine testing for cadmium and arsenic. Therefore, further studies are warranted to prospectively examine the detection of hearing loss through heavy metals using a large sample of the general population. Another limitation is the potential confounding effect of ototoxic medications; however, the low prevalence of exposure (n = 35, 1.3%) in our cohort suggests minimal impact on results, though future studies with larger samples and detailed medication data, including dosage and long-term use, could provide further insights. Additionally, while exposure to volatile organic compounds (VOCs) and pesticides is known to contribute to hearing loss, their frequent co-occurrence with heavy metals in environmental and occupational settings made it impractical to exclude such participants, as this would have reduced the representativeness of our cohort and introduced selection bias. Due to data limitations, our study could not explicitly account for vibration exposure, a well-established risk factor for hearing loss, which is frequently encountered alongside noise. Future research should also explore the combined effects of noise and vibration on hearing outcomes. While this study made important advances by focusing specifically on investigating hearing loss with heavy metal exposures, future work could aim to achieve even higher performance by integrating these biomarker features within models alongside additional clinical and demographic factors examined in other related research.

Conclusion

Overall, our findings provide preliminary validation of ML as a decision support tool for monitoring and detecting heavy metal-induced hearing loss at a population level. Specifically, we demonstrate that the CatBoost classifier achieved the highest performance with an AUC of 0.792 and accuracy of 74.9%. With prospective validation in larger, independent cohorts, this framework could aid public health efforts by facilitating early detection efforts and guiding clinical resource allocation.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found at: National Health and Nutrition Examination Survey (NHANES) repository, https://www.cdc.gov/nchs/nhanes/index.htm.

Ethics statement

The studies involving humans were approved by National Center for Health Statistics Research Ethics Review Board. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

AN: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. MK: Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – review & editing. SN: Investigation, Supervision, Validation, Writing – review & editing, Conceptualization. FS: Investigation, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2025.1471490/full#supplementary-material

References

1. Hoffman, HJ, Dobie, RA, Losonczy, KG, Themann, CL, and Flamme, GA. Declining prevalence of hearing loss in US adults aged 20 to 69 years. JAMA Otolaryngol Head Neck Surg. (2017) 143:274–85. doi: 10.1001/jamaoto.2016.3527

PubMed Abstract | Crossref Full Text | Google Scholar

2. Kim, JS. Prevalence and factors associated with hearing loss and hearing aid use in Korean elders. Iran J Public Health. (2015) 44:308–17.

Google Scholar

3. Haile, LM, Kamenov, K, Briant, PS, Orji, AU, Steinmetz, JD, Abdoli, A, et al. Hearing loss prevalence and years lived with disability, 1990–2019: findings from the global burden of disease study 2019. Lancet. (2021) 397:996–1009. doi: 10.1016/S0140-6736(21)00516-X

PubMed Abstract | Crossref Full Text | Google Scholar

4. Castellanos, M-J, and Fuente, A. The adverse effects of heavy metals with and without noise exposure on the human peripheral and central auditory system: a literature review. Int J Environ Res Public Health. (2016) 13:1223. doi: 10.3390/ijerph13121223

PubMed Abstract | Crossref Full Text | Google Scholar

5. Wang, F, Böhnke, F, Böck, K, and Wirth, M. Population-based study of environmental heavy metal exposure and hearing loss: a systematic review and meta-analysis. Laryngoscope investigative. Otolaryngology. (2024) 9:1230. doi: 10.1002/lio2.1230

PubMed Abstract | Crossref Full Text | Google Scholar

6. He, ZL, Yang, XE, and Stoffella, PJ. Trace elements in agroecosystems and impacts on the environment. J Trace Elem Med Biol. (2005) 19:125–40. doi: 10.1016/j.jtemb.2005.02.010

PubMed Abstract | Crossref Full Text | Google Scholar

7. Hubbard, A. Heavy metals in the environment: Origin, interaction and remediation, Elsevier/Academic Press, London Journal of Colloid and Interface Science. (2005) 291:307. doi: 10.1016/j.jcis.2005.05.069

Crossref Full Text | Google Scholar

8. Park, SK. Role of free radicals in hearing loss due to heavy metals. In: J. Miller, C. Le Prell, L. Rybak. (eds.). Free radicals in ENT. Pathology. Oxidative Stress in Applied Basic Research and Clinical Practice. Cham:Humana Press. (2015):93–109. doi: 10.1007/978-3-319-13473-4_5

PubMed Abstract | Crossref Full Text | Google Scholar

9. Fábelová, L, Loffredo, CA, Klánová, J, Hilscherová, K, Horvat, M, Tihányi, J, et al. Environmental ototoxicants, a potential new class of chemical stressors. Environ Res. (2019) 171:378–94. doi: 10.1016/j.envres.2019.01.042

PubMed Abstract | Crossref Full Text | Google Scholar

10. Choi, Y-H, and Kim, K. Noise-induced hearing loss in Korean workers: co-exposure to organic solvents and heavy metals in nationwide industries. Plo S one. (2014) 9:e97538. doi: 10.1371/journal.pone.0097538

PubMed Abstract | Crossref Full Text | Google Scholar

11. Lilienthal, H, and Winneke, G. Lead effects on the brain stem auditory evoked potential in monkeys during and after the treatment phase. Neurotoxicol Teratol. (1996) 18:17–32. doi: 10.1016/0892-0362(95)02010-1

PubMed Abstract | Crossref Full Text | Google Scholar

12. Apostoli, P, Catalani, S, Zaghini, A, Mariotti, A, Poliani, PL, Vielmi, V, et al. High doses of cobalt induce optic and auditory neuropathy. Exp Toxicol Pathol. (2013) 65:719–27. doi: 10.1016/j.etp.2012.09.006

PubMed Abstract | Crossref Full Text | Google Scholar

13. Carlson, K, Schacht, J, and Neitzel, RL. Assessing ototoxicity due to chronic lead and cadmium intake with and without noise exposure in the mature mouse. J Toxic Environ Health A. (2018) 81:1041–57. doi: 10.1080/15287394.2018.1521320

PubMed Abstract | Crossref Full Text | Google Scholar

14. Centers for Disease Control and Prevention (CDC). NHANES questionnaires, datasets, and related documentation. (2023). Available online at:https://wwwn.cdc.gov/nchs/nhanes/Default.aspx#.

Google Scholar

15. Yorgason, JG, Kalinec, GM, Luxford, WM, Warren, FM, and Kalinec, F. Acetaminophen ototoxicity after acetaminophen/hydrocodone abuse: evidence from two parallel in vitro mouse models. Otolaryngology—head and neck. Surgery. (2010) 142:814–9. doi: 10.1016/j.otohns.2010.01.010

PubMed Abstract | Crossref Full Text | Google Scholar

16. Tang, H, Vrabec, JT, and Burton, AW. Hydrocodone use and sensorineural hearing loss. Pain Physician. (2007) 10:467.

Google Scholar

17. Dirain, CO, and Antonelli, PJ. Cytotoxicity of tetracyclines in human tympanic membrane fibroblasts. Otol Neurotol. (2023) 44:520–4. doi: 10.1097/MAO.0000000000003867

PubMed Abstract | Crossref Full Text | Google Scholar

18. Hamed, SA. The auditory and vestibular toxicities induced by antiepileptic drugs. Expert Opin Drug Saf. (2017) 16:1281–94. doi: 10.1080/14740338.2017.1372420

PubMed Abstract | Crossref Full Text | Google Scholar

19. Baluku, JB, and Bongomin, F. Treatment outcomes of pregnant women with drug-resistant tuberculosis in Uganda: a retrospective review of 18 cases. Int J Infect Dis. (2021) 105:230–3. doi: 10.1016/j.ijid.2021.02.032

PubMed Abstract | Crossref Full Text | Google Scholar

20. Goodall, RL, Meredith, SK, Nunn, AJ, Bayissa, A, Bhatnagar, AK, Bronson, G, et al. Evaluation of two short standardised regimens for the treatment of rifampicin-resistant tuberculosis (STREAM stage 2): an open-label, multicentre, randomised, non-inferiority trial. Lancet. (2022) 400:1858–68. doi: 10.1016/S0140-6736(22)02078-5

PubMed Abstract | Crossref Full Text | Google Scholar

21. Reese, S, and Grundfast, K. Minocycline-induced hyperpigmentation of tympanic membrane, sclera, teeth, and pinna. Laryngoscope. (2015) 125:2601–3. doi: 10.1002/lary.25365

PubMed Abstract | Crossref Full Text | Google Scholar

22. Langworthy, B, Wu, Y, and Wang, M. An overview of propensity score matching methods for clustered data. Stat Methods Med Res. (2023) 32:641–55. doi: 10.1177/09622802221133556

PubMed Abstract | Crossref Full Text | Google Scholar

23. Rotman, A, Michael, P, Tykocinski, M, and O’Leary, SJ. Sudden sensorineural hearing loss secondary to metronidazole ototoxicity. Med J Aust. (2015) 203:253–253e.1. doi: 10.5694/mja15.00222

PubMed Abstract | Crossref Full Text | Google Scholar

24. Qi, R, Zhang, J, Diao, T, and Yu, L. The auditory function in migraine model rats induced by postauricular nitroglycerin injection. Front Neurol. (2023) 14:1259982. doi: 10.3389/fneur.2023.1259982

Crossref Full Text | Google Scholar

25. Rybak, LP. Ototoxicity of loop diuretics. Otolaryngol Clin N Am. (1993) 26:829–44. doi: 10.1016/S0030-6665(20)30770-2

Crossref Full Text | Google Scholar

26. Wang, S, Luo, J, Zhang, F, Zhang, R, Ju, W, Wu, N, et al. Association between blood volatile organic aromatic compound concentrations and hearing loss in US adults. BMC Public Health. (2024) 24:623. doi: 10.1186/s12889-024-18065-0

PubMed Abstract | Crossref Full Text | Google Scholar

27. Clark, JG. Uses and abuses of hearing loss classification. ASHA. (1981) 23:493–500.

PubMed Abstract | Google Scholar

28. Rodríguez, P, Bautista, MA, Gonzalez, J, and Escalera, S. Beyond one-hot encoding: lower dimensional target embedding. Image Vis Comput. (2018) 75:21–31. doi: 10.1016/j.imavis.2018.04.004

Crossref Full Text | Google Scholar

29. Shamshirband, S, Fathi, M, Dehzangi, A, Chronopoulos, AT, and Alinejad-Rokny, H. A review on deep learning approaches in healthcare systems: taxonomies, challenges, and open issues. J Biomed Inform. (2021) 113:103627. doi: 10.1016/j.jbi.2020.103627

PubMed Abstract | Crossref Full Text | Google Scholar

30. Joloudari, JH, Hassannataj Joloudari, E, Saadatfar, H, Ghasemigol, M, Razavi, SM, Mosavi, A, et al. Coronary artery disease diagnosis; ranking the significant features using a random trees model. Int J Environ Res Public Health. (2020) 17:731. doi: 10.3390/ijerph17030731

PubMed Abstract | Crossref Full Text | Google Scholar

31. Jalali, A, Lonsdale, H, Do, N, Peck, J, Gupta, M, Kutty, S, et al. Deep learning for improved risk prediction in surgical outcomes. Sci Rep. (2020) 10:9289. doi: 10.1038/s41598-020-62971-3

PubMed Abstract | Crossref Full Text | Google Scholar

32. Pedregosa, F, Varoquaux, G, Gramfort, A, Michel, V, Thirion, B, Grisel, O, et al. Scikit-learn: machine learning in Python. J Machine Learn Res. (2011) 12:2825–30.

Google Scholar

33. Ketkar, N, and Santana, E. Deep learning with Python. Berkeley, CA, USA: Apress (2017).

Google Scholar

34. Lundberg, SM, and Lee, S-I. A unified approach to interpreting model predictions. Adv Neural Inf Proces Syst. (2017) 30:4768–4777.

Google Scholar

35. Pan, X, Feng, T, Liu, C, Savjani, RR, Chin, RK, and Sharon, QX. A survival prediction model via interpretable machine learning for patients with oropharyngeal cancer following radiotherapy. J Cancer Res Clin Oncol. (2023) 149:6813–25. doi: 10.1007/s00432-023-04644-y

PubMed Abstract | Crossref Full Text | Google Scholar

36. Grinspan, ZM, Shapiro, JS, Abramson, EL, Hooker, G, Kaushal, R, and Kern, LM. Predicting frequent ED use by people with epilepsy with health information exchange data. Neurology. (2015) 85:1031–8. doi: 10.1212/WNL.0000000000001944

PubMed Abstract | Crossref Full Text | Google Scholar

37. Gathman, TJ, Choi, JS, Vasdev, RMS, Schoephoerster, JA, and Adams, ME. Machine learning prediction of objective hearing loss with demographics, clinical factors, and subjective hearing status. Otolaryngol Head Neck Surg. (2023) 169:504–13. doi: 10.1002/ohn.288

PubMed Abstract | Crossref Full Text | Google Scholar

38. Zhao, Y, Li, J, Zhang, M, Lu, Y, Xie, H, Tian, Y, et al. Machine learning models for the hearing impairment prediction in workers exposed to complex industrial noise: a pilot study. Ear Hear. (2019) 40:690–9. doi: 10.1097/AUD.0000000000000649

PubMed Abstract | Crossref Full Text | Google Scholar

39. Lenatti, M, Moreno-Sánchez, PA, Polo, EM, Mollura, M, Barbieri, R, and Paglialonga, A. Evaluation of machine learning algorithms and Explainability techniques to detect hearing loss from a speech-in-noise screening test. Am J Audiol. (2022) 31:961–79. doi: 10.1044/2022_AJA-21-00194

PubMed Abstract | Crossref Full Text | Google Scholar

40. Prokhorenkova, L, Gusev, G, Vorobev, A, Dorogush, AV, and Gulin, A. Cat boost: unbiased boosting with categorical features. Adv Neural Inf Proces Syst. (2018) 31

Google Scholar

41. Nohara, Y, Matsumoto, K, Soejima, H, and Nakashima, N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput Methods Prog Biomed. (2022) 214:106584. doi: 10.1016/j.cmpb.2021.106584

PubMed Abstract | Crossref Full Text | Google Scholar

42. Vuckovic, D, Biino, G, Panu, F, Pirastu, M, Gasparini, P, and Girotto, G. Age related hearing loss and level of education: an epidemiological study on a large cohort of isolated populations. Hear Balance Commun. (2014) 12:94–8. doi: 10.3109/21695717.2014.911472

Crossref Full Text | Google Scholar

43. Bowl, MR, and Dawson, SJ. Age-related hearing loss. Cold Spring Harb Perspect Med. (2019) 9:a033217. doi: 10.1101/cshperspect.a033217

PubMed Abstract | Crossref Full Text | Google Scholar

44. Asghari, A, Farhadi, M, Daneshi, A, Khabazkhoob, M, Mohazzab-Torabi, S, Jalessi, M, et al. The prevalence of hearing impairment by age and gender in a population-based study. Iran J Public Health. (2017) 46:1237–46.

PubMed Abstract | Google Scholar

45. Jerger, J, and Johnson, K. Interactions of age, gender, and sensorineural hearing loss on ABR latency. Ear Hear. (1988) 9:168–76. doi: 10.1097/00003446-198808000-00002

PubMed Abstract | Crossref Full Text | Google Scholar

46. Osman, K, Pawlas, K, Schütz, A, Gazdzik, M, Sokal, JA, and Vahter, M. Lead exposure and hearing effects in children in Katowice. Poland Environ Res. (1999) 80:1–8. doi: 10.1006/enrs.1998.3886

PubMed Abstract | Crossref Full Text | Google Scholar

47. Shargorodsky, J, Curhan, SG, Henderson, E, Eavey, R, and Curhan, GC. Heavy metals exposure and hearing loss in US adolescents. Arch Otolaryngol Head Neck Surgery. (2011) 137:1183–9. doi: 10.1001/archoto.2011.202

PubMed Abstract | Crossref Full Text | Google Scholar

48. Zou, P, Li, M, Chen, W, Ji, J, Xue, F, Wang, Z, et al. Association between trace metals exposure and hearing loss. Front Public Health. (2022) 10:973832. doi: 10.3389/fpubh.2022.973832

PubMed Abstract | Crossref Full Text | Google Scholar

49. Koszewicz, M, Markowska, K, Waliszewska-Prosol, M, Poreba, R, Gac, P, Szymanska-Chabowska, A, et al. The impact of chronic co-exposure to different heavy metals on small fibers of peripheral nerves. A study of metal industry workers. J Occup Med Toxicol. (2021) 16:1–8. doi: 10.1186/s12995-021-00302-6

Crossref Full Text | Google Scholar

50. Yang, T, Chen, HJ, Zhang, CY, He, D, and Yuan, W. Association of blood heavy metal concentrations with hearing loss: a systematic review and meta-analysis. Public Health. (2024) 227:95–102. doi: 10.1016/j.puhe.2023.10.019

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: hearing loss, heavy metals, machine learning, association, risk

Citation: Nabavi A, Kashkooli M, Nabavizadeh SS and Safari F (2025) Heavy metal biomarkers and their impact on hearing loss risk: a machine learning framework analysis. Front. Public Health. 13:1471490. doi: 10.3389/fpubh.2025.1471490

Received: 27 July 2024; Accepted: 31 March 2025;
Published: 16 April 2025.

Edited by:

Jason Brant, University of Wisconsin-Madison, United States

Reviewed by:

Renata Sisto, National Institute for Insurance against Accidents at Work (INAIL), Italy
Volker Hohmann, University of Oldenburg, Germany

Copyright © 2025 Nabavi, Kashkooli, Nabavizadeh and Safari. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Farimah Safari, ZmFyaW1hc2FmYXJpNDBAZ21haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.