Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med., 01 October 2025

Sec. Hematology

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1605868

Development and application of machine learning models for hematological disease diagnosis using routine laboratory parameters: a user-friendly diagnostic platform

Jingya Liu,,&#x;Jingya Liu1,2,3Yang Gou,,&#x;Yang Gou1,2,3Wuchen Yang,,&#x;Wuchen Yang1,2,3Hao Wang,,Hao Wang1,2,3Jing Zhang,,Jing Zhang1,2,3Shengwang Wu,,Shengwang Wu1,2,3Siheng Liu,,Siheng Liu1,2,3Tinglu Tao,,Tinglu Tao1,2,3Yongjie Tang,,Yongjie Tang1,2,3Cheng Yang,,Cheng Yang1,2,3Siyin Chen,,Siyin Chen1,2,3Ping Wang,,Ping Wang1,2,3Yimei Feng,,Yimei Feng1,2,3Cheng Zhang,,Cheng Zhang1,2,3Shuiqing Liu,,
Shuiqing Liu1,2,3*Xiangui Peng,,
Xiangui Peng1,2,3*Xi Zhang,,
Xi Zhang1,2,3*
  • 1Medical Center of Hematology, The Second Affiliated Hospital of Army Medical University, Chongqing, China
  • 2State Key Laboratory of Trauma, Burns and Combined Injury, Jinfeng Laboratory, Chongqing, China
  • 3Chongqing Key Laboratory of Hematology and Microenvironment, Chongqing, China

Aim: In recent years, with the change of social environment, the incidence and detection rate of hematological diseases have shown an increasing trend. Early diagnosis and detection of hematological diseases are very important to improve the quality of life and prognosis of patients.

Methods: In this study, we employed 54 clinical and conventional laboratory parameters. By optimally combining multiple feature selection methods and machine learning algorithms, we developed 7 machine learning models with varying feature set sizes. We comprehensively evaluated the performance of these models, analyzed the interpretability of the optimal and simplified models using SHapley Additive exPlanations (SHAP), and compared these two models with the diagnostic performance of hematologists. Finally, we developed a user-friendly diagnostic platform.

Results: The results showed that the ensemble model_1 with 46 feature parameters (EnMod1-46) and the simple ensemble model_2 with 12 feature parameters (EnMod2-12) demonstrated significant performance in diagnosing 16 types of hematological diseases. On the temporally distinct test set_1, the EnMod1-46 achieved an accuracy of 0.804 and an area under the curve (AUC) of 0.964, while EnMod2-12 attained an accuracy of 0.784 and an AUC of 0.961. To further validate the model’s generalization performance, EnMod1-46 achieved an accuracy of 0.738 and an AUC of 0.973 on the independent external test set_2, while EnMod2-12 yielded an accuracy of 0.705 and an AUC of 0.962. SHAP analysis showed that PLT, WBC, MCV, HGB, RBC and age were significant parameters in both models. Comparative analysis of clinical diagnosis revealed that the performance of EnMod1-46 and EnMod2-12 outperformed junior hematologists, while EnMod1-46 was comparable to senior hematologists. Concurrently, based on EnMod2-12, we developed a user-friendly diagnostic platform to facilitate risk assessment and improve access to accurate diagnosis.

Conclusion: This study provides an efficient and accurate screening method for hematological diseases, especially in resource-limited countries and regions.

1 Introduction

Hematological diseases include a wide range of benign and malignant disorders that affect the blood, bone marrow and lymphatic system. In recent years, with the change of social environment, especially the aging of population and the continuous development of medical technology, the incidence and detection rate of hematological diseases have shown an increasing trend (1). However, due to the strong heterogeneity of laboratory indicators in hematological diseases (2), diverse clinical manifestations, and frequent overlap with other systemic diseases, some patients have difficulty in obtaining timely and accurate diagnosis at the first visit. This often results in multiple medical consultations, increasing both patients’ suffering and the waste of medical resources. If benign hematological diseases are not detected early or managed properly, these conditions can lead to serious symptoms and complications that can significantly affect the quality of life for patients. According to the global cancer statistics from GLOBOCAN 2022, malignant hematological diseases such as non-Hodgkin lymphoma and leukemia exhibited high incidence and mortality rates worldwide (3). Modern treatment methods, such as targeted therapy, and hematopoietic stem cell transplantation, have significantly improved the prognosis of patients and increased the survival rate of patients (4, 5). However, misdiagnosis or delayed diagnosis in the early stages can significantly compromise patient’s prognosis (6).

Machine learning (ML) as a subset of artificial intelligence has been explored in various domains of hematological diagnosis, including laboratory testing, histopathology, flow cytometry and molecular data (7). Moreover, the model built using ML can accurately diagnose various types of hematological diseases and identify key parameters (8, 9). In clinical practice, easily accessible laboratory parameters have been proved to be an important data source for implementing disease diagnosis models. For example, Erler et al. improved the classification accuracy of thalassemia by building a backpropagation artificial neural network using erythrocyte related parameters such as red blood cell count and average red blood cell volume, with sensitivity and specificity both reaching 94.2% (10). With the employment of random forest algorithms, Guncar et al. developed models with 181 and 61 parameters to classify 43 hematological diseases. The accuracy for diagnosing the most likely hematological disease was 59% and 57%, respectively, while the accuracy for predicting the top five diagnoses improved to 88% and 86% (11). Park et al. further developed an ensemble model based on deep neural networks, using 88 different parameters to classify 39 hematological disorders, achieving accuracies of 65% and 93% for predicting the most likely single and top five diagnoses, respectively (12). These models achieved accurate prediction of hematologic diseases. However, some of them involve a large number of parameters, which complicates model training and computation. Moreover, the prediction accuracy for the most likely single disease remains relatively low. While other studies have reported accuracy above 0.90, these have generally focused on predicting or differentiating between specific diseases, such as certain types of leukemia (13, 14).

Therefore, developing highly accurate and computationally efficient prediction models and tools for common hematological diseases are significant in aiding clinical diagnosis and decision-making. In this study, a total of 16 types of hematological diseases and a group of healthy individuals were included. The disease types cover about 80% of hematological diseases, which have a high prevalence. We developed a prediction model, named EnMod1-46, which was an ensemble model trained using 46 parameters selected by random forest combined with recursive feature elimination with cross-validation (RF-RFECV) and achieved a high classification accuracy. In order to facilitate clinical application, we simultaneously constructed a simplified model, EnMod2-12, which utilized an ensemble of four ML algorithms trained on 12 routine complete blood count (CBC) parameters. This model also supported a practical online prediction tool, facilitating real-world application. This study provided valuable decision-making support for clinical practice and helped to realize early diagnosis and screening of hematological diseases.

2 Materials and methods

2.1 Participants

This study included 10,401 patients newly diagnosed at the Medical Center of Hematology of the Second Affiliated Hospital of Army Medical University from January 2015 to March 2024. The cohort comprised 16 groups of patients with newly diagnosed hematological diseases and a group of healthy individuals as controls (including healthy donors and healthy medical examination patients). Malignant hematological diseases include ALL, APL, AML-nonAPL, CLL, CML, CMML, MDS, MM, MPN, and Lymphoma. Benign hematological diseases include Hemocytopenia, AA, Thalassemia, MgA, HA, and IDA. The full names of the disease types are provided in the Glossary. Additionally, 342 patients from April 2024 to June 2024 were collected as the temporally distinct test set_1. The multi-center and independent external test set_2 included 149 patients recruited from two tertiary medical centers: the First Affiliated Hospital of Chongqing Medical University and Southwest Hospital of Army Medical University. Inclusion criteria: (1) No restrictions were placed on sex or age; (2) All cases were confirmed and diagnosed clearly by the hospital; (3) Laboratory data were available for analysis; (4) Patients were not receiving any treatment at the time of diagnosis. Exclusion criteria: (1) Cases with unclear or ambiguous diagnoses; (2) Insufficient laboratory data for analysis; (3) Patients who had received relevant medications prior to diagnosis. This study was approved by the Medical Ethics Committee of Second Affiliated Hospital of Army Medical University (2024–236-01).

2.2 Data collection

A total of 54 potential features were collected at the time of patient diagnosis. These parameters included 2 demographic parameters (sex and age) and 52 blood test parameters: WBC, HGB, PLT, MCV, NEUT%, LYM%, MXD%, BASO%, EO%, RBC, EO#, BASO#, HCT, LYM#, MCH, MCHC, MPV, MXD#, NEUT#, PDW, PCT, RDW-CV, RDW-SD, RET%, ALB, GLB, A/G, ALP, ALT, AST, CREA, DBIL, TBIL, GGT, LDH, PT, APTT, TT, Fg, IgA, IgE, IgG, IgM, Igκ, Igλ, EPO, FOL, VB12, SF, FE, IAT, and DAT. The full names of the blood test parameters are provided in the Glossary.

2.3 Data preprocessing

Among the 52 blood test parameters included, the missing value ratio of 23 CBC parameters was relatively low, all less than 15%, while the missing values ratio for the other parameters exceeded 30%. First, we performed data cleaning by removing special characters such as “>” and “<” from the parameters in the dataset. To address the missing values, categorical variables were imputed with the mode, while continuous variables were imputed with the median. Subsequently, we standardized the units of all parameters to the international system of units (IS units). Additionally, to eliminate the influence of scales, we applied the StandardScaler to normalize all continuous variables.

2.4 Feature selection

To identify the key features for model training, we employed three feature selection methods to create three distinct feature subsets:

1 RF-RFECV (15, 16)

  Using random forest classifier (v1.5.1) with 5-fold cross-validation (CV) and RFECV (v1.5.1), we progressively eliminated less relevant features to obtain an optimal feature subset that enhanced model generalization while improving model interpretability.

2 Light Gradient Boosting Machine (LightGBM) (17, 18)

  We trained a LightGBM model (v4.4.0) and ranked features based on their importance in the tree structure, selecting those with above-average importance values.

3 Information Gain (IG) (19)

  We ranked features based on their entropy-based contribution to the target variable and selected those with information gain values above the average to form the IG feature subset, reducing noise features.

Additionally, we included the complete dataset of all 54 parameters and a clinically relevant subset of 12 parameters (WBC, HGB, PLT, RBC, MCV, NEUT%, LYM%, MXD%, EO%, BASO%, age and sex) validated by three hematology experts. The specific screening method was conducted as follows: from the 23 CBC parameters with minimal missing values, the experts unanimously agreed on 7 core parameters (WBC, HGB, PLT, RBC, MCV, age and sex) for direct inclusion in the model. For the remaining parameters, the experts chose to incorporate NEUT%, LYM%, MXD%, EO%, and BASO% after discussion, as these percentage-based parameters are relative values unaffected by WBC count, making them more comparable across different individuals. This makes them particularly suitable for initial disease screening or classification, especially in the classification of leukemia. Parameters not selected by the expert panel were excluded. Therefore, a total of 5 parameter sets were used for the training of the model.

2.5 Model training and optimization

The dataset was randomly divided into a training set and a validation set at a 7:3 ratio. To address the issue of class imbalance, we applied the synthetic minority over-sampling technique (20), which generates synthetic samples to balance the class distribution in the training set. The Optuna framework was utilized for automated hyperparameter tuning through bayesian optimization (21), with an objective function defined as maximizing the coefficient of determination (R2) on the validation set.

We evaluated candidate models, including k-nearest neighbor (KNN), random forest (RF), categorical boosting (CatBoost), extreme gradient boosting (XGBoost), LightGBM, alongside two ensemble models: EnMod1, which combines RF, XGBoost, and LightGBM, and EnMod2, which integrates RF, XGBoost, LightGBM, and CatBoost. Model performance was assessed using five-fold cross-validation, with evaluation metrics including precisions, recalls, F1-scores, and area under the curves (AUCs). The initial validation was conducted on the temporally distinct test set (test set_1) and the optimal model demonstrating superior performance was selected for further evaluation. This final model underwent rigorous validation on the multi-center and independent external test set (test set_2), confirming its robust generalization capabilities.

2.6 Interpretability

We used SHapley Additive exPlanations (SHAP; version 0.46.0), a widely recognized framework for explaining ML models, to quantify and interpret the contribution of input features. SHAP summary plots visualize the overall impact of each feature, while SHAP dependency plots reveal the relationship between individual features and model predictions. SHAP values quantitatively measure the contribution of each variable to the prediction outcomes, enabling us to rank and evaluate feature importance (22). This analysis enhanced the transparency and interpretability of our ML models, identifying the critical blood laboratory parameters that most influenced the predictions in hematological disease diagnosis.

2.7 Clinical diagnostic comparison test

To further evaluate the diagnostic performance and practical application of these models, we conducted a clinical diagnostic comparison test with 100 randomly selected patients from the dataset. The cases were randomly divided into two groups: a 46-parameter group and a 12-parameter group of 50 patients each, all covering 16 disease categories and a group of healthy individuals. Specifically, 4 senior hematologists and 4 junior hematologists were invited for diagnosis. The cases were randomly distributed to each hematologist, and each doctor needed to analyze 20 cases (10 cases in each of the two groups). Finally, the diagnostic accuracy of the two model’s and the two groups of hematologists was evaluated across the two distinct parameter sets.

2.8 Development of predictive tools

A web-based calculator using a simplified 12-parameter model has been developed to provide diagnostic support to clinicians and serve as a health assessment platform for users. This platform was employed using the Shiny for Python framework (shiny: version 0.5.0; shinyswatch: version 0.3.1), integrating libraries such as Pandas and Matplotlib for data processing and visualization. In our study, users can input clinical parameters to obtain predicted outcomes for the potential hematological diseases, thereby enhancing the model’s utility and accessibility in clinical diagnosis and patient health assessment.

2.9 Statistics method

Data analysis was performed using Python version 3.12.3. For continuous variables, median and quartile ranges (IQR) were used for representation, and comparisons were performed using the Mann-Whitney U test. For discrete variables, they were represented as counts and precentages, and the Chi-square test was used for comparative analysis. During the significance test, the p value of parameters was calculated. When the p value was less than 0.05, the difference between the two groups of data was considered to be statistically significant.

3 Results

3.1 Study population

Our dataset comprised 10,401 patients representing 16 distinct hematological disease types and 1 healthy control group. We analyzed 54 clinical and laboratory parameters for each patient. The dataset was randomly divided into training and validation sets at a 7:3 ratio, consisting of 7,280 and 3,121 cases, respectively. Demographic characteristics were comparable across all sets. The training set had a median age of 46.00 years (IQR: 29.00–57.00) with a male-to-female ratio of 0.85. Similarly, the validation set showed a median age of 46.00 years (IQR: 29.00–58.00) with a male-to-female ratio of 0.88. The test set_1 included 342 patients with a median age of 49.00 years (IQR: 30.00–60.00), comprising 159 males (46.49%) and 183 females (53.51%). The test set_2 with 149 patients had a median age of 55.0 years (IQR: 45.0–69.0) with 78 males (52.35%) and 71 females (47.65%). Supplementary Tables 1, 2 provide the full baseline characteristics of the training/validation sets and test sets. In addition, regarding the dataset distribution, Figure 1A illustrates the overall breakdown of the internal dataset into training, validation, and temporally distinct test sets, demonstrating the proportion of samples allocated to each subset for model development and evaluation. Figures 1BD further detail the disease category distribution within each of these subsets, respectively. These pie charts show the proportion of each disease category and healthy control samples in the training, validation, and temporally distinct test sets.

Figure 1
(A) A pie chart showing the internal dataset breakdown: training set at 67.77%, validation set at 29.05%, and test set_1 at 3.18%.(B) A pie chart of the training set composition, highlighting Hemocytopenia (28.13%), AML-nonAPL (11.80%), and IDA (11.40%) as major components.(C) A pie chart of the validation set, with Hemocytopenia at 28.32%, AML-nonAPL at 12.18%, and IDA at 11.28%.(D) A pie chart of the test set_1, showing Hemocytopenia at 24.85%, AML-nonAPL at 12.28%, and IDA at 13.16%.

Figure 1. (A) Dataset proportions. Disease distribution across datasets of (B) Training set. (C) Validation set. (D) Test set_1.

3.2 Selection of feature parameters

For feature selection, Figure 2A illustrates the relationship between the number of parameters and the mean CV score as determined by the RF-RFECV method. The curve shows a gradual increase in CV scores as more parameters were included, reaching an optimal point at 46 parameters, beyond which the score plateaus. This indicated that 46 parameters were sufficient to capture the essential patterns in the data for optimal model performance. Figures 2B,C show the feature importance scores calculated using the LightGBM algorithm and IG scores, respectively. In total, 23 parameters scored above the average importance threshold with LightGBM, while 24 parameters were selected by IG. Significant parameters like PLT, WBC, and PCT were identified as crucial for the prediction by all methods, although there are slightly different evaluation criteria among them. Detailed information about the selected parameters for each method is shown in Supplementary Table 3.

Figure 2
(A) Line graph showing RF-RFECV feature selection with the number of parameters on the x-axis and mean CV scores on the y-axis, indicating performance improvement with more parameters. (B) Bar chart displaying feature importance based on LightGBM, with features such as age and WBC having higher importance scores. (C) Bar chart showing feature importance based on information gain, with features like PLT and PCT having higher scores.

Figure 2. Feature selection. (A) RF-RFECV. (B) LightGBM. (C) IG.

3.3 Model performance and feature importance analysis of the validation set

We assessed the performance of our model using accuracy and AUC as the primary evaluation metrics. On the validation set, the model with 54 parameters achieved a CV score of 0.796, while EnMod1-46, which was developed using the RF-RFECV method to select 46 parameters, achieved a slightly lower CV score of 0.793. Both models demonstrated identical AUC values of 0.979, indicating comparable predictive capabilities. While the performance of the two models was nearly equivalent, EnMod1-46 utilized fewer features, thereby reducing computational costs and minimizing dependence on parameters. Based on this analysis, we selected the 46-parameter EnMod1-46 as our final model. At the same time, EnMod2-12, which was constructed using a simplified parameter set of 12 features, achieved a CV score of 0.739 and an AUC of 0.962. Although its accuracy was lower than that of EnMod1-46, EnMod2-12 offered a practical advantage by requiring fewer and more easily obtainable parameters. Table 1 presents the CV scores and AUCs of different feature subsets across the models.

Table 1
www.frontiersin.org

Table 1. The CV scores and AUCs of different models and different parameter sets on the validation set.

3.3.1 Model EnMod1-46

The classification performance of the optimal EnMod1-46, constructed using the 46 key feature parameters was evaluated on the validation set, with results presented in Figure 3. Figure 3A displays the precision, recall, and F1-score for each group on the validation set using the EnMod1-46 model. It shows that the model achieved the highest precision and recall for healthy individuals, at 0.961 and 0.940, respectively. The precision and recall of CLL, CML, MM, MPN, hemocytopenia, and IDA were all above 0.800. However, for ALL, APL, and AML-not APL, the precision were slightly lower, at 0.608, 0.650 and 0.703, respectively. The precision of AA, CMML, MDS, and HA were below 0.600, with CMML showing the poorest performance, achieving precision and recall of only 0.333. This might be attributed to the limited sample size of CMML, with only 12 cases available, potentially leading to biased results.

Figure 3
(A) Bar chart showing precisions, recalls, and F1-scores of EnMod1-46 for various hematological diseases on a validation set, with values ranging from 0.333 to 0.950. (B) ROC curves and AUC values for different labels, highlighting strong model performance with AUC values mostly above 0.950. (C) Confusion matrix displaying prediction labels against actual labels, indicating accurate classification for most categories. (D-F) SHAP value bar charts for RF, XGBoost, and LightGBM models, showing feature importance with PLT, HGB, and HCT as significant features.

Figure 3. Performance and feature importance analysis of EnMod1-46. (A) Classification performance. (B) ROC and AUC. (C) Confusion matrix. SHAP analysis for EnMod1-46. (D) RF. (E) XGBoost. (F) LightGBM.

Figure 3B presents the receiver operating characteristic (ROC) curves and AUC values for EnMod1-46 on the validation set. The results showed that the model exhibited excellent performance in diagnosing CLL and healthy group, achieving AUC values of 0.999 for both. In contrast, the AUC value for MDS was the lowest at 0.930. However, excluding MDS, all other groups achieved AUC values exceeding 0.950, demonstrating the model’s robust classification ability across most conditions. Figure 3C presents the confusion matrix for EnMod1-46 on the validation set. As shown in the matrix, the model faced challenges in accurately diagnosing APL, with approximately half of APL cases being misclassified as AML-nonAPL. Additionally, high classification errors were observed among AA, ALL, AML-nonAPL, MDS and Hemocytopenia, suggesting significant overlap in peripheral blood parameters across these conditions. However, the model demonstrated high accuracy in diagnosing CLL and CML, indicating strong differentiation capabilities for these two diseases.

Figures 3DF present SHAP analysis for EnMod1-46. Eight parameters consistently emerged as top predictors across all three models, including PLT, HGB, WBC, MCV, RBC, PCT, MCH and age. Disease-specific patterns revealed PCT and HGB as primary contributors to Hemocytopenia predictions, while WBC showed strong predictive value for CML and MCV for Thalassemia. Age exhibited particular significance in ALL predictions, and MCH had a notable impact on IDA.

3.3.2 Model EnMod2-12

The classification performance of EnMod2-12 was evaluated on the validation set, with results presented in Figure 4. The figure shows that the classification report results of EnMod2-12 were consistently lower than those of EnMod1-46. Specifically, as demonstrated in Figure 4A, the lowest precision was observed for APL at 0.240, while the highest precision was achieved for MPN at 0.929. In terms of recall, CMML showed the poorest performance at 0.250, while CML achieved the highest recall at 0.938. For F1-score, CMML exhibited the lowest value, while Healthy achieved the highest F1-score at 0.912. In addition, unlike EnMod1-46, which demonstrated precision and recall exceeding 0.800 for seven categories (Healthy, CLL, MPN, CML, Hemocytopenia, IDA, and MM), EnMod2-12 showed a significant decline in precision and recall for MM (Figure 4A).

Figure 4
(A) Bar chart showing precisions, recalls, and F1-scores of EnMod2-12 for different hematological diseases with varying performance metrics. (B) ROC curves and AUC values for each label, indicating high diagnostic performance. (C) Confusion matrix for the ensemble model highlighting prediction accuracy across different diseases. (D-G) SHAP value bar charts for RF, XGBoost, LightGBM, and CatBoost models highlighting feature importance, with PLT, WBC, and RBC having significant impacts.

Figure 4. Performance and feature importance analysis of EnMod2-12. (A) Classification performance. (B) ROC and AUC. (C) Confusion matrix. SHAP analysis for EnMod2-12. (D) RF. (E) XGBoost. (F) LightGBM. (G) CatBoost.

Figure 4B illustrates the ROC and AUC values for EnMod2-12 on the validation set. The highest AUC value of 0.998 was achieved for CLL, while the AUC value for the healthy group decreased to 0.995 compared to EnMod1-46. MDS still exhibited the lowest AUC value at 0.912, with Lymphoma showing the second lowest AUC value of 0.931.

The confusion matrices for EnMod2-12 on the validation set are presented in Figure 4C. Similar to EnMod1-46, this model struggled with APL diagnosis, frequently misclassifying it as AML-nonAPL. Significant confusion was observed among AA, ALL, AML-nonAPL, MDS, and Hemocytopenia cases, reflecting similarities in their peripheral blood parameters.

Figures 4DG display corresponding analyses for EnMod2-12. Six of the eight most influential parameters (PLT, WBC, MCV, HGB, RBC, and age) overlapped with those identified in EnMod1-46, maintaining consistent disease-specific associations. For example, PCT retained its strong association with Hemocytopenia predictions across both model architectures, while WBC continued to demonstrate particular importance for CML classification. Based on the data analysis of EnMod1-46 and EnMod2-12, it can be concluded that PLT, WBC, MCV, HGB, RBC and age were the most important characteristic parameters of the prediction models.

3.4 Model performance of the test set

On the test set_1, the accuracy of EnMod1-46 was 0.804 and the AUC was 0.964, both of which were better than the 56-parameter models. EnMod2-12 had an accuracy of 0.784 and an AUC of 0.961 (Table 2). This performance advantage persisted in test set_2, the EnMod1-46 attained 0.738 accuracy and 0.973 AUC versus 0.705 accuracy and 0.962 AUC for EnMod2-12. Model diagnostic visualizations are provided in Supplementary Figures: ROC curves with corresponding AUC values for EnMod1-46 (Supplementary Figure S1A, test set_1; Supplementary Figure S2A, test set_2) and EnMod2-12 (Supplementary Figure S1C, test set_1; Supplementary Figure S2C, test set_2), alongside confusion matrices for EnMod1-46 (Supplementary Figure S1B, test set_1; Supplementary Figure S2B, test set_2) and EnMod2-12 (Supplementary Figure S1D, test set_1; Supplementary Figure S2D, test set_2).

Table 2
www.frontiersin.org

Table 2. The accuracy and AUC for different models and parameter sets on the test set_1.

3.5 Diagnostic agreement analysis

Figure 5 presents a comparison of diagnostic accuracy among different groups using different parameter sets. When evaluating diagnoses based on the 12-parameter subset, the EnMod2-12 model demonstrated a diagnostic accuracy of 0.840. In contrast, the mean diagnostic accuracy among five junior hematologists was 0.530, while that of senior hematologists was 0.550. For EnMod1-46, which utilized an expanded 46-parameter dataset, the model achieved a diagnostic accuracy of 0.800. Senior hematologists demonstrated improved performance with this extended parameter set, achieving an accuracy of 0.750, which was slightly lower than the model’s performance. However, junior hematologists showed only a marginal improvement, with their accuracy increasing by just 0.050.

Figure 5
Bar chart comparing average scores by experience level and parameter count. For 46 parameters, scores are: Junior 0.580, Senior 0.750, All 0.650, EnMod1-46 0.800. For 12 parameters, scores are: Junior 0.530, Senior 0.550, All 0.540, EnMod2-12 0.840. Legend denotes experience levels and models.

Figure 5. The diagnostic accuracy of clinicians and models using two different parameter sets. Junior, junior hematologist; Senior, senior hematologists; All, both junior and senior hematologists.

3.6 Predictive tool

Considering the challenges that less experienced clinicians may face during the diagnostic process, we have developed an online prediction tool using the EnMod2-12. This practical tool is designed to provide auxiliary diagnostic support for clinical settings, especially for institutions with limited resources. The platform is user-friendly, requiring only the input of 12 key CBC parameters. Among these parameter, “Age” must be entered as an integer and “Gender” requires the user to select the appropriate option. The remaining parameters can accept numeric values. Notably, WBC, HGB, PLT, MCV, and RBC must not be 0, otherwise, the system will issue a warning that the inputs are invalid and request the user to enter the correct value. After inputting the valid values, the user can click the “Predict” button, then the system will rapidly generate precise diagnostic predictions by classifying disease as benign or malignant and providing the corresponding probability for the most possible disease. Therefore, facilitating accurate and timely diagnosis in clinical settings. The system is open to the public and users can use the tool by visiting https://xqidch.shinyapps.io/ham_classification/.

Figure 6 displays a detailed overview of the online prediction platform using an IDA case as an example. The patient is female, and her age is 66 years old. After entering corresponding data into the platform, the diagnosis revealed a 90.1% likelihood of the patient having a benign hematological disease, and IDA was identified as the most likely disease, with the highest possibility at 85.2%. The diagnostic prediction aligned with the actual clinical diagnosis for this patient.

Figure 6
Online form titled

Figure 6. Schematic diagram of the online prediction website.

4 Discussion

The heterogeneity of hematological diseases leads to complex correlations between clinical presentation and laboratory parameters. While routine indicators such as CBC and biochemical markers provide valuable diagnostic clues, integrating multi-parameter analyses remains challenging. ML techniques, leveraging their powerful data processing and analysis capabilities, have been widely applied in the diagnosis, prognosis, and prediction of complications in hematologic diseases (2326). Meanwhile, clinical parameter-based risk prediction models have also demonstrated strong predictive performance in clinical practice (11, 13, 27). ML models based on clinical and laboratory parameters can be cost-effective and efficient for screening hematological diseases, especially malignant ones (28). This study established diagnosis models for common hematological diseases using routine clinical laboratory parameters, achieving high accuracy, stability, and efficiency at low cost, thereby facilitating early diagnosis and screening in clinical settings.

This study developed a predictive model for hematological diseases using peripheral blood parameters. Feature selection is pivotal in ML model construction, as it identifies key features, reduces complexity, and improves generalization. This study employed three feature selection algorithms, with the RF-RFECV algorithm yielding 46 parameters that formed an integrated model, EnMod1-46, showing strong diagnostic performance in all validation and test sets (Tables 1, 2). The advantage of RF-RFECV is that it combines the integration capabilities of random forests with cross-verified feature elimination. This approach better captures complex patterns and feature interactions, which are essential for classifying multiple blood disorders in unbalanced datasets. The integrated model, combining RF, XGBoost, and LightGBM, outperformed single-model approaches by leveraging the strengths of multiple algorithms (29). Additionally, to optimize hyperparameters, we introduced Optuna, an automated optimization framework, simplifying the search process and reducing the time and computational cost of manual tuning (30).

EnMod1-46 achieved accuracies of 0.804 and 0.738 on the test set_1 and the test set_2, respectively, surpassing existing RF-based studies reporting accuracies of 0.57–0.59 (11). While some studies have reported accuracies exceeding 0.90 (13, 14), these models predominantly target single diseases (e.g., specific leukemia subtypes). Additionally, the simplified 12-parameter model, EnMod2-12, achieved test set accuracies of 0.784 (test set_1) and 0.705 (test set_2), offering a practical solution for primary healthcare settings. Despite the reduction in parameters, EnMod2-12 maintained strong predictive capabilities and generalizability, highlighting its potential for applications requiring simpler models. In the validation set, the EnMod1-46 demonstrated high accuracies and recalls for CLL, CML, MM, MPN, hemocytopenia, and IDA (all>0.800). However, performance was lower for CMML (precision: 0.333, recall: 0.333), likely due to insufficient sample size. The confusion matrix analysis demonstrated strong performance in identifying Healthy, CLL, CML, and MM. However, it still faced challenges in distinguishing ALL, APL, and AML-nonAPL.

There were certain differences in the key feature parameters identified by different models, but the overall trend remained consistent in EnMod1-46 and EnMod2-12. SHAP analysis highlighted key parameters contributing to disease prediction, including PLT, WBC, HGB, MCV, RBC, and age. These parameters directly reflect the development of megakaryocytes, myeloid cells, lymphocytes, and red blood cells. Notably, PLT was the most important parameter across all models, consistent with its role in acute leukemia (31). In addition, studies have confirmed that WBC and RBC are key features for identifying leukemia (14). The occurrence of leukemia is due to the abnormal proliferation of leukemic cells in the bone marrow, which often leads to an increase in WBC. This abnormal proliferation of leukemic cells can inhibit normal hematopoiesis, thereby affecting RBC level. HGB is a critical indicator for detecting the presence of anemia, while MCV, MCH, and MCHC are the main parameters for differentiating types of anemia (32). Additionally, different types of hematological diseases have distinct age distributions at onset (33, 34). These key parameters are closely related to the characteristics of disease development. Notably, even without specific parameters such as IgM, the simplified model effectively identified MM (precision: 0.643) on the validation set, demonstrating its validity and reliability. Other studies have demonstrated that clinical parameters such as cytokines, lymphocyte subsets, and cell population data have proven to be highly valuable in the classification and differential diagnosis of hematological diseases (35, 36). In future research, we will incorporate a broader range of clinically valuable parameters to further enhance the predictive scope and accuracy of our models.

Comparative analysis of clinical diagnosis showed that the performance of EnMod2-12 was better than that of hematologists with different experience, indicating that even in the case of fewer parameters, the model can accurately identify complex relationship patterns between different parameters and achieve efficient and accurate predictions. Senior hematologists performed better on 46 parameters, achieving results comparable to EnMod1-46 because hematologists were able to interpret relationships between complex parameters. However, junior hematologists did not improve accuracy when adding parameters. Among them, we also saw that the accuracy rate of EnMod2-12 was better than that of EnMod1-46, which was somewhat different from our research results, which might be caused by differences in data distribution. To facilitate clinical application, we developed a user-friendly web prediction platform based on the simplified EnMod2-12, which can be accessed via the following link: https://xqidch.shinyapps.io/ham_classification/.

Despite the high accuracy achieved in this study, there are still some limitations. First, this single-center retrospective study had incomplete data and insufficient sample sizes for rare diseases, which affected diagnostic accuracy. Second, although this study covered about 80% of hematological disease types, which basically meets clinical needs, further expanding the prediction range will be more clinically valuable. Future studies should increase sample sizes for rare diseases to ensure balanced data and conduct multi-center prospective studies to improve generalizability and clinical application.

5 Conclusion

In this study, we developed two high-performance diagnostic models for hematological diseases by employing Optuna for automatic hyperparameter tuning and optimizing the combination of feature algorithms with various ML models. The clinical diagnostic comparison test further validated the significant clinical practical value and diagnostic performance of both models. Additionally, we developed a practical online tool to provide an efficient screening solution and facilitate early diagnosis opportunities for hematological diseases.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.

Ethics statement

This study was approved by the Medical Ethics Committee of Second Affiliated Hospital of Army Medical University (2024–236-01). The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because our study was a retrospective clinical study.

Author contributions

JL: Writing – review & editing, Methodology, Software, Visualization, Writing – original draft, Formal analysis, Conceptualization. YG: Writing – review & editing, Data curation, Investigation, Methodology, Writing – original draft, Conceptualization. WY: Data curation, Investigation, Methodology, Writing – original draft, Conceptualization, Writing – review & editing. HW: Writing – review & editing, Investigation. JZ: Writing – review & editing, Investigation. SW: Writing – review & editing, Investigation. SiL: Validation, Writing – review & editing, Investigation. TT: Validation, Writing – review & editing, Investigation. YT: Validation, Writing – review & editing, Investigation. CY: Writing – review & editing, Investigation. SC: Validation, Writing – review & editing. PW: Validation, Writing – review & editing. YF: Writing – review & editing, Methodology. CZ: Funding acquisition, Supervision, Writing – review & editing. ShL: Supervision, Conceptualization, Project administration, Writing – review & editing. XP: Supervision, Conceptualization, Project administration, Writing – review & editing. XZ: Project administration, Supervision, Writing – review & editing, Funding acquisition.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by the Science and Technology Innovation Key Research and Development Program of Chongqing (CSTB2023TIAD-STX0008) and the Chongqing Major Project for Joint Medical Research by the Science and Health Commission (2024DBXM004).

Acknowledgments

We would like to express our gratitude to the 4 junior hematologists and 4 senior hematologists who participated in the diagnosis accuracy test.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1605868/full#supplementary-material

References

1. Yan, B, Yuan, Q, and Guryanova, OA. Epigenetic mechanisms in hematologic aging and premalignant conditions. Epigenomes. (2023) 7:32. doi: 10.3390/epigenomes7040032

PubMed Abstract | Crossref Full Text | Google Scholar

2. Zini, G. Hematological cytomorphology: where we are. Int J Lab Hematol. (2024) 46:789–94. doi: 10.1111/ijlh.14330

PubMed Abstract | Crossref Full Text | Google Scholar

3. Bray, F, Laversanne, M, Sung, H, Ferlay, J, Siegel, RL, Soerjomataram, I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834

PubMed Abstract | Crossref Full Text | Google Scholar

4. Wang, Y, Chang, YJ, Chen, J, Han, M, Hu, J, Hu, J, et al. Consensus on the monitoring, treatment, and prevention of leukaemia relapse after allogeneic haematopoietic stem cell transplantation in China: 2024 update. Cancer Lett. (2024) 605:217264. doi: 10.1016/j.canlet.2024.217264

PubMed Abstract | Crossref Full Text | Google Scholar

5. Zhang, C, He, J, Liu, L, Wang, J, Wang, S, Liu, L, et al. Novel CD19 chimeric antigen receptor T cells manufactured next-day for acute lymphoblastic leukemia. Blood Cancer J. (2022) 12:96. doi: 10.1038/s41408-022-00688-4

PubMed Abstract | Crossref Full Text | Google Scholar

6. Wang, Y, Chen, H, Chen, J, Han, M, Hu, J, Hu, J, et al. The consensus on the monitoring, treatment, and prevention of leukemia relapse after allogeneic hematopoietic stem cell transplantation in China. Cancer Lett. (2018) 438:63–75. doi: 10.1016/j.canlet.2018.08.030

Crossref Full Text | Google Scholar

7. Muhsen, IN, Shyr, D, Sung, AD, and Hashmi, SK. Machine learning applications in the diagnosis of benign and malignant hematological diseases. Clin Hematol Int. (2020) 3:13–20. doi: 10.2991/chi.k.201130.001

PubMed Abstract | Crossref Full Text | Google Scholar

8. El Alaoui, Y, Padmanabhan, R, Elomri, A, Qaraqe, MK, El Omri, H, and Yasin Taha, R. An Artificial Intelligence-Based Diagnostic System for Acute Lymphoblastic Leukemia Detection. Stud Health Technol Inform. (2023) 305:265–8. doi: 10.3233/SHTI230479

Crossref Full Text | Google Scholar

9. Alcazer, V, Meur, GL, Roccon, M, Barriere, S, and Calvez, BL. Evaluation of a machine-learning model based on laboratory parameters for the prediction of acute leukaemia subtypes: a multicentre model development and validation study in France. Lancet Digital health. (2024) 6:e323–33. doi: 10.1016/S2589-7500(24)00044-X

PubMed Abstract | Crossref Full Text | Google Scholar

10. Erler, BS, Vitagliano, P, and Lee, S. Superiority of neural networks over discriminant functions for thalassemia minor screening of red blood cell microcytosis. Arch Pathol Lab Med. (1995) 119:350–4.

Google Scholar

11. Guncar, G, Kukar, M, Notar, M, Brvar, M, Cernelc, P, Notar, M, et al. An application of machine learning to haematological diagnosis. Sci Rep. (2018) 8:411. doi: 10.1038/s41598-017-18564-8

PubMed Abstract | Crossref Full Text | Google Scholar

12. Park, DJ, Park, MW, Lee, H, Kim, Y-J, Kim, Y, and Park, YH. Development of machine learning model for diagnostic disease prediction based on laboratory tests. Sci Rep. (2021) 11:7567. doi: 10.1038/s41598-021-87171-5

PubMed Abstract | Crossref Full Text | Google Scholar

13. Haider, RZ, Ujjan, IU, Khan, NA, Urrechaga, E, and Shamsi, TS. Beyond the in-practice CBC: the research CBC parameters-driven machine learning predictive modeling for early differentiation among Leukemias. Diagnostics (Basel). (2022) 12:138. doi: 10.3390/diagnostics12010138

PubMed Abstract | Crossref Full Text | Google Scholar

14. Yu, C, Peng, Y-y, Liu, L, Wang, X, and Xiao, Q. Leukemia can be effectively early predicted in routine physical examination with the assistance of machine learning models. J Healthc Eng. (2022) 2022:8641194. doi: 10.1155/2022/8641194

PubMed Abstract | Crossref Full Text | Google Scholar

15. Escanilla, NS, Hellerstein, L, Kleiman, R, Kuang, Z, Shull, JD, and Page, D. Recursive feature elimination by sensitivity testing. Proceed Int Conference Machine Learn Applications. (2018) 2018:40–7. doi: 10.1109/ICMLA.2018.00014

PubMed Abstract | Crossref Full Text | Google Scholar

16. Ravishankar, H, Madhavan, R, Mullick, R, Shetty, T, Marinelli, L, and Joel, SE. Recursive feature elimination for biomarker discovery in resting-state functional connectivity. Annual Int Conference IEEE Eng Med Biol Society IEEE Eng Med Biol Society Annual Int Conference. (2016) 2016:4071–4. doi: 10.1109/EMBC.2016.7591621

Crossref Full Text | Google Scholar

17. Ke, G, Meng, Q, Finley, T, Wang, T, Chen, W, Ma, W, et al. (2017). “LightGBM: a highly efficient gradient boosting decision tree.” Paper presented at the NIPS'17.

Google Scholar

18. Wang, D, Zhang, Y, and Zhao, Y. (2017). “LightGBM: an effective miRNA classification method in breast Cancer patients.” Paper presented at the Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics.

Google Scholar

19. Martín-Valdivia, MT, Díaz-Galiano, MC, Montejo-Raez, A, and Ureña-López, LA. Using information gain to improve multi-modal information retrieval systems. Inf Process Manag. (2008) 44:1146–58. doi: 10.1016/j.ipm.2007.09.014

Crossref Full Text | Google Scholar

20. Chawla, NBK, Hall, L, and Kegelmeyer, W. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. (2022) 16:321–57. doi: 10.1613/jair.953

Crossref Full Text | Google Scholar

21. Akiba, T, Sano, S, Yanase, T, Ohta, T, and Koyama, M. (2019). “Optuna: a next-generation Hyperparameter optimization framework.” Paper presented at the KDD '19.

Google Scholar

22. Lundberg, SM, and Lee, S-I. (2017). “A unified approach to interpreting model predictions.” Paper presented at the proceedings of the 31st international conference on neural information processing systems, Long Beach, California, USA.

Google Scholar

23. Cheng, ZJ, Li, H, Liu, M, Fu, X, Liu, L, Liang, Z, et al. Artificial intelligence reveals the predictions of hematological indexes in children with acute leukemia. BMC Cancer. (2024) 24:993. doi: 10.1186/s12885-024-12646-3

PubMed Abstract | Crossref Full Text | Google Scholar

24. Duchmann, M, Wagner-Ballon, O, Boyer, T, Cheok, M, Fournier, E, Guerin, E, et al. Machine learning identifies the independent role of dysplasia in the prediction of response to chemotherapy in AML. Leukemia. (2022) 36:656–63. doi: 10.1038/s41375-021-01435-7

PubMed Abstract | Crossref Full Text | Google Scholar

25. Brown, AL, Harris, R, Taylor, OA, Zobeck, M, Bernhardt, MB, Gramatges, MMM, et al. Evaluation of methotrexate pharmacogenomic variants to predict acute neurotoxicity in children with acute lymphoblastic leukemia. Blood. (2024) 144:4190–14. doi: 10.1182/blood-2024-200891

Crossref Full Text | Google Scholar

26. Karami, K, Akbari, M, Moradi, MT, Soleymani, B, and Fallahi, H. Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques. PLoS One. (2021) 16:e0254976. doi: 10.1371/journal.pone.0254976

PubMed Abstract | Crossref Full Text | Google Scholar

27. Lee, H, Yoo, G, Pak, D, and Lee, JH. Evaluation of D-dimer and prothrombin time in alcohol related liver cirrhosis with comparison of machine learning analyses. Int J Med Inform. (2024) 186:105407. doi: 10.1016/j.ijmedinf.2024.105407

PubMed Abstract | Crossref Full Text | Google Scholar

28. Sun, D, Chen, W, He, J, He, Y, Jiang, H, Jiang, H, et al. A novel method for screening malignant hematological diseases by constructing an optimal machine learning model based on blood cell parameters. BMC Med Inform Decis Mak. (2025) 25:72. doi: 10.1186/s12911-025-02892-1

PubMed Abstract | Crossref Full Text | Google Scholar

29. Opitz, D, and Maclin, R. Popular ensemble methods: an empirical study. J Artif Intell Res. (1999) 11:169–98.

Google Scholar

30. Hassanali, M, Soltanaghaei, M, Javdani Gandomani, T, and Zamani Boroujeni, F. Software development effort estimation using boosting algorithms and automatic tuning of hyperparameters with Optuna. J Softw Evol Process. (2024) 36:e2665. doi: 10.1002/smr.2665

Crossref Full Text | Google Scholar

31. Zhang, L, Liu, J, Qin, X, and Liu, W. Platelet–acute leukemia interactions. Clin Chim Acta. (2022) 536:29–38. doi: 10.1016/j.cca.2022.09.015

PubMed Abstract | Crossref Full Text | Google Scholar

32. Darshan, BSD, Sampathila, N, Bairy, GM, Prabhu, S, Belurkar, S, Chadaga, K, et al. Differential diagnosis of iron deficiency anemia from aplastic anemia using machine learning and explainable artificial intelligence utilizing blood attributes. Sci Rep. (2025) 15:505. doi: 10.1038/s41598-024-84120-w

PubMed Abstract | Crossref Full Text | Google Scholar

33. Romero-Morelos, P, González-Yebra, AL, Bueno-Rosario, LJ, and González-Yebra, B. Leukemia types and subtypes analysis: epidemiological age-standardized exploration in the Mexican Bajio region. Medicina (Kaunas). (2024) 60:731. doi: 10.3390/medicina60050731

PubMed Abstract | Crossref Full Text | Google Scholar

34. Yang, W, Liu, J, Gou, Y, Huang, X, Chen, M, Huang, D, et al. Development and validation of an interpretable machine learning model for predicting Philadelphia chromosome-positive acute lymphoblastic leukaemia using clinical and laboratory parameters: a single-Centre retrospective study. BMJ Open. (2025) 15:e097526. doi: 10.1136/bmjopen-2024-097526

PubMed Abstract | Crossref Full Text | Google Scholar

35. Qi, Y, Liu, X, Ding, Z, Yu, Y, and Zhuang, Z. A potential predictive model based on machine learning and CPD parameters in elderly patients with aplastic anemia and myelodysplastic neoplasms. BMC Med Inform Decis Mak. (2024) 24:379. doi: 10.1186/s12911-024-02781-z

PubMed Abstract | Crossref Full Text | Google Scholar

36. Wan, X, Tao, T, Zhang, J, Li, N, Gou, Y, Yang, W, et al. Effect of age, sex and season on acute myeloid leukemia clinical characteristics: a retrospective study. J Inflamm Res. (2025) 18:2363–75. doi: 10.2147/jir.s495615

PubMed Abstract | Crossref Full Text | Google Scholar

Glossary

AA - Aplastic Anemia

ALL - Acute Lymphoblastic Leukemia

APL - Acute Promyelocytic Leukemia

AML-nonAPL - Acute Myeloid Leukemia-nonAPL

CLL - Chronic Lymphocytic Leukemia

CML - Chronic Myeloid Leukemia

CMML - Chronic Myelomonocytic Leukemia

MDS - Myelodysplastic Syndrome

MM - Multiple Myeloma

MPN - Myeloproliferative Neoplasm

Hemocytopenia - Immune-related or unclear hemocytopenia

MgA - Megaloblastic Anemia

HA - Hemolytic Anemia

IDA - Iron Deficiency Anemia

WBC - White blood cell count

HCT - Hematocrit

PT - Prothrombin time

HGB - Hemoglobin

MCH - Mean corpuscular hemoglobin

APTT - Activated partial thromboplastin time

PLT - Platelet count

MCHC - Mean corpuscular hemoglobin concentration

Fg - Fibrinogen

MCV - Mean corpuscular volume

MPV - Mean platelet volume

TT - Thrombin time

LYM% - Lymphocyte ratio

PDW - Platelet distribution width

IgA - Immunoglobulin A

LYM# - Lymphocyte count

PCT - Thrombocytocrit

IgE - Immunoglobulin E

BASO% - Basophil ratio

ALB - Albumin

IgG - Immunoglobulin G

BASO# - Basophil count

GLB - Globulin

IgM - Immunoglobulin M

EO% - Eosinophil ratio

ALB/GLB - Albumin/globulin ratio

Igκ - Kappa light chain

EO# - Eosinophil count

ALP - Alkaline phosphatase

Igλ - Lambda light chain

MXD% - Monocyte ratio

ALT - Alanine aminotransferase

EPO - Erythropoietin

MXD# - Monocyte count

AST - Aspartate aminotransferase

FOL - Folic acid

NEUT% - Neutrophil ratio

CREA - Creatinine

VB12 - Vitamin B12

NEUT# - Neutrophil count

DBIL - Direct bilirubin

SF - Ferritin

RET% - Reticulocyte ratio

TBIL - Total bilirubin

FE - Serum iron

RBC - Red blood cell count

GGT - Gamma-glutamyl transpeptidase

IAT - Indirect antiglobulin

RDW-CV - Coefficient of variation of red blood cell width

LDH - Lactate dehydrogenase

DAT - Direct antiglobulin

RDW-SD - Standard deviation of red blood cell width.

Keywords: hematological diseases, machine learning, prediction model, SHAP, laboratory parameters, diagnostic platform

Citation: Liu J, Gou Y, Yang W, Wang H, Zhang J, Wu S, Liu S, Tao T, Tang Y, Yang C, Chen S, Wang P, Feng Y, Zhang C, Liu S, Peng X and Zhang X (2025) Development and application of machine learning models for hematological disease diagnosis using routine laboratory parameters: a user-friendly diagnostic platform. Front. Med. 12:1605868. doi: 10.3389/fmed.2025.1605868

Received: 04 April 2025; Accepted: 07 August 2025;
Published: 01 October 2025.

Edited by:

Eleni Gavriilaki, Aristotle University of Thessaloniki, Greece

Reviewed by:

Pier Paolo Piccaluga, University of Bologna, Italy
Chang Won Jeong, Wonkwang University, Republic of Korea

Copyright © 2025 Liu, Gou, Yang, Wang, Zhang, Wu, Liu, Tao, Tang, Yang, Chen, Wang, Feng, Zhang, Liu, Peng and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shuiqing Liu, bGl1c2h1aXFpbmcwNjJAdG1tdS5lZHUuY24=; Xiangui Peng, cHhncHhnMTk2NEB0bW11LmVkdS5jbg==; Xi Zhang, emhhbmd4eGlAc2luYS5jb20=

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.