Optimization of diagnosis-related groups for patients with acute appendicitis using a machine learning model

Gu, Xinlong; Li, Niannian; Wang, Heng

doi:10.3389/fpubh.2025.1581441

ORIGINAL RESEARCH article

Front. Public Health, 02 September 2025

Sec. Health Economics

Volume 13 - 2025 | https://doi.org/10.3389/fpubh.2025.1581441

This article is part of the Research TopicPublic Health Outcomes: The Role of Social Security Systems in Improving Residents' Health WelfareView all 104 articles

Optimization of diagnosis-related groups for patients with acute appendicitis using a machine learning model

Xinlong Gu¹

Niannian Li²

Heng Wang^3,4^*

¹Teaching Management Department, The Second Affiliated Hospital of Anhui Medical University, Hefei, China
²Department of Research Administration Office, The First Affiliated Hospital of Anhui Medical University, Hefei, China
³Department of Dean’s Office, The First Affiliated Hospital of Anhui Medical University, Hefei, China
⁴Department of Health Services Management, School of Health Services Management, Anhui Medical University, Hefei, China

Background: The diagnosis-related groups prospective payment system (DRG-PPS) is widely implemented worldwide. Its core components include disease classification and pricing mechanisms. Developing a disease grouping and pricing approach that aligns with local conditions is essential. This study examines the factors influencing hospitalization costs for acute appendicitis (AA) patients and proposes strategies for disease grouping and pricing.

Methods: Stratified random sampling was used to select research sites from provincial, municipal, and county hospitals in Hefei, China. Data were obtained from the hospitalization information systems of three hospitals from 2017 to 2019. The primary diagnosis was defined as AA. Single-factor analysis and multiple linear stepwise regression were used to identify the main factors influencing hospitalization costs. Additionally, a classification and regression tree (CART) model, based on the exhaustive chi-square automatic interaction detection (E-CHAID) algorithm, was applied to establish the DRG grouping model.

Results: A total of 4,066 patients were included. Significant differences in hospitalization costs were observed based on length of stay (LOS), marital status, surgery, and hospital level (p < 0.05). By incorporating age, type of surgery, and LOS into the CART model, AA inpatients were classified into 10 DRG groups. The standardized disease cost ranged from 3,047 CNY to 15,569 CNY.

Conclusion: Hospitalization costs for AA patients are primarily influenced by LOS, marital status, surgery, and hospital level. The decision tree model provides a basis for DRG grouping. Health administration departments may consider implementing precise and individualized hospitalization cost reimbursement mechanisms accordingly.

1 Introduction

Appendicitis is an inflammation of the vermiform appendix (1) and is among the most common surgical emergencies in both children and adults (2). Globally, the annual incidence of appendicitis ranges from 96.5 to 100 cases per 100,000 individuals (1). Acute appendicitis (AA), a prevalent form of appendicitis, is most frequently diagnosed between the ages of 10 and 30, with the lowest incidence occurring in children aged nine years or younger (3). The typical symptoms of appendicitis include vague periumbilical pain, anorexia, intermittent vomiting, nausea, pain radiating to the lower right abdomen, and low-grade fever (4). When symptoms persist for more than 24 h, the risk of localized ischemia, perforation, gangrene, and abscess formation increases. As a result, AA complications pose a significant threat to public health.

The clinical diagnosis of AA is based on patient history, physical examination, laboratory findings, and imaging studies (5). Prompt treatment is essential to prevent severe complications. Open and laparoscopic appendectomy are the primary treatment methods, and antibiotic therapy has been shown to be a viable and effective option for acute uncomplicated appendicitis (6). The Society of American Gastrointestinal and Endoscopic Surgeons (SAGES) recommends laparoscopic appendectomy as the first-line treatment for adult patients with acute uncomplicated appendicitis (7, 8). However, with increasing age, both recovery time and length of stay (LOS) following AA surgery tend to be longer. Additionally, the mortality rate for AA rises significantly in individuals aged 65 and older (3). The high incidence and mortality rates, coupled with the substantial treatment costs, have created a severe clinical and economic burden globally (8, 9). Globally, the economic burden of AA is substantial. For instance, the average cost of hospitalization for appendicitis in the United States is approximately $13,000 per case, with the costs rising significantly for complicated cases such as perforated appendicitis (9). In China, the average hospitalization cost for acute appendicitis varies by region but typically ranges between 5,000 to 15,000 RMB per patient, depending on the complexity of the case and the type of surgical intervention required (10). These costs reflect not only the price of surgery and hospital stay but also the long-term healthcare expenses related to complications, extended recovery times, and follow-up care. The growing financial burden associated with AA, alongside its high incidence and mortality rates, highlights the need for more efficient cost-control measures, including the implementation of DRG systems. Reducing hospitalization and treatment costs through optimized healthcare management can mitigate the economic strain on both patients and healthcare systems. To reduce the financial strain on patients, government interventions aimed at controlling medical expenses are necessary.

Diagnosis-related groups (DRG) are widely recognized as one of the most advanced medical payment management systems. Numerous studies have demonstrated its effectiveness in controlling medical costs and reducing the financial burden on patients (10). Initially developed at Yale University, DRG was first implemented in the United States in 1983 (11). It is a payment model that classifies diseases with similar clinical symptoms and resource consumption into specific groups. These classifications serve as the foundation for medical institutions to generate patient bills and for insurance agencies to establish reimbursement standards (12). However, DRG grouping rules vary across different countries and regions.

The DRG system was introduced in China in 1994, with its feasibility first studied by Huang (13). Due to economic disparities between regions, the Chinese government allows each locality to develop grouping rules that reflect its specific conditions. Over time, various DRG models such as BJ-DRG, C-DRG, and CN-DRG have been established in China (14). Significant differences exist in the design of DRG grouping rules across different regions. To standardize grouping criteria in pilot cities, the National Healthcare Security Administration (NHSA) launched the China Health and Safety Diagnosis-Related Group (CHS-DRG) in 2019. As a result, well-structured DRG grouping is essential for an effective cost-payment system, helping to regulate medical expenses and monitor unreasonable charges.

Recently, machine learning models have gained significant attention for their potential to enhance the accuracy and efficiency of DRG grouping systems. Several studies have utilized machine learning techniques to refine disease classification and predict healthcare costs, particularly for specific conditions. For instance, algorithms have been used to forecast hospitalization costs, length of stay, and patient outcomes in diseases, such as heart failure, diabetes, and cancer (15, 16). These studies have demonstrated promising results in optimizing DRG systems by improving cost prediction accuracy and resource allocation. However, the application of machine learning to DRG models for acute appendicitis remains largely unexplored, with limited research examining its potential benefits. A review of existing machine learning-based DRG models for diseases with similar clinical characteristics could help contextualize this study, providing valuable insights into how such approaches could contribute to better cost control and more effective resource management in the DRG framework. This study analyzed hospitalization data of AA patients in Hefei, China, one of the pilot cities for NHSA grouping. First, univariate analysis was conducted to identify factors influencing hospitalization costs for AA patients. Next, multiple stepwise linear regression analysis was used to select predictive factors for the machine learning model. Finally, based on these predictive factors, a decision tree model was developed to estimate hospitalization costs for AA patients, and a grouping scheme was proposed to align with the local healthcare landscape. This study aims to provide theoretical support for assessing the applicability of the DRG prospective payment system.

2 Materials and methods

2.1 Study design and data collection

This cross-sectional study employed stratified random sampling to ensure a representative sample. Hospitals in China are categorized into provincial, municipal, and county-level institutions. One hospital from each level was randomly selected. Medical records and cost data for inpatients diagnosed with acute appendicitis (AA) as the primary diagnosis (ICD-10 code K35) from 2017 to 2019 were extracted from the hospital information system. Patient information, including age, gender, type of surgery, insurance type, LOS, and hospitalization costs, was collected. Cases with incomplete data were excluded from the analysis; however, the proportion of missing data for each variable was not specified. The missing data rate was calculated for each variable, and any variable with more than 10% missing values was excluded from the final analysis to prevent bias. For variables with less than 10% missing data, imputation was performed using the multiple imputation method (MICE) to preserve sample size and ensure robust estimates. The imputation process was conducted under the assumption that data were missing at random (MAR). Sensitivity analysis was conducted to assess the impact of imputation on the results, and no significant differences were found between the complete case and imputed datasets. This approach minimized the potential for bias due to missing data and enhanced the validity of the study findings. After excluding cases with incomplete data, a total of 4,066 cases were included.

2.2 Statistical analysis

First, the Chi-square test was used to analyze differences between groups. The demographic characteristics of inpatients were summarized as rates and percentages according to hospital levels.

Second, the t-test and analysis of variance (ANOVA) were conducted to examine factors influencing hospitalization costs in AA patients. A multiple linear regression model was then established, incorporating statistically significant variables from the univariate analysis as independent factors. Multicollinearity was assessed using the variance inflation factor (VIF) and tolerance. A VIF > 10 or tolerance <0.1 was considered indicative of multicollinearity.

Third, to further explore the interactive relationships between hospitalization costs and demographic or health-related variables, a classification and regression tree (CART) model was employed. This machine learning model is effective in identifying complex interactions among factors that traditional analytical methods may overlook (15). All variables found to be statistically significant in the univariate regression model were included in the CART model. To optimize the classification tree, the exhaustive chi-square automatic interactive detection (E-CHAID) algorithm was used as the growing method. The validity of the grouping was assessed by evaluating heterogeneity between groups and homogeneity within groups based on data distribution. The CART model was selected for this study because of its ability to capture complex interactions between multiple variables and its clear interpretability through decision rules. In contrast to traditional linear models, which assume linear relationships between predictors and outcomes, CART does not require such assumptions and is well-suited for modeling non-linear relationships. This flexibility is particularly useful for healthcare data, where the relationships between factors, such as demographic characteristics, surgical interventions, and hospitalization costs are often non-linear and intricate. While alternative machine learning models, such as Random Forests and Gradient Boosting, were considered, they were ultimately not chosen for this study. These ensemble models typically present improved predictive accuracy, while their “black-box” nature limits their interpretability, which was a key consideration for this study. Since the goal was to generate transparent, interpretable insights into the factors influencing hospitalization costs, the CART model was preferred for its ability to produce easily understandable decision trees. Logistic regression was also evaluated, while was regarded inappropriate for this analysis, as it assumes a binary outcome, whereas hospitalization costs are continuous. Although the CART model effectively captures the relationships in the dataset, future research should compare its performance with other models, such as Random Forests or Gradient Boosting, to assess potential gains in predictive accuracy.

Categorical variables with two levels (e.g., marital status) were entered into the regression model using binary coding. For variables with more than two categories (e.g., hospital level, insurance type), dummy variables were created, with one category designated as the reference group. This approach allowed for the comparison of each category’s effect relative to the reference category while ensuring appropriate model specification. All collected data were entered into Excel 2010 (Microsoft Corporation, Redmond, WA, United States), and statistical analyzes were performed using SPSS 26.0 (SPSS Inc., Chicago, IL, United States). A p-value <0.05 was considered statistically significant.

2.3 Variables

The dependent variable is the hospitalization costs (Y), and the independent variables are gender (X1, male, female), age (X2, years, <11, 11–20, 21–30, 31–40, 41–50, 51–60, 61–70, >71), marital status (X3, married or cohabited, single), insurance type (X4, urban resident basic medical insurance (URBMI), urban employee basic medical insurance (UEBMI), new rural cooperative medical insurance (NRCMI), other), LOS (X5, days, 1–3, 4–6, 7–9, 10–12, >12), presence of complications (X6, yes, no), whether surgery (X7, yes, no), type of surgery (X8, laparoscopic surgery, laparotomy), and hospital level (X9, provincial hospitals, municipal hospitals, and county hospitals). It should be clarified that the term “hospitalization costs” in this study refers to the total charges billed to patients or their insurers during the hospitalization episode, as recorded in the hospital information system. These figures may not necessarily reflect the exact economic cost incurred by the provider but are used here as proxies for resource consumption in DRG classification.

3 Results

3.1 Results of descriptive analysis

A total of 4,066 patients were included in the study, with 1,197 patients (29.4%) from provincial hospitals, 1,200 patients (29.5%) from municipal hospitals, and 1,669 patients (41.1%) from county hospitals. The sample consisted of 2,103 male patients (51.7%) and 1,963 female patients (48.3%). Patient ages ranged from 2 to 95 years, with a mean age of 39.51 years. The majority of patients were married or cohabiting (72.8%), selected basic medical insurance (80.3%) (including UEBMI, URBMI, and NRCMI), and had a LOS of 4–9 days (75.6%). Most patients had simple appendicitis without complications or comorbidities (CC) (75.3%), while 1,006 patients presented with CC (including perforation, peritonitis, peripheral abscess, perforation with localized peritonitis, and perforation with diffuse peritonitis). Surgical intervention was required for 73.6% of AA patients, with 78.7% of these cases undergoing laparoscopic appendectomy.

3.2 Results of single factor analysis

The t-test and analysis of variance (ANOVA) were conducted to perform univariate analysis of hospitalization costs in AA patients. Hospitalization costs increased with age, reaching the highest levels in patients aged >71 years (11,557.56 ± 7,582.89 CNY). Higher costs were also observed among married or cohabiting patients (10,408.92 ± 5,740.97 CNY), those with an LOS > 12 days (18,579.15 ± 11,437.15 CNY), and those who underwent surgery (11,888.28 ± 5,223.94 CNY). Patients using the NRCMI payment method (8,600.93 ± 4,859.97 CNY) and those treated in county-level hospitals (7,583.15 ± 4,031.00 CNY) had lower hospitalization costs. However, no statistically significant correlation was found between gender, CC, type of surgery, and hospitalization costs, as shown in Table 1.

Table 1

Table 1. Single factor analysis of hospitalization costs in acute appendicitis.

3.3 Results of multivariate linear regression analysis

A collinearity diagnostic analysis was conducted before performing multiple linear regression. No collinearity was detected among the variables in this study (Supplementary Table S1). All variables found to be statistically significant in the univariate regression model were included as independent variables in the multiple linear regression model. These variables included age (X2), marital status (X3), insurance type (X4), LOS, (X5), surgery (X7), and hospital level (X9). The results of the multivariate linear regression analysis are shown in Table 2. The multiple linear regression equation is formulated as:

\begin{array}{l} Y = 13447.64 + 2642.58 X 5 - 4298.61 X 8 - 1840.27 X9 \\ + 389.81 X 2 - 476.46 X3 \end{array}

Table 2

Table 2. Multivariate linear regression analysis of hospitalization costs in acute appendicitis.

The model demonstrated statistical significance, with a corrected R² = 0.460, F = 511.778, and p < 0.001. At a significance level of α = 0.05, the multiple linear regression equation was confirmed to be statistically significant.

3.4 Results of classification and regression tree model

A CART model was developed and pruned using the E-CHAID algorithm. Inpatient hospitalization costs were set as the dependent variable, while LOS, type of surgery, hospital level, and age were used as classification nodes. The model parameters were configured as follows: the maximum number of tree layers was set to 3, the minimum sample size for the parent node was 400, the minimum sample size for the child node was 250, and the significance level for node splitting was α = 0.05. The number of DRG groups was determined based on the E-CHAID algorithm’s optimization, which identified 10 distinct groups with significant differences in hospitalization costs. The Kruskal-Wallis rank sum test results indicated that the ten case combinations identified had significantly different hospitalization costs. The CART model results are presented in Figure 1.

Figure 1

Decision tree diagram showing hospitalization costs categorized by type of surgery: laparoscopic, laparotomy, and nonsurgical procedures. Each branch is further divided by age or length of stay, with nodes detailing mean costs, standard deviation, number, and percentage of patients. Key nodes display specific values, such as Node 1 with a mean cost of 12,866.18, and Node 3 with 5,223.94, illustrating statistical differences across categories.

Figure 1. Classification and regression tree model of hospitalization costs in patients with acute appendicitis.

Although alternative group numbers (e.g., 5 or 15) were considered, the 10-group configuration emerged as the most statistically significant, as it best captured the variability in hospitalization costs. Table 3 displays the optimized DRG grouping scheme for AA patients. The DRG 1 group had the highest number of cases, accounting for 15.7% of patients, followed by the DRG 3 group, which comprised 14.3%. The model’s validity was assessed based on heterogeneity between groups and homogeneity within groups, with the coefficient of variation (CV) among the ten DRG groups being less than 0.8, indicating a high degree of cost homogeneity in each group. Future studies will further refine the grouping approach by testing alternative configurations (such as 5 or 15 groups) and using model selection criteria, such as AIC, BIC, or cross-validation to compare and justify the optimal number of groups.

Table 3

Table 3. Results of DRG grouping in patients with acute appendicitis.

4 Discussion

In this study, the majority of AA patients were between 11 and 50 years old, accounting for 78.06% of the cohort. The age range was broad, with the youngest patient being 2 years old and the oldest 95 years old. AA is common across various age groups, a finding consistent with previous research (16). Additionally, county hospitals treated the highest number of cases (1,669 patients, 41.05% of the sample). This study indicated differences in LOS, CC, surgery, and type of surgery among AA patients across different hospital levels. The LOS for most patients ranged from 1 to 9 days, with the majority having simple AA. Surgical intervention is the primary treatment, which aligns with the known characteristics of AA (17, 18). As a common acute condition, AA follows a well-defined treatment pathway, and postoperative recovery is generally favorable following appendectomy (19). This study revealed significant differences among AA patients based on age, marital status, insurance type, LOS, surgical intervention, and hospital level. A negative correlation was identified between hospital level, insurance type, marital status, type of surgery, and hospitalization costs. Patients with the lowest hospitalization costs were those who were single, covered by NRCMI, treated in county hospitals, and underwent open appendectomy. Conversely, age, LOS, CC, surgery, and hospitalization costs showed a positive correlation. Hospitalization costs increased with age, longer LOS, presence of CC, and surgical intervention. A strong association was identified between surgery and the type of surgery in relation to hospitalization costs. However, it is noteworthy to clarify that surgery and the type of surgery are inherently linked variables, and this should not be interpreted as multicollinearity. Rather, this association highlights that different surgical approaches (e.g., laparoscopic vs. open appendectomy) have distinct effects on resource utilization, which may in turn influence hospitalization costs. The correlation can be attributed to both the characteristics of the surgical procedures and the underlying structure of medical payment systems. Extended hospitalization, more complex surgeries, and the presence of comorbidities all contribute to higher resource consumption, ultimately leading to increased costs.

Age was identified as the second-level classification node. As individuals age, physiological functions decline, the prevalence of underlying diseases increases, complications become more frequent, and disease prognosis tends to be slower. Consequently, older patients require more medical resources. Among those aged 60 and above, most have comorbidities that directly contribute to increased resource consumption. In this study, patients over 60 years old had the highest hospitalization costs, a finding consistent with research conducted in Germany and the United States (20, 21). Although age was one of the key splitting variables in the CART model, it was not the sole or primary determinant of DRG grouping; rather, it interacted with other clinically relevant and resource-related factors, such as type of surgery, LOS, and hospital level, supporting a multidimensional rather than age-dominant classification approach for determining provider payments.

The third-level classification node was divided into two factors: hospital level and LOS. Patients receiving treatment at county-level hospitals had the lowest hospitalization costs. Under China’s tiered healthcare system, county hospitals have a higher medical insurance reimbursement ratio than provincial or municipal hospitals. Additionally, patients in county hospitals typically present with less severe conditions and are more likely to receive non-surgical treatments, such as medication or open appendectomy. Among all factors, LOS had the most significant impact on hospitalization costs, aligning with findings from related studies in China (22–24). In the DRG grouping guidelines of developed countries, including the United States, United Kingdom, and Poland, LOS is considered a key factor (25). As widely reported, hospitalization costs increase with longer LOS. This correlation exists because LOS reflects not only medical resource consumption but also disease severity and healthcare efficiency. Therefore, measures should be implemented to reduce LOS, improve bed turnover rates, and lower hospitalization costs. It is important to note that the hospitalization costs analyzed in this study refer to the actual charges recorded in the hospital information system, which may not fully represent the underlying resource-based costs of care. These charges reflect the billed amounts and are subject to regulatory policies, hospital pricing strategies, and insurance reimbursement frameworks. In some cases, charges may be higher or lower than the true economic cost due to subsidies, profit margins, or government-imposed price ceilings. Nonetheless, the use of these data remains valid for DRG optimization, as DRG systems are primarily designed to classify and reimburse cases based on relative resource consumption across homogeneous patient groups, rather than to capture precise economic costs on an individual basis.

The evaluation indicators from the CART model confirm that the DRG grouping scheme for AA in this study is well-structured. Our findings demonstrate significant cost homogeneity within each DRG group and notable cost heterogeneity between groups, aligning with the fundamental principles of DRG classification. While this study proposed a new 10-group DRG classification for AA patients, a comparative analysis with existing DRG systems, such as the CHS-DRG, is warranted. In particular, systems, such as BJ-DRG, CN-DRG, and CHS-DRG typically group AA cases into broader categories based primarily on surgical status and the presence of complications. These existing systems, however, may lack the granularity needed to capture regional variations in hospital practices and resource consumption, which can lead to reduced specificity in cost prediction and reimbursement allocation. In contrast, the proposed 10-group classification provides a more detailed stratification, allowing for more accurate cost predictions by incorporating not only the type of surgery and complications, but also age, LOS, and hospital level. This finer classification may reduce cost heterogeneity within groups, with a CV of less than 0.8, suggesting high-cost homogeneity. The greater stratification aligns with international DRG principles that emphasize balancing complexity and efficiency in grouping to avoid unnecessary fragmentation. Moreover, the CHS-DRG system may not fully reflect the local healthcare context, particularly the differences in hospital resources and practice patterns across varying hospital levels. This study’s approach, by explicitly considering these local factors, may possess advantages in terms of both policy relevance and practical implementation for medical insurance departments in China. The enhanced granularity of our model can support more precise reimbursement schemes and better reflect resource consumption, making it a potential improvement over current DRG systems. While the CHS-DRG system and other national groupers have a broader classification scheme, the optimized model presents a refined alternative that can promote future updates or the development of a more regionally adaptable system. It is noteworthy that in several DRG systems, such as the Australian Refined Diagnosis Related Groups (AR-DRG), the Patient Clinical Complexity Level (PCCL) is used to quantify the overall clinical complexity of a case by integrating the effects of multiple comorbidities and complications (11). The PCCL provides a more granular and standardized measure of resource intensity. However, PCCL values were not available in the dataset used for this study, as the hospital information systems did not record the necessary variables to compute this index. Therefore, we relied on the binary coding of complications and comorbidities (yes/no) as a simplified proxy for patient complexity. While this approach has limitations in capturing nuanced clinical severity, it reflects current data practices in many Chinese hospital systems and provides a pragmatic basis for DRG grouping in this context. Future research should incorporate PCCL or equivalent metrics once the required data infrastructure becomes available. Existing DRG groupers in China (e.g., BJ-DRG, CN-DRG, CHS-DRG) often group AA cases into broad categories based primarily on surgical status and presence of complications (26, 27). However, these systems may lack regional adaptability and granularity needed to reflect local practice patterns and resource variation. Our optimized DRG model introduces a more remarkable stratification, providing improved cost homogeneity and reflecting local hospital-level differences. This evidence-based classification may promote the refinement of current groupers or guide future updates to the national system.

This study presents several advantages. Firstly, the data from Hefei, China, ensure representativeness and minimize regional cost variations. Secondly, machine learning models enable a comprehensive analysis of multiple variables and produce interpretable decision tree diagrams for decision-making. Thirdly, using historical data to estimate standard costs highlights differences in resource consumption, providing valuable insights for future DRG-based payment reforms. While the study did not include a formal clinical severity index, selected predictors, such as age, length of stay, and surgery type serve as practical proxies for disease complexity. The study expands on DRG classifications for AA by creating 10 groups, providing a finer classification for more accurate cost prediction. This can support better reimbursement schemes and help local health authorities design policies that align with resource consumption patterns in AA patients.

However, this study has certain limitations. Firstly, it concentrated on AA patients, and the findings might not be directly applicable to other conditions. Secondly, while inpatient fees were used as a proxy for costs, they might not fully represent the actual economic resources used during treatment. The data, collected from hospitals in Hefei, might also limit the generalizability of the results to other regions. Future research should explore the applicability of this DRG model to other diseases and conduct prospective studies in different settings to validate the model’s broader use.

5 Conclusion

This study provides a theoretical basis for DRG grouping of AA, identifying key variables influencing hospitalization costs and utilizing the CART model for case combination classification. Our findings indicate that LOS, hospital level, and type of surgery serve as primary nodes for DRG grouping. Using a machine learning model, patients were classified into ten DRG groups, with significant cost differences observed between groups, while cost variations within groups remained relatively small. The study validates the applicability of disease grouping based on multivariate statistical analysis and machine learning models in AA patients in Hefei, China, demonstrating the feasibility of case combination classification. Furthermore, our findings provide a useful reference for improving disease diagnosis grouping systems in other regions and countries.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

This study has been approved by the Ethics Committee of the First Affiliated Hospital of Anhui Medical University (No. PJ 2024-12-56) and all patients signed the informed consent form. Ethical principles of the Declaration of Helsinki were adhered to throughout this study.

Author contributions

XG: Investigation, Writing – original draft, Writing – review & editing. NL: Investigation, Writing – review & editing. HW: Funding acquisition, Project administration, Supervision, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the National Natural Science Foundation of China (No. 72374004).

Acknowledgments

We would like to thank the participants for their supports in making this study possible.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2025.1581441/full#supplementary-material

References

1. Moris, D, Paulson, EK, and Pappas, TN. Diagnosis and Management of Acute Appendicitis in adults: a review. JAMA. (2021) 326:2299–311. doi: 10.1001/jama.2021.20502

PubMed Abstract | Crossref Full Text | Google Scholar

2. Di Saverio, S, Podda, M, De Simone, B, Ceresoli, M, Augustin, G, Gori, A, et al. Diagnosis and treatment of acute appendicitis: 2020 update of the WSES Jerusalem guidelines. World J Emerg Surg. (2020) 15:27. doi: 10.1186/s13017-020-00306-3

PubMed Abstract | Crossref Full Text | Google Scholar

3. Gal, M, Maya, P, Ofer, K, Mansoor, K, Benyamine, A, and Boris, K. Acute appendicitis in the elderly: a Nationwide retrospective analysis. J Clin Med. (2024) 13:2139. doi: 10.3390/jcm13072139

PubMed Abstract | Crossref Full Text | Google Scholar

4. Snyder, MJ, Guthrie, M, and Cagle, S. Acute appendicitis: efficient diagnosis and management. Am Fam Physician. (2018) 98:25–33.

PubMed Abstract | Google Scholar

5. Kotaluoto, S, Ukkonen, M, Pauniaho, SL, Helminen, M, Sand, J, and Rantanen, T. Mortality related to appendectomy; a population based analysis over two decades in Finland. World J Surg. (2017) 41:64–9. doi: 10.1007/s00268-016-3688-6

PubMed Abstract | Crossref Full Text | Google Scholar

6. Fugazzola, P, Ceresoli, M, Agnoletti, V, Agresta, F, Amato, B, Carcoforo, P, et al. The SIFIPAC/WSES/SICG/SIMEU guidelines for diagnosis and treatment of acute appendicitis in the elderly (2019 edition). World J Emerg Surg. (2020) 15:19. doi: 10.1186/s13017-020-00298-0

PubMed Abstract | Crossref Full Text | Google Scholar

7. Korndorffer, JR, Fellinger, E, and Reed, W. SAGES guideline for laparoscopic appendectomy. Surg Endosc. (2010) 24:757–61. doi: 10.1007/s00464-009-0632-y

PubMed Abstract | Crossref Full Text | Google Scholar

8. GBD 2021 Appendicitis Collaborator Group. Trends and levels of the global, regional, and national burden of appendicitis between 1990 and 2021: findings from the global burden of disease study 2021. Lancet Gastroenterol Hepatol. (2024) 9:825–58. doi: 10.1016/S2468-1253(24)00157-2

Crossref Full Text | Google Scholar

9. Wickramasinghe, DP, Xavier, C, and Samarasekera, DN. The worldwide epidemiology of acute appendicitis: an analysis of the Global Health data exchange dataset. World J Surg. (2021) 45:1999–2008. doi: 10.1007/s00268-021-06077-5

PubMed Abstract | Crossref Full Text | Google Scholar

10. Freeman, JL, Fetter, RB, Park, H, Schneider, KC, Lichtenstein, JL, Hughes, JS, et al. Diagnosis-related group refinement with diagnosis-and procedure-specific comorbidities and complications. Med Care. (1995) 33:806–27. doi: 10.1097/00005650-199508000-00006

PubMed Abstract | Crossref Full Text | Google Scholar

11. Dimitropoulos, V, Yeend, T, Zhou, Q, McAlister, S, Navakatikyan, M, Hoyle, P, et al. A new clinical complexity model for the Australian refined diagnosis related groups. Health Policy. (2019) 123:1049–52. doi: 10.1016/j.healthpol.2019.08.012

PubMed Abstract | Crossref Full Text | Google Scholar

12. Bertoli, P, and Grembi, V. The political economy of diagnosis-related groups. Soc Sci Med. (2017) 190:38–47. doi: 10.1016/j.socscimed.2017.08.006

PubMed Abstract | Crossref Full Text | Google Scholar

13. Jiao, WP. Diagnosis-related groups' payment reform in Beijing. Chin Med J. (2018) 131:1763–4. doi: 10.4103/0366-6999.235869

PubMed Abstract | Crossref Full Text | Google Scholar

14. Ma, W, Qu, J, Han, H, Jiang, Z, Chen, T, Lu, X, et al. Statistical insight into China's indigenous diagnosis-related-group system evolution. Healthcare (Basel). (2023) 11:2965. doi: 10.3390/healthcare11222965

PubMed Abstract | Crossref Full Text | Google Scholar

15. Liu, X, Fang, C, Wu, C, Yu, J, and Zhao, Q. DRG grouping by machine learning: from expert-oriented to data-based method. BMC Med Inform Decis Mak. (2021) 21:312. doi: 10.1186/s12911-021-01676-7

PubMed Abstract | Crossref Full Text | Google Scholar

16. Wagner, M, Tubre, DJ, and Asensio, JA. Evolution and current trends in the Management of Acute Appendicitis. Surg Clin North Am. (2018) 98:1005–23. doi: 10.1016/j.suc.2018.05.006

PubMed Abstract | Crossref Full Text | Google Scholar

17. Borruel Nacenta, S, Ibáñez Sanz, L, Sanz Lucas, R, Depetris, MA, and Martínez, CE. Update on acute appendicitis: typical and untypical findings. Radiologia (Engl Ed). (2023) 65:S81–91. doi: 10.1016/j.rxeng.2022.09.010

PubMed Abstract | Crossref Full Text | Google Scholar

18. Krzyzak, M, and Mulrooney, SM. Acute appendicitis review: background, epidemiology, diagnosis, and treatment. Cureus. (2020) 12:e8562. doi: 10.7759/cureus.8562

PubMed Abstract | Crossref Full Text | Google Scholar

19. Min, S, Pengqian, F, and Xue, B. Analysis of hospitalization expenses and influencing factors of acute appendicitis based on grey correlation analysis. Chin J Hosp Admin. (2018) 34:1022–5. doi: 10.3760/cma.j.issn.1000-6672.2018.12.012

Crossref Full Text | Google Scholar

20. Finnesgard, EJ, Hernandez, MC, Aho, JM, and Zielinski, MD. The American Association for the Surgery of Trauma emergency general surgery anatomic severity scoring system as a predictor of cost in appendicitis. Surg Endosc. (2018) 32:4798–804. doi: 10.1007/s00464-018-6230-0

PubMed Abstract | Crossref Full Text | Google Scholar

21. Stausberg, J, and Kiefer, E. Homogeneity of the German diagnosis-related groups. Health Serv Manag Res. (2010) 23:154–9. doi: 10.1258/hsmr.2010.010002

PubMed Abstract | Crossref Full Text | Google Scholar

22. Ruffolo, C, Fiorot, A, Pagura, G, Antoniutti, M, Massani, M, Caratozzolo, E, et al. Acute appendicitis: what is the gold standard of treatment? World J Gastroenterol. (2013) 19:8799–807. doi: 10.3748/wjg.v19.i47.8799

PubMed Abstract | Crossref Full Text | Google Scholar

23. Téoule, P, Laffolie, J, Rolle, U, and Reissfelder, C. Acute appendicitis in childhood and adulthood. Dtsch Arztebl Int. (2020) 117:764–74. doi: 10.3238/arztebl.2020.0764

PubMed Abstract | Crossref Full Text | Google Scholar

24. Li, ZL, Ma, HC, Yang, Y, Chen, JJ, and Wang, ZJ. Clinical study of enhanced recovery after surgery in laparoscopic appendectomy for acute appendicitis. World J Gastrointest Surg. (2024) 16:816–22. doi: 10.4240/wjgs.v16.i3.816

PubMed Abstract | Crossref Full Text | Google Scholar

25. Fox, KM. EURopean trial on reduction of cardiac events with perindopril in stable coronary artery disease investigators. Efficacy of perindopril in reduction of cardiovascular events among patients with stable coronary artery disease: randomised, double-blind, placebo-controlled, multicentre trial (the EUROPA study). Lancet. (2003) 362:782–8. doi: 10.1016/s0140-6736(03)14286-9

PubMed Abstract | Crossref Full Text | Google Scholar

26. Zhang, YH, He, GP, and Liu, JW. Comparison of medical costs and care of appendectomy patients between fee-for-service and set fee for diagnosis-related group systems in 20 Chinese hospitals. Southeast Asian J Trop Med Public Health. (2016) 47:1055–61.

PubMed Abstract | Google Scholar

27. Quentin, W, Scheller-Kreinsen, D, Geissler, A, and Busse, REuro DRG group. Appendectomy and diagnosis-related groups (DRGs): patient classification and hospital reimbursement in 11 European countries. Langenbeck's Arch Surg. (2012) 397:317–26. doi: 10.1007/s00423-011-0877-5

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: diagnosis-related groups, acute appendicitis, classification and regression tree, hospitalization cost, machine learning model

Citation: Gu X, Li N and Wang H (2025) Optimization of diagnosis-related groups for patients with acute appendicitis using a machine learning model. Front. Public Health. 13:1581441. doi: 10.3389/fpubh.2025.1581441

Received: 22 February 2025; Accepted: 19 August 2025;
Published: 02 September 2025.

Edited by:

Ding Li, Southwestern University of Finance and Economics, China

Reviewed by:

Shumin Ren, Sichuan University, China
Ramkrishna Mondal, All India Institute of Medical Sciences (Patna), India
Alvin Caballes, University of the Philippines Manila, Philippines

Copyright © 2025 Gu, Li and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Heng Wang, d2FuZ2hlbmdfMTk2OUAxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.