Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med., 16 May 2025

Sec. Family Medicine and Primary Care

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1509220

This article is part of the Research TopicHospital Management and Healthcare Policy: Financing, Resourcing and Accessibility, Volume IIView all 23 articles

Data-driven segmentation of type 2 diabetes mellitus patients: an observational study on health care utilisation prior to an emergency department visit in Germany

  • 1Department for Infectious Disease Epidemiology, Robert Koch Institute, Berlin, Germany
  • 2Department of Health Care Management, Berlin Centre for Health Economics Research, Technische Universität Berlin, Berlin, Germany
  • 3Emergency and Acute Medicine (CVK, CCM), Charité – Universitätsmedizin Berlin, Berlin, Germany
  • 4Institute of Social Medicine, Epidemiology and Health Economics, Charité – Universitätsmedizin Berlin, Berlin, Germany
  • 5Institute of Clinical Epidemiology and Biometry, University of Würzburg, Würzburg, Germany
  • 6State Institute of Health I, Bavarian Health and Food Safety Authority, Erlangen, Germany
  • 7Department of Epidemiology and Health Monitoring, Robert Koch Institute, Berlin, Germany
  • 8Faculty of Medicine, Institute of General Practice and Interprofessional Care, University Hospital Tübingen, Eberhard Karls Universität Tübingen, Tübingen, Germany

Background: Potentially avoidable hospital admissions (PAHs) due to type 2 diabetes mellitus (T2DM) occur more frequently in Germany than in the rest of Europe. Emergency departments (EDs) play an important role in understanding cross-sectoral health care utilisation resulting in inpatient admissions. Segmenting T2DM patients in homogenous groups according to their health care utilisation may help to understand the population’s needs and to allocate limited resources. The aim of this study was to describe ED use and subsequent inpatient admissions among T2DM patients, and to segment the study population into homogenous subgroups based on disease stage, health care utilisation and process quality of outpatient care prior to an ED visit.

Methods: This study was conducted as part of the INDEED project, comprising data on 56,821 ED visits in 2016 attributable to 40,561 patients with T2DM from 13 German EDs, as well as statutory health insurance claims data from 2014 to 2016 retrospectively linked per patient. Descriptive analyses included patient characteristics, ED admission diagnoses and discharge diagnoses in the case of inpatient admission of T2DM patients to the ED. Latent class analysis was conducted to identify different subgroups of T2DM patients based on disease stage, number of physician contacts and medical examinations prior to the ED visit.

Results: Almost half of the study population had severe comorbidities (44.3%). In addition to T2DM, multiple cardiovascular diagnoses were among the most frequently documented admission and discharge diagnoses. The proportion of hospitalised ED visits for T2DM patients was higher (59%) than that for the INDEED population (42.8%). We identified three latent classes that were characterised as “early disease stage and high utilisation” (36.5% of the study population), “progressing disease stage and low utilisation” (26.1%) and “progressed disease stage and high utilisation” (37.4%).

Conclusion: A substantial share of T2DM patients had not received disease monitoring according to guideline recommendations prior to ED presentation. Improving guideline-adherence in the outpatient sector could help reduce potentially avoidable ED visits. Effective interventions that aim at improving continuity and quality of care as well as reducing the share of PAH need to be identified and evaluated per identified class.

1 Introduction

Type 2 diabetes mellitus (T2DM) is a chronic metabolic disease characterised by elevated blood glucose levels. In Germany, 8.6% (95% CI: 7.4–10.0%) of women and 9.9% (95% CI: 8.6–11.3%) of men aged 18 to 79 are affected by known and unknown diabetes mellitus (DM) (1). The prevalence of known DM in Germany (8.4%; 95% CI: 7.8–8.9%) is thus higher than the EU average of 7.4% (95% CI: 7.3–7.6%) (2, 3). In the adult population, T2DM is the most common type of diabetes mellitus, accounting for more than 90.0% of all diagnosed cases (4, 5). Despite existing treatment guidelines and the high potential for preventing disease progression, DM frequently leads to comorbidities, including diseases of the cardiovascular system, kidney failure, diseases of the eye and amputations of the lower extremities (4). Due to the high burden of disease and the high economic burden on healthcare systems, DM constitutes a substantial challenge for public health (6).

DM is classified as an ambulatory care sensitive condition (ACSC) (7, 8), i.e., DM-related inpatient admissions can be avoided with adequate, guideline-based outpatient treatment (9). Potentially avoidable hospital admissions (PAHs) due to T2DM are defined as a key indicator to measure quality of care in the outpatient sector by the Organisation for Economic Co-operation and Development (OECD) and are monitored as part of the German Diabetes Surveillance (2, 62). With an inpatient admission rate of 206 per 100,000 people, Germany has high levels of PAH due to DM compared to the EU average admission rate of 139 per 100,000 people (2). In addition, DM is also a risk factor for other diseases leading to PAH: In a French cohort of patients with DM, congestive heart failure and chronic obstructive pulmonary disease (COPD) were the two most common causes of inpatient treatment due to an ACSC, while short-term complications due to DM ranked third (10). A Portuguese study on the impact of DM on multiple inpatient admissions showed that inpatients diagnosed with DM had a longer average length of stay and higher costs (11). In Germany, the prevalence of DM in inpatient cohorts is twice as high as in the general population, and inpatient mortality is increased by up to 30% in individuals with T2DM compared to non-diabetics (12).

Continuous, coordinated care across sectors is crucial for preventing PAHs (9). Mapping and improving cross-sectoral care play an important role in the highly fragmented German healthcare system. The financial and organisational separation of the outpatient and inpatient sectors in Germany impairs the coordination and quality of health care services (13). Emergency departments (EDs) form a special interface between the outpatient and inpatient sectors, as they contain components of both sectors (14). While services for patients not admitted as inpatients are billed on an outpatient basis through the associations of statutory health insurance (SHI) physicians organised at the federal state level, services for patients admitted as inpatients are billed directly by health insurance funds (15). With 48.8%, the inpatient admission rate out of all ED visits is significantly higher in Germany than in other European countries (16). EDs thus play a relevant role in understanding cross-sectoral patterns of health care utilisation resulting in inpatient admission. To date, little is known about the cross-sectoral health care utilisation of T2DM patients attending EDs.

To improve cross-sectoral coordination of care and enhance patient management of T2DM patients, it is important to understand health care utilisation resulting in ED visits and inpatient admissions (8). Segmenting the population into homogenous groups according to their health care utilisation contributes to a better understanding of the population’s demand and facilitates the effective allocation of limited resources from a population health perspective (17, 18). Usually, in population segmentation methods, a distinction is made between data-driven and expert-driven approaches. In expert-driven approaches, segments of a population are distinguished by a panel of experts or health care practitioners using predefined criteria established through a literature review or a consensus-building process. Data-driven approaches use statistical analyses on extensive datasets to recognise underlying patterns of factors influencing predefined outcomes (18). Regarding T2DM patients, the most studied objectives to date are health grouping, assessment of diabetes-related complications, and non-diabetic metabolic complications. Segmentation variables at the individual level mainly included sociodemographic, DM-related, and non-DM medical-related variables. Variables relating to the healthcare system utilisation are used less frequently (17). To date, no data-driven segmentation analyses have been conducted on the outpatient health care utilisation of T2DM patients prior to an ED visit.

The overall aim of this analysis was to contribute to a better understanding of cross-sectoral patterns of health care utilisation among T2DM patients prior to ED visits with and without subsequent inpatient admissions via ED in Germany. Specifically, the study aimed to (1) describe their outpatient health care utilisation in the year prior to an ED visit, (2) describe their diagnoses in the ED as well as main hospital diagnoses in case of inpatient admission, and (3) identify homogenous subgroups of T2DM patients regarding disease stage, outpatient health care utilisation and process quality of outpatient health care prior to an ED visit.

2 Methods

2.1 Data source

This study was conducted as part of the explorative multicentre project INDEED (INDEED: Utilisation and cross-sectoral patterns of care of patients in emergency departments in Germany). The overall aim of the INDEED project was to illustrate ED use and to explore cross-sectoral patterns of care for patients admitted to EDs (14, 19). The INDEED dataset used for this study consisted of (1) ED data of SHI insured patients collected from 16 German EDs in 2016 and (2) outpatient claims data from the years 2014–2016. Data were collected and linked for each of the ED patients retrospectively (14). The data set is unique for Germany. A more recent, comparable database with linked ED and outpatient data is not available due to the established data structures and data protection regulations in Germany. Over the last two decades the number of ED cases increased constantly, although this changed during the covid-19 pandemic (20). Nevertheless, the data set is still suitable for investigating cross-sectoral patterns of health care utilisation in Germany.

The ED data included information on patient demographics, ED treatment and ED diagnoses. The ED data were supplemented with the main hospital diagnosis in the case of inpatient admission following the ED visit. Diagnoses were not available for every ED visit or inpatient admission. Three of the 16 EDs did not provide any ED diagnoses and were therefore excluded from further analyses. The outpatient data included information on patient demographics and insurance status, as well as outpatient diagnoses coded according to ICD-10-GM (International Classification of Diseases and Related Health Problems, 10th revision, German Modification) (21), medical procedures coded according to standardised outpatient billing codes (22), and drug prescription data coded according to ATC-codes (Anatomical Therapeutic Chemical Classification) (23). Information on participation in a disease management programme (DMP) appeared to be incomplete, as the numbers of documented participants for all DMPs were significantly lower than the actual numbers recorded by statutory health insurances. It was not possible to ascertain whether this discrepancy was a data artefact or an actual deviation in the study population from the total population, based on the available data. Outpatient data were transmitted from eight associations of SHI physicians in the federal states in which the participating EDs were located (24). One of the eight participating associations of SHI physicians was unable to transmit drug prescription data. Comprehensive information on the participating EDs and associations of SHI physicians as well as the process of data linkage is described elsewhere (19).

The dataset is based on a total of 454,747 ED visits in 2016, attributable to 353,926 patients, who were at least 18 years old in 2014. Furthermore, the dataset only contained ED visits that were billed via the SHI. In 2020, 87% of the German population were covered by statutory health insurance (13). Privately insured patients and visits billed via statutory accident insurance were not included in the dataset (25).

2.2 Study population

2.2.1 Definition

The study population included patients with a validated T2DM diagnosis for which at least one case was documented in the outpatient data. The presence of DM was defined via the ICD-10-GM-Codes (E10.- up to E14.-) (21). In addition, drug prescriptions of insulin and oral antidiabetic drugs (OADs) as well as participation in a DMP (if available) were included for validation of diagnosis and to distinguish between different types of DM. Prescriptions for insulin or OAD were identified via ATC-codes (23). Table 1 provides an overview of the codes considered. Figure 1 illustrates the process of extraction and validation of the study population as outlined below.

Table 1
www.frontiersin.org

Table 1. Overview of the codes used for study population definition.

Figure 1
www.frontiersin.org

Figure 1. Flowchart of patient selection of all patients with validated type 2 diabetes mellitus within the INDEED dataset. DM, Diabetes mellitus; ED, Emergency department; EDD, Emergency department diagnosis; MHD, Main hospital diagnosis; OAD, Oral antidiabetic drug; T2DM, Type 2 diabetes mellitus.

2.2.2 Extraction and validation

The extraction of the study population was performed in three steps in accordance with procedures recommended by the Robert Koch Institute (RKI) and the German Institute for Quality and Efficiency in Health Care (IQWiG) (4, 26). (I) All patients with at least one ICD-10-GM code for diabetes mellitus were identified within the dataset. Outpatient diagnoses two years prior to the ED visit, as well as ED diagnoses and the main hospital diagnosis were considered. The period “two years prior to the ED visit” comprised eight quarters prior to the quarter in 2016 in which the ED visit took place for each patient. In the case of multiple ED visits per patient in 2016, the first visit was used as the calculation basis. Only outpatient diagnoses with the addition of “confirmed” or “condition after” were considered to ensure diagnosis validity (5, 26). Inpatient diagnoses are considered valid in the German healthcare system due to standardised coding guidelines with billing relevance (27). (II) The presence of at least one of the following conditions was used to validate the DM diagnosis: (1) DM was coded at least two quarters in the outpatient data two years prior to the ED visit; (2) DM was coded once in the outpatient data two years prior to the ED visit and once as the ED diagnosis in 2016; (3) DM was diagnosed once in the outpatient data two years prior to the ED visit or as the ED diagnosis in 2016, and there was at least one prescription for insulin or OAD in the outpatient data two years prior to the ED visit; or (4) DM was coded as main hospital diagnosis in case of inpatient admission following the ED visit. The conditions were applied one after another, i.e., if condition one was fulfilled, the following conditions were not applied. Conditions one to three were defined in reference to the well-validated M2Q criterion, which describes the repeated documentation of a diagnosis over the course of a year, usually in two or more different quarters (28, 29). In deviation from the usual definition of the criterion, the presence of at least two quarters with a documented DM diagnosis or a DM-relevant drug prescription within two years, not just one year, was defined as sufficient for diagnosis validation in order to identify all cases. The quarters with DM diagnosis or prescription did not have to be consecutive, and diagnosis and prescription of antidiabetic drugs did not have to be in the same quarter. (III) Finally, the study population was further narrowed to persons with T2DM only, based on an allocation algorithm considering the presence of different combinations of E10.- to E14.- diagnoses, insulin or OAD prescriptions, and participation in a DM-specific DMP (if available, Appendix 1).

2.3 Analysis

Descriptive analyses were performed to summarise patient characteristics, health care utilisation prior to the ED visit, as well as ED diagnoses and main hospital diagnoses. Latent class analysis (LCA) was applied for population segmentation. LCA is a probabilistic, unsupervised clustering method used to determine whether unobserved homogenous subgroups (classes) exist within populations based on measured indicator variables. In contrast to other clustering methods such as k-means, individuals are not assigned to a cluster by distance measures. Instead, maximum likelihood estimations, based on finite mixture modelling of assumed underlying latent classes are used to calculate probabilities of individuals belonging to all latent classes in the model (posterior probability). An LCA is appropriate for large study samples and categorical dependent variables (indicator variables) (30). It is particularly applicable for identifying subgroups that could benefit from similar interventions (31). One of the main prerequisites of LCA is the assumption of “local independence,” i.e., indicator variables are independent of one another. Violations of this assumption can lead to misclassification due to data redundancy and result in low accuracy of fit statistics and overestimation of the true number of classes (30).

2.3.1 Indicator variables

Table 2 describes the selected indicator variables. Age and sex represented patient demographics. The number of documented DM-related complication groups and DM-specific medication indicated the disease stage of T2DM (63). The following variables represent quality markers for T2DM outpatient care. The number of contacts with physicians in the outpatient sector characterises different levels of health care utilisation (32). Accessibility of physicians in the outpatient sector was inversely associated with PAH, although conflicting results exist (33). The differentiation of the medical specialty groups was based on the levels of the German planning scheme for the regional distribution of SHI physicians and was further restricted to disciplines relevant for T2DM patients (Appendix 3) (34). As flat rates are billed on a quarterly basis in outpatient physician care, no statement can be made regarding the actual number of physician contacts per quarter (29). All health care utilisation variables were therefore calculated on a quarterly basis. HbA1c (glycated haemoglobin) and microalbuminuria measurements, as well as ophthalmological fundus examination, are recommended as indicators for outpatient process quality in the treatment of DM (35, 36). In addition, creatinine measurement was defined as a mandatory annual measurement in the German guidelines for DMP for T2DM (37). Since microalbuminuria measurement is particularly relevant for the early detection of diabetic nephropathy and since creatinine is used as an indicator of disease progression, both indicators were considered (38). According to the German guidelines for DMP for T2DM, the HbA1c should be checked quarterly, but at least twice a year. Microalbuminuria and creatinine should be checked annually. The recommended frequency for ophthalmological fundus examination was reduced in 2015 from annually to at least biennially for people without a known risk for retinopathy (37). Regular measurements of HbA1c and microalbuminuria were shown to be negatively associated with inpatient admissions (39).

Table 2
www.frontiersin.org

Table 2. Description of variables used for population segmentation.

The indicators were calculated per patient. The period “one year prior to the ED visit” comprised the four quarters prior to the quarter in 2016 in which the ED visit took place. In the case of multiple ED visits per patient, the first visit was used as the calculation basis. Diagnoses, procedures, or medications were defined as not existing or not performed per patient as soon as the patient was present at least once in the outpatient data, but the corresponding billing code was not used in the defined period. Variable values were collapsed to three to four categories per indicator to ensure interpretability of the classes (31).

2.3.2 Data setup

Patterns of missing data are illustrated. Random forest imputation was conducted with ten iterations to impute missing data (40). Random forest imputation is a fully conditional, non-parametric imputation method that makes no assumptions about normality, linearity of the relation between variables, homoscedasticity, or independence (41). After imputation, collinearity between the indicator variables was checked to validate the assumption of local independence, whereas correlation coefficients under 0.50 were considered (30).

2.3.3 Model selection and validation

The selection of the number of clusters is a challenge in all segmentation methods, since universally valid latent clusters do not exist, and definitions of clusters are strongly influenced by the underlying research objective (40). To determine the number of classes, the use of various indices is recommended (42). Regarding information criteria indicating model fit, the use of the Bayesian information criterion (BIC) is recommended for large samples, with smaller values indicating better model fit (42). For very large samples, it is recommended to apply the elbow heuristic to the BIC, i.e., to find the inflection point in the BIC plot from which the decrease in values is less steep (30). To clearly identify distinct clusters, entropy-based criteria were used, with the integrated completed likelihood (ICL) proving more accurate than scaled relative entropy (42). In addition, the average silhouette width (ASW) can be used to check for within-cluster dissimilarities and cluster separation, with higher values indicating less dissimilarity (40). The ASW is usually applied in distance-based clustering methods. However, simulations have shown that model performance regarding the ASW does not differ between distance-based and probabilistic models (43). Theoretical interpretability should be considered at least as important as statistical fit values (30, 31, 42).

The selection of the final class solution was conducted in accordance with the extended selection strategy proposed by Lezhnina and Kismihók (40). The approach does not rely solely on the model fit indicated by information criteria but also considers cluster separation and partition stability. To determine the number of classes, LCA models were fitted for 1- to 10-class solutions. The selection of class solutions was then conducted under consideration of the BIC elbow heuristic, maximum ASW and minimum ICL to handle the trade-off between cluster separation and model fit (40). The smallest identified latent class in the chosen model was checked to comprise at least 5% of the study sample (31).

For internal validation, partition stability was checked via adjusted Rand index (ARI) and Jaccard index, whose values can range from zero to one and values closer to one indicate better stability (40). Furthermore, an average latent class posterior probability of at least 0.80 per class was used to assess the classification accuracy of the chosen model (31). Model selection was performed only on a proportion of the total sample, using a split ratio of 0.7 to separate training and validation data, to avoid overfitting and improve out-of-sample fit. For validation, the partition stability and classification accuracy of the chosen model were confirmed in the validation data.

2.3.4 Model interpretation

The characteristics of the classes per indicator variable as well as differences in class patterns were analyzed. The ten most frequently coded ED diagnoses and main hospital diagnoses per class were compared. Patient characteristics, recurrent ED visits and inpatient admissions per class were described. Recurrent ED visits and inpatient admissions were coded dichotomously. Recurrent ED visits were classified as present if there was more than one ED visit documented per patient in the participating EDs in 2016. Inpatient admission was classified as present as soon as at least one inpatient admission was documented per patient. Univariate and age-adjusted logistic regressions were performed to detect significant differences in the odds of recurrent ED visits or inpatient admissions between the classes, using the age categories described in Table 2 for age adjustment. Age was already defined as one of the discriminatory variables in the LCA, but showed little discriminatory power within the LCA (see Appendix 8). As age is known to influences both health care utilisation and disease progression, an additional model adjusted for age was created for comparison.

All steps were performed using the statistics program R version 4.3.2. The comorbidity index package was used to estimate the Charlson comorbidity index (44). The missforest package was used for the imputation of missing data (41, 45). The model selection function provided by Lezhnina and Kismihók is based on the VarSelClust function from the VarSelLCM package (40, 46).

2.4 Ethics and data protection

The study was approved by the ethics committee of the Charité – Universitätsmedizin Berlin (application number EA4/086/17). The data protection concept was approved by the data protection working group of the Technology and Methods Platform for Interconnected Medical Research e.V. (TMF) and the data protection officer of Charité – Universitätsmedizin Berlin (registration number 565/17/ST3). INDEED is registered in the German clinical trials registry (registration number DRKS00022969) (14).

3 Results

3.1 Study population

As shown in Figure 1, out of 239,418 eligible patients within the INDEED dataset, 40,561 patients fulfilled the inclusion criterion of having a validated diagnosis of T2DM and at least one documented outpatient case. The median age of the study population was 75 years. Almost half of the study population had severe comorbidities with CCI scores higher than four (44.3%). In 2016, a total of 56,821 visits to the included EDs were recorded. Of the 40,561 patients with T2DM, 30,844 patients had only one recorded visit, while 9,717 had two or more. Overall, 59.0% of all visits led to inpatient admissions. At least one inpatient admission was recorded for 65.1% all patients with T2DM.

Table 3 provides additional information on the study population, categorised into patients with at least one inpatient admission and those without. This distinction is intended to show whether differences in socio-demographic factors, disease severity and the extent of outpatient care prior to the ED visit influence the need for inpatient admission.

Table 3
www.frontiersin.org

Table 3. Summary of study population characteristics.

3.2 Health care utilisation prior to the ED visit

Table 4 provides information on medical outpatient health care utilisation in the year prior to the ED visit. Although 98.5% of patients had contact with a GP, 17.8% of patients did not have their HbA1c measured during the same period.

Table 4
www.frontiersin.org

Table 4. Outpatient health care utilisation one year prior to the index emergency department visit in 2016.

3.3 Emergency department diagnoses and inpatient admissions

For 7,414 of the 56,821 ED visits no ED diagnoses were transmitted, affecting 6,715 patients. Among the remaining 49,407 ED visits, 110,448 diagnoses were coded, with an average of 2.4 diagnoses per ED visit. One of the 13 EDs was notably above average with 9.9 documented diagnoses per ED visit and was excluded from the following analysis of the top ten most frequently coded ED diagnoses (responding to 3,266 excluded patients accounting for 4,373 ED visits). Table 5 represents the ten most frequent ICD-codes documented in the remaining twelve EDs, considering the first three digits of the codes. It should be noted that diagnoses per patient may be coded more than once if they presented repeatedly at the ED in 2016.

Table 5
www.frontiersin.org

Table 5. Top ten coded three-digit ICD-10-codes in the emergency department.

For T2DM patients, who were admitted to the hospital, the inpatient mortality rate was 6.6%. Table 6 represents the ten most frequent ICD-codes documented as main hospital diagnosis in case of inpatient admission. Details on the coded T2DM diagnoses in the ED and as main hospital diagnoses in case of inpatient admission can be found in Appendix 5.

Table 6
www.frontiersin.org

Table 6. Top ten coded three-digit ICD-10-codes as main hospital diagnosis in case of inpatient admission.

3.4 Population segmentation

3.4.1 Data setup

At 20.4%, “medication” had the highest proportion of missing data, whereas all other variables showed proportions of missing values of less than 1 % (Appendix 6). More than 99.0% of the missing medication values occurred in patients from a particular association of SHI physicians, which did not provide drug prescription data. Random forest imputation was conducted. Local independence was checked via a correlation matrix (Appendix 7). A correlation of 0.63 was found between the indicators “HbA1c” and “creatinine measurement.” Therefore, “Creatinine measurement” was excluded from the LCA to avoid bias and misclassification and to reduce data redundancy (30).

3.4.2 Model selection

By applying the combination of the BIC elbow criterion and the maximum ASW indicating model fit as well as the minimum ICL to assess cluster separation, the three-class solution was selected as the final model. This is illustrated in Figure 2, which shows the ASW and BIC values for each of the 1- to 10-class solutions along the x-axis, with the minimum ICL indicated by the dash-dot vertical line.

Figure 2
www.frontiersin.org

Figure 2. Visualisation of 1- to 10-class solutions (A) and cluster selection results (B) assessed for the latent class analysis. The model name shown on the x-axis indicates the number of clusters tested for the respective model.

The “number of quarters with contact with a general specialist” showed the highest discriminatory power in the three-class solution, followed by “medication” (Appendix 8). The partition stability was stable, with an ARI of 0.92 and a Jaccard index of 0.91. At exactly 0.80 for class one, the smallest average latent class posterior probability reached the target value of 0.80 (Appendix 9). Partition stability in the validation dataset was at 0.89 (ARI) and 0.87 (Jaccard). With the smallest average latent class posterior probability at 0.80, the classification accuracy was at the targeted value in the validation dataset as well.

3.4.3 Model interpretation

After model selection and validation, the model was applied to all 40,561 patients. The response distributions of the three identified classes per indicator variable are shown in Figure 3.

Figure 3
www.frontiersin.org

Figure 3. Response probability per indicator per class identified with the latent class analysis.

The first class was named “early disease stage and high utilisation” and represented 36.5% of the study population. Class one was characterised by low average values of indicators reflecting disease progression, with high health care utilisation. The second class, “progressive disease stage and low utilisation,” containing 26.1% of the study population, showed evidence of a more advanced stage of T2DM than class one, with higher values for “medication” in particular. However, compared to classes one and three, class two showed a lower utilisation of specialised care and less frequent measurement of all indicators reflecting process quality of care. The third class “progressed disease stage and high utilisation,” had the highest values of disease progression and health care utilisation, as well as more frequent HbA1c measurements than did the other two classes. Class three represented 37.4% of the study population.

Figure 4 illustrates the average response values per indicator and class and visualises the latent class patterns. It should be noted that classes with a high average value for one variable, for example age, may also include young people and vice versa. The data points per indicator and class should not be interpreted in isolation; instead, the different graph patterns can be compared.

Figure 4
www.frontiersin.org

Figure 4. Average response values per indicator variable per class identified with the latent class analysis.

Although the ICD-GM-code for T2DM (E11.-) remained among the three most frequently coded ED diagnoses and main hospital diagnoses in classes two and three, it was no longer among the top ten in class one. In all three classes, congestive heart failure and cerebral infarction were the two most frequent main hospital diagnoses (Appendix 10).

In the last step, patient characteristics and the occurrence of recurrent ED visits and inpatient admissions per class were analysed (Table 7). Overall, class three had the highest comorbidity scores, with a correspondingly high proportion of recurrent ED visits and inpatient admissions. Class one had slightly higher CCI scores than did class two, which is congruent with higher health care utilisation and a higher average age in this class. However, inpatient admissions occurred more frequently, and recurrent ED visits occurred slightly more frequently in class two than in class one.

Table 7
www.frontiersin.org

Table 7. Patient characteristics and occurrence of recurrent emergency department visits and inpatient admission per class identified by the latent class analysis.

Logistic regression revealed increasing odds of inpatient admissions with increasing average disease stage of T2DM per class (Table 8). The effect can already be seen between classes one and two, although class one had higher values in general comorbidity than class two. Differences were particularly visible in the age-adjusted model, in which the chance of inpatient admission in class two was even higher than that in class three.

Table 8
www.frontiersin.org

Table 8. Logistic regression comparing occurrence of recurrent emergency department visits and inpatient admissions per class.

4 Discussion

4.1 Type II diabetes mellitus patients attending emergency departments

This study was the first to assess the diagnoses of T2DM patients in EDs in Germany using cross-sectoral claims data. With a median age of 75 years, the T2DM study population was older than the underlying ED cohort (25). As the dataset comprised individuals who had visited the ED at least once, it can be assumed that the individuals in the study population were older and at a more advanced stage of the disease compared to all T2DM patients in the general population (47). Additionally, it was already known that the application of the M2Q criterion particularly excludes younger individuals in the early stages of disease (28).

GP utilisation in the year prior to the ED visit was high, 98% of the T2DM patients had contact with a GP at least once in the first two quarters prior to the ED visit. However, for 17.8% of patients, no HbA1c measurement was billed in the year prior to the ED visit. This percentage was considerably higher than the proportion of 4.3% of T2DM patients aged 45 years and older without HbA1c measurements in the last twelve months reported by the National Diabetes Surveillance in Germany for 2021 (48, 49). Similarly, only one third (33.0%) of patients in the study population had an ophthalmological fundus examination in the year before the ED visit, whereas the proportion of T2DM patients with an eye examination in the last 12 months was 64.8%, as reported by the National Diabetes Surveillance in Germany for T2DM patients aged 45 years and older in 2021 (48). When comparing the data, it should be noted that the Diabetes Surveillance data are based on self-report survey data, not claims data (48). In addition, temporal changes between 2016 and 2021 may have contributed to the differences, although there were indications that health care utilisation for specialised care of T2DM patients decreased rather than increased in 2020 due to the starting COVID-19 pandemic (50, 51). Furthermore, the German guideline for DM treatment was adapted in 2015, as part of which the recommended period for eye examinations for people without known risk for retinopathy was adjusted from annually to biennial. This may have contributed to a reduced number of check-ups in the period under review in this study (52). Overall, 84.3% of patients in the study population did not have a microalbuminuria measurement billed in the year prior to the ED visit, but billing codes for creatinine measurement were documented for 87.2%. Similar results on creatinine measurement were reported for patients participating in the DMP for T2DM from the German federal state of North Rhine-Westphalia, ranging from 85.6 to 95.6% between 2014 and 2016 (53).

The proportion of ED visits for T2DM patients admitted as inpatients (59.0%) was notably higher than the proportion of patient visits admitted as inpatients (42.8%) in the underlying ED cohort, which is consistent with previous research (12, 25). Patients with at least one inpatient admission were on average at a more advanced stage of DM with more comorbidities than patients who were treated exclusively as outpatients. Inpatient mortality (6.6%) was about one third higher in the study population than in the underlying cohort (4.3%) (25). T2DM was the second most frequent coded ED diagnosis, as well as the third most frequent main hospital diagnosis in cases of subsequent inpatient admission. In addition, there are several diseases coded under the most frequent ED diagnoses or main hospital diagnoses that can be promoted by DM or similar risk factors as DM, such as hypertension, kidney diseases and cardiovascular diseases. Particularly striking is the high frequency of cardiovascular diagnoses, which emphasises the need for guideline-compliant check-ups for secondary diseases and consistent glycaemic control to avoid microvascular damage (37). Due to the advanced age and high comorbidity in the study population, it should be noted that presumably not all cases defined as having PAH by diagnosis were preventable in the outpatient sector, as health status as well as lifestyle-related factors not represented in the dataset are known as confounders of the relationship between access to primary care and PAH (54).

4.2 Segmentation of T2DM patients prior to an ED visit

Using data-driven segmentation methods, three classes differing in terms of DM-disease progression and health care utilisation were identified. It should be noted, that the labels given to the classes describe only average characteristics of the group, not on every individual person in the respective group (“naming fallacy,” (31)). For example, there might be few individuals in class one, “early disease stage and high utilisation,” with a progressed stage of T2DM, who were assigned to that class due to other characteristics like age or health care utilisation. Furthermore, the assignment of individuals to the classes was based on the information documented in the dataset. It is possible that some individuals in class one were already at a progressed stage of T2DM prior to their ED visit, despite having no visible symptoms or documented diagnoses. The LCA revealed that the variables “number of quarters with contact with a general specialist,” “medication” and “complications” differed across the identified classes (Figure 3; Appendix 8). This indicates that both disease progression and specialist utilisation are important for interpreting the patient groups. These findings highlight the potential for developing targeted care strategies in integrated care settings.

On average, the indicators used to measure the progression of T2DM showed an early stage of disease in the first class. However, since the CCI scores indicated a pronounced level of comorbidity, it can be assumed that this class would benefit from preventive measures targeting risk factors promoting a range of non-communicable diseases. In contrast, class two showed on average an advanced disease stage of T2DM and at the same time the lowest CCI values. Beyond the prevention of primary risk factors, the focus for this group should be on secondary prevention of T2DM in the form of reducing disease progression and associated complications. In both class one and class two, the recommended frequency of quarterly HbA1c measurements was met by only approximately one third of patients in the respective classes. This indicates a high potential for improving process quality in the outpatient care sector in both classes (48). For class three, a high overall morbidity regarding T2DM as well as other diseases could be shown. For this group, the focus should be on ensuring continuity of care and thus on avoiding disruptions of care. Instruments of managed care, e.g., DMPs, GP-centred gatekeeping or case management by home-nursing units, offer a promising approach to both reducing emergency cases and ensuring better cross-sectoral continuity of care in cases of unavoidable ED visits or inpatient admissions (55). For clinicians in the outpatient setting, access to information on patients’ participation in regular check-ups, existing specialist care, and any complications related to T2DM is essential. This information enables the clinician to more accurately assess their patients’ condition and adapt their treatment accordingly. Guideline-based treatment has the potential to prevent the disease from progressing and thus reduce the risk for emergency treatment.

Regarding PAH, the ten most frequently coded main hospital diagnoses per class suggest that in classes two and three PAH occurred more often directly due to T2DM, among other causes, while T2DM had more of an indirect effect on the occurrence of PAH in class one. Beyond that, it could be confirmed that the risk of inpatient admission increased in classes with an average advanced stage of T2DM (12). In comparison, class two with a less pronounced CCI but more advanced T2DM even had a higher risk for inpatient admissions than class one with a slightly more pronounced average CCI but less pronounced T2DM. The higher chance of inpatient admission in class two than in class three was more visible in the age-adjusted model than in the crude model, however, this finding must be interpreted with caution, as age was already included as an indicator variable in class partitioning. Further research is necessary to specify the health care needs of the identified classes in detail to plan and evaluate strategies that can reduce the proportion of PAH patient cases.

From a methodological perspective, the combination of values assessing model fit and values assessing cluster separation ensured interpretability (40). The graphs running parallel in some sections in Figure 4 suggest that the so-called salsa effect may have occurred in the analysis. This effect describes the phenomenon in which different patterns were not identified, but rather the same pattern in varying degrees of severity [“mild, medium and hot salsa,” (30)]. This phenomenon does not necessarily represent a methodological error but must be considered in interpreting the classes (31). The occurrence of the salsa effect was accepted for this study since interactions between disease stages and health care utilisation contribute to the understanding of disease trajectories and health care needs. The slight salsa effect, particularly between the variables of health care utilisation and process quality, can be explained by the fact that no examination would have taken place without prior contact with the responsible physician and that two variables indicating disease progression were included in the model for class partitioning. It can be assumed that the effect would have been smaller if other indicator variables had been included that reflect not only the utilisation of services provided by physicians but also of other professions, such as outpatient home nursing, podiatry, nutrition assistants or specialised outpatient wound care. In addition to other health care levels, further studies should consider class differences in terms of socioeconomic status, lifestyle-related factors, physician density or living in a deprived environment, which are already known to be associated with the utilisation of health care services and the occurrence of PAH (54, 56).

4.3 Potential limitations

The dataset offered the opportunity to map cross-sectoral health care utilisation via the ED as an interface between the inpatient and outpatient sectors. However, several limitations have to be mentioned. First, providers in the outpatient sector show different coding practices due to low-standardised coding guidelines (29, 57). Second, due to the structure of the dataset and the quality of coding diagnoses in EDs in Germany, it was not clear whether DM, if coded as a diagnosis in the ED, was the main cause for the ED visit or whether it was coded as a concomitant disease. The lack of differentiation between principal and secondary diagnoses in the ED dataset was one of the main limitations of the dataset and highlights well-known challenges of data quality in the German outpatient sector (57). This limitation does not apply for the main hospital diagnoses, as only one main hospital diagnosis was documented per inpatient case. Since T2DM and several diseases that are secondary to T2DM were among the top ten inpatient diagnoses, we assume that in many cases DM contributed at least in part to the need for the ED visit.

Third, the lack of complete data on DMP participation in the underlying dataset represented a further limitation and hinders the evaluation of an already well-implemented programme (1, 58, 59). If insights had been available to determine whether a large proportion of T2DM patients were not enrolled in a DMP, this would have clearly highlighted the need to promote participation in such programmes. Fourth, the lack of data on drug prescriptions from one of the participating associations of SHI physicians may have led to the exclusion of T2DM patients from the study population due to the applied criteria for diagnosis validation. Fifth, in addition, the outpatient claims data do not contain any quantitative clinical data, for example on the level of measured HbA1c or other laboratory parameters. Sixth, no lifestyle-related information, for example on physical activity, dietary behaviour or social support as well as socioeconomic status was available to evaluate their association with T2DM healthcare provision in Germany. Socioeconomic and lifestyle-related factors as well as HbA1c levels have already been shown to be relevant factors when segmenting T2DM patients into subgroups (17). This information could have contributed to practical recommendations for improving the quality of outpatient care – such as not only regularly measuring HbA1c, but also adjusting treatment regimens when values fall outside the target range. Further research is necessary to develop data driven recommendations.

Seventh, since not all associations of SHI physicians in Germany transmitted outpatient data, the ED patients may have had contact with a physician billed through one of the associations of SHI physicians not represented in the dataset. Therefore, it was not possible to distinguish whether individuals who did not appear in the outpatient data prior to the ED visit have never had contact with a physician or have had a physician contact that was billed via an association of SHI physicians not included in the study and thus not transmitted to the study site. In addition, no information is known about ED visits or inpatient admissions before 2016 or visits to EDs that are not part of the INDEED study or the utilisation of services that are not covered by the SHI, i.e., have to be paid out of pocket. The dataset therefore did not allow us to draw conclusions about total lack of access to health care prior to the ED visit or first-time diagnoses in the ED. Identification of first-time diagnoses of DM in the ED was additionally prevented by the application of the M2Q criterion for diagnosis validation, as individuals without at least one outpatient diagnosis or DM-medication use prior to the ED visit or DM as the main hospital diagnosis were excluded from the study population.

Eighth, it should be noted that the available data and period under consideration did not allow us to draw conclusions on any causal inference between health care utilisation and disease stage. For example, no conclusions can be drawn as to whether low health care utilisation occurred due to a lack of need in the case of low morbidity, or whether low morbidity occurred due to the lack of documented diagnoses due to low utilisation.

5 Conclusion

The results of the indicators measuring the frequency of medical examinations recommended by guidelines suggest the potential for improving process quality in the outpatient sector to prevent ED utilisation. Our study showed that segmenting the T2DM population according to their health care utilisation contributes to a better understanding of cross-sectoral patterns of care. We identified three latent classes of T2DM characterised as “early disease stage and high utilisation” (36.5% of the study population), “progressing disease stage and low utilisation” (26.1%) and “progressed disease stage and high utilisation” (37.4%). This can contribute to plan and establish health care measures aimed at reducing PAH and to improve health care continuity. Nevertheless, further research is necessary to identify and evaluate respective interventions per identified class. In addition, further research should investigate the role of non-physician health care providers as well as socioeconomic and lifestyle-related factors in cross-sectoral care patterns.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: due to the high sensitivity of the data in the project, it is not possible to make those available to the public. In the central data management of the project the data are already pseudonymised twice. Data are made available exclusively to the analyzing partners of the consortium in anonymised form and only with variables that are matched to the research question. Requests to access these datasets should be directed to YW50amUuZmlzY2hlci1yb3NpbnNreUBjaGFyaXRlLmRl.

Ethics statement

The studies involving humans were approved by Ethics committee of the Charité – Universitätsmedizin Berlin (application number EA4/086/17). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

MR: Conceptualization, Data curation, Formal analysis, Methodology, Software, Writing – original draft, Writing – review & editing. AC: Methodology, Software, Validation, Writing – review & editing. YW: Data curation, Methodology, Supervision, Writing – review & editing. AF-R: Methodology, Project administration, Supervision, Writing – review & editing. AS: Methodology, Project administration, Supervision, Writing – review & editing. DR: Methodology, Supervision, Writing – review & editing. MM: Project administration, Supervision, Writing – review & editing. TK: Project administration, Supervision, Writing – review & editing. LR: Conceptualization, Methodology, Software, Supervision, Validation, Writing – review & editing. CH: Conceptualization, Methodology, Resources, Software, Supervision, Validation, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. The INDEED project was funded by the Innovation Fund of the German Joint Federal Committee (G-BA: Gemeinsamer Bundesausschuss; grant number 01VSF16044).

Acknowledgments

We thank the members of the INDEED study group for their support: Natalie Baier, Dominik Brammen, Johannes Drepper, Patrik Droege, Felix Greiner, Ryan King, Stella Kuhlmann, Björn Kreye, Christian Lüpkes, Thomas Reinhold, Burgi Riens, Stephanie Roll, Marie-Luise Rosenbusch, Felix Staeps, Kristin Schmieder, Anna Schneider, Daniel Schreiber, Dominik von Stillfried, Maike Below, Rainer Röhrig, Thomas Ruhnke, Felix Walcher, and Grit Zimmermann. We are grateful to all participating study partners and their staff. Representing the cooperating emergency departments we would like to thank the respective head physicians: M. Bernhard (University Hospital Düsseldorf), H. J. Busch (University Hospital Freiburg), C. Wrede (Helios Clinical Center Berlin Buch), R. Somasundaram (Charité–Universitaetsmedizin Berlin, Campus Benjamin Franklin), T. Schoepke (Barnim Hospital), E. Weidmann E (Ruppiner Hospital in Neuruppin), B. Flasch (Frankfurt (Oder) Hospital), H. Hoeger-Schmidt (Chemnitz Hospital), A. Gries (University Hospital Leipzig), C. Schwarz (Sana Hospital, Leipzig County), W. Behringer (University Hospital Jena), B. Erdmann (Wolfsburg Hospital), S. Blaschke (University Hospital Goettingen), and S. Wolfrum (University Hospital Luebeck).

Conflict of interest

LR reports having received research grants from the Bundesministerium für Gesundheit outside of the submitted work. AS reports grants from Innovationsfonds des Gemeinsamen Bundesausschuss, during the conduct of the study; grants from Deutsche Forschungsgemeinschaft, grants from Bundesministerium für Bildung und Forschung, grants from Zentralinstitut für die Kassenärztliche Versorgung, grants from Thermo Fisher Scientific, grants from Roche Diagnostics, grants from Bundesministerium für Gesundheit, grants from Berlin University Alliance, outside the submitted work. TK received grants from the Innovationsfonds (Gemeinsamer Bundesausschuss, Germany), the German Federal Ministry of Education and Research, the German Federal Centre for Health Education, and the Bavarian State Ministry of Health and Care (Germany). MM reports grants from Health Care Research Projects (German Ministry of Education and Research; German Innovation).

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that Gen AI was used in the creation of this manuscript. ChatGPT was used as a tool for debugging R code and to specify R functions.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1509220/full#supplementary-material

References

1. Heidemann, C, Du, Y, Paprott, R, Haftenberger, M, Rathmann, W, and Scheidt-Nave, C. Temporal changes in the prevalence of diagnosed diabetes, undiagnosed diabetes and prediabetes: findings from the German Health Interview and Examination Surveys in 1997–1999 and 2008–2011. Diabet Med. (2016) 33:1406–14. doi: 10.1111/dme.13008

PubMed Abstract | Crossref Full Text | Google Scholar

2. OECD/European Union. Health at a Glance: Europe 2022: State of Health in the EU Cycle. Paris: OECD (2022).

Google Scholar

3. Baumert, J, Sarganas, G, Kuhnert, R, Thamm, R, Steppuhn, H, Waldhauer, J, et al. Diabetes mellitus, cardiovascular and chronic respiratory diseases in Germany and Europe – results of the European Health Interview Survey (EHIS 3, 2018 – 2020). J Health Monit. (2024) 9:e12920. doi: 10.25646/12920

PubMed Abstract | Crossref Full Text | Google Scholar

4. DeFronzo, RA, Ferrannini, E, Groop, L, Henry, RR, Herman, WH, Holst, JJ, et al. Type 2 diabetes mellitus. Nat Rev Dis Primers. (2015) 1:1. doi: 10.1038/nrdp.2015.19

PubMed Abstract | Crossref Full Text | Google Scholar

5. Reitzle, L, Ihle, P, Heidemann, C, Paprott, R, Köster, I, and Schmidt, C. Algorithm for the classification of type 1 and type 2 diabetes mellitus for the analysis of routine data. Gesundheitswesen. (2023) 85:S119–26. doi: 10.1055/a-1791-0918

PubMed Abstract | Crossref Full Text | Google Scholar

6. GBD 2021 Diabetes Collaborators. Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021. Lancet. (2023) 402:203–34. doi: 10.1016/S0140-6736(23)01301-6

PubMed Abstract | Crossref Full Text | Google Scholar

7. Sundmacher, L, Fischbach, D, Schuettig, W, Naumann, C, Augustin, U, and Faisst, C. Which hospitalisations are ambulatory care-sensitive, to what degree, and how could the rates be reduced? Results of a group consensus study in Germany. Health Policy. (2015) 119:1415–23. doi: 10.1016/j.healthpol.2015.08.007

PubMed Abstract | Crossref Full Text | Google Scholar

8. Schuettig, W, and Sundmacher, L. Ambulatory care-sensitive emergency department cases: a mixed methods approach to systemize and analyze cases in Germany. Eur J Pub Health. (2019) 29:1024–30. doi: 10.1093/eurpub/ckz081

PubMed Abstract | Crossref Full Text | Google Scholar

9. Lyhne, CN, Bjerrum, M, Riis, AH, and Jørgensen, MJ. Interventions to prevent potentially avoidable hospitalizations: a mixed methods systematic review. Front Public Health. (2022) 10:10. doi: 10.3389/fpubh.2022.898359

PubMed Abstract | Crossref Full Text | Google Scholar

10. Georgescu, V, Green, A, Jensen, PB, Möller, S, Renard, E, and Mercier, G. Primary care visits can reduce the risk of potentially avoidable hospitalizations among persons with diabetes in France. Eur J Pub Health. (2020) 30:1056–61. doi: 10.1093/eurpub/ckaa137

PubMed Abstract | Crossref Full Text | Google Scholar

11. Seringa, J, Marques, AP, Moita, B, Gaspar, C, Raposo, JF, and Santana, R. The impact of diabetes on multiple avoidable admissions: a cross-sectional study. BMC Health Serv Res. (2019) 19:1002. doi: 10.1186/s12913-019-4840-4

PubMed Abstract | Crossref Full Text | Google Scholar

12. Auzanneau, M, Fritsche, A, Icks, A, Siegel, E, Kilian, R, Karges, W, et al. Diabetes in the hospital - A nationwide analysis of all hospitalized cases in Germany with and without diabetes, 2015-2017. Dtsch Arztebl Int. (2021) 118:407–12. doi: 10.3238/arztebl.m2021.0151

PubMed Abstract | Crossref Full Text | Google Scholar

13. Schut, FT, Henschke, C, and Or, Z. Changing roles of health insurers in France, Germany, and the Netherlands: any lessons to learn from Bismarckian systems? Health Econ Policy Law. (2023) 18:362–76. doi: 10.1017/S1744133123000191

PubMed Abstract | Crossref Full Text | Google Scholar

14. Fischer-Rosinský, A, Slagman, A, King, R, Zimmermann, G, Drepper, J, Brammen, D, et al. Der Weg zu Routinedaten aus 16 Notaufnahmen für die sektorenübergreifende Versorgungsforschung: Erfahrungen, Herausforderungen und Lösungsansätze aus der Extraktion pseudonymer Daten für das Projekt INDEED. Medizinische Klinik - Intensivmedizin und Notfallmedizin. (2022) 117:644–53. doi: 10.1007/s00063-021-00879-0

PubMed Abstract | Crossref Full Text | Google Scholar

15. Greiner, F, Slagman, A, Stallmann, C, March, S, Pollmanns, J, Dröge, P, et al. Routine data from emergency departments: varying documentation standards, billing modalities and data custodians at an identical unit of care. Gesundheitswesen. (2020) 82:S72–82. doi: 10.1055/a-0996-8371

PubMed Abstract | Crossref Full Text | Google Scholar

16. Baier, N, Geissler, A, Bech, M, Bernstein, D, Cowling, TE, Jackson, T, et al. Emergency and urgent care systems in Australia, Denmark, England, France, Germany and the Netherlands - Analyzing organization, payment and reforms. Health Policy. (2019) 123:1–10. doi: 10.1016/j.healthpol.2018.11.001

PubMed Abstract | Crossref Full Text | Google Scholar

17. Seng, JJB, Monteiro, AY, Kwan, YH, Zainudin, SB, Tan, CS, Thumboo, J, et al. Population segmentation of type 2 diabetes mellitus patients and its clinical applications - a scoping review. BMC Med Res Methodol. (2021) 21:49. doi: 10.1186/s12874-021-01209-w

PubMed Abstract | Crossref Full Text | Google Scholar

18. Yan, S, Kwan, YH, Tan, CS, Thumboo, J, and Low, LL. A systematic review of the clinical application of data-driven population segmentation analysis. BMC Med Res Methodol. (2018) 18:121. doi: 10.1186/s12874-018-0584-9

PubMed Abstract | Crossref Full Text | Google Scholar

19. Fischer-Rosinský, A, Slagman, A, King, R, Reinhold, T, Schenk, L, Greiner, F, et al. INDEED-utilization and cross-sectoral patterns of care for patients admitted to emergency departments in Germany: Rationale and study design. Front Public Health. (2021) 9:9. doi: 10.3389/fpubh.2021.616857

PubMed Abstract | Crossref Full Text | Google Scholar

20. Schranz, M, Boender, TS, Greiner, T, Kocher, T, Wagner, B, Greiner, F, et al. Changes in emergency department utilisation in Germany before and during different phases of the COVID-19 pandemic, using data from a national surveillance system up to June 2021. BMC Public Health. (2023) 23:799. doi: 10.1186/s12889-023-15375-7

Crossref Full Text | Google Scholar

21. Deutsches Institut für Medizinische Dokumentation und Information. ICD-10-GM. Version 2016. Systematisches Verzeichnis: Internationale statistische Klassifikation der Krankheiten und verwandter Gesundheitsprobleme, 10. Revision, vol. 2016.

Google Scholar

22. Kassenärztliche Bundesvereinigung. Einheitlicher Bewertungsmaßstab (EBM): Stand: 4. Quartal 2025. Berlin: Kassenärztliche Bundesvereinigung (2015).

Google Scholar

23. Wissenschaftlichen Institut der AOK. Anatomisch-therapeutisch-chemische Klassifikation mit Tagesdosen für den deutschen Arzneimittelmarkt: ATC-Index mit DDD-Angaben. Berlin: Wissenschaftlichen Institut der AOK (2016).

Google Scholar

24. Fischer-Rosinský, A, Slagman, A, King, R, et al. Ergebnisbericht: Inanspruchnahme und sektorenübergreifende Versorgungsmuster von Patienten in Notfallversorgungsstrukturen in Deutschland: Konsortialführung. Berlin: Charité - Universitätsmedizin Berlin (2023).

Google Scholar

25. Nimptsch, U, Busse, R, Möckel, M, Fischer-Rosinský, A, Slagman, A, Keil, T, et al. Recording early deaths following emergency department visits in inpatient data: An observational study using data of 16 German hospitals. J. Evid Quality Health Care. (2023) 177:35–40. doi: 10.1016/j.zefq.2022.12.003

PubMed Abstract | Crossref Full Text | Google Scholar

26. Institut für Wirtschaftlichkeit und Qualität im Gesundheitswesen. Bestimmung der GKV-Zielpopulation für die Indikation Diabetes mellitus Typ 2 mittels GKV-Routinedaten. IQWiG-Berichte 763. Köln: Institut für Wirtschaftlichkeit und Qualität im Gesundheitswesen (2019).

Google Scholar

27. Grobe, T, Nimptsch, U, and Friedrich, J. Krankenhausbehandlung In: D Matusiewicz, editor. Routinedaten im Gesundheitswesen: Handbuch Sekundärdatenanalyse: Grundlagen, Methoden, und Perspektiven. 2. Göttingen: Aufl. Verlag Hans Huber (2014). 121–45.

Google Scholar

28. Epping, J, Stahmeyer, JT, Tetzlaff, F, and Tetzlaff, J. M2Q or Something else? The Impact of Varying Case Selection Criteria on the Prevalence Estimation of Chronic Diseases Based on Outpatient Diagnoses in German Claims Data. Gesundheitswesen. (2023) 86:S188–95. doi: 10.1055/a-2052-6477

PubMed Abstract | Crossref Full Text | Google Scholar

29. Horenkamp-Sonntag, D, Linder, R, Wenzel, F, Gerste, B, and Ihle, P. Prüfung der Datenqualität und Validität von GKV-Routinedaten In: D Matusiewicz, editor. Routinedaten im Gesundheitswesen: Handbuch Sekundärdatenanalyse: Grundlagen, Methoden, und Perspektiven. 2. Göttingen: Aufl. Verlag Hans Huber (2014). 314–30.

Google Scholar

30. Sinha, P, Calfee, CS, and Delucchi, KL. Practitioner’s guide to latent class analysis: methodological considerations and common pitfalls. Crit Care Med. (2021) 49:e63–79. doi: 10.1097/CCM.0000000000004710

PubMed Abstract | Crossref Full Text | Google Scholar

31. Weller, BE, Bowen, NK, and Faubert, SJ. Latent class analysis: a guide to best practice. J Black Psychol. (2020) 46:287–311. doi: 10.1177/0095798420930932

Crossref Full Text | Google Scholar

32. Pioch, C, Henschke, C, Lantzsch, H, Busse, R, and Vogt, V. Applying a data-driven population segmentation approach in German claims data. BMC Health Serv Res. (2023) 23:591. doi: 10.1186/s12913-023-09620-3

PubMed Abstract | Crossref Full Text | Google Scholar

33. Rosano, A, Loha, CA, Falvo, R, van der Zee, J, Ricciardi, W, Guasticchi, G, et al. The relationship between avoidable hospitalization and accessibility to primary care: a systematic review. Eur J Pub Health. (2012) 23:356–60. doi: 10.1093/eurpub/cks053

PubMed Abstract | Crossref Full Text | Google Scholar

34. Versorgungsebene und Arztgruppen. (2023) Kassenärztliche Bundesvereingung. Available online at: https://www.kbv.de/html/bedarfsplanung.php#content28374

Google Scholar

35. Landgraf, R, Aberle, J, Birkenfeld, AL, Gallwitz, B, Kellerer, M, Klein, HH, et al. Therapie des Typ-2-Diabetes. Die Diabetol. (2022) 18:623–56. doi: 10.1007/s11428-022-00921-5

Crossref Full Text | Google Scholar

36. Nicolucci, A, Greenfield, S, and Mattke, S. Selecting indicators for the quality of diabetes care at the health systems level in OECD countries. Int J Qual Health Care. (2006) 18:26–30. doi: 10.1093/intqhc/mzl023

PubMed Abstract | Crossref Full Text | Google Scholar

37. Gemeinsamer Bundesausschuss. Beschluss des Gemeinsamen Bundesausschusses über die DMP-Anforderungen-Richtlinie: Erstfassung. Berlin: Gemeinsamer Bundesausschuss (2014).

Google Scholar

38. Wanner, C, and Lopau, K. Diabetische Nephropathie: Die neue Rolle der Niere. Deutsches Ärzteblatt. (2020). doi: 10.3238/PersDia.2020.05.15.01

Crossref Full Text | Google Scholar

39. Comino, EJ, Islam, MF, Tran, DT, Jorm, L, Flack, J, Jalaludin, B, et al. Association of processes of primary care and hospitalisation for people with diabetes: A record linkage study. Diabetes Res Clin Pract. (2015) 108:296–305. doi: 10.1016/j.diabres.2015.02.003

PubMed Abstract | Crossref Full Text | Google Scholar

40. Lezhnina, O, and Kismihók, G. Latent class cluster analysis: selecting the number of clusters. Methods X. (2022) 9:9. doi: 10.1016/j.mex.2022.101747

PubMed Abstract | Crossref Full Text | Google Scholar

41. Golino, HF, and Gomes, CMA. Random forest as an imputation method for education and psychology research: its impact on item fit and difficulty of the Rasch model. Int J Res Method Educ. (2016) 39:401–21. doi: 10.1080/1743727X.2016.1168798

Crossref Full Text | Google Scholar

42. Whittaker, TA, and Miller, JE. Exploring the enumeration accuracy of cross-validation indices in latent class analysis. Struct Equ Model Multidiscip J. (2021) 28:376–90. doi: 10.1080/10705511.2020.1802280

Crossref Full Text | Google Scholar

43. Anderlucci, L, and Hennig, C. The Clustering of categorical data: a comparison of a model-based and a distance-based approach. Commun Stat Theory Methods. (2014) 43:704–21. doi: 10.1080/03610926.2013.806665

Crossref Full Text | Google Scholar

44. Quan, H, Sundararajan, V, Halfon, P, Fong, A, Burnand, B, Luthi, JC, et al. Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med Care. (2005) 43:1130–9. doi: 10.1097/01.mlr.0000182534.19832.83

PubMed Abstract | Crossref Full Text | Google Scholar

45. Stekhoven, DJ, and Bühlmann, P. Miss Forest - non-parametric missing value imputation for mixed-type data. Bioinformatics. (2012) 28:112–8. doi: 10.1093/bioinformatics/btr597

PubMed Abstract | Crossref Full Text | Google Scholar

46. Marbac, M, and Sedki, M. VarSelLCM: an R/C++ package for variable selection in model-based clustering of mixed-data with missing values. Bioinformatics. (2019) 35:1255–7. doi: 10.1093/bioinformatics/bty786

Crossref Full Text | Google Scholar

47. Köster, I, Huppertz, E, Hauner, H, and Schubert, I. Costs of diabetes mellitus (CoDiM) in Germany, direct per-capita costs of managing hyperglycaemia and diabetes complications in 2010 compared to 2001. Exp Clin Endocrinol Diabetes. (2014) 122:510–6. doi: 10.1055/s-0034-1375675

PubMed Abstract | Crossref Full Text | Google Scholar

48. Tuncer, O, Du, Y, Baumert, J, Reitzle, L, and Heidemann, C. Update of the National Diabetes surveillance. Overview on indicators from the fields of action improving the early detection and treatment of diabetes, reducing the complications of diabetes and reducing the burden and costs of disease. Diabetologie. (2023) 19:561–70. doi: 10.1007/s11428-023-01069-6

Crossref Full Text | Google Scholar

49. Heidemann, C, Du, Y, Mauz, I, Walther, L, Peitz, D, Müller, A, et al. Healthcare and health situation of adults with type 2 diabetes in Germany: The study GEDA 2021/2022-Diabetes. J Health Monitor. (2024) 9:12128. doi: 10.25646/12128

PubMed Abstract | Crossref Full Text | Google Scholar

50. Du, Y, Baumert, J, Damerow, S, Rommel, A, Scheidt-Nave, C, and Heidemann, C. Utilisation of outpatient medical services by people with diagnosed diabetes during the COVID-19 pandemic in Germany. J Health Monitor. (2021) 6:51–8. doi: 10.25646/8333

PubMed Abstract | Crossref Full Text | Google Scholar

51. Narres, M, Claessen, H, Kvitkina, T, Rosenbauer, J, Scheider, M, Morbach, S, et al. Hospitalisation rate and mortality among people with and without diabetes during the COVID-19 pandemic year 2020. Eur J Epidemiol. (2022) 37:587–90. doi: 10.1007/s10654-022-00865-6

PubMed Abstract | Crossref Full Text | Google Scholar

52. Kassenärztliche Bundesvereinigung. Nationale Versorgungsleitlinie: Prävention und Therapie von Netzhautkomplikationen bei Diabetes: Langfassung. Berlin: Kassenärztliche Bundesvereinigung (2015).

Google Scholar

53. Nierenfunktion überprüfen. (2023). Zentralinstitut für die kassenärztliche Versorgung in der Bundesrepublik Deutschland. DMP-Atlas Nordrhein-Westfalen: Diabetes mellitus Typ2. Nierenfunktion überprüfen. Available online at: https://www.zi-dmp.de/dmp-atlas_nrw/dmp_d2_nieren_check.html

Google Scholar

54. Gibson, OR, Segal, L, and McDermott, RA. A systematic review of evidence on the association between hospitalisation for chronic disease related ambulatory care sensitive conditions and primary health care resourcing. BMC Health Serv Res. (2013) 13:336. doi: 10.1186/1472-6963-13-336

PubMed Abstract | Crossref Full Text | Google Scholar

55. Amelung, VE. Managed Care: Neue Wege im Gesundheitsmanagement. 5th ed. Wiesbaden: Gabler Verlag | Springer Fachmedien (2012).

Google Scholar

56. Lampert, T, Richter, M, Schneider, S, Spallek, J, and Dragano, N. Social inequality and health: Status and prospects of socio-epidemiological research in Germany. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. (2016) 59:153–65. doi: 10.1007/s00103-015-2275-6

PubMed Abstract | Crossref Full Text | Google Scholar

57. Grobe, T, and Dräther, H. Ambulante ärztliche Versorgung In: D Matusiewicz, editor. Routinedaten im Gesundheitswesen: Handbuch Sekundärdatenanalyse: Grundlagen, Methoden, und Perspektiven. 2. Göttingen: Aufl. Verlag Hans Huber (2014). 43–62.

Google Scholar

58. Fuchs, S, Henschke, C, Blümel, M, and Busse, R. Disease management programs for type 2 diabetes in Germany: a systematic literature review evaluating effectiveness. Dtsch Arztebl Int. (2014) 111:453–63. doi: 10.3238/arztebl.2014.0453

PubMed Abstract | Crossref Full Text | Google Scholar

59. Mehring, M, Donnachie, E, Bonke, FC, Werner, C, and Schneider, A. Disease management programs for patients with type 2 diabetes mellitus in Germany: a longitudinal population-based descriptive study. Diabetol Metab Syndr. (2017) 9:37. doi: 10.1186/s13098-017-0236-y

PubMed Abstract | Crossref Full Text | Google Scholar

60. Charlson, ME, Pompei, P, Ales, KL, and MacKenzie, CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. (1987) 40:373–83. doi: 10.1016/0021-9681(87)90171-8

PubMed Abstract | Crossref Full Text | Google Scholar

61. Soril, LJJ, Leggett, LE, Lorenzetti, DL, Noseworthy, TW, and Clement, FM. Reducing frequent visits to the emergency department: a systematic review of interventions. PLoS One. (2015) 10:e0123660. doi: 10.1371/journal.pone.0123660

PubMed Abstract | Crossref Full Text | Google Scholar

62. Heidemann, C, Reitzle, R, Ziese, T, and Scheidt-Nave, C. Diabetes surveillance at the Robert Koch Institute – pilot project for the development of an NCD surveillance in Germany. Public Health Forum. (2021) 29:277–81. doi: 10.1515/pubhef-2021-0081

Crossref Full Text | Google Scholar

63. Schuettig, W, and Sundmacher, L. The impact of ambulatory care spending, continuity and processes of care on ambulatory care sensitive hospitalizations. The European Journal of Health Economics (2022) 23:1329–40.

Google Scholar

Keywords: type II diabetes mellitus, emergency department, health care utilisation, avoidable hospital admission, population segmentation, latent class analysis

Citation: Rupprecht M, Campione A, Wu YN, Fischer-Rosinský A, Slagman A, Riedlinger D, Möckel M, Keil T, Reitzle L and Henschke C (2025) Data-driven segmentation of type 2 diabetes mellitus patients: an observational study on health care utilisation prior to an emergency department visit in Germany. Front. Med. 12:1509220. doi: 10.3389/fmed.2025.1509220

Received: 14 October 2024; Accepted: 24 April 2025;
Published: 16 May 2025.

Edited by:

Jay J. Shen, University of Nevada, Las Vegas, United States

Reviewed by:

Adrian Soto-Mota, National Institute of Medical Sciences and Nutrition Salvador Zubirán, Mexico
Xinhang Wang, Stanford University, United States

Copyright © 2025 Rupprecht, Campione, Wu, Fischer-Rosinský, Slagman, Riedlinger, Möckel, Keil, Reitzle and Henschke. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mirjam Rupprecht, UnVwcHJlY2h0TUBya2kuZGU=

These authors have contributed equally to this work and share last authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.