Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Public Health, 09 December 2025

Sec. Infectious Diseases: Epidemiology and Prevention

Volume 13 - 2025 | https://doi.org/10.3389/fpubh.2025.1698923

This article is part of the Research TopicMathematical Modelling and Data Analysis in Infectious DiseasesView all 13 articles

Spatial patterns and socio-environmental determinants of gonorrhea incidence in China

Ke Hu&#x;Ke Hu1Xingjin Yang&#x;Xingjin Yang2Yu Cai&#x;Yu Cai3Chaojie LiChaojie Li4Xing ZhangXing Zhang5Di XiaoDi Xiao6Mingyang Yu
Mingyang Yu7*
  • 1Xiamen Haicang Hospital, Xiamen, Fujian, China
  • 2QianDongNanZhou Center for Disease Control and Prevention, Guizhou, China
  • 3Shanghai Municipal Hospital of Traditional Chinese Medicine, Shanghai, China
  • 4Xingtai Center for Disease Control and Prevention, Xingtai, Hebei, China
  • 5Nanjing Lishui Dongping Street Health Center, Nanjing, Jiangsu, China
  • 6Community Health Service Center of Jiuxian Tongliang District, Chongqing, China
  • 7Fuwai Central China Cardiovascular Hospital, Zhengzhou, Henan, China

Introduction: Gonorrhea is a major sexually transmitted infection in China, showing distinct regional clustering and spatial heterogeneity. Understanding its geographical distribution and influencing factors is crucial for targeted prevention.

Methods: We analyzed 2022 gonorrhea incidence across 31 Chinese provinces using both traditional and spatial statistical approaches, including descriptive statistics, spatial autocorrelation, multiple linear regression (MLR), and spatial error models (SEM). Factors from five categories—economic, demographic, environmental, educational, and healthcare-related—were examined. The Geodetector method was additionally used to assess factor contributions and interactions.

Results: Three key findings emerged: (1) Significant regional disparities in gonorrhea incidence were observed, with high-high clusters detected in southern provinces and low-low clusters in northern regions.; (2) SEM outperformed MLR, confirming stronger effects of illiteracy rate, bed utilization rate, sex ratio, and PM2.5 concentration while demonstrating better model fit (higher R2, log-likelihood; lower AIC); (3) Sex ratio was identified as a core determinant, with interaction effects (particularly bed utilization rate and sex ratio) amplifying individual impacts.

Conclusion: These results support spatially tailored intervention strategies that integrate sociodemographic and environmental factors for effective gonorrhea prevention.

1 Introduction

Gonorrhea, one of the most prevalent sexually transmitted infections in China, reported 96,313 cases in 2022, ranking as the fourth most common notifiable infectious disease nationwide, following viral hepatitis, tuberculosis, and syphilis (1). In China, gonorrhea incidence exhibits significant regional clustering and spatial heterogeneity. Studies have shown that disparities in gonorrhea incidence exist not only at the provincial level but also display aggregated distribution patterns in smaller administrative units (e.g., counties or districts). For instance, data from Zhejiang Province (2016–2020) revealed that gonorrhea incidence was significantly higher in northern and central regions (e.g., Hangzhou, Jiaxing) compared to the south. Spatial autocorrelation analysis (Moran’s I index) further confirmed its spatial clustering, with hotspot areas primarily concentrated in highly urbanized districts (2). This spatial heterogeneity may be closely linked to regional disparities in socio-environmental determinants of health (3, 4).

The epidemiology of gonorrhea is shaped by a complex interplay of multidimensional factors. Socioeconomically, lower per capita GDP is associated with higher incidence rates, likely due to reduced healthcare access in economically disadvantaged regions (5). Urbanization exerts a bidirectional influence, wherein expanded sexual networks may enhance transmission, whereas concentrated medical infrastructure could mitigate disease incidence (2). Demographic analyses reveal that areas with elevated illiteracy rates, indicative of poorer health literacy, demonstrate significantly higher gonorrhea prevalence (68), with additional influences from gender ratio imbalances (2, 9, 10). Healthcare system factors, particularly the density of physicians per 1,000 population and hospital bed utilization rates, critically determine case detection and treatment efficacy, where resource-limited areas often experience surveillance gaps leading to underreporting and treatment delays (11). Environmental determinants, including air pollution levels, have emerged as potential drivers of transmission dynamics (12, 13). The spatial heterogeneity of these interacting factors collectively determines the geographic distribution patterns of gonorrhea across regions.

Spatial regression models (e.g., Spatial Error Model and Spatial Lag Model) effectively address spatial autocorrelation issues neglected by conventional statistical methods. By constructing spatial weight matrices, these models can not only identify the direct effects of influencing factors on local areas but also quantify their spatial spillover effects (i.e., spatial dependence) on neighboring regions (1416). In contrast, the Geodetector method, based on the principle of spatial stratified heterogeneity, demonstrates unique advantages: it can precisely quantify the independent contribution of each risk factor to spatial variations in disease incidence through its factor detector, while effectively identifying interaction effects between different factors (17, 18). Owing to its methodological strengths in analyzing disease spatial transmission mechanisms, Geodetector has been successfully implemented across multiple public health research domains, particularly in disease risk zoning, health resource allocation optimization, and environmental health risk assessment (19).

The spatial heterogeneity of gonorrhea incidence results from the nonlinear interactions of multiple socioeconomic factors, necessitating spatial analytical approaches to uncover underlying geographical patterns. The application of spatial regression models and Geodetector can overcome the limitations of conventional epidemiological analyses, thereby providing scientific evidence for identifying high-risk areas and optimizing resource allocation.

2 Methods

2.1 Data

Building upon the theoretical framework established in the background section and considering data availability, this study analyzed provincial-level data from 31 Chinese mainland administrative units (provinces, autonomous regions, and centrally administered municipalities) by integrating seven socio-environmental indicators in 2022 across five dimensions: economic development, population structure, environmental condition, education level, and healthcare resources (Table 1). Data were systematically compiled from authoritative sources: gonorrhea incidence rates from the most recent 2023 China Health Statistical Yearbook, PM2.5 concentrations from provincial environmental reports, and all remaining explanatory variables from the China Statistical Yearbook.

Table 1
www.frontiersin.org

Table 1. Key factors selected for analysis categorized into economic development, education level, healthcare resources, population structure, and environmental condition.

2.2 Descriptive analyses

The spatial distribution characteristics of gonorrhea incidence in the 31 districts were illustrated through thematic mapping.

2.3 Spatial autocorrelation analysis

The methodological framework for spatial autocorrelation analysis primarily comprises the Global Moran’s I and Local Moran’s I indices.

2.3.1 Global Moran’s I

As a fundamental tool in spatial econometrics, the Global Moran’s I index is principally employed to measure spatial dependence at the aggregate regional level. Its mathematical formulation is expressed as Equation 1 (20):

I = n i = 1 n j = 1 n W ij ( x i x ¯ ) ( x j x ¯ ) i = 1 n j = 1 n W ij i = 1 n ( x i x ¯ ) 2     (1)

The Global Moran’s I index, ranging from −1 to 1, quantifies spatial dependence through the formula incorporating sample size (n), spatial weight matrix elements (Wij), observed values (xi, xj), and their mean ( x ¯ ). Values demonstrate distinct spatial patterns: significant positive autocorrelation (0 < I ≤ 1) reflects clustering of similar values, negative autocorrelation (−1 ≤ I < 0) indicates dispersion of dissimilar values, while I ≈ 0 suggests random distribution. Statistical significance is assessed via standardized Z-scores (21).

2.3.2 Local Moran’s I (LISA)

LISA primarily serves to uncover localized spatial heterogeneity patterns within study regions. By quantifying local spatial autocorrelation, it effectively identifies statistically significant spatial clusters. The computational formula is expressed as Equation 2 (22):

I i = n ( x i x ¯ ) j = 1 n w ij ( n j n ¯ ) j = 1 n ( x j x ¯ ) 2     (2)

LISA adopts the same variable definition framework as the Global Moran’s I index to identify four distinct spatial association patterns: high-high clustering (HH-type) representing agglomeration of high-value areas, low-low clustering (LL-type) indicating concentration of low-value regions, high-low outliers (HL-type) showing isolated high values surrounded by low values, and low-high outliers (LH-type) featuring isolated low values encircled by high values. These spatial patterns, when validated by significance testing and visualized through LISA cluster maps, effectively characterize the spatial heterogeneity of the study region (23).

2.4 Multiple Linear Regression (MLR)

MLR is a multivariate statistical analysis method that quantifies the collective influence of multiple explanatory variables on a single response variable through the construction of a linear mathematical model (24). The standard formulation is expressed as Equation 3:

Y = β 0 + β 1 X 1 + β 2 X 2 + + β p X p + ε     (3)

where Y represents the dependent variable, X1, X2, …, Xp denote the independent variables, β0 is the intercept term, β₁, β₂, …, βp are the regression coefficients, and ϵ stands for the random error term following a normal distribution with mean zero.

The goodness-of-fit of mode was comprehensively evaluated using multiple metrics, including the coefficient of determination (R2), Akaike Information Criterion (AIC), and log-likelihood values. Additionally, to assess potential multicollinearity among the independent variables, diagnostic analysis was performed using variance inflation factors (VIF). Variables with VIF > 5 may exhibit multicollinearity (25).

2.5 Spatial regression models

2.5.1 Spatial error model (SEM)

SEM is a specialized spatial econometric approach designed to address spatial autocorrelation in regression residuals. The model structure is specified as Equations 4, 5:

Y = X β + u     (4)
u = λ W u + ε     (5)

where λ denotes the spatial error coefficient and Wu represents the spatially lagged error term. By incorporating a spatial autocorrelation structure into the error terms, SEM effectively corrects the parameter estimation bias caused by residual spatial dependence in conventional regression analysis (26).

2.5.2 Spatial lag model (SLM)

SLM effectively captures spatial dependence among study units by incorporating a spatial lag term of the dependent variable. The model is expressed as Equation 6 (27):

Y = ρ W Y + X β + ε     (6)

where ρ represents the spatial autoregressive coefficient, W denotes the spatial weight matrix, and WY indicates the spatially lagged dependent variable. This model is particularly suitable for quantifying spillover effects between spatial units.

2.6 Geodetector

The Geodetector is a statistical method based on the principle of spatial stratified heterogeneity, which systematically examines spatial differentiation patterns and their underlying drivers through four integrated analytical modules: the Factor Detector quantifying explanatory power using PD (Power of Determinant)-statistics, the Interaction Detector evaluating synergistic effects among multiple factors, the Risk Detector identifying high-risk spatial zones, and the Ecological Detector testing the significance of factor differences (2830). This comprehensive framework enables rigorous investigation of complex spatial heterogeneity by simultaneously assessing individual factor contributions, interaction effects, risk distribution patterns, and statistical significance of spatial variations.

2.6.1 Factor detector

The Factor Detector module within the Geodetector methodology is grounded in the theory of spatial stratified heterogeneity. It systematically evaluates the independent contribution of each explanatory variable to the spatial differentiation characteristics of the study target by constructing the PD-value (also called q statistic). The computational principle of this method lies in analyzing the difference between within-stratum variance and total variance, thereby achieving a precise measurement of the explanatory power of influencing factors (31). The model is expressed mathematically in Equation 7:

PD = 1 h = 1 L N h σ h 2 N σ 2     (7)

Here, Nh denotes the sample size of the h-th stratum, σh2 represents the variance of the h-th stratum, N is the total sample size, and σ2 is the total variance. The PD-value ranges between [0, 1], and its magnitude is positively correlated with the explanatory power of the variable. Specifically, the PD-value closer to 1 indicates a stronger influence of the factor on spatial differentiation.

2.6.2 Risk detector

The Risk Detector is designed to assess the heterogeneity of spatial risk levels under stratified conditions of different influencing factors. Based on the statistical principle of independent samples t-test, this method identifies high-risk or low-risk areas in spatial distribution by testing the significance of mean differences across stratified sub-regions (28). See Equation 8 for the model formulation:

t y ¯ h 1 y ¯ h 2 = Y ¯ h = 1 Y ¯ h = 2 [ Var ( Y ¯ h = 1 ) n h = 1 + Var ( Y ¯ h = 2 ) n h = 2 ] 1 / 2     (8)

Where Y ¯ h represents the average gonorrhea incidence of layer h, nh is samples, Var represents sample variance, t follows the Student’s-t-test distribution. The null hypothesis as follows (Equation 9):

H 0 : Y ¯ h = 1 = Y ¯ h = 2     (9)

Rejection of the null hypothesis at the α level indicates statistically significant differences in the mean incidence rates across the two study regions.

2.6.3 Ecological detector

The Ecological Detector is grounded in the Analysis of Variance (ANOVA) framework, employing the F-statistic to conduct significance testing on the explanatory power of influencing factors regarding spatial differentiation. Its core principle involves comparing within-stratum variance differences across factors, thereby quantitatively assessing whether each factor’s contribution to the formation of spatial patterns in geographical phenomena is statistically significant. The mathematical formulation of the model is given by Equations 1012:

F = n X 1 ( n x 2 1 ) SS W X 1 n X 2 ( n x 1 1 ) SS W X 2     (10)
SS W X 1 = h = 1 L 1 N h σ h 2     (11)
SS W X 2 = h = 1 L 2 N h σ h 2     (12)

Where nx1 and nx2 represent the samples of two factors x1 and x2, respectively. SSWx1 and SSWx2 represent the sum of the within-strata variance of x1 and x2, respectively; L1 and L2 represent the number of layers of x1 and x2, respectively.

The null hypothesis (H0) of this test assumes that two influencing factors exhibit no significant difference in their explanatory power regarding the spatial differentiation of the dependent variable. By calculating the F-statistic and conducting hypothesis testing: if the observed F-value exceeds the critical value at a given significance level, the null hypothesis is rejected, indicating a statistically significant difference between the two factors in their ability to explain spatial differentiation.

2.6.4 Interaction detector

The Interaction Detector is designed to investigate the synergistic effects of dual factors on geographical spatial differentiation. This method systematically compares the explanatory power (PD-value) of individual factors with the joint explanatory power (PD-value) of factor combinations, enabling precise identification of five characteristic interaction types (19): nonlinear-weakening, univariate-weakening, bivariate enhancement, independent, and nonlinear-enhancement effects (Table 2).

Table 2
www.frontiersin.org

Table 2. Descriptions and classifications of interaction types between variables including weakened nonlinear, weakened univariate, enhanced bivariate, independent, and enhanced nonlinear interactions.

2.7 Classification methods for independent variables

Geodetector requires independent variables to be categorical, making the discretization of continuous variables a crucial preprocessing step. This study employs five mainstream discretization methods (32):

Natural breaks: Optimizes classification boundaries based on data distribution characteristics to maximize inter-category differences.

Quantile classification: Ensures balanced sample sizes across categories, suitable for skewed distributions.

Equal interval classification: Divides the value range into equal-width intervals, ideal for uniformly distributed data.

Geometric interval classification: Sets intervals based on geometric progression, tailored for exponentially distributed data.

Standard deviation classification: Partitions data based on intervals measured by n standard deviations from the mean, optimized for normal distributions.

The discretization optimization follows the principle of maximizing PD-value. By systematically comparing different classification methods with 3 to 8 intervals in factor detection, the optimal discretization scheme is determined. This standardized process significantly improves the model’s explanatory power.

2.8 Software

This study utilized ArcGIS 10.2 software (Esri)1 for the discretization of independent variables, spatial autocorrelation analysis, collinearity diagnosis, and map visualization. GeoDa 1.22 software2 was employed to construct multiple linear regression models and spatial regression models. All spatial analyses and the construction of spatial weight matrices were performed based on the Albers Equal Area Conic projection coordinate system to ensure measurement consistency and the reproducibility of results. The Geodetector analysis was conducted using its official Excel version.3 The foundational map data used in the study were sourced from the National Platform for Common Geospatial Information Services4 [approval number: GS (2024) 0650]. All statistical analyses adopted a two-tailed test, with the significance level set at p < 0.05.

3 Results

3.1 Spatial distribution characteristics and aggregation patterns of Gonorrhea incidence

In 2022, gonorrhea incidence in China showed clear geographical variation, with southern regions generally experiencing higher rates than northern areas (Figure 1). Specifically, Zhejiang Province had the highest incidence rate (20.58 cases per 100,000 population), while Hebei Province had the lowest (0.94 cases per 100,000 population), indicating pronounced regional disparities.

Figure 1
Map of China showing gonorrhea incidence rates per 100,000 people, coded by color: gray (0.97-2.31), yellow (2.32-4.67), green (4.68-7.78), blue (7.79-12.23), red (12.24-20.58). White areas indicate unstudied regions. A scale bar is shown in miles. The inset shows the South China Sea islands.There is also a compass.

Figure 1. Spatial distribution of gonorrhea incidence rates across China in 2022 illustrating geographic heterogeneity and identifying regions with high and low incidence rates.

Global spatial autocorrelation analysis revealed that the Moran’s I index for gonorrhea incidence in China in 2022 was 0.391 (p < 0.001), suggesting significant spatial clustering. LISA analysis results (Figure 2) showed two primary clustering patterns: high-high clusters were concentrated in southern coastal provinces, including Guangdong, Guangxi, and Fujian, whereas low-low clusters were predominantly located in northern provinces, with Hebei Province being the most typical example.

Figure 2
Map of China showing LISA cluster analysis. Most regions are gray, indicating no significance. A high-high cluster is shown in black in the south. A low-low cluster is in blue in the north. High-low outliers are yellow, and low-high outliers are light gray. White areas indicate unstudied regions. A scale bar and compass is included.The inset shows the South China Sea islands.

Figure 2. Local indicators of spatial association cluster map of gonorrhea incidence in China in 2022 revealing statistically significant spatial aggregation patterns including high-high clusters, low-low clusters, and spatial outliers.

3.2 MLR results

Initial collinearity diagnostics revealed strong multicollinearity between GDP per capita and urbanization rate (VIF values of 7.2 and 7.92, respectively). After removing urbanization rate, all remaining variables had VIF values below 5. The preliminary multiple linear regression analysis (Table 3) indicated that illiteracy rate, bed utilization rate, sex ratio, and PM2.5 concentration had statistically significant effects on gonorrhea incidence. Additionally, Robust Lagrange Multiplier tests revealed statistically significant spatial dependence (error term: p = 0.010, lag term: p = 0.039), justifying the application of both SLM and SEM for further analysis.

Table 3
www.frontiersin.org

Table 3. Comparison of model performance between the spatial error model and the multiple linear regression model presenting regression coefficients, statistical significance, r-squared, Akaike information criterion, and log-likelihood values for each predictor variable.

3.3 Spatial regression model results

The study employed SLM and SEM for spatial econometric analysis. Model testing results showed that the spatial lag model’s lag coefficient (WY = 0.054, p = 0.77801) did not reach statistical significance (α = 0.05). In contrast, the spatial error model’s error term coefficient was highly significant (λ = −0.982, p < 0.001). Based on these findings, the spatial error model was selected for further analysis.

Table 3 presents the estimation results of the spatial error model. Compared to MLR, this model not only confirmed the significant effects of illiteracy rate, bed utilization rate, sex ratio, and PM2.5 concentration on gonorrhea incidence (with stronger statistical significance for each variable) but also demonstrated superior model fit—evidenced by higher R2 and log-likelihood values, as well as a lower AIC value. These metrics collectively indicate that the spatial error model provides a better fit for the data.

3.4 Geodetector model results

3.4.1 Discretization of independent variables

Five classification methods discretized seven variables into 3–8 categories. Factor detector calculated PD values for each scheme, and the optimal one (maximum PD value) was selected (Table 4).

Table 4
www.frontiersin.org

Table 4. Results of the factor detection analysis showing the power of determinism values, statistical significance, classification method, and optimal class interval for each independent variable.

3.4.2 Factor detector results

Statistical results in Table 4 show that variables such as per capita GDP (p > 0.05), illiteracy rate (p > 0.05), and urbanization rate (p > 0.05) did not exhibit statistically significant associations with gonorrhea incidence. In contrast, other influencing factors demonstrated varying degrees of explanatory power (PD values ranging from 0.27 to 0.62). Among them, the sex ratio had the strongest explanatory power (PD = 0.62), while the number of physicians per 1,000 population showed relatively weaker explanatory power (PD = 0.27).

3.4.3 Risk detector results

The risk detector revealed the average gonorrhea incidence rates across different strata of each factor and analyzed whether significant differences existed between these strata. For example, the relationship between the sex ratio and gonorrhea incidence is illustrated in Figure 3. As shown in the figure, the sex ratio exhibited a clear interval effect on gonorrhea incidence: in the high sex ratio interval (107.88–113.36), the incidence rate peaked (14.5275 cases per 100,000 population), whereas in the low sex ratio interval (97.25–98.77), the incidence rate dropped to its lowest level (1.33 cases per 100,000 population).

Figure 3
Map of China showing the sex ratio related to gonorrhea incidence rates, with regions color-coded by incidence rates. A color legend to the left indicates incidence rate categories, ranging from low to high. Areas without data are shown in white. The inset shows the South China Sea islands.A scale bar and compass is included.

Figure 3. Geographic distribution of sex ratio intervals and their corresponding gonorrhea incidence rates in China.

Table 5 presents the differences in gonorrhea incidence across various sex ratio intervals using significance testing (Y/N). The study divided the sex ratio into eight gradients, with the eighth stratum (107.88–113.36) representing the highest sex ratio interval and the first stratum (97.25–98.77) representing the lowest. Statistical analysis showed that the highest sex ratio stratum (8th) exhibited significant differences in gonorrhea incidence compared to the first four low sex ratio strata (1st–4th).

Table 5
www.frontiersin.org

Table 5. Statistical significance of pairwise comparisons in mean gonorrhea incidence rates across different strata of the sex ratio factor where y denotes a significant difference and n denotes no significant difference.

Further analysis demonstrated that the risk detector could quantify the quantitative relationship between influencing factors and gonorrhea incidence. As shown in Table 6, each factor had specific risk intervals corresponding to peak incidence levels (33). Epidemiologically, these high-risk intervals can be defined as the primary influence domains of each factor (34). Based on spatial analysis techniques, this study visualized these quantitative relationships in Figure 4.

Table 6
www.frontiersin.org

Table 6. Identified high-risk intervals for key factors and their corresponding maximum gonorrhea incidence rates based on the risk detector analysis.

Figure 4
Four maps of China depicting different metrics with areas marked in red for primary impact: (A) Number of licensed physicians per 1,000 population; (B) Hospital bed utilization rate; (C) Sex ratio; (D) PM2.5 concentrations. Non-primary impact areas are hatched, and unstudiable areas are shown in white. Each map includes a scale bar and compass for orientation. The inset in each map shows the South China Sea islands.

Figure 4. Geographic distribution of the primary impact areas for the key identified risk factors influencing gonorrhea incidence. (A) Number of licensed physicians per 1,000 population, (B) Hospital bed utilization rate, (C) Sex ratio, (D) PM2.5 concentration.

3.4.4 Ecological detector results

This study employed the ecological detector method to conduct pairwise comparative analysis of key factors influencing gonorrhea incidence. As shown in Table 7, statistical results indicated that only two pairs of factors exhibited statistically significant differences in PD values: sex ratio vs. physicians per 1,000 population (p < 0.05, marked as “Y”) and sex ratio vs. PM2.5 concentration (p < 0.05, marked as “Y”). The differences between other factor pairs were not statistically significant (marked as “N”). To further investigate the interaction effects among influencing factors and their synergistic mechanisms on gonorrhea incidence subsequent analysis will employ the interaction detector for in-depth exploration.

Table 7
www.frontiersin.org

Table 7. Significance testing for differences in the power of determinism values between pairs of influential risk factors where y indicates a statistically significant difference and n indicates no significant difference.

3.4.5 Interaction detector results

The interaction detector analysis revealed significant synergistic effects among influencing factors. As shown in Table 8, the interactive PD values for all pairwise factor combinations were substantially higher than the independent effects of single factors (PD > 0.6). Notably, the interaction between bed utilization rate and sex ratio showed the most pronounced effect (PD = 0.96). Of particular interest was the unique nonlinear enhancement characteristic observed in the interaction between number of licensed physicians per 1,000 population and PM2.5, while other variable combinations exhibited typical bivariate enhancement effects. This finding provides critical insights for understanding the multifactorial synergistic mechanisms underlying gonorrhea incidence.

Table 8
www.frontiersin.org

Table 8. Interaction effects between pairs of factors on gonorrhea incidence rate demonstrating the magnitude of their combined influence.

4 Discussion

The analysis reveals that gonorrhea incidence in China exhibits significant spatial clustering, with distinct high-high clusters concentrated in southern coastal provinces and low-low clusters in northern regions. Both MLR and SEM identify sex ratio, bed utilization rate, illiteracy rate, and PM2.5 concentration as significant factors, with SEM demonstrating superior performance. Geodetector results further quantify these relationships, showing sex ratio as the strongest individual determinant and revealing particularly powerful interactive effects, especially between bed utilization rate and sex ratio.

These patterns carry substantial implications for public health planning and intervention strategies. The introduction of the SEM significantly improved both the significance of latent factors and the model goodness-of-fit, confirming the superiority of SEM in addressing spatial issues. Compared to MLR regression, SEM effectively controls for spatial dependency bias and provides more accurate estimations and statistical inferences (26, 35). Spatial analysis revealed a significant negative spatial spillover effect in gonorrhea incidence (λ = −0.982, p < 0.001), indicating that a 1% increase in incidence in neighboring regions may lead to a 0.98% decrease in the local area. This inverse dependency may be attributed to a “siphon effect” caused by concentrated medical resources, which could render traditional joint prevention strategies less effective and thus call for dynamic monitoring and tailored resource allocation.

Notably, compared to SEM, the Geodetector effectively identified the independent influence of four key factors on gonorrhea incidence. By quantifying the driving intensity of each factor through PD-values, Geodetector circumvents the issue of multicollinearity among variables. This characteristic grants Geodetector a unique advantage in analyzing spatially heterogeneous influencing factors, particularly in handling complex spatial data while avoiding the limitations of traditional spatial regression models.

Both MLR and SEM consistently demonstrated a positive association between sex ratio and gonorrhea incidence, with factor detector analysis further identifying sex ratio as exhibiting the highest PD value. This observed relationship may be attributed to two primary factors: males are more likely to develop symptomatic infections that prompt healthcare-seeking behavior (36), and male populations (particularly men who have sex with men) generally exhibit higher prevalence of high-risk sexual behaviors (37, 38). The identification of sex ratio as the dominant driver, coupled with its specific high-risk intervals, highlights the need for targeted interventions in regions with imbalanced sex ratios, potentially focusing on male-specific health education and screening programs.

Notably, the number of physicians per 1,000 population showed no significant association in spatial regression models and yielded the lowest PD value in factor detection analysis, while bed utilization rate demonstrated relatively stronger explanatory power. This discrepancy suggests that bed utilization rate may serve as a more comprehensive indicator reflecting both healthcare facility workload and population health status (39), whereas physician density primarily represents basic healthcare coverage (40). The significant spatial dependence in gonorrhea distribution patterns, particularly evident in high-high clustering regions along the southeastern coast, likely stems from cross-regional healthcare-seeking behavior among gonorrhea patients, leading to case concentration in areas with abundant medical resources. Consequently, this phenomenon results in a decoupling effect between local physician availability and reported incidence rates. These findings carry significant implications for public health planning, suggesting that simply increasing the number of physicians may be insufficient. Instead, health authorities should prioritize enhancing the capacity and efficiency of existing healthcare facilities, as reflected by bed utilization, and establish regional referral networks to manage cross-border patient flows effectively.

This study presents the first evidence of a significant negative correlation between PM2.5 exposure and gonorrhea incidence. The novel finding may be mediated through two plausible mechanisms: first, as proposed in previous literature, air pollution may reduce outdoor activities and social contact opportunities, thereby potentially decreasing sexual transmission risks (41), while pollution-induced respiratory discomfort could further suppress intimate contact behaviors (42). Additionally, other pathways should be considered, such as potential confounding by regional socioeconomic characteristics or healthcare-seeking patterns that co-vary with pollution levels. Notably, industrial areas with severe PM2.5 pollution typically exhibit unique population mobility patterns (43) that could influence case reporting accuracy. While the observed association requires further validation, this discovery not only expands the research paradigm regarding environmental factors in sexually transmitted diseases, but more importantly highlights the necessity to consider atmospheric pollution’s potential regulatory effects on human behavioral patterns in public health research. From a policy perspective, this unexpected association underscores the complexity of disease drivers and suggests that integrated health policies, which consider environmental, social, and behavioral factors simultaneously, may be more effective than siloed approaches.

The observed inverse relationship between illiteracy rate and gonorrhea incidence may stem from data biases, confounding factors, distinctive social structures, or alternative public health interventions. This phenomenon generally reflects context-specific indirect associations that require comprehensive evaluation incorporating cultural, policy, and data quality considerations. Furthermore, all statistical models consistently demonstrated non-significant effects of urbanization rate on gonorrhea incidence distribution (p > 0.05), suggesting its potential pathways might be obscured by other socioeconomic factors or require more granular data for proper validation.

The interaction detector analysis revealed significant synergistic enhancement effects among influencing factors, demonstrating systematically higher PD values for all interaction terms compared to individual factors. Notably, the interaction between bed utilization rate and sex ratio exhibited the strongest synergistic effect. This phenomenon may stem from a positive feedback mechanism between gender-related biological susceptibility (higher in males) and healthcare system pressure (characterized by bed utilization rate): in regions with strained medical resources, delayed diagnosis and treatment in male patients may lead to prolonged infectious periods, thereby accelerating transmission through sexual networks (44). This finding has direct implications for public health practice. It argues against one-size-fits-all interventions and instead supports the development of integrated strategies that concurrently address demographic risks (e.g., through targeted male screening) and strengthen healthcare system capacity (e.g., by reducing diagnostic and treatment delays in high-pressure facilities), particularly in identified high-risk clusters.

This study innovatively integrates spatial regression models with Geodetector methods to systematically elucidate the spatial distribution patterns and driving mechanisms of gonorrhea incidence in China. The approach not only quantifies risk thresholds for key influencing factors but also, for the first time, reveals multifactorial synergistic effects. This coupled analytical framework of multiple models provides methodological innovation for deciphering multiscale mechanisms underlying complex public health issues.

This study has several limitations that should be acknowledged. First, at the data level, the unavailability of certain potential influencing variables may introduce omitted-variable bias into the models. Specifically, key behavioral and demographic factors such as high-risk sexual behavior frequency and migrant population size, which are known to influence gonorrhea transmission dynamics, were not included in our analysis due to data unavailability. The absence of these variables may limit the explanatory power of our current models. More importantly, a key limitation lies in the analytical scale. Our analysis relied on provincial-level data from 31 administrative units in mainland China. While this provides a broad national overview, it inevitably obscures the finer-scale spatial heterogeneity that exists at the prefectural or county levels. These factors could potentially affect the generalizability of the research findings. Therefore, future studies are advised to utilize finer-grained data (e.g., at the prefectural or county scale) to better capture local variations and enhance the model’s explanatory power.

5 Conclusion

In China, Gonorrhea incidence distribution exhibits significant spatial heterogeneity and autocorrelation, with the SEM demonstrating superior performance over MLR in controlling spatial dependence. The study identifies sex ratio as a core determinant of gonorrhea patterns, with male-dominated regions showing significantly higher incidence rates. Notably, bed utilization rate emerges as a stronger predictor of disease risk than physician density, underscoring the need for optimized allocation of specialized medical resources and coordinated regional prevention strategies.

A negative association was observed between PM2.5 exposure levels and gonorrhea incidence, suggesting environmental factors may play an underrecognized role in sexually transmitted disease transmission dynamics. The interaction effects among influencing factors significantly amplified the impact of individual factors, with the synergistic effect between bed utilization rate and sex ratio being particularly prominent. This highlights the critical importance of coordinated planning between demographic structure and healthcare resource allocation in public health policy formulation. These findings provide empirical support for developing spatially adaptive intervention frameworks that account for both sociodemographic drivers and environmental contexts in gonorrhea prevention.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

KH: Writing – original draft, Conceptualization. XY: Writing – original draft, Conceptualization. YC: Writing – original draft, Conceptualization. CL: Writing – original draft, Visualization. XZ: Methodology, Writing – original draft. DX: Writing – original draft, Validation. MY: Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

We thank the National Bureau of Statistics and Department of Ecology and Environment of various regions of China for providing us with data.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2025.1698923/full#supplementary-material

Footnotes

References

1. Zhu, X, Xi, Y, Gong, X, and Chen, S. Ceftriaxone-resistant Gonorrhea - China, 2022. MMWR Morb Mortal Wkly Rep. (2024) 73:255–9. doi: 10.15585/mmwr.mm7312a2,

PubMed Abstract | Crossref Full Text | Google Scholar

2. Zeng, F, Shen, Y, Du, N, Wu, L, Fei, L, Wang, Y, et al. Spatiotemporal epidemiology of the gonorrhea epidemic in relation to neighborhood-level structural factors in an eastern province of China, 2016-2020. Altern Ther Health Med. (2023) 29:350–7.

PubMed Abstract | Google Scholar

3. Zhou, J, Peng, R, Feng, J, Kang, J, Deng, M, Meng, S, et al. Equity of intensive care unit bed resource allocation in ethnic minority areas of China-an empirical analysis based on geographical and population dimensions. BMJ Open. (2024) 14:e083203. doi: 10.1136/bmjopen-2023-083203,

PubMed Abstract | Crossref Full Text | Google Scholar

4. Shi, B, Wang, Y, Bai, X, Lai, Y, Xiang, W, Wu, B, et al. Spatial effects of public health laboratory emergency testing institutions under COVID-19 in China. Int J Equity Health. (2023) 22:88. doi: 10.1186/s12939-023-01871-0,

PubMed Abstract | Crossref Full Text | Google Scholar

5. Chen, X, Chen, S, Li, C, Shi, L, Zhu, Y, and Yao, Y. Analysis and prediction of the incidence and prevalence trends of gonorrhea in China. Hum Vaccin Immunother. (2023) 19:2256907. doi: 10.1080/21645515.2023.2256907,

PubMed Abstract | Crossref Full Text | Google Scholar

6. Sulinskaite, K, Zagurskiene, D, and Blazeviciene, A. Patients' health literacy and health behaviour assessment in primary health care: evidence from a cross-sectional survey. BMC Prim Care. (2022) 23:223. doi: 10.1186/s12875-022-01809-5,

PubMed Abstract | Crossref Full Text | Google Scholar

7. Marshall, A, Mackson, G, Kaighobadi, F, Hoffman, ND, Ciarleglio, A, and Sandfort, TGM. Health literacy among sexual and gender-diverse adolescents in new York City. Health Educ Behav. (2025) 52:289–98. doi: 10.1177/10901981241307616,

PubMed Abstract | Crossref Full Text | Google Scholar

8. Xu, W, Li, H, Zhao, P, Wang, J, Liang, P, and Wang, C. Trends of chlamydia and gonorrhea infections by anatomic sites among men who have sex with men in South China: a surveillance analysis from 2018 to 2022. BMC Public Health. (2024) 24:2484. doi: 10.1186/s12889-024-19987-5,

PubMed Abstract | Crossref Full Text | Google Scholar

9. Pollock, ED, Clay, PA, Kreisel, KM, and Spicknall, IH. Estimated incidence and prevalence of Gonorrhea in the United States, 2006-2019. Sex Transm Dis. (2023) 50:188–95. doi: 10.1097/olq.0000000000001763,

PubMed Abstract | Crossref Full Text | Google Scholar

10. Tao, G, and Gift, TL. High volume and high positivity of Chlamydia and Gonorrhea tests by anatomic site from a large National Laboratory in the United States, 2019-2023. Sex Transm Dis. (2025) 52:523–8. doi: 10.1097/olq.0000000000002165,

PubMed Abstract | Crossref Full Text | Google Scholar

11. Liu, K, Xue, Y, Chen, Z, and Miao, Y. The spatiotemporal evolution and influencing factors of urban green innovation in China. Sci Total Environ. (2023) 857:159426. doi: 10.1016/j.scitotenv.2022.159426,

PubMed Abstract | Crossref Full Text | Google Scholar

12. Lin, Y, Meng, H, He, Y, Liang, W, Niu, Y, Liu, Z, et al. Short-term effects of air pollution on the infectious disease spectrum in Shanghai, China: a time-series analysis from 2013 to 2019. Front Public Health. (2025) 13:1454809. doi: 10.3389/fpubh.2025.1454809,

PubMed Abstract | Crossref Full Text | Google Scholar

13. Singer, M. Is pollution the primary driver of infectious Syndemics? Pathogens. (2024) 13:370. doi: 10.3390/pathogens13050370,

PubMed Abstract | Crossref Full Text | Google Scholar

14. Lyu, Y, Wang, W, Wu, Y, and Zhang, J. How does digital economy affect green total factor productivity? Evidence from China. Sci Total Environ. (2023) 857:159428. doi: 10.1016/j.scitotenv.2022.159428,

PubMed Abstract | Crossref Full Text | Google Scholar

15. Sheikh, MR, Khan, SU, Ahmed, M, Ahmad, R, Abbas, A, and Ullah, I. Spatial spillover impact of determinants on child mortality in Pakistan: evidence from spatial Durbin model. BMC Public Health. (2023) 23:1612. doi: 10.1186/s12889-023-16526-6,

PubMed Abstract | Crossref Full Text | Google Scholar

16. Deng, W, and Zhang, Z. Environmental regulation intensity, green finance, and environmental sustainability: empirical evidence from China based on spatial metrology. Environ Sci Pollut Res Int. (2023) 30:66228–53. doi: 10.1007/s11356-023-26946-x,

PubMed Abstract | Crossref Full Text | Google Scholar

17. Zhang, S, Wang, X, Li, H, and Wang, H. The geographical pension gap: understanding the causes of inequality in China's pension funds. PLoS One. (2023) 18:e0288754. doi: 10.1371/journal.pone.0288754,

PubMed Abstract | Crossref Full Text | Google Scholar

18. Li, W, Du, FJ, and Ruan, O. Analysis of spatial pattern and influencing factors of private clinics in the main urban area of Guiyang in China from 2021 to 2022 based on multi-source data. Arch Public Health. (2023) 81:52. doi: 10.1186/s13690-023-01068-5,

PubMed Abstract | Crossref Full Text | Google Scholar

19. Wu, Y, Hu, K, Han, Y, Sheng, Q, and Fang, Y. Spatial characteristics of life expectancy and geographical detection of its influencing factors in China. Int J Environ Res Public Health. (2020) 17:17. doi: 10.3390/ijerph17030906,

PubMed Abstract | Crossref Full Text | Google Scholar

20. Duque, AM, Peixoto, MV, Lima, S, Goes, MAO, Santos, AD, Araujo, K, et al. Analysis of the relationship between life expectancy and social determinants in a north-eastern region of Brazil, 2010-2017. Geospat Health. (2018) 13:345–52. doi: 10.4081/gh.2018.702

Crossref Full Text | Google Scholar

21. Wang, S, and Luo, K. Life expectancy impacts due to heating energy utilization in China: distribution, relations, and policy implications. Sci Total Environ. (2018) 610-611:1047–56. doi: 10.1016/j.scitotenv.2017.08.195,

PubMed Abstract | Crossref Full Text | Google Scholar

22. Hu, K, Zhang, X, Yang, X, and Yu, M. A study on the spatial distribution of life expectancy and its air pollution factors in China based on geographically weighted regression. Front Public Health. (2025) 13:1565744. doi: 10.3389/fpubh.2025.1565744,

PubMed Abstract | Crossref Full Text | Google Scholar

23. Sridharan, S, Tunstall, H, Lawder, R, and Mitchell, R. An exploratory spatial data analysis approach to understanding the relationship between deprivation and mortality in Scotland. Soc Sci Med. (2007) 65:1942–52. doi: 10.1016/j.socscimed.2007.05.052,

PubMed Abstract | Crossref Full Text | Google Scholar

24. Ellsworth, SG, van Rossum, PSN, Mohan, R, Lin, SH, Grassberger, C, and Hobbs, B. Declarations of Independence: how embedded multicollinearity errors affect Dosimetric and other complex analyses in radiation oncology. Int J Radiat Oncol Biol Phys. (2023) 117:1054–62. doi: 10.1016/j.ijrobp.2023.06.015,

PubMed Abstract | Crossref Full Text | Google Scholar

25. Taylor-Haas, JA, Garcia, MC, Rauh, MJ, Peel, S, Paterno, MV, Bazett-Jones, DM, et al. Cadence in youth long-distance runners is predicted by leg length and running speed. Gait Posture. (2022) 98:266–70. doi: 10.1016/j.gaitpost.2022.09.085,

PubMed Abstract | Crossref Full Text | Google Scholar

26. Suerungruang, S, Sornlorm, K, Laohasiriwong, W, and Mahato, RK. Spatial association and modelling of under-5 mortality in Thailand, 2020. Geospat Health. (2023) 18. doi: 10.4081/gh.2023.1220

Crossref Full Text | Google Scholar

27. Khedhiri, S. A spatiotemporal analysis of the COVID-19 pandemic in North Africa. Geohealth. (2022) 6:e2022GH000630. doi: 10.1029/2022gh000630,

PubMed Abstract | Crossref Full Text | Google Scholar

28. Wang, JF, Xin-Hu, L, George, C, Yi-Lan, L, Tin, Z, Xue, G, et al. Geographical detectors-based health risk assessment and its application in the neural tube defects study of the Heshun region, China. Int J Geogr Inf Sci. (2010) 24:107–27. doi: 10.1080/13658810802443457

Crossref Full Text | Google Scholar

29. Wang, J-F, Zhang, T-L, and Fu, B-J. A measure of spatial stratified heterogeneity. Ecol Indic. (2016) 67:250–6. doi: 10.1016/j.ecolind.2016.02.052

Crossref Full Text | Google Scholar

30. Wang, J, Robert, H, Tonglin, Z, Chengdong, X, Maogui, H, Qian, Y, et al. Statistical modeling of spatially stratified heterogeneous data. Ann Assoc Am Geogr. (2024) 114:499–519. doi: 10.1080/24694452.2023.2289982

Crossref Full Text | Google Scholar

31. Wang, JF, Wang, Y, Zhang, J, Christakos, G, Sun, JL, Liu, X, et al. Spatiotemporal transmission and determinants of typhoid and paratyphoid fever in Hongta district, Yunnan province, China. PLoS Negl Trop Dis. (2013) 7:e2112. doi: 10.1371/journal.pntd.0002112,

PubMed Abstract | Crossref Full Text | Google Scholar

32. Cao, F, Yong, G, and Wang, J-F. Optimal discretization for geographical detectors-based risk assessment. GISci Remote Sens. (2013) 50:78–92. doi: 10.1080/15481603.2013.778562

Crossref Full Text | Google Scholar

33. Zha, X, Tian, Y, Gao, X, Wang, W, and Yu, C. Quantitatively evaluate the environmental impact factors of the life expectancy in Tibet, China. Environ Geochem Health. (2019) 41:1507–20. doi: 10.1007/s10653-018-0211-z,

PubMed Abstract | Crossref Full Text | Google Scholar

34. Bai, L, Jiang, L, Yang, D-y, and Liu, Y-b. Quantifying the spatial heterogeneity influences of natural and socioeconomic factors and their interactions on air pollution using the geographical detector method: a case study of the Yangtze River Economic Belt, China. J Clean Prod. (2019) 232:692–704. doi: 10.1016/j.jclepro.2019.05.342

Crossref Full Text | Google Scholar

35. Baptista, EA, and Queiroz, BL. Spatial analysis of cardiovascular mortality and associated factors around the world. BMC Public Health. (2022) 22:1556. doi: 10.1186/s12889-022-13955-7,

PubMed Abstract | Crossref Full Text | Google Scholar

36. Aboud, S, Buhalata, SN, Onduru, OG, Chiduo, MG, Kwesigabo, GP, Mshana, SE, et al. High prevalence of sexually transmitted and reproductive tract infections (STI/RTIs) among patients attending STI/outpatient department clinics in Tanzania. Trop Med Infect Dis. (2023) 8:62. doi: 10.3390/tropicalmed8010062,

PubMed Abstract | Crossref Full Text | Google Scholar

37. Tran, NK, Welles, SL, Roy, JA, Brennan, DJ, Chernak, E, and Goldstein, ND. Self-reported PrEP use and risk of bacterial STIs among Ontarian men who are gay or bisexual or have sex with men. Ann Fam Med. (2024) 22:437–43. doi: 10.1370/afm.3152,

PubMed Abstract | Crossref Full Text | Google Scholar

38. Bruzzesi, E, Galli, L, Poli, A, Bossolasco, S, Cernuschi, M, Spagnuolo, V, et al. Prevalence and risk factors of anal HPV infection in MSM living with HIV: identifying the target groups to prioritize for immunization. J Acquir Immune Defic Syndr. (2022) 91:226–31. doi: 10.1097/qai.0000000000003057,

PubMed Abstract | Crossref Full Text | Google Scholar

39. Liu, Y, Mai, L, Huang, F, and Zeng, Z. Regional healthcare resource allocation and decision-making: evaluating the effectiveness of the three-stage super-efficiency DEA model. Heliyon. (2024) 10:e40312. doi: 10.1016/j.heliyon.2024.e40312,

PubMed Abstract | Crossref Full Text | Google Scholar

40. Koh, H, Kwon, S, and Cho, B. Association of Primary Care Physician Supply with population mortality in South Korea: a pooled cross-sectional analysis. Korean J Fam Med. (2024) 45:105–15. doi: 10.4082/kjfm.23.0156,

PubMed Abstract | Crossref Full Text | Google Scholar

41. Wang, JX, and Liu, XQ. Climate change, ambient air pollution, and students' mental health. World J Psychiatry. (2024) 14:204–9. doi: 10.5498/wjp.v14.i2.204,

PubMed Abstract | Crossref Full Text | Google Scholar

42. Yoo, H, and Kim, N. Development of respiratory infection prevention and control self-efficacy scale for adults. J Multidiscip Healthc. (2024) 17:1943–52. doi: 10.2147/jmdh.S456939,

PubMed Abstract | Crossref Full Text | Google Scholar

43. Li, C, Xu, X, Wang, W, Yu, X, Wang, J, Meister, S, et al. The attraction effect of cleaning air on migrants in China: a comparative analysis with the contribution of wage and house prices. iScience. (2025) 28:111532. doi: 10.1016/j.isci.2024.111532,

PubMed Abstract | Crossref Full Text | Google Scholar

44. Traeger, MW, Guy, R, Taunton, C, Chow, EPF, Asselin, J, Carter, A, et al. Syphilis testing, incidence, and reinfection among gay and bisexual men in Australia over a decade spanning HIV PrEP implementation: an analysis of surveillance data from 2012 to 2022. Lancet Reg Health West Pac. (2024) 51:101175. doi: 10.1016/j.lanwpc.2024.101175,

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: gonorrhea incidence, spatial autocorrelation, multiple linear regression, spatial error model, Geodetector

Citation: Hu K, Yang X, Cai Y, Li C, Zhang X, Xiao D and Yu M (2025) Spatial patterns and socio-environmental determinants of gonorrhea incidence in China. Front. Public Health. 13:1698923. doi: 10.3389/fpubh.2025.1698923

Received: 04 September 2025; Revised: 10 November 2025; Accepted: 17 November 2025;
Published: 09 December 2025.

Edited by:

Sarafa Iyaniwura, Fred Hutchinson Cancer Center, United States

Reviewed by:

Hoiman Ng, Kiang Wu Hospital, Macao SAR, China
Shuang Ma, Shenyang Ligong University, China

Copyright © 2025 Hu, Yang, Cai, Li, Zhang, Xiao and Yu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mingyang Yu, MjI5Njk5MTE0MEBxcS5jb20=

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.