Your new experience awaits. Try the new design now and help us make it even better

CORRECTION article

Front. Environ. Sci.

Sec. Land Use Dynamics

Revealing Urban–Rural Sustainability Dynamics Based on Multi-Source Data and Machine Learning: Spatio-Temporal Hotspots, Driving Mechanisms, and Non-Linear Thresholds in Suzhou, China

Provisionally accepted
Wenquan  GanWenquan Gan1,2*Haoqi  WangHaoqi Wang3Pengcheng  LiPengcheng Li4Jiana  LiuJiana Liu5*Jinliu  ChenJinliu Chen5
  • 1School of Art and Design, Zhejiang Sci-Tech University, Hangzhou, China
  • 2Zhejiang Sci-Tech University, Hangzhou, China
  • 3The University of Hong Kong, Hong Kong, China
  • 4Suzhou University of Science and Technology, Suzhou, China
  • 5Suzhou City University, Suzhou, China

The final, formatted version of the article will be published soon.

Urbanisation is one of the primary drivers of land-use transformation in contemporary society, reshaping both urban and rural landscapes throughout the 20th century (Nuissl and Siedentop, 2021). Although urbanisation drives economic growth and infrastructure advancement, it simultaneously leads to extensive transitions from rural to urban land uses, inevitably posing profound challenges to the sustainable development of land-use systems, particularly in peri-urban areas (Fu et al., 2021). These transitional zones, situated between urban and rural systems, integrate land, ecological resources, and human activities related to production and living. They are critical components of regional development (Zhang et al., 2020) and have significant impacts on human health and well-being (Barton, 2009). In the Chinese context, these processes unfold against a longstanding institutional separation between urban and rural areas, shaped by the household registration (hukou) system, differentiated land-use practices, economic structures and socio-cultural characteristics (Li, 2012).Rapid urbanisation has accelerated land-use transformations in peri-urban areas, shifting landscapes from primarily agricultural functions toward residential, industrial, and commercial uses. Development models focused on economic growth, to some extent, overlook the land-use conflicts resulting from such expansion and exacerbate their negative impacts (He et al., 2020).Existing studies on urban-rural land-use transitions can be broadly categorised into two perspectives. The first perspective examines how land-use change in peri-urban areas reshapes economic, ecological and social outcomes. Empirical work has assessed the efficiency and effectiveness of land-use transformation (Fu et al., 2021), the ecological consequences of land-use changes and land-use intensity (Wu et al., 2024;Xu et al., 2020), and broader sustainability implications, including environmental degradation, habitat fragmentation and the loss of cultural landscape identity (Shao et al., 2020;Xu et al., 2020). International research further highlights that unsustainable land-use transformations in rural areas during urbanisation can lead to displacement (Salem et al., 2020), infrastructure deficits, and widening socioeconomic disparities (Dadashpoor and Ahani, 2019). Taken together, this perspective demonstrates that peri-urban land-use transitions are crucial to the sustainability of urban-rural regions and have a direct impact on human well-being. However, most studies address economic, ecological and social dimensions separately or treat them as loosely connected outcome indicators, paying limited attention to how these dimensions interact and co-evolve over time. As a result, the coupled dynamics of "coordination-conflict" between land-use transitions and multi-dimensional sustainability outcomes remain insufficiently understood, particularly in rapidly urbanising contexts such as China.A second perspective focuses on governance arrangements, planning instruments and assessment frameworks for managing land-use change in urban-rural interfaces. Scholars have highlighted that current landuse transformation processes are often governed by fragmented and overlapping institutional regimes, leading to overlapping jurisdictions, inefficient decision-making and weak coordination across administrative boundaries (Perrin et al., 2018). Additionally, Follmann et al. (2021) and Ahani and Dadashpoor (2021) note that existing peri-urban land-use assessment and management frameworks fail to address the dynamic realities of urbanisation, creating loopholes for informal land speculation, unauthorised urban expansion, and environmental degradation. Although this line of research has significantly advanced the understanding of governance deficits and institutional complexity, it tends to emphasise qualitative institutional analysis or singledimensional indicators. However, the foundation of sustainable development lies in the coordination between economic, social, and environmental factors, with the core being the balance between ecological considerations and other dimensions (Gan et al., 2020;Li et al., 2022). There is a lack of attention to operational, quantitative frameworks that can systematically link land-use structures to economic vitality, ecological quality, and social well-being, and that can identify when and how urban-rural systems shift from coordinated development to conflictual outcomes (Buzási and Csete, 2017;Keith et al., 2023). Consequently, despite a substantial body of work on land-use transformation, ecological impacts, planning and control mechanisms, a comprehensive evaluation framework that integrates these dimensions to support sustainable development remains lacking (Salem et al., 2025).In addition to these theoretical gaps, there are notable limitations in the methodological approaches employed in the existing research. Currently, three categories of quantitative approaches are commonly used to evaluate urban-rural sustainability and land-use dynamics. The first category comprises statistical and spatial econometric models, including fixed-effects and random-effects models, as well as spatial panel specifications.The Spatial Durbin Model (SDM) has been widely used to identify spatial spillover effects and the impacts of neighbouring areas, serving as a key tool for evaluating the efficiency and driving factors of land use structure. (Luge et al., 2025). These models offer clear parameter interpretations and a solid theoretical foundation for analysing directional relationships and spillover effects. However, their reliance on linearity and global stationarity limits their capacity to capture non-linear interactions, high-order effects and complex spatial heterogeneity, and model stability can be compromised when the underlying processes are strongly nonstationary.The second category centres on geographically weighted regression (GWR) and its extensions, such as multi-scale regression methods (MGWR), which are used to address spatial non-stationarity by allowing coefficient estimates to vary across space (Wu et al., 2025). These approaches are effective for uncovering localised spatial mechanisms and scale-dependent relationships and have been used to explore the spatial heterogeneity of land-use drivers in urban-rural regions. Yet, they are less suited to handling high-dimensional variables and intricate interaction structures, and their performance often depends on subjective choices of bandwidths and kernel functions. As noted by Lessani and Li (2024), many extensions of GWR still have these limitations and thus only partially resolve the challenges of modelling complex, multi-dimensional land systems.The third category encompasses predictive models based on remote sensing and machine learning, including random forests, support vector machines, XGBoost, and deep learning. These models have demonstrated strong capabilities in extracting complex spatiotemporal patterns from large-scale remote sensing and multi-source big data, and can achieve high predictive accuracy in land-use classification, change detection and risk assessment. However, they often suffer from limited interpretability and weak connections to established theoretical frameworks in land-use and sustainability research (Argota Sánchez-Vaquerizo and Helbing, 2025). Recent advances in explainable artificial intelligence (XAI), including SHAP and partial dependence plots (PDPs), have enhanced the interpretability of such models by quantifying global and local feature contributions and interactions, leading to composite frameworks such as XGBoost-SHAP. These approaches have helped to bridge, to some extent, the gap between "spatial processes" and "black-box predictions" (Alvarez-Garcia et al., 2025;Cui et al., 2025). Nonetheless, their application in urban-rural sustainability assessment remains fragmented, and XAI tools have rarely been embedded within systematic evaluation frameworks that integrate economic, ecological and social dimensions. Overall, existing approaches either prioritise statistical inference under restrictive assumptions or emphasise predictive performance at the expense of interpretability, with few attempts to integrate their respective strengths into a coherent analytical workflow.Building on the above review, it is clear that existing research has made significant contributions to understanding the spatial patterns, structures, and efficiency of urban-rural land use, as well as the governance challenges of peri-urban transformation. However, several key gaps remain. First, at the conceptual and analytical levels, there is a lack of systematic exploration of the coupling mechanisms between land-use transitions and their impacts on economic vitality, ecological quality, and social well-being. This results in an absence of a quantitative basis for evaluating the dynamic transformation processes of "coordination-conflict" outcomes in urban-rural regions, and for identifying when development pathways remain sustainable or become problematic. Second, the non-linear threshold relationships and interaction effects among urban-rural land-use factors have not been clearly identified, making it difficult to determine how different land use structures and combinations influence sustainability outcomes, and under what critical conditions marginal changes in landuse configurations can trigger regime shifts. Third, at the methodological level, the integration of multi-source spatial data and machine learning models into urban-rural sustainability research frameworks remains underdeveloped. While single methods have been fruitfully applied, there is a lack of integrated workflows that can simultaneously capture multi-dimensional coupling, spatial heterogeneity and non-linear threshold effects, and translate these insights into policy-relevant indicators for sustainable land-use governance.In response to these gaps and the associated practical needs, this study addresses the following core research question: How do land-use transitions in evolving urban-rural systems interact and co-evolve across the economic, ecological and social dimensions, and under what critical conditions do these processes shift from coordination to conflict? To answer this question, three interrelated research objectives are proposed: (1) To develop an Urban-Rural Sustainability Index (URSI) that integrates economic, ecological, and social dimensions, thereby quantifying the multi-dimensional coupling characteristics of urban-rural sustainability; (2) To employ the CRITIC method and the Coupling Coordination Model (CCM) to characterise the spatial coordination mechanisms between human activity intensity and ecological sensitivity, and to identify the linear effects of different land use types and structures on urban-rural sustainability; (3) To develop a quantitative analytical framework for urban-rural sustainability centred on "multi-dimensional coupling-nonlinear driversthreshold identification". Building on the interpretability of traditional panel models, the framework incorporates a Random Effects Model (REM) together with an XGBoost-SHAP approach to form a composite analytical pathway that combines statistical explanation, non-linear pattern recognition and segmented effect profiling.Correspondingly, the study makes three main contributions. First, at the theoretical level, it presents an analytical framework that reveals the mechanisms of "multi-dimensional coupling and non-linear segmentation" within urban-rural systems. This framework extends existing discussions beyond single-dimensional linear effects to include cross-dimensional coupling and critical threshold mechanisms. Second, at the methodological level, it establishes a quantitative workflow that links statistical models (panel and random effects models, CCM) with machine learning models (XGBoost and SHAP), offering an operational technical route for identifying nonlinear relationships and threshold effects in complex land systems. Third, at the practical level, using Suzhou as an empirical case, the study identifies policy-relevant land use thresholds and key driving mechanisms. It provides measurable and testable evidence that supports spatial optimisation and sustainable governance in rapidly urbanising regions, with potential applications to other urban-rural contexts. Urban-rural land-use transformation is both a consequence of urbanisation and a critical driver of urban and rural development (Somanje et al., 2020). In this process, a dynamic system emerges as people, capital, resources, and land shift between urban and rural areas, with these factors continuously interacting over space and time (Zhao and Wan, 2021;Yan et al., 2018;He et al., 2017). Historically, this transformation, driven by rural labour migration, profit-seeking capital, and technology diffusion, has often increased urban-rural disparities by skewing the allocation of resources toward cities (Yang et al., 2021). Baffoe et al. (2021) argued that sustainable urban-rural land use should instead optimise the spatial distribution of capital, labour, technology, and industry, fostering balanced development on both sides of the urban-rural spectrum. Achieving this balance requires maintaining a dynamic equilibrium across economic, ecological, and social dimensions, thereby narrowing long-standing gaps between urban and rural areas.To monitor progress toward this goal, it is necessary to develop a comprehensive framework for assessing the sustainability of land use in integrated urban and rural contexts. In this regard, the Urban-Rural Sustainability Index (URSI) serves as a crucial tool for assessing the sustainability of regional land-use patterns. The construction of the URSI in this study is based on the human-land system coupling and coordination theory (Long, 2022) and the three-dimensional framework of sustainable development (UN., 2015). This theory posits that sustainable development in urban and rural systems is the result of a dynamic balance between economic vitality, ecological carrying capacity, and social well-being, and should be understood as a continuously adjusting and evolving process (Yang et al., 2021). These three dimensions interact and constrain each other, collectively determining the level of coordination and sustainability in regional development. Economic vitality drives growth, ecological capacity ensures sustainable resource use, and social well-being ensures quality of life and social equity. Only through the coordination of these three components can sustainable urban-rural integration be achieved (Liu et al., 2020).Building upon this theoretical foundation, this study defines the logical path for the URSI evaluation as follows: from the Sustainable Development Goals to System Dimensions and Indicator Factors. The sustainable development goal in this context is to achieve efficient resource allocation, ecological security, and improvements in living standards within the framework of urban-rural integration. This goal focuses not only on economic development but also on the balanced advancement of ecological protection and social equity.Therefore, based on the urban-rural sustainable development goal, the index is divided into three dimensions: economic, ecological, and social. The economic dimension reflects the development dynamics and competitiveness of urban and rural areas, the ecological dimension focuses on environmental quality, resource utilisation efficiency, and environmental resilience, while the social dimension measures social welfare, the accessibility of public services, and social equity (Liu, 2018;Long and Tu, 2018).Therefore, from a measurable perspective, this study selects 17 multi-source data indicators that reflect the characteristics of the three dimensions to assess the URSI. The selection of indicators is guided by three principles. First, the theoretical relevance indicates that each indicator corresponds to a specific dimension and goal within the sustainable development framework. Second, regarding data availability, the indicators are based on publicly accessible or remotely sensed multi-source data, such as Points of Interest (POI), nightlight imagery, and Normalised Difference Vegetation Index (NDVI). This approach ensures repeatability and spatiotemporal comparability. Third, to address non-redundancy, the study employs the CRITIC method to evaluate the variability and correlation of the indicators. This ensures that each indicator in the system is independent and contributes valuable information. As shown in Table 1, each indicator is linked to economic, ecological or social dimensions, and to the overarching goal of urban-rural sustainable integration. The economic dimension of the URSI assesses the capacity for regional economic development and its sustainability, focusing on the transitions in land use between urban and rural areas. It is operationalised through four target layers, including economic vitality, industrial agglomeration, innovation and production, business mobility and the nighttime economy, a set of seven indicators (A1-A7). Economic vitality is measured by the presence of retail facilities, financial institutions, and food and beverage outlets. Existing studies highlight that retail facilities such as shopping malls, supermarkets and convenience stores reflect the vibrancy of urban and rural consumer markets and their responsiveness to demand for goods and services (Zhou and Li, 2020).Similarly, restaurants, cafés and related food and beverage outlets are indicative of the consumption-driven economy and its potential for growth (Zikirya et al., 2021). Financial institutions, such as banks, securities firms, insurance companies, and ATMs, represent the economic services infrastructure and financial accessibility, which are crucial for economic integration between urban and rural areas (Ma et al., 2020). Industrial agglomeration, innovation and production are captured through the distribution of industrial parks and office premises, encompassing high-tech industrial zones, development zones, corporate headquarters, and office buildings. These indicators jointly reflect the degree of industrial concentration, innovation capacity and production sustainability, as well as the extent of economic diversification and the ability of a region to adapt to evolving industrial structures (Hu and Han, 2019). Furthermore, accommodation facilities (e.g., hotels, guesthouses) are included as proxies for business mobility and the region's capacity to support economic activities such as tourism and business conferences (Hidalgo, 2024). Finally, the nighttime economy is represented by nighttime facilities, such as night markets, bars, and 24-hour convenience stores, complemented by night-time light data, which provide an additional layer of evidence on economic vibrancy after dark (Shi et al., 2020).The ecological dimension of URSI assesses the sustainability of land use in terms of ecological infrastructure, biodiversity, and environmental governance capacity, using four indicators (B1-B4). Ecological infrastructure is represented by parks, green spaces, and bodies of water. These include urban parks, street greenery, wetland parks, lakes and rivers, which are key indicators for assessing the extent of urban greening and its role in improving air quality, biodiversity, and overall environmental health (Pan et al., 2023;Pan et al., 2024). Protected natural areas, including scenic and forest parks, are vital for maintaining ecological functions and providing ecosystem services like carbon sequestration and water filtration, reflecting the region's biodiversity (Alvarez and Larkin, 2010). Additionally, environmental protection facilities, such as monitoring stations, waste sorting systems, and wastewater treatment plants, reflect the region's ecological governance capacity and its efforts in pollution control and resource recycling. These facilities are integral to maintaining environmental quality in the face of urbanisation and industrialisation (Fu et al., 2021).The social dimension of URSI encompasses three interrelated aspects of social sustainability: residents' quality of life, public governance and social welfare, and the diversity of cultural and community life, represented by six indicators (C1-C6). Quality of life is primarily reflected in the accessibility of basic public service facilities, especially education and healthcare services, which directly affect residents' everyday well-being and health security (Chen et al., 2022;Zhang, 2025). Social welfare facilities (e.g., community service centres, nursing homes and disability service stations) and public service facilities (e.g., government offices, police stations, and post offices) reflect the region's social governance capacity and the depth of welfare provision (Yang et al., 2021). These facilities are core elements of social equity in the process of urban-rural integration (Saraswat et al., 2025). Furthermore, religious and cultural venues (e.g., churches, temples, and mosques) and cultural and recreational venues (e.g., libraries and cultural centres) are treated as key indicators of the diversity of cultural and community life (Ives and Kidewll, 2019;Yang et al., 2021). Instead of directly measuring religiosity, the presence and density of these venues are used as indicators of community organisations and shared spaces for interaction. These places serve as crucial venues for forming and maintaining social connections, as well as for expressing cultural and religious practices. Literature on social sustainability and social capital consistently highlights that these venues contribute to social cohesion, foster a sense of belonging, and promote inclusive community development (Sumi et al., 2025;Maselko et al., 2011). Therefore, they play a vital role in supporting the social foundations necessary for sustainable integration between urban and rural areas.Unlike other studies on the sustainability of urban-rural land use, the URSI proposed in this research encompasses 17 evaluation indicators across the economic, ecological, and social dimensions, which can dynamically reflect the core mechanisms of urban-rural integration to some extent. Furthermore, this new assessment framework can serve as a systematic diagnostic tool to identify the strengths and weaknesses in the sustainability of urban-rural land use. The results derived from this assessment framework systematically reflect the overall elements and structural functions of urban-rural areas, providing policymakers with valuable decision-making support to promote sustainable development. Building on the foregoing review of literature on urban-rural land-use sustainability, this study develops a comprehensive framework that combines economic, ecological and social dimensions to evaluate the Urban-Rural Sustainability Index (URSI). FIGURE 1 Research framework. This study selected Suzhou City in Jiangsu Province, China, as the research case. As of 2023, Suzhou administers six districts (Gusu, Wuzhong, Xiangcheng, Wujiang, High-tech, and Suzhou Industrial Park) and four county-level cities (Changshu, Zhangjiagang, Kunshan, and Taicang), with a permanent population exceeding 10.68 million and an urbanisation rate of 82.48% (Figure 2). Suzhou is a central city in the Yangtze River Delta and one of the earliest regions in China to undergo rapid urbanisation. It has long been characterised by high-intensity factor concentration and spatial expansion (Ma et al., 2007). Since the reform and opening-up, Suzhou has expanded through the integration of towns and the establishment of township development zones, leading to the reconstruction of its urban functions and spatial structure. Agricultural land has been continuously converted into residential, industrial, and service land, resulting in significant changes in vegetation cover and landscape patterns (Zhang et al., 2003). As a key node in China's national urban agglomeration and metropolitan FIGURE 2 Boundaries and administrative divisions of the study area, Suzhou, China.From the perspectives of urbanisation intensity, the complete urban-suburban-rural gradient, and multi- The CRITIC weighting method is an objective weighting approach proposed by Diakoulaki et al. (1995).It quantifies the relative importance of indicators by considering their variability (contrast intensity) and interdependence (conflict). This method is particularly suited for Multi-Criteria Decision Analysis (MCDA), offering a systematic approach to objectively determine indicator weights through the following steps:Step 1: Data StandardisationThe original data matrix □□ = (□□ !" ) #×% (with m samples and n indicators) is standardised using min-max normalisation, defined as follows:For positive indicators (where higher values indicate better performance):□□ □□□□ = □□ □□□□ -□□□□□□(□□ □□ ) □□□□□□+□□ □□ , -□□□□□□(□□ □□ )For negative indicators (where lower values indicate better performance):□□ #$ = □□□□□□+□□ $ , -□□ #$ □□□□□□+□□ $ , -□□□□□□(□□ $ )Step 2: Calculation of Contrast Intensity (Standard Deviation)The contrast intensity reflects the variability of each indicator and is computed as follows:□□ $ = 5 1 □□ 7(□□ #$ -□□̅ $ ) % & #'(Where □□̅ " is the average value of indicator □□.Step 3: Determination of Conflict (Intercriteria Correlation) □□ $ = 7(1 -|□□ $) |) * )'(Step 4: Computation of the Information Content (□□ $ )The information content integrates both the variability and independence of indicators, defined as:□□ $ = □□ $ × □□ $Larger values of □□ " indicate a greater amount of unique information provided by the corresponding indicator.Step 5: Determination of Weights (□□ $ )Finally, indicator weights are computed by normalising the information content:□□ $ = □□ $ ∑ □□ ) * )'( Building upon the URSI assessment framework, two remote sensing data indicators, radiance-calibrated nightlight and Normalised Difference Vegetation Index (NDVI), were incorporated as supplementary measures in the construction of the Human Activity (HA) and Ecology Sensitivity (ES) indices, respectively. The Coupling Coordination Model (CCM) is adopted to measure the interactive relationship between HA and ES.Table 2 shows that the index evaluates HA using economic facilities, economic activities, and social facilities, while ecological facilities and quality are used for assessing ES. □□□□ = □□ □□ × □□□□□□ + □□ □□ × □□□□ + □□ □□ × □□□□ □□□□ = □□ □□ × □□□□□□ + □□ □□ × □□□□The coupling degree □□ between HA and ES is calculated based on physics-derived coupling concepts: The coupling degree □□ ranges between [0,1]. A value approaching 0 indicates imbalance and increasing deviation, whereas a value approaching 1 signifies stronger and improving coordination between the two systems. However, the coupling degree alone cannot effectively distinguish between scenarios of low-level yet highly coupled systems and those of high-level coordination. Therefore, this study employs the CCM indicator, adapted from relevant existing literature (Shen et al., 2023), which is calculated as follows:□□□□□□ = √□□ × □□ □□ = □□□□ + □□□□ To link the Urban-Rural Sustainability Index (URSI) and its coupling coordination measure (CCM) with land-use characteristics in a statistically robust manner, this study constructs a spatial panel dataset at the level of regular grid units. Specifically, the entire study area of Suzhou is divided into 1 km x 1 km grid cells, and for each grid cell, we calculate the URSI/CCM and associated land-use, ecological and socio-economic indicators for each year from 2012 to 2024. Using a grid-based design offers several advantages over traditional administrative units. First, it minimises the modifiable areal unit problem, which can skew data analysis. Second, it enables a more detailed identification of variations within districts and highlights localised conflict hotspots.Lastly, a grid-based approach remains consistent over time, making it essential for effectively analysing dynamic land-use changes.In panel data analysis, to simultaneously account for the dual dimensions of time-series and cross-sectional individual characteristics, this study employs the Random Effects Model (REM) to examine the relationship between the Urban-Rural Sustainability Index and land use characteristics. Unlike the Fixed Effects Model, which assumes that individual effects are fixed constants, the REM posits that individual effects are not fixed but are random variables drawn from a larger population. This assumption enables the model to account for both temporal variation and random individual differences, making it particularly suitable for research settings with large sample sizes or when the goal is to capture variations within and between regions (Wooldridge, 2010).The general form of the Random Effects Model used in this study can be expressed as:□□ □□□□ = □□ + □□□□ □□□□ + □□ □□ + □□ □□□□where, □□ !' represents the dependent variable (the CCM value derived from the URSI framework) for grid unit □□ in year □□; □□ !' is the vector of explanatory variables for grid □□ in year □□, including land-use structure indicators (e.g., proportions of built-up land, farmland and water bodies), ecological, and socio-economic indicators; □□ is the constant term, □□ is the vector of regression coefficients, reflecting the marginal impact of each explanatory variable on the CCM; □□ ! represents the grid-specific random effect, assumed to follow an independent normal distribution □□ □□ ∼ □□(□□, □□ □□ □□ ), capturing time-invariant unobserved heterogeneity across grid units; and □□ !' is the random error term, assumed to follow □□ !' ∼ □□(0, □□ + , ), representing time-varying shocks specific to unit □□ and year □□.The core assumption of the REM is:Cov(□□ □□□□ , □□ □□ ) = □□This implies that the unobserved individual-specific effects are uncorrelated with the explanatory variables.Under this assumption, the REM yields consistent and more efficient estimates than the fixed effects model, and allows the estimation of both within-unit (over time) and between-unit (across grid cells) variation, thereby supporting broader generalisation to similar urban-rural contexts. In practice, the model parameters are estimated using the Generalised Least Squares (GLS), which provides efficient estimates in the presence of heteroskedasticity and serial correlation in panel data. Furthermore, to test the appropriateness of the REM, aHausman test is conducted to compare the results from the Fixed Effects and Random Effects Models, ensuring that the REM is suitable for the data at hand. To systematically quantify the complex, non-linear, and interactive influences of land-use composition on the Urban-Rural Sustainability Index (URSI)-based conflict, this study employed the Extreme Gradient Boosting (XGBoost) modelling in conjunction with SHapley Additive exPlanations (SHAP). Compared to conventional linear models, XGBoost has demonstrated superior predictive performance by effectively capturing higher-order interactions and non-linear threshold effects between variables (Yang et al., 2025;Li and Miao, 2025). This methodological strength makes it well-suited for exploring the complex and multidimensional relationships associated with the research question in this study.XGBoost iteratively constructs an ensemble of decision trees, optimized via a regularised objective function defined as:□□(□□) = 7 □□[□□ □□ , □□ \ □□ (□□) ] + 7 □□(□□ □□ ) □□ □□'□□ □□ □□'□□with the regularisation term defined as:□□(□□ □□ ) = □□□□ + □□ □□ □□||□□|| □□Here, □□(. ) denotes the prediction loss function, □□ ! and □□ > ! (') represent the actual and predicted values, respectively, for the □□ -□□ℎ observation at the □□ -□□ℎ iteration, □□ is the number of leaves, and □□ indicates leaf weights. Regularisation parameters (□□, □□) control model complexity, thereby enhancing generalization performance. By iteratively fitting residuals from previous stages, XGBoost incrementally refines its predictive accuracy.In this study, the input variables consisted of areas from five distinct land-use categories: farmland, forest, water, built-up, and other land. The dependent variable was the CCM results evaluated based URSI framework.Model predictive performance was assessed by the coefficient of determination (R2) and root mean squared error (RMSE).To address the interpretability challenge associated with complex machine learning models, SHAP was utilised. This method applies principles from cooperative game theory to provide a transparent decomposition of each individual prediction into feature-level contributions (Xi et al., 2024). Specifically, the SHAP explanation for the □□ -□□ℎ prediction can be represented as:□□ \ □□ = □□ □□ + 7 □□ □□ (□□) □□ □□'□□Where □□ / represents the baseline model prediction, and □□ " (!) is the marginal contribution (SHAP value) of feature □□ " to the prediction for observation □□. By analysing the mean absolute SHAP values, the key global drivers of land use were identified. Additionally, SHAP dependence and interaction plots offered valuable insights into the non-linear marginal effects and interactions between different land-use variables. This study utilises multiple datasets for assessing the sustainability of urban-rural land use, including remote sensing data, publicly accessible datasets, the Gaode Map platform, and supplementary auxiliary data (Table 3).Data In this study, two composite indices, Human Activity (HA) and Ecological Sensitivity (ES), were constructed to objectively reflect urban-rural land-use dynamics in Suzhou City during the period of urban expansion from 2012 to 2024. The HA index comprised three indicators: economic facilities density (EMF), economic activity intensity (EA), and social facilities density (SF), while the ES index included ecological facilities density (EGF) and ecological quality (EQ). The spatio-temporal distribution characteristics of these indicators during the study period are illustrated in Figure 3.Within the HA indicator system, the EMF showed a clear spatial clustering in 2012, primarily concentrated in the core urban area of Gusu District. This centralisation gradually expanded outward in subsequent years.From 2015 to 2018, a notable diffusion trend was observed, with the density of EMF in peripheral regions, such as county-level cities Kunshan and Changshu, experiencing significant increases. However, in 2021, despite the overall spatial configuration of the city remaining largely unchanged, EMF density experienced a marked decline. This reduction is likely attributable to the impact of the COVID-19 pandemic, aligning with findings from previous studies on Chinese cities during the pandemic (Han et al., 2024). By 2024, EMFs were more evenly distributed across Suzhou, forming a multi-centre structure. This trend reflects the city's shift from a centralised economy development model to a more dispersed model, with multiple focal points across the region.Similarly, EA also exhibited a spatial expansion pattern. In 2012, the initial EA hotspots were primarily concentrated along the east-west axis, centred on Gusu District, Suzhou Industrial Park (SIP), and Kunshan Taken together, these indicators collectively reflect the multidimensional developments and transitions across economic, social, and ecological domains in Suzhou, laying the groundwork for establishing an Urban-Rural Sustainable Index (URSI). To objectively determine the influence weights of different indicators, the CRITIC method was utilised, effectively avoiding subjective bias by scientifically identifying key indicators.The calculated weights for each indicator fluctuated across the study periods, as summarised in Table 4. Within the HA domain, EA contributes more than 4.5% to the total HA weighting, while EQ accounts for over 9.5% within the ES weighting system, marking them as the most influential indicators within their respective dimensions. Empirically, the HA index demonstrated a volatile yet upward trajectory, whereas the ecological sensitivity index remained relatively stable from 2012 to 2024, suggesting that Suzhou's ecosystems retained a degree of stability and resilience amid urban-rural land transformations. These two-dimensional indicators were then introduced into the Coupling Coordination Model (CCM) to derive the URSI, which is subsequently used to examine the evolving coordination-conflict dynamics throughout the land transformation process. This study employed the Coupling Coordination Model (CCM) to quantitatively evaluate the URSI for Suzhou from 2012 to 2024, revealing the spatial distribution and temporal evolution of urban-rural land-use conflict in the context of rapid urban expansion (Figure 4). Utilising Jenks' natural breaks classification, URSIbased conflicts were categorised into five intensity levels: Low conflict (0.00<CCM<0.19), Relatively low conflict (0.20<CCM<0.28), Medium conflict (0.29<CCM<0.39), Relatively high conflict (0.40<CCM<0.53),and High conflict (0.54<CCM<1.19).In 2012, high-conflict regions were primarily concentrated within the urban core, particularly around the Gusu District, indicating a clear monocentric spatial pattern. By 2015, conflict hotspots had gradually diffused outward along major transportation corridors toward suburban areas, and new hotspots had emerged within secondary urban centres and newly developed regions, such as Kunshan, Changshu, and Zhangjiagang counties.This spatial diffusion aligns with the concept of urban-rural transition zones (Yang et al., 2021;Zhang, 2025), illustrating a shifting land-use conflict pattern driven by accelerated urban expansion (Zheng et al., 2022). By 2018, high-conflict areas (CCM > 0.54) had notably increased in central urban areas, including Gusu, the Hightech zone, and SIP districts, as well as Kunshan County, with an expansion of 8.4% compared to 2015. industrial relocation, population migration, and infrastructural expansion (Xu et al., 2020;Zhu and He, 2025;Pan et al., 2023). Overall, the evolution from a monocentric to polycentric conflict pattern in Suzhou illustrates the cumulative and diffusive nature of URSI-based conflicts, providing critical insights into the dynamic interplay between urban economic growth, spatial expansion and sustainable development. This highlights the necessity of adopting spatially differentiated, multi-nodal urban planning and ecological governance strategies. The Hausman test was conducted to compare the Fixed-effects (FE) estimator with the Random-effects (RE) estimator in order to select the appropriate model. As reported in Table 5, the test yields a statistic of -21,956.937 with 5 degrees of freedom and a p-value of 1.000 (p≥0.05). The large p-value indicates that there is no statistically detectable difference between the FE and RE estimates. Hence, we do not reject the orthogonality assumption, implying that RE is consistent for our data. Conditional on this result, the RE model was selected for the main analysis due to its ability to achieve greater efficiency than the FE model, thereby improving the precision of the estimated effects. Table 6 presents RE estimates for two specifications: URSI-based conflict level (Model 1) and its temporal change (Model 2). In the baseline model (Model 1), the Built-up area shows the largest positive coefficient among land-use categories (0.615***), indicating that more urbanised regions are associated with increased urban-rural conflict. This finding is consistent with agglomeration-driven land competition and pressure on shared resources (Zheng et al., 2022). By contrast, Farmland (-0.149***) and Water (-0.051***) present a significant negative sign, indicating that agricultural and ecological stocks are associated with lower levels of conflict, theoretically through buffering, livelihood support and ecological services (Yang et al., 2018). Forest also shows a small but significant negative association (-0. This research re-estimates the models after excluding the top and bottom 5% of samples in the URSI-based conflict index (see Supplementary Table S1). The signs and significance remain consistent, confirming the robustness of the findings. Specifically, in the baseline model, Built-up area remains strongly positive To further explore the impact of spatial heterogeneity, REM tests were conducted in nine districts and counties under the jurisdiction of Suzhou City, and the results are presented in Supplementary Tables 2 and3. These findings confirm that increasing urban development intensity remains the primary driver of escalating URSI-based conflict, echoing and expanding upon prior studies by Zheng et al. (2022) and Shen et al. (2023). Importantly, this research highlights that different types of land use, especially ecological and agricultural areas, play a significant role in reducing conflict levels. These land-use categories demonstrate a buffering effect and emphasise their value in balancing both conservation efforts and production needs.Moreover, in the short run, adjustments to water bodies may coincide with heightened conflict, which highlights the central role of blue-green infrastructure in urban-rural sustainability strategies and aligns with recent theoretical arguments that nature-based solutions is significant for mitigating urban ecological pressures (He et al., 2024). Collectively, these results reveal a dual driver of the URSI-based conflict, structural composition and temporal transformation, and provide an empirical basis for the subsequent non-linear interaction and segmented effect analysis. To further validate and enhance the results derived from REM, the XGBoost modelling and ShapleyAdditive exPlanations (SHAP) analysis were employed to explore non-linear thresholds and pairwise interaction effects among different land-use variables. Figure 5 illustrates the importance of variables and their contributions based on two model specifications: the cross-sectional model (Figure 5a) and the change model (Figure 5b).Specifically, the bar plots on the left display the ranked SHAP values, arranged from highest to lowest. These plots highlight the average contribution of each land-use variable to the model predictions. In contrast, the SHAP beeswarm plots on the right illustrate the distribution and direction of local impacts for individual observations. The cross-sectional model (Figure 6a) revealed a strong association between the URSI-based conflict and the proportion of built-up area, as well as significant segmentation effects between built-up land and other landuse categories. The built-up area shows a monotonic increase in SHAP values up to approximately 0.60, after which the curve flattens, indicating diminishing marginal contributions beyond this level. The steepest rise occurs in the 0.40-0.60 interval. Taken together, these patterns suggest an empirical operating threshold near 0.60 for planning attention, with particularly pronounced sensitivity as the built-up area moves through the 0.40-0.60 range. Interaction-coloured plots further reveal a turning point for the farmland area around a Z-score of approximately 0.40 (i.e., 0.40 standard deviations above its mean). Below this level, SHAP values rise sharply, indicating that low farmland availability fails to buffer urban pressure and amplifies conflict as the built-up area increases. Above a Z-score of approximately 0.40, the slope attenuates, implying that maintaining at least a moderate, contiguous farmland share mitigates conflict, an effect that is more pronounced in highly urbanised areas. By contrast, the water share curve is smooth and generally conflict-reducing, with diminishing returns as area increases. This profile is consistent with a stable but moderate buffering role: expanding water areas contribute to conflict alleviation, but the marginal benefit tapers as coverage grows.The change model (Figure 6b) shows that year-to-year variation in the conflict index arises from the joint influence of land-use category, the rate of land-use change, and baseline urbanisation. Built-up expansion remains the dominant driver. The SHAP dependence curves display a V-shaped response around zero growth, with contributions being lowest near zero and rising as the growth rate departs from zero in either direction. The increase is steeper for expansion than for contraction. Within the intervals -0.25 ≤ built-up in z-standardised units ≤ 0, and 0 <built-up in z-standardised units≤ 0.50, SHAP values increase approximately linearly with the absolute growth rate, indicating that moderate acceleration in built-up change already elevates conflict risk.Changes in water area are asymmetric. Small losses have a limited effect, but larger losses exacerbate conflict.Moderate gains provide the most effective mitigation, particularly when paired with sufficient farmland, which enhances the buffering capabilities of the blue-green system. In contrast, significant increases in farmland (greater than 0.25 in z-standardised units) are linked to higher levels of conflict. This suggests that there are transitional frictions, such as reallocation, access issues, and compensation challenges, when agricultural land is rapidly reconfigured. Forest cover change shows relatively minor effects throughout its range, and significant negative impacts primarily arise during periods of rapid forest gains, aligning with challenges in transition management. Therefore, these findings highlight the need for context-sensitive land-use management, where sequencing is required, rather than relying on a single ecological mitigation measure.These findings quantify model-implied marginal contributions, including the 60% limit for built-up areas, 15% coverage for water, and 20% preservation levels for farmland serve as referable targets for sustainable development and the integration of urban and rural areas. However, it is essential to note that the results derived from the integration of XGBoost and SHAP represent the segmental effects of features on model predictions, revealing nonlinear relationships and correlation thresholds. These results should not be interpreted as a causal relationship. Furthermore, the identified segmental effects may be influenced by geographical, social, and institutional contexts and should be validated against local planning practices. In the 21st century, addressing the uneven and fragmented urban-rural development has become one of the most pressing challenges for China's social and policy challenges (Yang et al., 2021). In this context, the effective integration of economic, ecological, and social resources between urban and rural areas is therefore a critical strategy for achieving sustainable regional development. The Urban-Rural Sustainability Index (URSI) developed in the study not only provides a comprehensive quantitative tool for identifying different patterns of coordination and conflict, but also offers a technical basis for formulating differentiated, place-based land-use management strategies.Using Suzhou as a case study, the URSI-based conflict mapping reveals a clear evolution between 2012 and 2024: the spatial pattern of conflicts has shifted from a monocentric conflict hotspot to a polycentric, multinodal configuration. This spatial transformation reveals that as urban spillover persists and multi-core development advances, urban-rural conflicts are no longer confined to the traditional urban core. Instead, these conflicts are now appearing as overlapping and multi-scalar patterns across major urban districts, county-level sub-centres, and the surrounding rural areas. Based on the intensity of conflicts, the regions in Suzhou can be classified into three categories. First, high-conflict core urban areas, represented by Gusu, Wuzhong and High-Tech Zone. Second, county-level urban centres such as Kunshan, Changshu, and Zhangjiagang, which form medium-conflict peripheral sub-centres. Third, low-intensity rural areas and ecological core zones, represented by Wujiang District. Under such conditions, traditional control measures based solely on administrative boundaries or single indicators are insufficient to respond to heterogeneous regional challenges. By contrast, the URSI evaluation framework enables the construction of a resultsoriented, tiered governance system, supporting a governance pathway that links problem identification, zoning, regulation, and monitoring. Drawing upon the proposed URSI evaluation framework and empirical findings, targeted land-use management and planning measures can be designed along three main aspects to alleviate URSI-based conflicts. Land-use control strategies can be differentiated according to the three spatial categories identified by the URSI assessment. First, in Category 1 high-conflict core urban areas, urbanisation is already mature, socioeconomic structures are relatively stable, and the share of built-up land exceeds the critical threshold of around 60% revealed by the URSI analysis. Land use in these areas is characterised by high intensity and density, but also by pronounced spatial pressures and sustainability conflicts. In such contexts, further outward expansion of construction land is likely to exacerbate conflicts. Therefore, these areas should prioritise improving the quality and efficiency of the existing built environment as the main measure of conflict mitigation. In operational terms, additional development demand should, in principle, be met through urban renewal, redevelopment of underutilised land, and vertical intensification, while strictly controlling further outward expansion of urban construction land, an approach supported by the findings of Xu et al. (2020). Moreover, URSI impact assessments can be integrated into the planning and development approval processes. Major projects must demonstrate that the URSI-based conflict for the relevant unit does not increase after implementation. If there are negative impacts, these should be mitigated through measures such as demolishing low-efficiency buildings or increasing green and blue spaces. Differentiated land prices, taxation, and floor-area ratio policies can help discourage land-intensive, low-value activities while promoting the clustering of knowledge-intensive and innovation-driven industries. When these measures are paired with digital transformation initiatives, such as the digital upgrading of traditional manufacturing and data-driven production organisation, they can enhance economic output and social benefits for each unit of construction land (Kim and Feng, 2024).Second, Category 2 county-level peripheral areas are characterised by rapid economic growth, strong pressure for urban expansion and the continued presence of agricultural and ecological spaces. For these areas, the primary task is to control the scale of construction land growth. The expansion of construction land exhibits diminishing marginal returns: once the scale of built-up land exceeds certain thresholds, further expansion may not generate additional economic benefits and can even lead to a decline in economic performance (Xu et al., 2020). Accordingly, these regions should focus on developing ecologically compatible industries and infrastructure, aligning industrial development models with local ecological conditions. The empirical analysis indicate that maintaining approximately 20% of the area as farmland and increasing the proportion of water bodies by 0-15% can significantly mitigate URSI-based conflicts. In implementation terms, this implies that explicit targets for the preservation of farmland and water bodies should be incorporated as semi-binding requirements in spatial plans. At the same time, when new construction land is designated, land consolidation, land-use swaps and quota trading can be employed within or across jurisdictions to ensure that overall land-use composition remains within the "safe ranges" identified by the URSI. In addition, planning interventions in these areas should address the balanced and equitable spatial distribution of social infrastructure, such as education, healthcare and recreational facilities, in peri-urban neighbourhoods. Improving access to such services is an indispensable component of enhancing the URSI (Baffoe et al., 2021).Third, Category 3 rural and ecological core areas (e.g. Wujiang District) are characterised by low land-use intensity and high ecological sensitivity, and they play a key role in safeguarding regional ecological security and ecosystem service provision. In these areas, new construction should be strictly controlled. Rural housing improvements and infrastructure provision should be achieved as far as possible through village consolidation, in-situ upgrading and the reuse of idle homesteads, rather than through the conversion of forests, water bodies or high-quality farmland. Remote sensing and long-term monitoring data can be used to establish ecological monitoring indicators, such as forest cover, farmland, water body areas, and composite ecological quality indices.These indicators can then be integrated with the URSI assessment to define early-warning thresholds. When the URSI or its ecological sub-component shows a sustained decline, ecological restoration programs and stricter development control measures should be implemented. The empirical analysis shows that increases in water bodies, forests and agricultural land have a significant positive effect in alleviating URSI-based conflicts. Prioritising blue-green infrastructure in urban-rural planning and development is therefore a key strategy for mitigating conflicts and enhancing resilience. However, the focus of interventions should differ across the three types of regions. In Category 1 core urban areas, blue-green infrastructure should be advanced primarily through nature-based solutions embedded in stock redevelopment.URSI-based conflict hotspot maps can effectively pinpoint high-conflict streets and blocks for prioritised bluegreen retrofitting. In these units, interventions such as the creation of pocket parks, the installation of permeable pavements, the introduction of rainwater harvesting systems and the construction of green roofs should be implemented first, which is consistent with the findings of Chen et al. (2024). At the same time, blue-green infrastructure indicators should be incorporated into the planning conditions and approval criteria for urban renewal and regeneration projects, with explicit requirements that each project achieve a net increase in blue and green space within its footprint.In Category 2 peripheral urban areas, the emphasis should be on constructing ecological networks.Operationally, rivers, canals, lakes and agricultural water systems can serve as the backbone for delineating riparian buffer zones and blue-green corridors in territorial spatial plans. Within these corridors, land use should be restricted to low-impact activities such as ecological conservation, recreation and eco-agriculture, thereby reducing the negative ecological impacts of outward urban expansion. In addition, URSI-based conflict results can be used to prioritise segments where conflicts are most intense and ecological fragmentation is most severe.These segments should be prioritised for restoration and connectivity projects, as modest investments can lead to significant reductions in land-use conflicts. In Category 3 rural and ecological core areas, blue-green infrastructure development should focus on ecological restoration and the conservation of traditional farmland and water systems. Restoring fragmented farmland, wetlands, and riparian zones should be a top priority. It's essential to re-establish traditional composite patterns of farmland, water systems, and woodlands to support agro-ecosystems that offer significant ecological benefits. Blue-green infrastructure projects in these areas should be closely integrated with rural revitalisation initiatives. Through eco-tourism, landscape agriculture, and similar activities, ecological restoration can also provide a new source of income for farmers and local governments. Urban-rural sustainable development relies not only on spatial land use structure but also on the organized movement of essential factors like land, capital, technology, and industry across various spatial units. Currently, factor flows between urban and rural areas in China are still dominated by the movement of population and land, whereas the flows of technology and capital remain relatively limited (Yang et al., 2021) and industries from the core, under the condition that overall land-use configurations remain within sustainable ranges. Policy-guided coordination of factor flows between core and peripheral areas, and strengthened industrial linkages and collaboration along value chains, are important means of promoting sustainable development at the scale of the entire urban region (Su et al., 2024). Given that Category 3 areas function as providers of ecological assets and ecosystem services for the broader region, an ecological compensation mechanism should be established to ensure that a share of land value increments and fiscal revenues generated in Category 1 and 2 areas is reinvested in Category 3. These funds can be used to support ecological restoration, infrastructure improvement and the provision of basic public services. At the same time, policies should facilitate the movement of surplus labour from core urban areas to Category 3 regions where appropriate, for example through the development of eco-agriculture, eco-tourism or decentralised service activities. This can help to prevent excessive depopulation and hollowing-out in Category 3 areas, which would otherwise undermine ecological management and the operation of local infrastructure.Overall, the URSI evaluation framework developed in this study establishes a systematic governance pathway that spans conflict identification, spatial zoning, control measures and monitoring. It thus provides a replicable technical route and policy reference for optimising urban-rural spatial structures and advancing sustainable governance in rapidly urbanising regions. Although this study has made significant progress in both theoretical contributions and practical applications, several limitations remain that warrant further exploration and improvement in future research.Firstly, this study uses Suzhou as a single case study. While Suzhou is one of the key cities in the Yangtze River Delta region and features a comprehensive urban-rural structure, ranging from high-density urban cores to districts and county-level areas, the generalisability of the conclusions may be limited by spatial scale and regional context. Future research could apply this framework to different urban agglomerations with varying development backgrounds and spatial scales, such as the Beijing-Tianjin-Hebei regions and the Guangdong-Hong Kong-Macao Greater Bay Area, to test its applicability and generalisability under varying regional policy contexts. Particularly, given the differences in economic development stages, policy support, and regional resource endowments, urban-rural sustainability models may vary significantly across regions. Thus, conducting cross-regional comparative studies will help enrich the theoretical framework and broaden the practical application scenarios of this research.Secondly, this study employs a static approach to set indicator weights using the CRITIC method. This approach somewhat overlooks the dynamic importance of each indicator during the urban-rural integration process. As urbanisation progresses, the sustainability goals and key factors at different stages of urban-rural development may shift, leading to changes in the influence of specific indicators on sustainability assessments.Consequently, this study did not fully capture this dynamic variation. Future research could consider the adoption of dynamic entropy-weight methods or Bayesian dynamic weighting techniques to more accurately reflect the temporal evolution of indicator weights (Almulhim, 2024), thereby improving the flexibility and precision of the model. Finally, while this study focuses on the impact of different land-use types on urban-rural sustainability conflicts, it remains an open question how to leverage these findings to offer differentiated governance solutions for various regions. Future research could incorporate additional policy variables into the analytical framework to explore how policy interventions affect urban-rural sustainability pathways. This would further enhance the specificity and operationality of policy recommendations, making them more applicable to diverse contexts and facilitating more effective governance for sustainable urban-rural integration. This study develops an integrated analytical framework that combines economic, ecological, and social dimensions to assess land-use conflict and coordination within urban-rural sustainable development. In contrast to earlier work that examines sustainability from a single perspective, the framework proposed in this study delivers a more comprehensive and systematic understanding of how land systems shape sustainability outcomes and tensions. Methodologically, this study combines Random-Effects regression with Extreme Gradient Boosting (XGBoost) and Shapley Additive exPlanations (SHAP), to unite prediction and interpretation. This creatively hybrid design identifies the complex mechanisms through which land-use categories influence the URSI-based conflict index, including variable importance, non-linear responses, interaction effects, and segmented effects. The approach enhances both explanatory clarity and empirical traction relative to conventional linear models alone.Applying the framework to Suzhou over 2012-2024 demonstrates its validity and identifies three key

Keywords: land-use change, Non-linear threshold and interaction effects, Sustainable evaluation, Urban-rural sustainable index, XGBoost-SHAP approach

Received: 28 Jan 2026; Accepted: 09 Feb 2026.

Copyright: © 2026 Gan, Wang, Li, Liu and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Wenquan Gan
Jiana Liu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.