Multi-agent evolutionary game analysis on the implementation of municipal solid waste classification policy

With the rapid development of economy and the improvement of people's living standard, the domestic waste is greatly increasing. Based on the multi-agent evolutionary game theory, an evolutionary game model among the recycling industry, urban management officer and government is established to address the environmental issues of rising municipal solid waste production and low recycling utilization rates in China. Through numerical example analysis, the relationship between relevant factors on the evolutionary stable state of the system is studied. The findings demonstrate that when the likelihood of severe oversight by the government and of significant enforcement by urban management officers grows, the recycling industry becomes more ready to select waste separation strategy. Additionally, the government is far more affected by the likelihood of significant enforcement by urban management officers than the recycling industries are. In addition, effective acceleration of waste sorting may be achieved by giving the recycling industries the proper government subsidies and increasing the recycling industries' profits from garbage sorting. Moreover, the government's increased fines for the urban management officers can significantly lower the incidence of bribery between the recycling industries and the urban management officers.


. Introduction
The problem of "garbage siege" has become more prevalent as a result of China's economy and society developing so quickly (Tang et al., 2022;Zhang et al., 2022). Modernizing urban growth has been severely hampered by the outdated domestic waste management system and trash disposal practices now in use (Yu, 2012). Therefore, it has become crucial for economic and social growth to figure out how to reduce and recycle household garbage (Fan and Wang, 2021;Gupta et al., 2022). According to estimates from the National Bureau of Statistics on the amount of domestic rubbish cleared, as of December 2021, China had removed 248.692 million tons of waste, up 5.8% over the previous year. China has worked hard to boost domestic garbage sorting and recycling over the past 10 years . Major cities like Shanghai and Beijing enacted garbage separation regulations in 2020. The globe will enter an age of energy conservation and CO 2 emission reduction in the same year that carbon neutrality was initially suggested during the UN General Assembly's 75th session. Thus, it follows that the future's primary research and development focus will be on how to successfully promote resource recycling, implement sustainable development methods, and encourage trash separation and recycling. Separating municipal waste involves a game between the government, the urban management officer and the recycling industry. In some literature studies on the influencing factors of waste separation and recycling in recent years, most academics' research mostly concentrate on the behavioral strategies of residents' waste separation and recycling with a single dimension. Few scholars have studied waste separation from multiple factors and perspectives that affect the awareness of each participating subject. It is significant to note that the urban management officer's stringent enforcement and inspection is a crucial assurance for urban domestic waste classification. In reality, many cities in China actively carry out "strict enforcement of urban management officer" to further strengthen administrative penalties for violations and to promote responsible units to fulfill their obligations of domestic waste separation. The majority of the present study on government incentives for citizens participating in trash separation and recycling is concentrated on residents (Guo et al., 2017;Chen F. et al., 2019;Kuang and Lin, 2021;Peng et al., 2021), with less attention being paid to urban management officers and recycling industries. In reality, government subsidies to the recycling industry can better promote the recycling of recyclable resources after separation, reduce the amount of waste generated and solve the problem of waste siege. Therefore, it is necessary to consider multiple participating subjects and focus on the interactive perspective of multiple influencing factors to explore strategies to promote waste separation policies and provide reasonable countermeasures and suggestions for all parties involved in waste separation and recycling.
In addition, the subject of rubbish categorization and recycling study is now somewhat constrained. The majority of waste research focuses on a single waste product, such as construction waste, waste electronics, etc., (Lauridsen and Jørgensen, 2010;Peng et al., 2022), and little attention is paid to the classification and recycling of different wastes in daily life, especially against the backdrop of China's garbage siege, which lacks common research. Therefore, the research object of this paper is urban household waste. Currently, the majority of research on waste sorting and recycling is qualitative, with some quantitative study thrown in for good measure. Additionally, qualitative research lacks a quantitative analysis and verification that is particular and scientific, and theoretical research has long lagged behind the growth of practice. In view of this, this paper considers the possible bribery behavior between the recycling industry and the urban management officer, constructs a tripartite evolutionary game model between the government, the urban management officer and the recycling industry, analyzes the strategy stability of each game subject and the influence of each influencing factor on the evolutionary stability state of the system, and verifies the validity of the evolutionary stability analysis of this paper through numerical case analysis, and then proposes countermeasures and suggestions to promote the effective implementation of waste separation based on the analysis conclusion. The results of this study are conducive to maximizing resource utilization, reducing total economic energy consumption and carbon emissions, and effectively distributing the responsibilities, powers and benefits of relevant parties in the process of recyclable waste processing and recycling, forming a win-win mechanism with shared responsibilities and benefits for multiple parties. At the same time, it promotes the secondary use of renewable resources by enterprises, reduces production costs, improves the financial performance of recycling enterprises, promotes the overall green transformation of the economic society, and ultimately forms the development model of a circular economic society with Chinese characteristics. Thus, it contributes to the construction of ecological civilization, the realization of high-quality development, the construction of resource-saving and environment-friendly "twooriented society, " and the ultimate realization of the long-term goal of "no waste country." . Garbage classification and tripartite game process Waste separation enhances resource usage and promotes a healthy, eco-friendly, green, and ecologically conscious way of living, all of which are beneficial to humankind (Ma and Zhu, 2021). Municipal household waste has considerable value in terms of resources utilization and holds great wealth (Meng et al., 2018). Recyclable, hazardous, kitchen, and other garbage are the four basic categories of household waste. According to the circular economy theory of "resource-product-renewable resource, " waste separation and recycling can effectively alleviate the current situation of garbage siege and promote the recycling of resources and sustainable development. As shown in Figure 1, non-recyclable hazardous waste is safely disposed of by special means. Other wastes such as sludge and ceramics can be disposed of by sanitary landfill. For recyclable domestic waste, it will be further sorted and reprocessed by the recycling industry and eventually used as a new raw material for product production. For instance, food waste (Wei et al., 2021) may be composted, and waste paper is utilized to create recycled paper (Mansikkasalo et al., 2014). Additionally, plastic packaging can be recycled, waste glass packaging may be processed and utilized to create crafts and other items, and scrap steel can be used in the creation of new steel. In the process of recycling, the urban management officer supervises the recycling process of the recycling industry, and the government, through the constraints of economic policies, promotes the recycling industry to separate waste for recycling and urges the urban management officer to enforce the law seriously.
. Literature review Waste separation and recycling are progressively catching the attention of specialists and academics due to the significant impact resource and environmental issues have on the economy and society. The advancement of trash separation and recycling is a cutting-edge topic researched to encourage resource recycling and is essential to nations' attempts to promote sustainable economic growth. Research on trash separation, recycling, and treatment is currently widespread both domestically and internationally, but it primarily focuses on two levels: examining efficient waste separation management strategies and behavioral aspects impacting waste separation. At the level of waste separation management policy, Guman et al. (2020) studied the current issues in the area of household waste treatment in certain countries across the world and proposed some solutions to the problems. In the book <Effective Front-End Strategies to Reduce Waste on Construction Projects>, Rundle's research proposes implementing a coordinated construction waste management system through regulations to reduce construction waste and promote recovery, reuse and recycling (Rundle et al., . /frsc. .

FIGURE
The main process of municipal domestic waste recycling and related game subjects.
2019). The book <Electronic Waste: Recycling and Reprocessing for a Sustainable Future> gives readers a thorough review of the situation of e-waste creation, disposal, laws, recycling technology, and describes practices in both developed and developing countries (Holuszko et al., 2022). Although all of the aforementioned research used qualitative analytic techniques, they lacked rigorous quantitative analysis and verification, which limited the conclusions drawn from them. Therefore, in order to solve the shortcomings of this analysis method, subsequent scholars used quantitative analysis to improve the stability and credibility of studies related to garbage classification. Evolutionary games as a quantitative research method have been widely used in the field of environmental protection. For instance, Wang and Fu (2020) reviewed recent advances in evolutionary game theory in the direction of ecological dilemmas. Habib et al. (2022) developed two game models for PV and CF power generation systems, respectively. According to the research, government subsidies are successful in encouraging the production of electricity by having a considerable impact on the evolutionary tendency of the evolutionary game strategy. In order to determine what policies are most likely to promote the adoption of low-carbon production methods, Zheng and Zhou (2020) constructed an evolutionary game model of corporate production strategy choice. A three-way evolutionary game model of the government, waste producers, and waste recyclers was created by Su (2020). According to the study, government regulation and laws are essential for raising the percentage of businesses that employ recycling tactics. As the recycling industry develops and becomes more lucrative, government participation in businesses will progressively decline until it ceases altogether.

. Evolutionary game model
This section first presents the tripartite evolutionary game model involving government, urban management officers and recycling industries. And then, the stability conditions are derived according to the Lyapunov stability theory.

. . Assumptions of game model
Due to the complexity, diversity and uncertainty of social reality, the government, urban management officers and recycling industries cannot grasp complete information to choose the best strategy, that is, the government, urban management officers and recycling industries can only make satisfactory decisions based on limited information. Therefore, this paper assumes that the government, urban management officers and the recycling industries are all bounded rationality. They can constantly learn and adapt to environmental changes and adjust their strategies to maximize their own interests.
In the tripartite dynamic game of implementing garbage sorting policy, the government censors the urban management officers while the urban management officers regulate the recycling industries. However, in order to gain more revenue, some recycling industries may abandon the garbage sorting policy and engage in bribery behavior to the city administration. The relationship between the government, city administration and recycling industries is shown in Figure 2.
Based on the above tripartite game relationship, some complex conditions are simplified without changing the nature of the problem, and the following assumptions are made.
(1) Government. The benefits to the government are influenced by economic benefits, social benefits and fiscal policy. When the recycling industry adopts the waste separation policy, it will produce certain economic and social benefits. Conversely, if it does not adopt policies, economic and social benefits will not be generated. In order to ensure economic and social benefits, the government adopts economic policies to promote the implementation of waste separation policies. The government's Frontiers in Sustainable Cities frontiersin.org . /frsc. . behavioral strategy set is {strict review, lenient review}. When the government strictly regulates, it rewards the recycling industry as S a if it adopts a waste separation policy, and rewards city officials who take their regulatory duties seriously as S b . While if the recycling industry fails to separate waste, the government will fine it F a , and those who neglect their duties and practice fraud will be fined F b . On the contrary, if the government is lax in regulation, it cannot obtain relevant information and does not reward or punish relevant departments. In addition, if the recycling industry fails to separate garbage and the city administration fails to supervise carefully. When the government scrutinizes strictly, to maintain the social environment, the political cost to the government is F g ; and when the government scrutinizes leniently, it will be pursued by the relevant higher regulatory authorities, setting an administrative penalty of F n . (2) Urban management officers. The behavioral strategy of urban management officers is {serious law enforcement, dereliction of duty}. When the urban management officer is carefully supervising, he will not collect any funds given privately by the recycling industry, and the fine imposed by the urban management department on the recycling industry that does not carry out garbage classification is F c . But when the urban management officer is negligent, he will collect funds from the recycling industry. When the urban management officer does not seriously enforce the regulation, it will incur speculative costs of C f , which mainly includes costs such as falsifying inspection records. (3) Recycling industry. The behavioral strategy set of the recycling industry is {garbage sorting and recycling, not garbage sorting and recycling}. The use of renewable resources eliminates the cost of purchasing natural materials, so it is assumed that the profit of products made from renewable resources is greater than that of products made from natural materials, that is, R 1 > R 2 . In addition, the quality of the products is also related to how carefully recycling workers sort the waste. Hence, let τ be the degree of fineness in executing the waste sorting strategy, and 0 < τ ≤ 1, the larger τ is, the finer the classification is. When the recycling industry does not separate waste for recycling, funds will be given to the urban management officer, and the behavior of the recycling industry does not carry out garbage classification and recovery will generate speculative cost C t , mainly including the management expenses such as forging the classification and recovery records.
Assuming the proportion of the recycling industry choosing garbage classification and recovery strategy is x, while the proportion that it does not choose this strategy is 1 − x. The proportion of urban management officer who takes law enforcement seriously is y, and the probability that urban management officer falsifies and neglects his duties is 1 − y. The probability of the government using strict scrutiny is z and the probability of the government using lenient scrutiny is 1 − z.
The parameters involved in the model are shown in Table 1.

. . Replicator dynamic evolutionary game analysis
The perceived benefits to the recycling industry of choosing and not choosing a waste separation strategy, and the average perceived benefit to the recycling industry are shown in E 11 , E 12 and E 1 .
According to the above perceived benefits, the replicating dynamic equation of recycling industries is as follows: Then, the respective perceived benefits to the urban management officers of choosing conscientious enforcement and falsification and negligence strategy, and the average perceived benefit to the urban management officers are shown in E 21 , E 22 and E 2 .
Similarly, the replicating dynamic equation of urban management officers is as follows: Finally, the respective perceived benefits of the government's strict scrutiny, lenient scrutiny strategy, and the average perceived benefit of the government are shown in E 31 , E 32 and E 3 .
. . Stability analysis of evolutionary games . . . Analysis of the evolutionary stability of the recycling industry When z = z * , F (x) = 0 holds, at which point for any x is a stable strategy for the recycling industry. When z = z * , let F (x) = 0, there Frontiers in Sustainable Cities frontiersin.org . /frsc. . are x = 0 and x = 1 two evolutionary stability points. By virtue of the stability theorem of the differential equation, it follows that when F ′ (x) < 0, x is the stable point of the evolutionary game. If z > z * , then F ′ (0) > 0, F ′ (1) < 0, x = 1 is the evolutionary stability point of the recycling industry. When the probability of the government choosing strict review is higher, the government strengthens control, takes the lead in boosting waste recycling, subsidizes enterprises that take measures to separate recycling, and punishes enterprises that do not take measures to separate recycling, which can effectively regulate the behavior of the recycling industry, prompt its employees to form the awareness of saving resources and protecting the environment, and increase the profit of the recycling industry, so the recycling industry will choose the strategy of separating waste for recycling. While if z < z * , then F ′ (0) < 0, F ′ (1) > 0, x = 0 is the evolutionary stability point of the recycling industry. When the probability of the government choosing strict review is relatively small, in the context of the slow development of China's recycling industry, immaturity, backward technology and lack of talent, the recycling industry does not choose waste separation to generate more revenues, and thus at this time, the recycling industry chooses not to carry out the recycling strategy of waste separation.
. . . Analysis of the evolutionary stability of the city's management department When z = z * , F y = 0 holds, at which point for any y is a stable strategy for the urban management officer. When z = z * , let F y = 0, there are y = 0 and y = 1 two evolutionary stable points. Based on the stability theorem of the differential equation, it follows that when F ′ y < 0, y is the stable point of the evolutionary game. If z > z * , then F ′ (0) > 0, F ′ (1) < 0, y = 1 is the evolutionary stability point of the city administration. When the government chooses a higher probability of strict scrutiny, the government strengthens supervision, regulates the enforcement behavior of the urban management officer, gives financial rewards to the urban management officer who does his duty and is meticulous and sanctions the urban management officer who falsifies and neglects his duties, so the urban management officer tends to choose a serious enforcement strategy at this time. While if z < z * , then F ′ (0) < 0, F ′ (1) > 0, y = 0 is the evolutionary stability point for the recycling industry. When the probability of the government choosing strict scrutiny is relatively small, the gains from conscientious enforcement by the urban management officer are smaller than the income from bribes, so some poorly qualified urban management officer enforcement officers tend to adopt the strategy of negligence and favoritism.

. . . Evolutionary stability analysis of government
x = x * , F (z) = 0 holds, then for any z is the government's stabilization strategy. When x = x * , let F(z) = 0, there are z = 0 and z = 1 two evolutionary stable points. By the stability theorem of differential equations, it follows that when F ′ (z) < 0, z is the stable point of the evolutionary game. If x > x * , thenF ′ (0) < 0, F ′ (1) > 0, z = 0 is the evolutionary stability point of the government. When the probability of the recycling industry choosing the waste separation strategy is high, it means that the profit of resource recycling is gradually increasing, and the employees of the recycling industry generally have the awareness of saving resources and can take the initiative to take the responsibility of environmental protection, so the recycling industry may choose the waste separation policy spontaneously even without strict government review. While if x < x * , then F ′ (0) > 0, F ′ (1) < 0, z = 1 is the government's evolutionary stability point. When the probability of the recycling industry choosing the waste separation strategy is small, the social environment will be damaged to a certain extent, which seriously hinders the development process of China's urban modernization and is not conducive to the construction of ecological civilization, and the government has to pay extra remediation costs, for which the government will take strict review measures in this case.
Based on the method proposed by FRIEDMAN (1991), the evolutionary stability strategy of the system of differential equations can be obtained through the local stability analysis of the Jacobi matrix of the system, and the equilibrium point that satisfies all the eigenvalues of the Jacobi matrix are non-positive is the evolutionary stability strategy of the system. The Jacobian matrix J is shown in (5) and the details are in Appendix.
The eigenvalues of the Jacobi matrix corresponding to each equilibrium point are shown in Table 2.

Equilibrium point
Eigenvalue When all eigenvalues of the Jacobi matrix are non-positive, the equilibrium is an evolutionary stable strategy. To analyze the positive and negative signs of the corresponding eigenvalues and to determine the stability of each equilibrium, it is assumed that: • The sign assumes that all values in the table are positive • F a + F b + F n > C c . . . Stability analysis in di erent contexts Scenario 1: The system has a unique evolutionary stability point (1, 1, 0) when R 1 − τ C 0 + F c − R 2 + C t > 0. The corresponding evolutionary stability strategy is (waste sorting, careful enforcement, lenient censorship).
Scenario 2: The system has a unique evolutionary stability point (0, 0, 1) when the corresponding evolutionary stability strategy is (no classification, neglect of duty, and strict scrutiny).
Scenario 3: The system has a unique evolutionary stability point (0, 1, 0) when C 2 − C f < F c < R 2 + τ C 0 − R 1 − C t , F a < S b + C c and its corresponding evolutionary stability strategy is (no classification, careful enforcement, lenient review).
Scenario 4: The system has a unique evolutionary stability point (0, 1, 1) when and its corresponding evolutionary stability strategy is (no classification, careful enforcement, and strict scrutiny).

. Evolutionary simulation studies
This section verifies the validity of the evolutionary stability analysis of the article for the two more ideal stability states of the system (1,1,0) (Case 1) and (0,0,1) (Case 2), and analyzes the effect of the recycling industry's revenue from adopting a waste separation strategy, the level of fineness of waste separation, different initial ratios, the strength of government penalties on urban management officers, and the strength of government subsidies to the recycling industry on the evolution of the system for Case 1. This section uses MATLAB2018a software for numerical simulations. Array 1:C c = 20, C 0 = 50, S a = 15, τ = 0.9, F a = 45, S b = 6, F b = 50, F c = 45, R 1 = 155, R 2 = 105, C 2 = 10, F g = 15, R 3 = 60, F n = 35, C t = 5, C f = 5. Array 2:C c = 20, C 0 = 50, S a = 1, τ = 0.9, F a = 1,

. . Model testing
To test the validity of the evolutionary stability analysis of the system in this paper, array 1 and array 2 are brought into the model for simulation, and the results of the runs are shown in Figures 3, 4, respectively.
From Figures 3, 4, the system under the array 1 condition eventually evolves to the point (1, 1, 0), which is consistent with the conclusion of Case 1. The system under the array 2 condition will eventually evolve to the point (0, 0, 1), which is consistent with the conclusion of case 2. The simulation analysis is consistent with the conclusion of the evolutionary stability analysis of the system in the  article, and the model is valid and has important practical guidance for the government, urban management and recycling enterprises.

. . Simulation analysis
First, to analyze the effect of the change of R 1 on the choice of the parties' participation strategies, let R 1 equals to 105, 155, and 205, respectively, and run the simulation results as shown in Figure 5. Let τ equals to 0.1, 0.5, and 0.9, respectively, and run the simulation results as shown in Figure 6. Figure 5 shows that during the gradual evolution of the system to a point of stability, an increase in the revenue of the recycling industry that chooses a waste separation strategy accelerates the evolution of its choice of waste separation strategy. The probability of the recycling industry choosing a waste separation strategy increases and the probability of strict government regulation decreases with Frontiers in Sustainable Cities frontiersin.org . /frsc. .  the increase of R 1 . For the recycling industry, recycling valuable products in the supply chain is a good business with a huge market potential (Kerr and Ryan, 2001), which helps to form a "natural resource-product-recycled resource" circular economy loop in the whole society, which is energy saving and environmental friendly, and can bring more economic benefits to the society. Therefore, the government can take appropriate measures to ensure the revenue of the recycling industry, which in turn will increase the motivation of the recycling industry to choose the waste separation strategy and ensure the recycling of resources. From Figure 6, it can be seen that in the evolution process, the probability of choosing waste separation strategy by Frontiers in Sustainable Cities frontiersin.org . /frsc. . recycling industry increases and the probability of serious enforcement by city administration and strict review by government decreases with decreasing τ . The government can promote the recycling industry to choose waste separation strategy more often by training the employees of the recycling industry, promoting the idea of waste separation to them and raising their awareness of energy saving and environmental protection (Kurniawan et al., 2021), or raising the awareness of responsibility of the recycling industry through social media disclosure, etc., to promote the recycling industry to choose waste separation strategy more often while ensuring the quality rate of waste separation. To investigate the effect of different initial ratios on the evolution of the system, let x, y and z be 0.1, 0.5, and 0.9, respectively, with other parameters held constant, and the evolutionary path of each party is shown in Figure 7.
As can be seen in Figure 7, as the probability of the recycling industry choosing a waste sorting strategy increases, the probability of the urban management officer choosing serious enforcement decreases and eventually evolves toward 1; the probability of the government choosing lenient review gradually increases and accelerates, eventually evolving to stabilize at 0.
As the probability of urban management officers' serious enforcement increases, the probability that the recycling industry chooses the waste separation strategy increases and eventually evolves toward 1. The probability that the government chooses a lenient review also gradually increases and accelerates and eventually evolves toward 0. The change in the probability of serious enforcement by .
/frsc. . the urban management officer has a much greater impact on the government than on the recycling industry, so the increase in the probability of serious enforcement by the city government facilitates the system to achieve evolutionary stability.
As the probability of strict government scrutiny increases, the recycling industry's choice of waste separation strategies and the city administration's choice of serious enforcement gradually accelerate. This illustrates that strict government scrutiny is conducive to the recycling industry and the city administration to act in accordance with the law, and to the construction of a resource-saving and environment-friendly two-type society.
In order to explore the effect of the amount of government punishment to the urban management officer on the speed of strategy choice of each party involved, let F b equals to 10, 30, and 50, respectively, the evolutionary path of each party is shown in Figure 8.
The government plays an important role in garbage sorting (Chen J. et al., 2019). The phenomenon of "bad money driving out good money" (Yang and Hu, 2012) often occurs when individual efforts vary greatly and do not yield the corresponding results. In order to avoid this phenomenon, the government should enact laws and regulations to ensure strict enforcement by urban management officers. As can be seen in Figure 8, the increase in the amount of government penalties on the urban management officer accelerates the rate at which the recycling industry and the urban management officer reach equilibrium, and its impact on the urban management officer is much greater than that of the recycling industry. However, the increase in the amount of government penalties on the urban management officer increases the government's incentive to scrutinize and slows down the government's rate of reaching equilibrium. Therefore, the government should establish appropriate penalties to promote the recycling industry's choice of waste separation policy and to prompt urban management officers to enforce the law carefully, in order to promote scientific, rational and efficient waste separation and recycling, and to achieve harmony between people and nature.
In order to explore the influence of government incentives to recycling industry on the evolutionary game process and results, S a is assigned to 0, 15, and 30, respectively. The evolutionary paths of all parties are shown in Figure 9.
As shown in Figure 9, the probability of strict government scrutiny decreases and the probability of serious enforcement by city officials decreases as the evolutionary stability process S a increases. It can be concluded that although government incentives for the recycling industry can motivate them to choose a waste separation policy, it is not conducive to the performance of the urban management officers. For this reason, the government needs to develop an appropriate subsidy strategy to promote recycling businesses for secondary use of recyclables while ensuring that urban management officers are diligent in their duties. According to the industry life cycle theory (Lumpkin and Dess, 2001), the government can increase subsidies and incentives in the early stages of recycling business development, but as the recycling industry reaches a mature stage, the government can reduce subsidies to effectively promote waste separation.

. Conclusions
This paper constructs a three-way evolutionary game model of government, urban management officer, and recycling industry based on the current environmental problems in China, such as the yearly increase in municipal waste generation and low resource recycling rate, and draws the following conclusions.
(1) From the perspective of the recycling industry, the final strategy choice of the recycling industry is mainly influenced by the strength of government subsidies, penalties, revenue from the adoption of waste separation, and the quality rate of waste separation. The simulation results show that the rate of evolutionary stability of the recycling industry is accelerated with the increase of revenue from the adoption of waste separation strategies. Technology changes life, and the recycling industry can actively research and develop to innovate the traditional waste sorting and recycling mode, improve the quality of waste sorting and recycling, reduce the cost of recycling, and increase the profit, so as to promote more recycling industry to choose waste sorting strategy and form a "burden reduction effect" for government departments.
(2) From the standpoint of urban management officer, government incentives and sanctions have a significant impact on the ultimate strategy decision. The government may provide the right incentives to encourage the local administration to take legal compliance seriously and behave legally. According to the simulation results, there is a considerably higher chance that municipal administration will choose to execute the law strictly as the likelihood of severe government scrutiny and the severity of punishment grows. The government should make relevant laws and regulations to ensure urban management officers to strictly enforce the law, and improve the administrative accountability system to give severe administrative punishment to those urban management officers who falsify and play favorites. (3) From the government's point of view, the government's final strategy choice is influenced by the strength of its subsidies, but also by the strength of the penalties imposed by the higher regulatory authorities. From the simulation results, it is clear that the evolution rate of the government's choice of lenient censorship policy gradually accelerates as the recycling industry chooses the waste separation policy and the probability of urban management officer choosing serious enforcement increases. The government should unquestionably take the lead in waste separation. For the time being, the government should make greater investments in the environmental sector, develop a focused environmental talent pool, and use economic constraints to motivate local authorities and the recycling sector. In addition, the government can also create a favorable social atmosphere by means of media exposure and other means to morally condemn the bad behavior of companies that fail to separate garbage and urban management officers who neglect their duties and play favorites.

Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.