Assessing Ecological Recovery of Reclaimed Well Sites: A Case Study From Alberta, Canada

Empirical evaluations of reclamation success are critical for understanding the speed of ecosystem recovery and improving best practices. In this study, we provide a quantitative evaluation of the effectiveness of past (pre-1995) and current (2010) reclamation criteria in creating functioning forest ecosystems on former industrial sites in boreal Alberta, Canada. We compared ecological indicators of ecosystem recovery (vegetation structure and soil properties) on mineral surface leases (MSLs) certified to the pre-1995 or 2010 reclamation criteria with nearby reference areas recovering from harvest (CUT) or fire (FIRE) disturbances. Six CUT and FIRE sites were chosen to compare six 1995MSLs and five 2010MSLs. Averaging 8 years since reclamation, most of the 2010MSLs were characterized by many of the same vegetation structure and soil properties as the FIRE and CUT sites. The 1995MSLs tended to support more agronomic species, notably grasses and non-native forbs, and fewer shrubs, trees, and native forbs than CUT or FIRE sites. Sites with the greatest coverage of herbaceous species (native and non-native grasses as well as non-native forbs) were the most ecologically impaired sites, based on the extreme deviation from reference site conditions. Based on these results, 2010 reclamation criteria appear to be more effectively promoting ecosystem recovery on reclaimed industrial sites than the pre-1995 criteria. While this case study illustrated the potential benefits of straightforward changes to reclamation criteria in terms of including metrics around soil quality and conservation, woody stem requirements and native plant coverage, there is ultimately always room for improvement. For jurisdictions where the objective of the criteria is to restore a forest ecosystem, including criteria geared toward tree establishment would likely be of value in ensuring the speedy return to a forest canopy state. Adding criteria with measures of native plant species diversity may also be of utility as it is well understood that having plant diversity is also a beneficial metric in creating a more resilient vegetation community.


INTRODUCTION
Governments around the world are experiencing mounting public concern over issues of environmental stewardship. These issues include how to restore lands where ecological integrity is lost or compromised, mainly by anthropogenic disturbances. In response, a number of regulatory frameworks have been developed over the last two decades on how best to restore the ecological integrity of affected lands (Baker and Eckerberg, 2016; Government of Alberta [GoA]., 1999; Environment and Sustainable Resource Development [ESRD]., 2013). This includes developing new reclamation criteria or initiating changes to refine existing criteria using jurisdiction-specific land management goals. Reclamation criteria are standards used to evaluate whether site restoration activities have achieved certain minimum standards. They define expectations and outline requirements regarding professional practices within the area of land reclamation. Having effective reclamation criteria is important for environmental stewardship because the magnitude and intensity of human disturbance have long been considered the primary cause of biodiversity loss and ecosystem degradation worldwide (Arbogast, 2002;Skousen et al., 2012;Rayner et al., 2017;Gerwing et al., 2021).
As in many other jurisdictions around the world, the province of Alberta in Canada has a long history of oil and gas extraction throughout the boreal forest region with nearly 8 to 20 well sites, each 0.8-1.4 hectares in size, established per square kilometer (Frerichs et al., 2017). As of April 2020, Alberta had more than 400,000 well sites (Government of Alberta [GoA]., 1999, 2020) and associated facilities, including borrow pits and sumps. Reclamation criteria in the oil and gas sector have evolved in the last 40 years. Since the early 1980s, there have been changes in the reclamation criteria toward requiring the restoration of biodiversity and ecosystem function (Powter et al., 2012; Environment and Sustainable Resource Development [ESRD]., 2013). This is in contrast to early criteria (pre-1995) that were focused primarily on revegetation of the site and did not typically require specific species to be established or that any plant structural targets be achieved. The goal of early reclamation practices was to establish a plant cover that would reduce soil erosion concerns (Alberta Environment and Protection [AEP]., 1995); however, this resulted in widespread utilization of agronomic, often non-native graminoid seeding on these sites. The widespread use of agronomic species in reclamation has potentially impeded natural colonization by both coniferous and deciduous tree species, leaving long-term legacy effects on sites reclaimed using pre-1995 standards (K. Kemball, personal communication, May 3, 2020).
In 2010, there was a significant regulatory adjustment in Alberta whereby the reclamation criteria were updated to include requirements for woody vegetation (trees or shrubs) and two layers of vegetation strata (overstory and understory). The 1995 reclamation criteria for forested lands do not explicitly require tree cover, which may raise questions regarding the potential speed and trajectory of ecological recovery to a forested state and conditions resembling those present before disturbance (Table 1). Further, rigorous empirical evidence of the rate of recovery in sites reclaimed to these more recent standards is lacking. Thus, a knowledge gap exists as to whether these new criteria are creating forests that appear to be on track to becoming functioning forest ecosystems (Rowland et al., 2009). Given that reclamation criteria will continue to evolve in the decades to come, it is important to evaluate the effectiveness of both past and current reclamation criteria in creating functioning ecosystems to inform future criteria changes locally and provided empirical evidence for other jurisdictions undertaking similar efforts.
To evaluate the rate of ecological recovery, it is essential to determine if the ecological characteristics of interest in the reclaimed ecosystem are similar to those present in healthy ecosystems in the region. A common means of evaluation is through comparison with adjacent forests which are often at a much later seral stage than the reclaimed site. Lupardus et al. (2019), in a study looking at the ecological recovery on reclaimed sites in Alberta, found that vegetation can be dissimilar to undisturbed, natural systems for up to 48 years after reclamation. In our study, because the evaluation is being made 5 to 20 years after reclamation, we believed that it was unlikely that the ecological condition of reclaimed areas would replicate an intact, mature forest simply due to the different seral stages, as previously noted by Redente et al. (1983).
Therefore, we elected to pursue an alternative approach whereby the ecological condition of reclaimed sites is compared with sites that have similarly experienced stand-replacing disturbances such as being burned, blown down, or harvested. It has been suggested that directly comparing the conditions of young post-disturbance ecosystems provides a better evaluation of ecosystem development than comparison with an undisturbed forest. Indeed, there is evidence to suggest that areas recovering from natural disturbances often have understory vegetation and similar trajectories of recovery to young native forests (Carleton and Maclellan, 1994;Bergeron et al., 2001;Dhar et al., 2018;Sasaki et al., 2018). This approach is previously recommended (Chambers et al., 1994;Rowland et al., 2009;Lupardus et al., 2019), and is based on a general assumption that the constituent plant and animal species are extremely adapted to current disturbances and would eventually colonize the area from elsewhere (Carleton and Maclellan, 1994). However, despite this general, and acceptance of the importance of integrating disturbance-type-specific considerations into the evaluation of ecosystem recovery, assessments of already conducted restoration projects from this perspective are not common.
Measures of vegetation structure and soil physical, chemical, and elemental properties are often used as an indicator of function (Ruiz-Jaen and Aide, 2005;Wortley et al., 2013). Previously, comprehensive analyses have demonstrated vegetation structure and soil physical, chemical, and elemental properties as our response variables (Ruiz-Jaen and Aide, 2005;Wortley et al., 2013). The selection of these variables as ecological indicators of choice was based on current experimental evidence demonstrating that vegetation and soil processes are inextricably linked (Vanhala et al., 2005;Lafleur et al., 2015) and most impacted during oil and gas operations (Dhar et al., 2018(Dhar et al., , 2020. Vegetation structure is useful for predicting the direction of plant succession (Salinas and Guirado, 2002;Prach et al., 2019;Rydgren et al., 2020), and is often evaluated through ground cover estimates, woody plant density, and species diversity. Soil physical, chemical and elemental properties are important because this information gives an indirect measure of nutrient cycling (Chambers et al., 1994;Wortley et al., 2013), and provides insights into the resilience of the restored ecosystem.
Given that boreal forest reclamation will continue in the decades to come, it is important to ascertain the degree to which current criteria are creating ecosystems that are representative of those in the surrounding landscape. We used a range of ecological indicators from reference harvest and fire areas to define "potential recovery, " for comparison with those on mineral surface leases certified with either the pre-1995 or 2010 reclamation criteria. We logically expect to observe differences in outcomes due to changes in criteria and it may be a reasonable expectation that the pre-1995 criteria will be most distinct concerning less severely disturbed early successional sites. Nevertheless, it is important to quantify the magnitude and direction of these similarities and differences to gauge the relative effectiveness that changes in criteria and practice have had on ecological recovery. We addressed ecological recovery by asking three questions in this study: (1) Have key changes in criteria led to quantifiable improvements in reclamation outcomes? (2) Which vegetation and soil properties match between reclaimed sites and reference areas, and are thus considered useful indicators of reclamation ecosystem recovery? (3) Can previously harvested cut blocks and forest fire impacted sites of comparable age provide reasonable analogs to assess ecosystem recovery (as opposed to the current practice of referencing adjacent and often mature forests)? •None (for forested sites in the Green Area) •Topsoil depth and distribution; •Soil texture; •Consistence and structure; and •Rooting restrictions. Vegetation Parameters: Vegetation Parameters: •% cover of plants, litter, and woody debris.

Study Region
The study area lies within the Central Mixedwood, Dry Mixedwood, and Lower Boreal Highlands natural sub-regions of Alberta (Natural Regions Committee [NRC], 2006). The general topography of the is consists of large and gently undulating plains, flat terrains, hummocky uplands, and extensive wetlands, with an average elevation of 354 m above mean sea level. The area has a continental climate, which is defined by short and warm summers and long and cold winters, providing a growing season of approximately 96 days between May and September. The average annual temperature ranges from −2 to 2.4 • C, with 400-439 mm of rainfall (Environment Canada Climate Normals 1981-2010). 1 Gray Luvisolic soils are dominant in this landscape in addition to Brunisolic soils.

Site Selection
The sites examined were mineral surfaces leases (MSLs) that have been certified as successfully reclaimed after oil and gas disturbance in north-central Alberta ( (Beckingham and Archibald, 1996). Prior to disturbance, all sites were mesic/medium (ecotype d) on the edatopic grid, with different proportions of native tree species including Populus tremuloides, Populus balsamifera, Picea glauca, Pinus banksiana (Rowe, 1972;Beckingham and Archibald, 1996). A total of 17 MSLs sites were selected based on their accessibility and known certification status; these sites ranged from 7 to 20 years since reclamation ( Table 2). Of the 17 sites, 12 were certified with the 1995 criteria and 5 with the 2010 criteria. For the current study, only 6 out of the 12 1995MSLs we sampled were included in the analysis to better represent a balanced study design, unless otherwise stated. Normally, the protocols described in the 2010 reclamation criteria require the use of undisturbed, natural systems as reference sites for comparison, but in this study, we chose 12 reference sites that represented two disturbance regimes: forest fires (6 sites) and forest harvest blocks (6 sites). Initially, historical information and a list of potential sites were collected for MSLs from Alberta Energy Regulator and Alberta Environment and Park's databases for the sub-regions to be sampled. A reconnaissance trip to potential sites was conducted to ensure sites were accessible and met selection criteria (i.e., all certified MSLs located in upland forest, and were mesic/medium (modal) on the edatopic grid) (Beckingham and Archibald, 1996). The final 29 sites used in this study were selected after confirming landowners' or companies' approval to measure (Figures 1A-D; Supplementary Figures 1.1-1 used to determine if MSLs were likely to have been modal predisturbance with typical vegetation including aspen and white spruce. Once this was confirmed, the location for sampling plot installation within a site was randomly selected using a random number table.

Sampling Design
We measured a suite of parameters relating to ecosystem structure and function in the certified MSLs and reference sites. Supplementary Table 1 contains the list of attributes and sampling protocols we considered. We created these protocols considering cost, ease of use, method sensitivity, and sampling season timing. Variables relating to ecosystem parameters were measured at growing season peak biomass ( (ii) total height of the tallest woody plant (for each species). Two 0.5 m × 0.5 m subplots were also established within each 10 m × 10 m plot to measure the percent ground coverage of each herbaceous and woody species. Soils were sampled from four pits that were located along the diagonal of the 150 m × 200 m measurement plot. The pits shared the same inter-plot distance as the vegetation survey protocol. Pits were dug to 60 cm to expose soil horizons where the depths of the organic (LFH) and the A + B mineral soil layers were recorded. Soil samples for soil chemistry analysis were collected from two depths in the mineral soil, 0-10 and 20-30 cm, using a 3.2 cm diameter core, thereby excluding the fresh litter, fibric and humified surficial organic matter (i.e., mulch or LFH) layer. Samples for bulk density were collected at each of these two depths (0-10 cm and 20-30 cm), by inserting three metal rings of 106 cm 3 volume into the soil. The composited samples were placed in plastic bags and refrigerated (∼4 • C) until further analysis.

Lab Procedure
Fresh soil samples were sieved to pass a 2 mm screen (# 10 U.S. Standard Sieve) and separated into two subsamples. One subsample was stored in a −20 • C freezer for chemistry analysis, while the other subsample was air-dried at room Frontiers in Forests and Global Change | www.frontiersin.org temperature (∼20-25 • C) in preparation for total carbon and nitrogen analyses. Soil pH and electrical conductivity (EC) were measured using a pH meter and an EC meter with a 1:2 (m:v) soil-to-water ratio (Kalra and Maynard, 1991). A portion of the oven-dried sample was ground using a ball mill (MM200, Retsch GmbH, Haan, Germany) and used for measurement of total C and total N, using an automated elemental analyzer (NA-1500 series, Carlo Erba, Milan, Italy). To determine bulk density, samples taken with the metal rings were oven-dried at 105 • C to constant mass and weighed. The bulk density of the soil was then calculated by dividing dry mass by metal ring volume.

Statistics
All data analyses and visualizations were performed using the R programming environment (R Core Team, 2020). Depending on the nature of the outcome variable, e.g., concentrations, fractions, percentages, or zero-inflated data, the appropriate distributions and functions were chosen for model fitting (see Supplementary  Table 1). The function gls() from the package nlme was used for all statistical models involving fixed and random effects (Pinheiro, 2017). The gls technique can model non-constant variance among factors by specifying 'weights = varIndent(). Count data with many zeros were analyzed with the function glm. nb() from the MASS package (Ripley et al., 2013), while the function gamlss() from the gamlss package was used for zero and one inflated percentage data (Rigby and Stasinopoulos, 2005). Shannon-Weiner diversity index is calculated in R using vegan package version 2.4-4 (Oksanen et al., 2008). Model assumptions were checked with diagnostic plots of fitted and residual values and a histogram of residuals. Post-hoc comparisons were conducted with the Fisher's protected least significant difference (LSD) test if the means were significantly different with ANOVA. Given that the study design depended on the availability of reclamation sites that had several confounding factors including small numbers of treatment sites, age since reclamation, age at the time of planting, and operational practices (e.g., equipment used, prior stockpiling), the risk of a type II error in the analysis was considered to be high. Consequently, a P-value of 0.10 was used to assess the significance to reduce the type II error.
We used an NMDS ordination to characterize and compare plant community diversity using vegetation cover between the site types. All 29 sites were fitted onto NMDS ordination using metaMDS function from the vegan package (Oksanen et al., 2008) as it is proven to have a significant advantage for analyzing data from this unbalanced experimental design by rapidly producing visually informative results (Rowland et al., 2009). Their centroids and 95% confidence interval ellipses were used to represent each site type on the biplots. With NMDS, space-sample relationships are based on ranked dissimilarity in compositional space, and thus free from assumptions of normality, dimensionality, linearity, and the shape of speciesresponse curves to gradients. The output solution with the lowest dissimilarity between ordination and Bray-Curtis distances were selected as the final model. Also, the resulting two-dimensional image output from the ggplot2 package produces a visual aid to interpreting similarity among entities, those more similar being clustered closer together. Subsequently, we completed a permutational analysis of variance (PERMANOVA) with the function adonis() (R vegan package) to determine if the vegetation cover of the different sites could be explained by species groupings variables including coniferous trees, hardwood trees, shrubs, forbs, grasses, and mosses and for comparison against the NMDS ordination to validate our findings. Bray-Curti's dissimilarities distance was used during the analysis.

Soil Physical-Chemical Characteristics
Soil bulk density (F = 1.84; p = 0.02) in the 0-10 cm layer of the 2010MSLs (0.97) was not different from that in the CUT (0.71) or FIRE (0.65); however, the 1995MSLs had higher bulk density (1.25 g cm −3 ) than the CUT and FIRE sites ( Table 3). Soil bulk density at 20-30 cm depth ranged from 1.34 to 1.49 g cm −3 and was similar (F = 0.25, p = 0.61) between the different site types. While soil pH (F = 6.51; p = 0.03) in the 0-10 cm layer varied between 1995MSLs and reference sites, no difference was observed between 2010MSLs and FIRE sites. Soil pH in the 20-30 cm layer (F = 5.99; p < 0.001) was higher on the reclaimed sites compared with on reference sites. The electrical conductivity (F = 4.21; p < 0.001) in the 20-30 cm depth was higher in the 1995MSLs compared with the reference sites; no differences were observed among the site types within the 0-10 cm soil layer (F = 0.41; p = 0.58). Soil C stock in the 0-10 cm depth did not vary (F = 10.77; p = 0.12) among the site types. On the other hand, soil C stock in the 10-30 cm layer was significantly different (F = 6.21; p = 0.02); it was the highest in 2010MSLs and lowest in CUT sites. While differences in total N stock in the 10-30 cm soil layer did not differ (F = 1.39; p = 0.26) among the site types, the 1995 and 2010MSLs at the 0-10 cm layer had an average a higher N stock (F = 0.54; p = 0.09) than the CUT site. The soil C:N ratios at the 0-10 cm (F = 3.64; p = 0.52) and 10-30 cm (F = 1.64; p = 0.13) depth did not differ among the site types (Table 3). LFH development (F = 21.39; p < 0.001) on the reclaimed sites had not yet reached a thickness similar to those found on reference sites ( Table 4). The average thickness of the LFH layer was 2.5 times thicker on reference sites (7.9) than on reclaimed sites (2.5 cm). The thickness of the mineral soil layer (F = 15.71; p = 0.002) on 2010MSLs was comparable to the FIRE site but significantly thicker than those found on 1995MSLs and CUT site (Table 4).

Vegetation Structure
Native forb cover (F = 2.21; p = 0.07) varied 1995MSLs and reference sites; no differences were observed between 2010MSLs and FIRE or CUT site (Figure 2A). Non-native forb cover (F = 7.64; p < 0.001) was greater on 1995MSLs than the other three site types ( Figure 2B); this change was driven by the initial establishment of Trifolium spp., Melilotus spp., and Medicago sativa in the 1995 criteria. Again, the 1995MSLs were associated with a substantially higher cover of native grasses (F = 13.88; p < 0.001), more than 20%, whereas the cover of native grasses was 10% or less for the other three site types ( Figure 2C). Nonnative grass coverage (F = 16.32; p < 0.001) was twice as high on the 1995MSLs than 2010MSLs, while no non-native grass was recorded on either the CUT or FIRE sites 5 years after disturbance ( Figure 2D). There was also an increase in moss cover (F = 4.11; p = 0.04) and total vegetation cover (F = 10.53; p < 0.001) on reclaimed sites, with the 1995MSLs having a higher coverage of each than the other site types (Figure 2E,F). Due to a lack of sample size (i.e., a low number of measurements), no statistical analyses were conducted for the non-native grass cover in FIRE and CUT sites. Coniferous tree stem densities (F = 9.64; p < 0.01) on the reclaimed sites were extremely low, presumably, these sites were not planted with conifers and natural recovery was lacking ( Figure 3A). Hardwood tree density (F = 2.65; p = 0.01) in CUT sites was significantly higher than all other site types at 35,000 stems ha −1 through both FIRE and 2010MSLs showed similar densities at ∼10,000 stems ha −1 ; of note, 1995MSLs almost entirely lacked hardwood tree species (Figure 3B). Tall shrub density (F = 4.25; p = 0.06) was 36% lower in 1995MSLs than the other site types (Figure 3C). Medium shrubs (F = 8.42; p = 0.04) were similarly lacking in 1995MSLs while the 2010MSLs showed densities comparable to the CUT sites but not the FIRE sites ( Figure 3D). The net effect for tree and shrub stem counts was that the 2010MSLs were comparable to both reference site types while the 1995MSLs were significantly lower than both reference site types (F = 2.73; p = 0.02; Figure 3E). Species richness and Shannon's diversity index tended to be lower on reclaimed sites compared to the CUT and FIRE sites (Figure 4). However, the 2010MSLs were more similar in species diversity to the reference sites than the 1995MSLs. The differences in diversity were due to significant differences in species richness, not to differences in species evenness. The 2010MSLs richness averaged 6.7 species per plot, whereas the 1995MSLs averaged 3.7 species per plot. The CUT sites averaged 12.1 species per plot and the FIRE sites averaged 11.2 species per plot (Figure 4).

Community Description
The final NMDS two-dimensional output had produced a stress value of 0.2, instability < 0.001, and R2 of 0.653 (axis 1 = 0.524, axis 2 = 0.129; data not shown). The reclaimed and reference sites were overlain on the biplot, and it was apparent that the species assemblages of these site types occupy different regions of the ordination plot (Figure 5). The 2010MSL plant communities 3 | Bulk soil chemical and physical properties (means with standard errors in parentheses) at the 0-10 and 10-20 cm depth at each test site (n = 5-12). shared a greater degree of similarity to the FIRE and CUT sites than the 1995MSLs. The 2010MSLs and CUT and FIRE sites were characterized by hardwood trees, tall and medium shrubs, and native forbs. The 1995MSLs tended to be dominated by a herbaceous cover, especially native and non-native grasses. The 2010MSLs did support some non-native grass cover, but it was significantly lower than the 1995MSLs. It is noteworthy that non-native grass development was a unique feature of all MSLs as these species were not found on the FIRE or CUT sites. Results of the PERMANOVA analysis confirmed those of NMDS. The PERMANOVA showed significant differences in coverage of most of the species' groupings (hardwood trees, medium shrub, native and non-native grasses, and non-native forbs) between the 1995MSLs and either of the CUT and FIRE sites ( Table 5).
Medium shrub and native forbs were the only species groupings that were significantly distinct between the 2010MSLs and either of the reference sites.

DISCUSSION
In the present study, ecological recovery on reclaimed sites (1995 and 2010MSLs) is assumed to follow the typical early recovery process in the natural boreal forests if the range of ecological indicators measured in the reclaimed site mirror that in the reference areas . Five years after disturbance, the range of most soil values measured in FIRE and CUT reference sites remained within the range of natural variability of the region (Schmidt et al., 1996;Howat, 2000;Sorenson et al., 2011). Soil pH, bulk density, EC, and total carbon stock differed significantly between the reclaimed and reference sites. The deviations from the reference sites were consistent with patterns found in other studies where this change was related to the long-term effects of soil stripping, stockpiling and redistribution which is a common feature of these types of industrial disturbance . Soil pH and bulk density can be reasonable predictors of vegetation community structure due to their ability to affect soil nutrient availability . High pH levels outside the natural range of variability are not uncommon on former industrial sites in northern Alberta. This is a product of two things: elevated calcium sulfate in lower soil horizons which is naturally occurring throughout the region and the fact that stripping of topsoil during initial site development invariably results in these deeper (higher pH) soils being brought closer to the surface, especially when stripped soils are stockpiled and then redistributed. When the soils are replaced during reclamation, the topsoil will be a mixture of LFH, A and some B horizon, depending on the experience and expertise of the operators. Effectively this mixing of surface and subsurface soil substrates may have contributed to the higher pH at most reclamation sites when compared to the reference areas. This finding is consistent with the results of Jamro et al. (2014), who reported pH increases of 180% when LFH was mixed with mineral substrates. In our study, five (41%) of the 1995MSLs sites had pH values > 8.5, which is outside the acceptable range for most vascular plants in the region which is cited at a pH range from 3.5 to 8.0 (Larcher, 1980). These included both young (5-7 years) and more mature (16-18 years) sites. Our high pH sites also had unusually low storage of carbon and nitrogen stored in soils and were comprised of grasses with non-native forbs. This lingering industrial reclamation effect on pH recovery is important to acknowledge as it may cause other processes such as nutrient cycling to recover at a slower rate or become permanently inhibited (Zhalnina et al., 2015;Neina, 2019). It may be possible to alter the pH conditions on these sites by introducing native species that may have a higher tolerance for elevated pH and through time, the impact of these species may lower pH through leaf litter inputs and other belowground processes effectively creating a positive feedback loop (e.g., organic matter accumulation, nutrient cycling, microbial symbiont activity). The requirements of the 2010 criteria appeared to result in decreased bulk density (and decreased compaction) compared to sites certified under the 1995 criteria, even though sites certified under the 2010 criteria have had a relatively shorter time to recover since reclamation.
The apparent improvement in bulk densities on 2010MSL soils may be due to the specified requirements around topsoil placement, soil physical characteristics, as well as the criteria specifying rooting restrictions (Environment and Sustainable Resource Development [ESRD]., 2013). These criteria likely facilitated changes in soil handling and replacement practices. For example, it is possible that sites were more consistently decompacted (particularly the lower soil horizons that received repeated industrial traffic) to better ensure the rooting restriction criteria were met. Even if the differences in bulk density in 20-30 cm of soil layer are small, they may be important because tree growth can be affected by even small differences in soil bulk density. For instance, in a study of the impacts of forest harvesting operations on soil characteristics, Binkley and Fisher (2019) found that tree growth started to decline when bulk density reached approximately 1.4 g cm −3 . Since the 2010 MSLs averaged 1.34 g cm −3 and the 1995MSLs averaged 1.49 g cm −3 at the 20-30 cm soil depth, it is reasonable to suggest that the 2010 criteria are enabling increased growth of desirable species simply due to decreased soil compaction, at least on some sites. Other studies are consistent in identifying 1.4 g cm −3 as an important bulk density threshold above which reduced growth can be expected on fine-textured soils, similar to those found on most of our study sites (Mitchell et al., 1982;Da Silva et al., 1994;Senyk, 2001).
Greater quantities of LFH can be a good indicator of ecosystem recovery as the thickness is positively correlated with soil water and nutrient status (Beckingham and Archibald, 1996;Rowland et al., 2009). The LFH thickness in 2010MSLs was similar to those observed in 1995MSLs. They showed average thicknesses of 2.9 cm in 2010MSLs and 3.4 cm in 1995MSLs, which were comparable to those observed (2.0-3.6 cm) in reclaimed sites, ranging in age from 16 to 33 years (Sorenson et al., 2011). The fact that the 2010MSLs had a similar thickness to the 1995MSLs suggests their forest floor is developing faster where this change could come from the adjustments in the 2010 criteria requiring thicker top and subsoil cover and establishment of the woody canopy. Forest floor development is modulated by two opposing processing: litter inputs from vegetation and decomposition outputs. When woody cover surpasses a threshold value of 30%, significant increases in litter inputs emerged, which can have direct positive impacts on early forest floor development (Sorenson et al., 2011). In comparison, the average LFH thickness in the reclaimed sites had not yet reached a thickness similar to those found in FIRE (7.3 cm) and CUT (8.5) sites. For each centimeter of LFH present on the reclaimed sites, the CUT and FIRE sites had 2.5 times as much LFH by approximately the same age since disturbance. Mineral soil depth in the 2010MSLs was comparable to the reference sites and was substantially higher than 1995MSLs; this difference again may be attributable to the inclusion of key soil criteria as well as potential changes in soil handling practices where both the subsoil and topsoil were required to be conserved (stockpiled) (Environment and Sustainable Resource Development [ESRD]., 2013).
Most of the 2010MSLs sites demonstrated similar woody plant density and diversity to reference areas, with corresponding declines in the 1995MSLs sites. These parameters are considered important components of plant community structure and are good indicators for assessing the direction of plant succession (Salinas and Guirado, 2002;Prach et al., 2019;FIGURE 4 | Least squares means (lsmeans) of species richness (species per plot) (A), Shannon's diversity index (B), and Pielou's evenness index (C) by reclaimed (1995 and 2010MSL) and reference (CUT and FIRE) site types. Letters indicate significant differences between treatments after Tukey adjustment (P < 0.05). Mean (±SE) values with different lowercase letter(s) were significantly different among the site types at P < 0.10.
FIGURE 5 | Non-metric multidimensional scaling (NMDS) analysis of vegetation cover for the corresponding test sites. Ellipses constructed around each site indicate 95% confidence intervals.
Frontiers in Forests and Global Change | www.frontiersin.org Rydgren et al., 2020). The total woody plant density of the 1995MSLs averaged 6,250 stems ha-1 after 12-20 years, while the 2010MSLs sites averaged 50,000 stems ha-1 after 7-10 years. Most of the species found on the 2010MSLs sites were earlyto-mid successional hardwood trees, with a mix of shrubs were (medium and tall), and some coniferous trees, whereas those on the 1995MSLs were primarily tall shrubs (>65%). This mixture of woody species on the 2010MSLs sites follows the typical early successional pattern or recovery process of the Central Mixedwood, Dry Mixedwood and Lower Boreal Highlands Natural Sub-Regions (Rowe, 1972;Beckingham and Archibald, 1996). Total herbaceous cover on the 1995MSLs sites was high (75%), with almost 40% coming from both non-native forbs and grasses. The presence of high amounts of non-native herbaceous cover on the 1995MLs and related sites assessed by Forsch et al. (2021) is not necessarily a positive indicator of ecosystem function. These non-native species may be an indicator of ecological impairment in some systems, as these species negatively impact vegetation diversity and ecosystem processes (K. Kemball, personal communication, May 3, 2020.) thereby leaving the site in a state of arrested development (Kurek et al., 2013;Chowdhury et al., 2017;Dhar et al., 2020).
Vegetation richness, diversity, and evenness have been recognized as strong indicators of ecological recovery from disturbance, as well as ecosystem resilience (Ruiz-Jaen and Aide, 2005). It was encouraging that plant community evenness resulted from both the 1995 and 2010 criteria were similar to the FIRE and CUT plant communities. However, despite these similarities, plant richness and diversity appeared to be depressed on the 1995MSLs compared with the CUT and FIRE sites. As for the 2010MSLs, plant richness and diversity were lower compared with the CUT sites but not always statistically distinct from the FIRE sites. A stronger presence of woody plants and tree species on the 2010MSLs may be facilitating the improvement of species richness and diversity observed from the 1995 to 2010MSL site types; Pensa et al. (2008) similarly suggested the recovery of plant richness and diversity was associated with the ingress of woody plants replacing early successional herbaceous species on abandoned mine land sites after 10-30 years in succession. From the viewpoint of vegetation classes, we found evidence to suggest that most of our certified, 2010MSL sites appear to be in transition and were moving toward a more similar compositional mixture of vegetation classes as that observed in both reference site types. As for the 1995MSLs, there is so far little evidence that suggests these sites are becoming more similar in community structure to that of the reference sites. Both the NMDS ordination and PERMANOVA test provided some support for this assertion. The NMDS showed a clear separation between the site types; the CUT and FIRE areas, while also distinct from each other, were clustered, with the 2010MSLs sites sitting adjacent to both reference site types and a certain degree overlapping with the CUT site type in particular. The 1995MSLs were more distant from all other site types and shared no common features with the reference site types. The NMDS also revealed that agronomic species form a higher proportion of plant cover on most of the 1995MSLs; this has also been reported on other certified reclaimed sites in the region where the grasses and forbs sowed initially were still evident after 5 (Forsch et al., 2021), 34 (Rowland et al., 2009), and 48  years post-certification. Due to the striking coverage of agronomic species, most of these sites are suspected to be in a state of arrested succession, thereby leaving long-term legacy effects on the landscape (Kurek et al., 2013;Chowdhury et al., 2017;Dhar et al., 2020).
Our approach of comparing a range of ecological indicators on former industrial sites that have been reclaimed within the last 20 years with those in areas recovering from harvest and fire disturbances appears to have been a useful exercise. This enabled us to evaluate whether either reclamation criteria (pre-1995 vs. 2010) were creating forest ecosystems that could be considered similar to those that develop after other types of stand-replacing disturbances. We do appreciate that a true likewith-like comparison with sites of the same age would have been ideal to evaluate this question. However, since the reclamation criteria changed over time and we restricted our study sites to similar edatopic grid positions (at least pre-treatment), it was not possible to find reclaimed, burned, and harvested sites of the same age in the same geographic area that were accessible. At the time of measurement, the s1995MSL sites averaged 16 years since reclamation (range: 12-20 years), whereas the 2010MSL sites averaged 8.8 years (range: 7-10 years) since reclamation. It was also interesting to see that the 2010MSLs were characterized by many of the same native species as the FIRE and CUT sites; there would be value in revisiting the sites in the current investigation in future to determine whether they continue progressing toward the reference sites as they progress in age.

CONCLUSION AND SUGGESTIONS FOR FUTURE WORK
To strengthen the present analysis, we recommend that additional 2010MSLs be evaluated because of the small and variable sample size; it has also now been more than 10 years since these criteria were established and there are likely many more sites that have been reclaimed-certified to these criteria compared with our investigation which was conducted between 2015 and 2016. These efforts should also broaden the geographic area and include a wider progression of site ages, which was not possible to do in the present investigation. Similar efforts should continue to use CUT and FIRE sites of similar seral stages or ages, as they were found to be very useful analogs for comparison, and they will be important in target setting. In the long term, additional inventories from nearby intact, mature forests would also help to improve our ability to detect whether reclamation success has been achieved.
Lastly, while this case study illustrated the potential benefits of relatively straightforward changes to reclamation criteria in terms of including indicators around soil quality and conservation, woody stem requirements and native plant coverage, there is ultimately always room for improvement. For jurisdictions where the objective of the criteria is to restore a forest ecosystem, including criteria specific to tree stems (rather than woody stems in general) and additional considerations regarding tree growth and site occupation of trees (such as stocking) would likely be of value in ensuring the speedy return to a forest canopy state. Adding criteria with measures of native plant species diversity, perhaps with targets grounded in known or expected diversity incomparable aged reference sites may also be of utility as it is well understood that having plant diversity is also a beneficial metric in creating a more resilient vegetation community.

DATA AVAILABILITY STATEMENT
The original contributions presented in this study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

AUTHOR CONTRIBUTIONS
AS and MD conceived and planned the experiment. AS, SS, EF, and MD carried out the experiment and supervised the findings of this work. MB-A wrote the manuscript with support from AS, SS, EF, and MD. All authors contributed to the article and approved the submitted version.

FUNDING
Funding for this work was provided by grants from the Government of Alberta and the Natural Sciences and Engineering Research Council of Canada (NSERC). data collection, and Kera Yucel and Jasmeen Kaur for their contributions during interim reporting on the results of this work. We also thank the forestry companies who allowed us access to their management areas and the two reviewers for their constructive comments that greatly improved the manuscript.