<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2023.1152432</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Inversion models of aboveground grassland biomass in Xinjiang based on multisource data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>R. P.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhou</surname>
<given-names>J. H.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2187036"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Guo</surname>
<given-names>J.</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Miao</surname>
<given-names>Y. H.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>L. L.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>College of Ecology and Environment, Xinjiang University</institution>, <addr-line>Urumqi</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Key Laboratory of Oasis Ecology, Xinjiang University</institution>, <addr-line>Urumqi</addr-line>, <country>China</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>Xinjiang Academy Forestry</institution>, <addr-line>Urumqi</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Fujiang Hou, Lanzhou University, China</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Yunlong Wang, Taiyuan University of Technology, China; Jianjun Chen, Guilin University of Technology, China</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: J. Guo, <email xlink:href="mailto:guojing7227279@163.com">guojing7227279@163.com</email>
</p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Functional Plant Ecology, a section of the journal Frontiers in Plant Science</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>13</day>
<month>03</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1152432</elocation-id>
<history>
<date date-type="received">
<day>27</day>
<month>01</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>23</day>
<month>02</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Zhang, Zhou, Guo, Miao and Zhang</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Zhang, Zhou, Guo, Miao and Zhang</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Grassland biomass monitoring is essential for assessing grassland health and carbon cycling. However, monitoring grassland biomass in drylands based on satellite remote sensing is challenging.Statistical regression models and machine learning have been used for the construction of grassland biomass models, but the predictive power for different grassland types is unclear. Additionally, the selection of the most appropriate variables to construct a biomass inversion model for different grassland types must be explored. Therefore,1201 ground-truthed data points collected from 2014-2021,including 15 Moderate Resolution Imaging Spectroradiometer (MODIS) vegetation indices,geographic location and topographic data,and meteorological factors and vegetation biophysical indicators were screened for key variables using principal component analysis (PCA). The accuracy of multiple linear regression models, exponential regression models, power function models, support vector machine (SVM) models, random forest (RF) models, and neural network models was evaluated for the inversion of three types of grassland biomass. The results were as follows: (1) The biomass inversion accuracy of single vegetation indices was low, and the optimal vegetation indices were the soil-adjusted vegetation index (SAVI) (R2 = 0.255), normalized difference vegetation index (NDVI) (R2 = 0.372) and optimized soil-adjusted vegetation index (OSAVI) (R2 = 0.285). (2)Grassland above-ground biomass (AGB) was affected by various factors such as geographic location,topography, and meteorological factors, and the inverse models using a single environmental variable had large errors. (3) The main variables used to model biomass in the three types of grasslands were different. SAVI, aspect, slope, and precipitation (Prec.) were selected for desert grasslands; NDVI,shortwave infrared 2 (SWI2), longitude, mean temperature, and annual precipitation were selected for steppe;and OSAVI, phytochrome ratio (PPR), longitude, precipitation, and temperature were selected for meadows. (4) The non-parametric meadow biomass model was superior to the statistical regression model. (5) The RF model was the best model for the inversion of grassland biomass in Xinjiang, and this model had the highest accuracy for grassland biomass inversion (R2 = 0.656, root mean square error (RMSE) = 815.6 kg/ha),followed by meadow (R2 = 0.610, RMSE = 547.9 kg/ha) and desert grassland (R2 = 0.441, RMSE = 353.6 kg/ha).</p>
</abstract>
<kwd-group>
<kwd>grassland</kwd>
<kwd>principal component analysis</kwd>
<kwd>biomass</kwd>
<kwd>machine learning</kwd>
<kwd>vegetation index</kwd>
<kwd>Xinjiang</kwd>
</kwd-group>
<counts>
<fig-count count="5"/>
<table-count count="2"/>
<equation-count count="2"/>
<ref-count count="53"/>
<page-count count="15"/>
<word-count count="7921"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<label>1</label>
<title>Introduction</title>
<p>Grassland ecosystems are among the most widely distributed terrestrial ecosystems,The grassland aboveground biomass (AGB) is a key indicator to evaluate the regional carbon budget and the sustainability of grassland ecosystems and is also an important material basis for the development of animal husbandry (<xref ref-type="bibr" rid="B37">Scurlock and Hall, 1998</xref>; <xref ref-type="bibr" rid="B32">Piao et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B51">Zhang et&#xa0;al., 2019b</xref>). Therefore, the accurate characterization of grassland biomass and its trends is of great importance for grassland management, grassland livestock-carrying capacity analysis, grassland growth status assessment, and ecological protection (<xref ref-type="bibr" rid="B28">Liu et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B26">Liang et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B43">Xu et&#xa0;al., 2020</xref>). At present, monitoring grassland biomass mainly involves ground-based monitoring and remote sensing monitoring (<xref ref-type="bibr" rid="B26">Liang et&#xa0;al., 2016</xref>). Limited by labor and material resources, for ground-based monitoring, the large-scale monitoring, high-efficiency monitoring, and whole-process monitoring of grassland biomass are challenging (<xref ref-type="bibr" rid="B9">Catchpole and Wheeler, 1992</xref>; <xref ref-type="bibr" rid="B24">Lehtonen et&#xa0;al., 2007</xref>) while remote sensing monitoring is the most effective method for estimating grassland biomass in long time series and over large areas (<xref ref-type="bibr" rid="B12">Craine and Nippert, 2014</xref>; <xref ref-type="bibr" rid="B13">Eisfelder et&#xa0;al., 2012</xref>).</p>
<p>Using the vegetation index for grassland biomass inversion is a common method of remote sensing monitoring. The normalized difference vegetation index (NDVI) has been widely used for grassland biomass inversion since it was proposed in 1974 (<xref ref-type="bibr" rid="B36">Rouse et&#xa0;al., 1974</xref>); However, NDVI is susceptible to the influence of many factors. Atmospheric effects include molecular and aerosol scattering and absorption by gases, such as water vapor, ozone, oxygen and aerosols (<xref ref-type="bibr" rid="B25">Liang et&#xa0;al., 2001</xref>). However, in addition to the influence of the atmosphere, the NDVI spectrum is easily affected by the soil background value, especially in places with sparse vegetation (<xref ref-type="bibr" rid="B22">Huete and Jackson, 1988</xref>; <xref ref-type="bibr" rid="B49">Zandler et&#xa0;al., 2015</xref>). Sparse and senescent vegetation may result in weak or blurred spectral responses, and the effects of soil background can lead to partial loss of vegetation information (<xref ref-type="bibr" rid="B13">Eisfelder et&#xa0;al., 2012</xref>). To eliminate the influence of soil background values, Huete proposed the soil-adjusted vegetation index (SAVI), The effects of vegetation indices are independent and the degradation of the atmosphere is similar in all soil contexts. Within the range of soil and atmospheric conditions examined here, the magnitude of soil effects on vegetation indices was similar to that attributed to the atmosphere. (<xref ref-type="bibr" rid="B22">Huete and Jackson, 1988</xref>). To further eliminate the effects of atmospheric attenuation and soil background, the modified soil-adjusted and atmospherically resistant vegetation index (MSARVI) was proposed (<xref ref-type="bibr" rid="B21">Huete et&#xa0;al., 1994</xref>; <xref ref-type="bibr" rid="B35">Qi et&#xa0;al., 1994</xref>). In the enhanced vegetation index (EVI), because the reflectivity of the blue light band is included in the calculation, the vegetation inversion effect of using EVI is better for the high vegetation cover areas (<xref ref-type="bibr" rid="B17">Garroutte et&#xa0;al., 2016</xref>). In addition to the vegetation indices mentioned above, other vegetation indices are also used by researchers for grassland biomass inversion (<xref ref-type="bibr" rid="B21">Huete et&#xa0;al., 1994</xref>; <xref ref-type="bibr" rid="B49">Zandler et&#xa0;al., 2015</xref>),Such as color adjustment index(RI),first order derivative of reflectanceand ratio (FDR).Although this type of model is simple and employs easily obtainable parameters, it is affected by factors such as sensor spectral characteristic information and environmental factors and still has uncertainties such as poor stability, low accuracy, and large regional differences in estimation results (<xref ref-type="bibr" rid="B26">Liang et&#xa0;al., 2016</xref>). These limitations are especially prominent for low vegetation cover areas because vegetation indices are strongly affected by the soil background values, i.e., weak or ambiguous spectral responses caused by sparse and aging vegetation; therefore, the use of a single factor to invert indicators such as vegetation biomass has great limitations(<xref ref-type="bibr" rid="B49">Zandler et&#xa0;al., 2015</xref>).</p>
<p>Relevant studies in recent years have demonstrated that in a grassland biomass inversion by a single vegetation index, geographic location and topography data, meteorological factors, vegetation biophysical indicators, and soil indicators are also used as important variables to construct grassland biomass, and the stability, versatility, and accuracy of the grassland biomass inversion models are improved(<xref ref-type="bibr" rid="B26">Liang et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B30">Meng et&#xa0;al., 2017</xref>). used the multiple regression analysis methods to study the grassland biomass and plant spectral response characteristics at different growth stages in central Montana, USA, and found that during the grassy stages of pasture herbage, there is a moderate correlation between the measured grassland biomass and biomass predicted by the multivariate regression model based on NDVI obtained from Landsat data. (<xref ref-type="bibr" rid="B33">Porter et&#xa0;al., 2014</xref>). (<xref ref-type="bibr" rid="B26">Liang et&#xa0;al., 2016</xref>) used multiple indicators, such as longitude, latitude, and meteorological factors, to construct a multivariate inversion model for grassland biomass on the Qinghai-Tibet Plateau, which was more accurate than the optimal model based on a single vegetation index (<xref ref-type="bibr" rid="B26">Liang et&#xa0;al., 2016</xref>). Previous studies have proposed vegetation biomass inversion models using multiple vegetation indices and meteorological and vegetation biophysical variables, but it requires considerable effort to select multiple variables, and there may be high information overlap and high autocorrelation among some variables (<xref ref-type="bibr" rid="B31">Penuelas et&#xa0;al., 1995</xref>; <xref ref-type="bibr" rid="B14">Feliciano et&#xa0;al., 2009</xref>). Therefore, how to screen suitable indicators and solve the problem of information overlap between variables still needs further exploration (<xref ref-type="bibr" rid="B46">Yang et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B43">Xu et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B53">Zhou et&#xa0;al., 2021</xref>).</p>
<p>At present, there are two main types of models for vegetation biomass inversion: statistical regression models and machine learning methods. Scholars have performed considerable research on vegetation biomass inversion using these two types of models; however, the choice of model for each grassland type is inconclusive (<xref ref-type="bibr" rid="B46">Yang et&#xa0;al., 2018</xref>). Previous studies have found that stepwise multiple regression models outperform machine learning methods in grassland biomass inversion. (<xref ref-type="bibr" rid="B43">Xu et&#xa0;al., 2020</xref>) used a simple linear regression model, a stepwise multiple regression model, a random forest (RF) model, and an artificial neural network model to simulate the grassland AGB in northern China and found that the performance of the stepwise multiple regression model was higher than that of the other three models (<xref ref-type="bibr" rid="B16">Gao et&#xa0;al., 2016</xref>). Other scholars have found similar findings. However, most studies have found that machine learning methods outperform statistical regression models in grassland biomass inversion (<xref ref-type="bibr" rid="B30">Meng et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B46">Yang et&#xa0;al., 2018</xref>). Currently, there is no consensus on the best model chosen for grassland biomass inversion,Some studies have found that the backpropagation (BP) neural network model has the highest accuracy in grassland biomass inversion (<xref ref-type="bibr" rid="B46">Yang et&#xa0;al., 2018</xref>), and other studies have found that RF and other machine learning methods have the highest accuracy (<xref ref-type="bibr" rid="B2">Adam et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B30">Meng et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B53">Zhou et&#xa0;al., 2021</xref>). There are many types of natural grassland, and the grassland area is often large and has high spatial heterogeneity. The use of one model type to invert biomass in large areas with many grassland types remains controversial.</p>
<p>At small scales, where high-resolution satellite observations are inadequate, <italic>normalin situ</italic> observations are feasible. At large scales, the utilization of high-resolution satellite imagery is often limited by cost and weather conditions. In addition, field surveys are further limited. The temporal resolution of satellites with high orbits is often not high, such as Landsat data (<xref ref-type="bibr" rid="B23">Kearney et&#xa0;al., 2022</xref>) and Sentinel data (<xref ref-type="bibr" rid="B27">Lin et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B23">Kearney et&#xa0;al., 2022</xref>), which only provide data for any point on the Earth every 15-30 days. Grasslands in Xinjiang are located in arid and semiarid regions, with low vegetation cover, and are easily affected by soil background during grassland biomass inversion (<xref ref-type="bibr" rid="B41">Townshend and Justice, 1986</xref>). Moderate Resolution Imaging Spectroradiometer (MODIS) images have been used as remote sensing images for grassland biomass inversion in many studies; these images can not only cover large areas but also have a high temporal resolution, making them more suitable for large-scale areas (<xref ref-type="bibr" rid="B26">Liang et&#xa0;al., 2016</xref>). Determining how best to use remote sensing methods to improve the accuracy of grassland biomass inversion in arid and semiarid regions, especially for grasslands in areas with low vegetation cover, is an important problem for vegetation remote sensing (<xref ref-type="bibr" rid="B5">Barati et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B44">Yang et&#xa0;al., 2012</xref>). Therefore, systematically studying the multivariate inversion of grassland biomass in Xinjiang is of great scientific value.</p>
<p>Based on the above analysis, this study pursued the following aims: (1) to analyze the correlations between three types of grassland biomass and 15 vegetation indices extracted by MODIS remote sensing, geographic location and topography data, meteorological variables, and vegetation biophysical variables; (2) using principal component analysis (PCA), to screen geographic location and topography data, meteorology, vegetation biophysical variables, and MODIS remote sensing vegetation indices to determine the key variables for constructing models for three types of grassland biomass; (3) according to the screened key variables of the three types of grasslands, to compare and analyze the accuracy of the nonparametric and parametric models, based on which the best inversion models for the three types of grassland biomass in Xinjiang were finally selected.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Data and methods</title>
<sec id="s2_1">
<label>2.1</label>
<title>Study area</title>
<p>Xinjiang is located in the middle of the Eurasian continent at 34&#xb0;22-49&#xb0;33&#x2032; N, 73&#xb0;22&#x2032;-96&#xb0;21&#x2032; E. Xinjiang has a unique topography of &#x201c;three mountains and two basins&#x201d; :it is surrounded by high mountains&#x2014;the Kunlun Mountains and the Altai Mountains in the north and south, respectively&#x2014;and the Tian Shan Mountains stretch across the entire territory of Xinjiang from east to west.Xinjiang has a typical temperate continental arid climate. Due to the unique topographic conditions, geographical location and arid climate, Xinjiang's ecosystem is extremely fragile with low vegetative cover, rare plant species and simple population structures The total area of grassland in Xinjiang is approximately 572,600 km<sup>2</sup>, accounting for 34.4% of the area of Xinjiang, and the grassland area accounts for 86% of the total area of green vegetation in Xinjiang. There are many types of grassland in Xinjiang. According to the <italic>Criteria for the Classification of Grassland Types in China and the Chinese Grassland Type Classification System</italic>, there are 11 main types of grassland in Xinjiang. These grasslands can be broadly divided into three groups according to vegetation type: steppe, desert, and meadow. Steppes include alpine grasslands, temperate meadow grasslands, temperate grasslands, and temperate desert grasslands; desert grasslands include alpine deserts, temperate deserts, and steppe deserts; and meadows mainly include alpine meadows, mountain meadows, and lowland meadows(<xref ref-type="bibr" rid="B51">Zhang et&#xa0;al., 2019b</xref>).</p>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Grassland measurement and meteorological observation data</title>
<p>The field investigation period was the forage growing seasons in 2014-2021. According to the topography and the spatial distribution characteristics of grassland types in Xinjiang, the sample lands were mainly selected in areas with a relatively uniform spatial distribution of grassland vegetation and gentle slopes. The size of the sample plots was approximately 500 m &#xd7; 500 m, and the plots were arranged according to the five-site sampling method. The center of the land was taken as the first plot, and then four corner points were selected as the remaining four plots. The size of the herbal plot was 1 m &#xd7; 1 m, and that of the dwarf shrub and tall herb plots was 5 m &#xd7; 5 m. The plots were intended to fully reflect and represent the real situation of grassland vegetation in the sampled grassland. During grassland monitoring, the characteristic data, such as grass height, grass cover, and ABG; the administrative region, grassland type, slope, aspect, grassland utilization status; and the longitude, latitude, and elevation of the observation point were recorded. Photographs of sample plots and landscapes were taken. Given that an abnormal value for the ground measurement data may affect the accuracy of the estimation model, the biomass data from ground observation sample points within an image element corresponding to the same geographical location of the remote sensing image were combined, and their average value was used to represent the AGB of the ground measurement and the image element. The AGB of the grasses in the characteristic data was measured by drying the fresh above-ground grasses (harvested flush) at 65&#xb0;C for 48 hours in an oven until a stable weight was reached to obtain the dry matter yield (<xref ref-type="fig" rid="f1">
<bold>Figure 1</bold>
</xref>).</p>
<fig id="f1" position="float">
<label>Figure&#xa0;1</label>
<caption>
<p>Correlations of vegetation indices and AGB of different types of grasslands in Xinjiang.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-14-1152432-g001.tif"/>
</fig>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Environmental variables</title>
<p>The digital elevation model (DEM) data used in this study were downloaded from the sharing website (<ext-link ext-link-type="uri" xlink:href="http://srtm.csi.cgiar.org/">http://srtm.csi.cgiar.org/</ext-link>); the data spatial resolution is 90 m, and the data format is GeoTIFF. For China&#x2019;s administrative boundary data, the national 1:4 million administrative division data released by the National Geomatics Center of China was adopted; the World Geodetic System 1984 (WGS-84) was used for map projection, and the ellipsoid was WGS-84. To carry out subsequent statistical analysis, ArcGIS software was used to extract the longitude, latitude, elevation, slope, and aspect of the field monitoring points of grassland biomass.</p>
<p>The ANUSPLIN software method is an extension of the thin-plate smoothing spline method (<xref ref-type="bibr" rid="B6">Bates et&#xa0;al., 1987</xref>) that introduces multiple covariate submodels to perform meteorological spatial interpolation of multiple surfaces simultaneously.</p>
</sec>
<sec id="s2_4">
<label>2.4</label>
<title>MODIS data</title>
<p>The MODIS surface reflectance product (MOD09GA) was used in this study. The data were obtained from the Earth Observing System Data and Information System (EOSDIS) website (<ext-link ext-link-type="uri" xlink:href="https://earthdata.nasa.gov/">https://earthdata.nasa.gov/</ext-link>) of the National Aeronautics and Space Administration (NASA); the format is EOS-HDF, and the spatial resolution is 500 m. The MOD09GA product is the daily surface reflectance estimate, including the reflectance data for MODIS bands 1-7. The daily MOD09GA product covering the entire Xinjiang region requires six scenes, and the track numbers are h23v04, h24v04, h25v04, h23v05, h24v05, and h25v05. The MOD09GA images of grassland during the peak production period (July-August) were downloaded for eight years (2014-2021).</p>
<p>The data preprocessing included the following main steps: (1) Using the MODIS projection conversion tool MODIS Reprojection Tool (MRT) software, the daily MOD09GA reflectance data from July to August in the Xinjiang region from 2014 to 2021 were processed by projection conversion and splicing. The sinusoidal projection was converted into the Albers map projection, the ellipsoid was WG-S84, and the nearest neighbor method was used for resampling. The final output image file format was GeoTIFF, and the daily reflectance data for MOD09GA band 1-7 were obtained. (2) Using the ArcGIS spatial analysis method and MODIS reflectance data, 15 daily vegetation indices closely related to biomass, such as NDVI, EVI, SAVI, modified SAVI (MSAVI), soil-adjusted total vegetation index (SATVI), optimized SAVI (OSAVI), reflectance index 1 (RI1), plant pigment ratio (PPR), phosphorous buffer index (PBI), thermal and shortwave infrared 2 (SWI2), global vegetation index (GVI), radar vegetation index (RVI), B7/B2, B7/B5, and B2/B1, were calculated (<xref ref-type="bibr" rid="B34">Price et&#xa0;al., 2002</xref>; <xref ref-type="bibr" rid="B49">Zandler et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B26">Liang et&#xa0;al., 2016</xref>). The maximum value composites (MVC) method was used to generate the monthly maximum vegetation index image data in July and August from 2012 to 2021.</p>
</sec>
<sec id="s2_5">
<label>2.5</label>
<title>Construction and evaluation of the grassland biomass model</title>
<sec id="s2_5_1">
<label>2.5.1</label>
<title>Univariate model</title>
<p>Taking grassland AGB as the dependent variable, geographic location and topography(longitude, latitude, and elevation), meteorological factors (annual mean temperature, annual precipitation), vegetation biophysical indicators (grass cover, grass height), and 15 MODIS vegetation indices corresponding to ground-measured sample points were used as independent variables to analyze the correlations between grassland AGB and 15 MODIS vegetation indices, longitude, latitude, elevation, annual mean temperature, annual mean precipitation, grassland cover, and grass height.</p>
</sec>
<sec id="s2_5_2">
<label>2.5.2</label>
<title>PCA of the main variables of grassland biomass</title>
<p>Grassland biomass models use variables that can either be single variables or combine information from multiple variables. To select a variety of variables reasonably and solve the problem of high information overlap and autocorrelation between variables, this study adopted the PCA method. PCA initially selects the internal structure of each variable and converts multiple variables into a few comprehensive variables, and these variables are independent of each other and contain most of the information of the original variables, reducing data dimensionality (<xref ref-type="bibr" rid="B20">Hossain et&#xa0;al., 2011</xref>). In this study, PCA was adopted for variables that passed the significance test, such as the MODIS vegetation indices, geographical location and topography, meteorological variables, and vegetation biophysical variables, to remove the correlations between the variables, and the main information was concentrated on the principal variables.</p>
</sec>
<sec id="s2_5_3">
<label>2.5.3</label>
<title>Construction of the grassland biomass model</title>
<p>According to the key variables of PCA and the grassland AGB from 2014 to 2021, a database was established, including a total of 1201 records. In this paper, the 10-fold cross-validation method is used to evaluate the performance ability of univariate parameter models (<xref ref-type="bibr" rid="B29">Liu et&#xa0;al., 2017</xref>), SPSS 26 software was used to randomly select 90% of the records for grassland biomass modeling and 10% of the data for accuracy verification,repeat the selection of the training and test sets 10 times until all samples appear in the test and training sets (<xref ref-type="bibr" rid="B30">Meng et&#xa0;al., 2017</xref>). Among them, there are 383 desert grasslands, with 345 modeling data points and 38 validation data points; 562 steppes, with 506 modeling data points and 56 validation data points; and 256 meadows, with 230 modeling data points and 26 validation data points. In this study, multivariate regression analysis and machine learning methods were used to construct grassland AGB models.</p>
<p>(1) SPSS 26 was used to build multivariate regression models (linear, exponential, logarithmic and power) (<xref ref-type="bibr" rid="B18">Ge et&#xa0;al., 2018</xref>);</p>
<p>(2) Three types of machine learning models, including backpropagation-artificial neural network (BP-ANN), support vector machine (SVM), and RF, were used as the multivariate nonparametric models, and MATLAB software was used to construct multivariate nonparametric models using different factors and their combinations that are significantly correlated with the grassland AGB.</p>
<p>The SVM regression model is an algorithm based on supervised learning; its core algorithm is to construct a set of hyperplanes in a high-dimensional or infinite latitude space, based on which it performs classification and regression (<xref ref-type="bibr" rid="B45">Yang et&#xa0;al., 2016</xref>). The SVM model is not sensitive to the sample size of the training set, and compared to other machine learning methods, it can produce comparable accuracy using a smaller training sample size (<xref ref-type="bibr" rid="B8">Camps-Valls et&#xa0;al., 2006</xref>). SVM regression can be implemented using the &#x201c;LIBSVM&#x201d; package in MATLAB (R2019b) (<xref ref-type="bibr" rid="B10">Veraverbeke et al., 2012</xref>; <xref ref-type="bibr" rid="B10">Chang and Lin, 2011</xref>). The SVM type is epsilon-SVR, and the kernel function type is the radial basis function (RBF). When using the SVM model to estimate the grassland AGB, the same training set and test set of the multivariate regression models were used to construct the SVM regression model.</p>
<p>The BP-ANN is composed of an input layer, one or more hidden layers, and an output layer. In the linkage of each layer, the information transmission process is a one-way transmission, i.e., the information is first input in the input layer and processed in the hidden layers and finally passed to the output layer (<xref ref-type="bibr" rid="B47">Yuan et&#xa0;al., 2017</xref>). In this study, the Levenberg-Marquardt algorithm was selected for model training. The final two parameters in the ANN regression model are the number of neurons and hidden layers. The more neurons and hidden layers there are, the higher the learning accuracy and the weaker the generalization ability of the model. In this study, the numbers of hidden layers and neurons were obtained by trial and error, and the establishment and verification of the BP-ANN model were implemented based on the neural network toolbox in MATLAB (R2019b) (<xref ref-type="bibr" rid="B40">Tiryaki and Aydin, 2014</xref>).</p>
<p>RF is a nonparametric nonlinear model construction method that improves prediction accuracy by applying a series of training trees, and its theoretical basis lies in the classification tree algorithm. The RF regression model adopts the bootstrap sampling method. The samples extracted each time are used to construct a decision tree, multiple decision trees are formed, and the final prediction result is obtained by voting (<xref ref-type="bibr" rid="B7">Breiman, 2001</xref>). Therefore, one advantage of using RF to build a model is that there is no overfitting (<xref ref-type="bibr" rid="B19">Han, 2001</xref>). In this study, both the construction and accuracy verification of the RF model were performed in MATLAB using the RF_MexStandalone-v0.02 toolkit.</p>
</sec>
</sec>
<sec id="s2_6">
<label>2.6</label>
<title>Model validation</title>
<p>The accuracy of the model was evaluated based on the coefficient of determination (R<sup>2</sup>) and the root mean square error (RMSE) between the measured value and the corresponding simulated value. The R<sup>2</sup> ranges from 0 to 1. The closer the R<sup>2</sup> value is to 1, the higher the accuracy of the model and the higher the reliability. RMSE is used to measure the deviation between the predicted value and the measured value, and the smaller the value, the better the fit of the constructed grassland AGB model. According to the model accuracy and error size, the grassland biomass inversion model in the study area was finally determined. The constructed biomass model was verified by using field-measured grassland data from 2014 to 2021, including 38 desert grasslands, 56 steppes, and 26 meadows. RMSE and R<sup>2</sup> were calculated as follows:</p>
<disp-formula>
<label>(1)</label>
<mml:math display="block" id="M1">
<mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>R</mml:mi>
<mml:mi>M</mml:mi>
</mml:mstyle>
<mml:mi>S</mml:mi>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>E</mml:mi>
</mml:mstyle>
<mml:mo>=</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>i</mml:mi>
</mml:mstyle>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>n</mml:mi>
</mml:mstyle>
</mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>Z</mml:mi>
</mml:mstyle>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>x</mml:mi>
</mml:mstyle>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>Z</mml:mi>
</mml:mstyle>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>y</mml:mi>
</mml:mstyle>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>n</mml:mi>
</mml:mstyle>
</mml:mfrac>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula>
<label>(2)</label>
<mml:math display="block" id="M2">
<mml:mrow>
<mml:msup>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>R</mml:mi>
</mml:mstyle>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>i</mml:mi>
</mml:mstyle>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>n</mml:mi>
</mml:mstyle>
</mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>Z</mml:mi>
</mml:mstyle>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>i</mml:mi>
</mml:mstyle>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>Z</mml:mi>
</mml:mstyle>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>y</mml:mi>
</mml:mstyle>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>i</mml:mi>
</mml:mstyle>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>n</mml:mi>
</mml:mstyle>
</mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>Z</mml:mi>
</mml:mstyle>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>i</mml:mi>
</mml:mstyle>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>Z</mml:mi>
</mml:mstyle>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>x</mml:mi>
</mml:mstyle>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Where Z<sub>x</sub> and Z<sub>y</sub> represent the actual observed value and predicted value, respectively, and n is the number of samples used for validation.</p>
</sec>
</sec>
<sec id="s3" sec-type="results">
<label>3</label>
<title>Results and analysis</title>
<sec id="s3_1" sec-type="results">
<label>3.1</label>
<title>Results and analysis</title>
<p>
<xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref> shows the correlations between grassland AGB during the period of peak production from 2014 to 2021 (July to mid-August) and the NDVI, EVI, SAVI, MSAVI, SATVI, OSAVI, RI1, PPR, of PBI, SWI2, GVI, RVI B7/B2, B7/B5, and B2/B1 calculated based on MODIS band 1-7 reflectance data. <xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref> indicates that in the Xinjiang desert grasslands, the SAVI has the best correlation with the grassland AGB (R<sup>2</sup> = 0.255, <italic>p</italic>&lt; 0.01), followed by MSAVI, NDVI, OSAVI, B2/B1, SATVI, GVI, EVI, RVI, B7/B5, B7/B2, RI1, SWI2, PBI, and PPR. The correlation between the NDVI and temperate steppe AGB was the highest (R<sup>2</sup> = 0.372, <italic>p&lt;</italic> 0.01), followed by MSAVI, OSAVI, SAVI, EVI, GVI, RI1, SATVI, B7/B2, RVI, B7/B5, B7/B2, PPR, SWI2, PBI, etc. The alpine grassland AGB shows the highest correlation with the OSAVI (R<sup>2</sup> = 0.285, <italic>p</italic>&lt; 0.01), followed by SAVI, MSAVI, NDVI, GVI, SATVI, EVI, B2/B1, RI1, B7/B5, B7/B2, PPR, RVI, SWI2, PBI, and so on. The linear regression models between the PPR, SWI2, and PBI indices and desert grassland AGB did not pass the significance test, and other vegetation indices passed the F-test at the significance level of 0.05 or 0.01. In summary, when using a single vegetation index to invert the AGB of desert grasslands, steppes, and meadows in Xinjiang, the SAVI, NDVI, and OSAVI should be selected.</p>
<fig id="f2" position="float">
<label>Figure&#xa0;2</label>
<caption>
<p>Correlations of environmental factors and AGB of different types of grasslands in Xinjiang.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-14-1152432-g002.tif"/>
</fig>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Comparative analysis of univariate AGB monitoring models of alpine grassland</title>
<p>
<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref> shows the linear regression analysis results between environmental variables and AGB of three types of grasslands: desert grassland, steppe, and meadow. From the perspective of geographical location and topographical factors, the AGB of the three types of grasslands was not highly correlated with geographical location and topographical factors. Among them, the correlations between the AGB of desert grassland and the aspect, elevation, and slope of the observation points are extremely significant (<italic>p</italic>&lt; 0.01), and the correlations with longitude and latitude are not significant (<italic>p</italic> &gt; 0.01); the correlations between the steppe AGB and the longitude, elevation, and slope of the observation points are extremely significant (<italic>p&lt;</italic> 0.01), and the correlations with slope and latitude are not significant (<italic>p</italic> &gt; 0.01); The correlation between the above-ground biomass of meadows and the longitude, latitude and elevation of the observation sites reached a highly significant level (<italic>P</italic>&lt; 0.01), while the correlation with slope and gradient was not significant (<italic>P</italic>&gt; 0.01). From the perspective of climatic factors, the correlations between AGB of desert grasslands and annual precipitation are extremely significant (<italic>p&lt;</italic> 0.01, R<sup>2</sup> = 0.180), but the correlation with temperature is not significant (<italic>p</italic> &gt; 0.01); The correlations between the AGB of meadows and annual precipitation and temperature are extremely significant (<italic>p&lt;</italic> 0.01); the correlations between meadow AGB and temperature and precipitation are extremely significant (<italic>p&lt;</italic> 0.01). From the perspective of vegetation biophysical indicators, except that the AGB of the desert grasslands is not significantly correlated with the grass cover, the AGB of the three types of grasslands is extremely significantly correlated with the grass cover and the grass height. Overall, among the nine environmental factors, the AGB of the desert grasslands has the highest correlation with the grass height (R<sup>2</sup> = 0.182), followed by annual precipitation (R<sup>2</sup> = 0.180), grass cover (R<sup>2</sup> = 0.047), and aspect (R<sup>2</sup> = 0.043); the steppe AGB has the highest correlation with grass height (R<sup>2</sup> = 0.344), followed by annual precipitation (R<sup>2</sup> = 0.261), grass cover (R<sup>2</sup> = 0.196), and annual temperature (R<sup>2</sup> = 0.170); the meadow AGB has the highest correlation with the annual mean temperature of the observation points (R<sup>2</sup> = 0.305), followed by grass height (R<sup>2</sup> = 0.261), temperature (R<sup>2</sup> = 0.146), and grass cover (R<sup>2 =</sup> 0.126). It can be seen that, except for the vegetation index factor, the correlations of the above-ground biomass of grasses in the three types of grass at the peak grass stage were significantly different from the geographical location of the observation sites, topographic factors, climatic factors, and vegetation biophysical indicators.</p>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Biomass model construction index screening</title>
<p>The above analysis reveals that the single vegetation index or environmental variable that is most closely correlated with AGB can only reflect 25.47% of the AGB of desert grasslands, 37.17% of the AGB of temperate grasslands, and 28.50% of the AGB of alpine grasslands in Xinjiang (<xref ref-type="fig" rid="f2">
<bold>Figures&#xa0;2</bold>
</xref>, <xref ref-type="fig" rid="f3">
<bold>3</bold>
</xref>). Therefore, biomass inversion models that simply use the MODIS vegetation indices or other environmental variables are prone to great errors and uncertainties. To avoid the poor accuracy with univariate biomass inversion models, this study explored the multivariate AGB monitoring models with factors that are closely correlated with AGB and independent of each other as independent variables. PCA was used to screen the vegetation indices and environmental variables with an extremely significant correlation with the grassland AGB. The results indicated that the KMO values of desert grassland, steppe, and meadow were all greater than 0.8, and the <italic>p-value</italic> was less than 0.0001, reaching the extremely significant level of 0.01, indicating that the selected variables met the requirements of PCA. <xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref> shows the principal variables with a cumulative contribution rate over 85%. SAVI, aspect, slope, and Prec were selected for desert grassland; NDVI, SWI2, longitude, mean temperature, and annual precipitation were selected for steppe; and OSAVI, PPR, longitude, precipitation, and temperature were selected for meadow.</p>
<fig id="f3" position="float">
<label>Figure&#xa0;3</label>
<caption>
<p>Cumulative contribution rate of variables for the AGB of different types of grasslands in Xinjiang.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-14-1152432-g003.tif"/>
</fig>
<fig id="f4" position="float">
<label>Figure&#xa0;4</label>
<caption>
<p>Distribution map of grassland sampling points <bold>(A)</bold> and meteorological stations <bold>(B)</bold> in Xinjiang from 2014 to 2021.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-14-1152432-g004.tif"/>
</fig>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Biomass inversion based on the multivariate regression method</title>
<p>Linear, logarithmic, power, and exponential models were analyzed based on the selected principal components of the three types of grasslands: desert grassland, steppe, and meadow (<xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>). For desert grasslands, the exponential model is the best, with an R<sup>2</sup> of 0.42 and RMSE of 419.56 kg/ha. For steppe and meadow, the power function model is the best, with R<sup>2</sup> values of 0.56 and 0.50 and RMSE values of 811.99 kg/ha and 737.90 kg/ha, respectively. Therefore, the exponential model is more suitable for biomass inversion of desert grasslands, and the power function model is more suitable for biomass inversion of steppe and meadows.</p>
<table-wrap id="T1" position="float">
<label>Table&#xa0;1</label>
<caption>
<p>Accuracy evaluation of multivariate regression models.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Type</th>
<th valign="top" align="left">Model type</th>
<th valign="top" colspan="3" align="left">Training set</th>
<th valign="top" colspan="2" align="left">Test set</th>
</tr>
<tr>
<th valign="top" align="left"/>
<th valign="top" align="center"/>
<th valign="top" align="left">Formula</th>
<th valign="top" align="left">R<sup>2</sup>
</th>
<th valign="top" align="left">RMSE</th>
<th valign="top" align="left">R<sup>2</sup>
</th>
<th valign="top" align="left">RMSE</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" rowspan="4" align="left">Desert (n=383)</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">y=-0.026 + 0.026slope + 0.013aspect + 0.266precipitation_20150 + 0.402SAVI</td>
<td valign="top" align="left">0.41</td>
<td valign="top" align="left">347.01</td>
<td valign="top" align="left">0.40</td>
<td valign="top" align="left">432.72</td>
</tr>
<tr>
<td valign="top" align="left">Logarithm</td>
<td valign="top" align="left">y=0.408-0.005 *LN(slope) + 0.004 * LN(aspect) + 0.006 * LN(precipitation _20150) + 0.114 * LN(SAVI)</td>
<td valign="top" align="left">0.31</td>
<td valign="top" align="left">373.47</td>
<td valign="top" align="left">0.04</td>
<td valign="top" align="left">1775.12</td>
</tr>
<tr>
<td valign="top" align="left">Power</td>
<td valign="top" align="left">y=0.860 * (slope0.041) * (aspect0.024) * (precipitation _201500.554) * (SAVI0.535)</td>
<td valign="top" align="left">0.42</td>
<td valign="top" align="left">343.36</td>
<td valign="top" align="left">0.41</td>
<td valign="top" align="left">426.39</td>
</tr>
<tr>
<td valign="top" align="left">Exponential</td>
<td valign="top" align="left">y=0.069 * e(0.252 * slope) * e(0.158 * aspect) * e(1.180 * precipitation _20150) * e(1.438 * SAVI)</td>
<td valign="top" align="left">0.39</td>
<td valign="top" align="left">352.29</td>
<td valign="top" align="left">0.42</td>
<td valign="top" align="left">419.56</td>
</tr>
<tr>
<td valign="top" rowspan="4" align="left">Steppe (n=562)</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">y=-0.113-0.053 * Y + 0.259 * avg_201509 + 0.328 * precipitation _20150 + 0.036 * SWI2 + 0.194 * NDVI</td>
<td valign="top" align="left">0.54</td>
<td valign="top" align="left">865.38</td>
<td valign="top" align="left">0.52</td>
<td valign="top" align="left">928.26</td>
</tr>
<tr>
<td valign="top" align="left">Logarithmic</td>
<td valign="top" align="left">y=0.262 + 0.004 * LN(Y) + 0.006 * LN(avg_201509) + 0.018 * LN(precipitation _20150) + 0.009 * LN(SWI2) + 0.014 * LN(NDVI)</td>
<td valign="top" align="left">0.10</td>
<td valign="top" align="left">1216.74</td>
<td valign="top" align="left">0.29</td>
<td valign="top" align="left">1178.26</td>
</tr>
<tr>
<td valign="top" align="left">Power</td>
<td valign="top" align="left">y=0.950 * (Y-0.012) * (avg_2015090.851) * (precipitation _201500.630) * (SWI20.181) * (NDVI0.239)</td>
<td valign="top" align="left">0.55</td>
<td valign="top" align="left">837.46</td>
<td valign="top" align="left">0.56</td>
<td valign="top" align="left">811.99</td>
</tr>
<tr>
<td valign="top" align="left">Exponential</td>
<td valign="top" align="left">y=0.040 * e(-0205 * Y) * e(1.325 * avg_201509) * e(1.331 * precipitation _20150) * e(0.215 * SWI2) * e(0.804 * NDVI)</td>
<td valign="top" align="left">0.56</td>
<td valign="top" align="left">814.17</td>
<td valign="top" align="left">0.55</td>
<td valign="top" align="left">827.87</td>
</tr>
<tr>
<td valign="top" rowspan="4" align="left">Meadow (n=256)</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">y=-0.026 + 0.026slope + 0.013aspect + 0.266 precipitation _20150 + 0.402SAVI</td>
<td valign="top" align="left">0.41</td>
<td valign="top" align="left">347.01</td>
<td valign="top" align="left">0.40</td>
<td valign="top" align="left">432.72</td>
</tr>
<tr>
<td valign="top" align="left">Logarithm</td>
<td valign="top" align="left">y=0.408-0.005 *LN(slope) + 0.004 * LN(aspect) + 0.006 * LN(precipitation _20150) + 0.114 * LN(SAVI)</td>
<td valign="top" align="left">0.31</td>
<td valign="top" align="left">373.47</td>
<td valign="top" align="left">0.04</td>
<td valign="top" align="left">1775.12</td>
</tr>
<tr>
<td valign="top" align="left">Power</td>
<td valign="top" align="left">y=0.860 * (slope0.041) * (aspect0.024) * (precipitation _201500.554) * (SAVI0.535)</td>
<td valign="top" align="left">0.42</td>
<td valign="top" align="left">343.36</td>
<td valign="top" align="left">0.41</td>
<td valign="top" align="left">426.39</td>
</tr>
<tr>
<td valign="top" align="left">Exponential</td>
<td valign="top" align="left">y=0.069 * e(0.252 * slope) * e(0.158 * aspect) * e(1.180 * precipitation _20150) * e(1.438 * SAVI)</td>
<td valign="top" align="left">0.39</td>
<td valign="top" align="left">352.29</td>
<td valign="top" align="left">0.42</td>
<td valign="top" align="left">419.56</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>* in the text means multiply.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s3_5">
<label>3.5</label>
<title>Nonparametric-based biomass inversion</title>
<p>Using the 10-fold cross-validation method, BP-ANN, SVM, and RF models for grassland AGB estimation of the three grassland types of desert grassland, steppe, and meadow were constructed using the selected principal components (<xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref>). Comparing the results of <xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref> with those of <xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref>, it is known that BP-ANN, SVM and RF significantly outperformed the multifactor-based linear and nonlinear regression model in inverting the three grassland types in the study area. <xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref> lists the accuracy evaluation results of the SVM, RF, and BPNN regression models for the biomass inversion of the three grassland types. The accuracy evaluation results indicate that the dry weight of desert grasslands predicted by the SVM regression model is the best (R<sup>2</sup> = 0.43, RMSE = 356.62 kg/ha), and the accuracy of the dry weight of the meadow (R<sup>2</sup> = 0.64, RMSE = 503.10 kg/ha) and steppe (R<sup>2</sup> = 0.65, RMSE = 763.33 kg/ha) predicted by the RF models is higher than that of other two machine learning methods.</p>
<table-wrap id="T2" position="float">
<label>Table&#xa0;2</label>
<caption>
<p>Accuracy of the multivariate machine learning regression models for three grassland types evaluated by the 10-fold cross-validation method.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Grass type</th>
<th valign="top" align="center">Model</th>
<th valign="top" align="center">R<sup>2</sup>
</th>
<th valign="top" align="center">RMSE</th>
<th valign="top" align="center">R<sup>2</sup>
</th>
<th valign="top" align="center">RMSE</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="middle" rowspan="4" align="left">Desert (n=383)</td>
<td valign="top" align="left"/>
<td valign="top" colspan="2" align="left">training set</td>
<td valign="top" colspan="2" align="left">test set</td>
</tr>
<tr>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">0.51</td>
<td valign="top" align="left">334.92</td>
<td valign="top" align="left">0.43</td>
<td valign="top" align="left">356.62</td>
</tr>
<tr>
<td valign="top" align="left">RF</td>
<td valign="top" align="left">0.89</td>
<td valign="top" align="left">179.05</td>
<td valign="top" align="left">0.42</td>
<td valign="top" align="left">356.98</td>
</tr>
<tr>
<td valign="top" align="left">BPNN</td>
<td valign="middle" align="left">0.54</td>
<td valign="middle" align="left">314.05</td>
<td valign="middle" align="left">0.42</td>
<td valign="middle" align="left">368.46</td>
</tr>
<tr>
<td valign="middle" rowspan="4" align="left">Meadow (n=256)</td>
<td valign="top" align="left"/>
<td valign="top" colspan="2" align="left">training set</td>
<td valign="top" colspan="2" align="left">test set</td>
</tr>
<tr>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">0.61</td>
<td valign="top" align="left">543.66</td>
<td valign="top" align="left">0.56</td>
<td valign="top" align="left">537.55</td>
</tr>
<tr>
<td valign="top" align="left">RF</td>
<td valign="top" align="left">0.91</td>
<td valign="top" align="left">279.86</td>
<td valign="top" align="left">0.64</td>
<td valign="top" align="left">503.10</td>
</tr>
<tr>
<td valign="top" align="left">BPNN</td>
<td valign="middle" align="left">0.71</td>
<td valign="middle" align="left">456.45</td>
<td valign="middle" align="left">0.51</td>
<td valign="middle" align="left">604.02</td>
</tr>
<tr>
<td valign="middle" rowspan="4" align="left">Steppe (n=562)</td>
<td valign="top" align="left"/>
<td valign="top" colspan="2" align="left">training set</td>
<td valign="top" colspan="2" align="left">test set</td>
</tr>
<tr>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">0.62</td>
<td valign="top" align="left">797.33</td>
<td valign="top" align="left">0.58</td>
<td valign="top" align="left">831.31</td>
</tr>
<tr>
<td valign="top" align="left">RF</td>
<td valign="top" align="left">0.93</td>
<td valign="top" align="left">372.49</td>
<td valign="top" align="left">0.65</td>
<td valign="top" align="left">763.33</td>
</tr>
<tr>
<td valign="top" align="left">BPNN</td>
<td valign="middle" align="left">0.67</td>
<td valign="middle" align="left">739.45</td>
<td valign="middle" align="left">0.60</td>
<td valign="middle" align="left">817.75</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>In each evaluation, except for BPNN, the machine learning methods use 10% of the samples as the test set and the rest as the training set; for BPNN, 80% of the samples constitute the training set, and the remaining 20% of the samples are used as the test set and validation set.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>When using the SVM regression model to predict the dry weight of desert grassland, meadow, and steppe, the prediction accuracy of the three grassland types is in the following order: steppe &gt; meadow &gt; desert (<xref ref-type="fig" rid="f5">
<bold>Figures&#xa0;5A, D, G</bold>
</xref>). When using RF model (<xref ref-type="fig" rid="f5">
<bold>Figures&#xa0;5B, E, H</bold>
</xref>) and the BPNN model (<xref ref-type="fig" rid="f5">
<bold>Figures&#xa0;5C, F, I</bold>
</xref>) to predict the dry weight of three different grassland types, the prediction accuracy follows the same order: steppe &gt; meadow &gt; desert.</p>
<fig id="f5" position="float">
<label>Figure&#xa0;5</label>
<caption>
<p>The relationships between the dry weight of the three grassland types predicted by machine learning models using the test set and the measured dry weight. <bold>(A&#x2013;C)</bold> are the relationships between the dry weight of the desert predicted by SVM, RF, and BPNN models, respectively, using the test set and the measured dry weight; <bold>(D&#x2013;F)</bold> are the relationships between the dry weight of the meadow predicted by SVM, RF, and BPNN models, respectively, using the test set and the measured dry weight; and <bold>(G&#x2013;I)</bold> are the relationships between the dry weight of the steppe predicted by SVM, RF, and BPNN models, respectively, using the test set and the measured dry weight.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-14-1152432-g005.tif"/>
</fig>
</sec>
</sec>
<sec id="s4" sec-type="discussion">
<label>4</label>
<title>Discussion</title>
<sec id="s4_1">
<label>4.1</label>
<title>Accuracy analysis of the grassland AGB inversion models based on the remote sensing vegetation indices</title>
<p>In this study, we analyzed the correlation between 15 vegetation indices such as MODIS NDVI, SAVI, and EVI or wave combinations and above-ground biomass of grassland. The grassland AGB has the highest correlation with NDVI (R<sup>2</sup> = 0.372), and other vegetation indices with high correlations are MSAVI, OSAVI, and SAVI. The SAVI is the best vegetation index for the AGB inversion of desert grassland (R<sup>2</sup> = 0.255), and other vegetation indices with good inversion effects are MSAVI, NDVI, and OSAVI, which is consistent with the results of Veraverbeke et&#xa0;al. (2012); that is, when performing grassland biomass inversion in sparsely vegetated areas, SAVI is better than other single vegetation indices. The best vegetation index for meadow AGB inversion is OSAVI (R<sup>2</sup> = 285), and other indices, such as SAVI, MSAVI, and NDVI, are also good for meadow AGB inversion. Although the best indices for AGB inversion of the three grassland types are different, the best vegetation indices are generally NDVI, SAVI, MSAVI, and OSAVI. Due to the sparse vegetation in Xinjiang, when a pixel is composed of green vegetation and soil background, soil-adjusted indices, such as SAVI, MSAVI, and OSAVI, can eliminate the influence of the soil background value, and the inversion effect is better (<xref ref-type="bibr" rid="B4">Bannari et&#xa0;al., 1995</xref>; <xref ref-type="bibr" rid="B39">Silleos et&#xa0;al., 2006</xref>). When only using a single vegetation index for the grassland biomass inversion, the grassland biomass inversion accuracy of desert, steppe, and meadow is different; steppe has the highest accuracy, followed by meadow, and the desert grassland has the lowest accuracy. Due to the low vegetation cover in most of the desert area, the leaves of the vegetation are small, and with low chlorophyll concentration; therefore, the &#x201c;contamination&#x201d; of the target signal by background information is prone to occur in the process of detecting vegetation spectral information by satellite remote sensing sensors, and the sensitivity of sensors to detect vegetation spectral information in desert areas is reduced, making the vegetation spectral information obtained from satellite remote sensing images extremely weak or even difficult to detect by satellite remote sensing sensors (<xref ref-type="bibr" rid="B41">Townshend and Justice, 1986</xref>; <xref ref-type="bibr" rid="B42">Vanselow and Samimi, 2014</xref>), which could lead to predicted values that are higher or lower than actual values during grassland biomass inversion.</p>
<p>In view of the low accuracy of using a single vegetation index for grassland AGB inversion in Xinjiang, other environmental factors related to the grassland AGB, such as meteorological factors, geographic location and topography, and vegetation biological indicators, must be added to the grassland AGB inversion models. This study found that the AGB of desert grasslands and steppes has the highest correlation with grass height, with R<sup>2</sup> values reaching 0.182 and 0.344, respectively, and precipitation also has a high correlation with the AGB of desert grasslands and steppes; for the AGB of meadows, the index with the highest correlation is temperature, R<sup>2</sup> = 0.305, followed by the grass height. Since there is less precipitation in desert grasslands and steppes, precipitation is one of the most critical factors in determining grassland biomass. (<xref ref-type="bibr" rid="B38">Shoshany and Karnibad, 2011</xref>) also found that in large-scale arid areas with sparse vegetation, precipitation is a key indicator to construct a biomass model (<xref ref-type="bibr" rid="B38">Shoshany and Karnibad, 2011</xref>). For meadows, the elevation is high, and the temperature is low, and temperature is a key factor in determining meadow AGB(<xref ref-type="bibr" rid="B50">Zhang et&#xa0;al., 2019a</xref>), which indicates that for different vegetation types, the responses to climatic factors are inconsistent due to different growth environments. Our study found that the correlation between a single environmental variable and the biomass of the three grassland types is not high, but the correlation of most variables is extremely significant (<italic>p&lt;</italic> 0.01). (<xref ref-type="bibr" rid="B46">Yang et&#xa0;al., 2018</xref>) found similar results in studying grassland biomass in the Sanjiangyuan area. Several scholars have also found that there are many uncertain factors in the grassland biomass inversion using a single vegetation index or environmental index (<xref ref-type="bibr" rid="B46">Yang et&#xa0;al., 2018</xref>). Similarly, other scholars have found similar results (<xref ref-type="bibr" rid="B49">Zandler et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B30">Meng et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B53">Zhou et&#xa0;al., 2021</xref>). Therefore, it is necessary to comprehensively consider the vegetation indices and environmental variables when constructing grassland biomass models. </p>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Screening of indicators for constructing biomass models</title>
<p>Although grassland biomass inversion by a single vegetation index or environmental variable is insufficient, the comprehensive consideration of these variables could provide more information. Of course, some indicators may have high information overlap and high autocorrelation. Using PCA can eliminate the interactions between the evaluation variables. After PCA, the principal components that are independent of each other are formed, and PCA can reduce the workload of selecting the independent variables for biomass inversion (<xref ref-type="bibr" rid="B14">Feliciano et&#xa0;al., 2009</xref>). For the selection of independent variables to construct a grassland biomass inversion model, this study selected the PCA method for the first time and screened out a small number of variables to maximally reflect the information of the original variables to ensure that the loss of original information was small, and the number of variables was as small as possible. Therefore, in this study, correlation analysis and PCA were used to screen the vegetation indices, geographical location and topography, meteorological, and vegetation biophysical indices for the biomass inversion of the three grassland types of desert grassland, steppe, and meadow. The cumulative contribution rate of the selected principal variables of desert grassland, steppe, and meadow is over 85%, which indicates that the information provided by all the variables is included in the eigenvalues of the principal variables, i.e., most of the information is contained. This study revealed that the principal variables of desert grassland are aspect, precipitation, total grass cover, mean grass height, and OSAVI; the principal variables of steppe are Y, aspect, temperature, precipitation, OSAVI, and PBI; and the principal variables of meadow are elevation, grass height, grass cover, precipitation, OSAVI, and PBI. Notably, among the 29 single vegetation indices used in regression models, NDVI has the highest correlation with steppe AGB, and SAVI has the highest correlation with the AGB of desert grassland. However, in the process of screening variables by PCA, NDVI and SAVI were not screened out, and the screened index was OSAVI, because it is an improved vegetation index based on NDVI; OSAVI has a good linear correlation with NDVI and SAVI and can reflect a large amount of information contained in certain indices, such as the unadjusted index NDVI and soil-adjusted indices SAVI, EVI, MSAVI, and SATVI (<xref ref-type="bibr" rid="B34">Price et&#xa0;al., 2002</xref>). Green and red NDVI (GRNDVI), RVI, green&#x2013;red vegetation index (GRVI), and green chlorophyll index (GCI) were screened out in the variable screening, and these vegetation indices have better accuracy for biomass fitting in the univariate regression model. The PBI is one of important variables for steppe and meadow, but the accuracy of the univariate model using PBI was not high; the EVI, GVI, and other high-precision variables were not selected, and the interaction and complementarity of various vegetation indices are an important reason for this phenomenon. The PRI is a color-adjusted index (<xref ref-type="bibr" rid="B11">Coppin and Bauer, 1994</xref>) and is more sensitive to plants with &#x201c;greener&#x201d; leaves, and PBI is a principal variable for steppe and meadow after PCA. The leaf color for desert grassland vegetation is darker and more consistent with the background color, while the leaf color of steppe and meadow is obviously different from the background color; therefore, when performing the biomass inversion of meadows and steppes, using the PBI is beneficial for the biomass inversion (<xref ref-type="bibr" rid="B15">Gamon et&#xa0;al., 1997</xref>).</p>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Comparative analysis of biomass models</title>
<p>Model selection is a key step in accurately estimating grassland biomass. The parametric and nonparametric models for the biomass inversion of desert, steppe, and meadow were compared and analyzed. The inversion accuracy of the parametric models was low, and the logarithmic models had the highest accuracy in estimating steppe and meadow biomass, while the linear function model was the best model for estimating the biomass of desert grassland, with R<sup>2</sup> = 0.323, which can only be used for a rough estimation of biomass over a large grassland area in the study area. Compared with traditional parametric models, nonparametric models can significantly improve the accuracy of grassland biomass estimation, and machine learning algorithms are more suitable for more complex operations, which can better filter and combine variables and greatly improve the accuracy of grassland AGB estimation models (<xref ref-type="bibr" rid="B30">Meng et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B3">Anderson et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B46">Yang et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B52">Zhao et&#xa0;al., 2018</xref>). RF is the model with the highest biomass inversion accuracy for the three types of grasslands, but the accuracy is not the same, with the highest accuracy found for steppe (R<sup>2</sup> = 0.656), followed by meadow (R<sup>2</sup> = 0.61), with desert grassland having the worst accuracy (R<sup>2</sup> = 0.441). Desert grasslands are in arid environments, and due to the influence of the soil background and leaves, the biomass inversion of desert grasslands by remote sensing still needs further exploration. In addition to the RF model, the SVM model performs well for the biomass inversion of meadows and steppes but performs the worst for the biomass inversion of desert grasslands. Therefore, RF should be selected for grassland biomass inversion in Xinjiang. Meng compared and analyzed the parametric and nonparametric models for the estimation of alpine grassland biomass in southern Gansu and found that RF is the optimal grassland biomass inversion model (<xref ref-type="bibr" rid="B30">Meng et&#xa0;al., 2017</xref>), which is essentially consistent with the findings of this study. However, Simple regression models, stepwise multiple regression models, RF models and ANN models have been used for comparison of grassland biomass estimations in the mixed agropastoral zone of northern China, and stepwise multiple regression models were found to be the best models for grassland inversion, which may be related to the number of data samples used to build the models. When the model is built with fewer samples, the parametric model is better than the nonparametric model (<xref ref-type="bibr" rid="B1">Abrougui et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B43">Xu et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B43">Xu et&#xa0;al., 2020</xref>).</p>
</sec>
<sec id="s4_4">
<label>4.4</label>
<title>Factors affecting the accuracy of grassland biomass inversion models</title>
<p>Although parametric or nonparametric models have a high inversion accuracy, some factors still affect the accuracy of grassland biomass inversion. First, there may be spatiotemporal inconsistencies between field sampling ranges and satellite data (<xref ref-type="bibr" rid="B13">Eisfelder et&#xa0;al., 2012</xref>). In terms of spatial consistency, the sampling points are relatively small (i.e., 1 m &#xd7; 1 m or 5 m &#xd7; 5 m). Although each sample land has five plots, each NDVI pixel covers a square of 500 m, which is substantially larger than the sampling point, and this difference inevitably creates modeling errors (<xref ref-type="bibr" rid="B48">Yuan et&#xa0;al., 2016</xref>). For grasslands in high-elevation and desert areas, there are fewer sampling points due to impassability, which could inevitably have a certain impact on the results of grassland biomass inversion. This study demonstrated the feasibility of using PCA for screening indicators for desert grassland, steppe, and meadow, and for grasslands with more sampling points, the machine learning method is the optimal method for grassland biomass inversion. Of course, the machine learning method requires a large amount of ground-measured data. The amount of data sampled in this paper (desert: 383; steppe: 562; and meadow: 256) can support the parameter model to simulate grassland biomass. This study provides a basis for how to select data, models, and predictors and successfully constructed a model with high accuracy for grassland biomass inversion. However, some grassland biomass indicators selected in this study are not easily obtained. For example, the accuracy of the inversion model using the grass height by remote sensing is extremely poor. However, some of the grassland biomass indicators selected in this study are not easily available, such as the extremely poor accuracy of the grassland height remote sensing inversion model, and this indicator is not yet available for automated observation, thus lacking operability in practice and cannot be applied yet.</p>
</sec>
</sec>
<sec id="s5" sec-type="conclusions">
<label>5</label>
<title>Conclusion</title>
<p>This study collected 1201 grassland AGB data points in Xinjiang from 2014 to 2021 and compared the univariate and multivariate AGB inversion models and their accuracy. The main conclusions are as follows:</p>
<list list-type="simple">
<list-item>
<p>(1) Of 15 vegetation indices (NDVI, EVI, SAVI, MSAVI, SATVI, OSAVI, RI1, PPR, PBI, SWI2, GVI, RVI, B7/B2, B7/B5, and B2/B1), except for PPR, PBI, and SWI2, the other vegetation indices have extremely significant correlations with the AGB of desert grasslands (<italic>p</italic>&lt; 0.01), among which SAVI has the strongest correlation. The 15 vegetation indices used in this study all have extremely significant correlations with the AGB of steppes and meadows, among which NDVI has the highest correlation with the steppe AGB and MSAVI has the highest correlation with the meadow AGB.</p>
</list-item>
<list-item>
<p>(2) Grassland AGB is significantly affected by geographical location and topography, climate, and vegetation biophysical indicators, and the accuracy of grassland biomass inversion models based on a single environmental variable is poor.</p>
</list-item>
<list-item>
<p>(3) In the biomass models of three types of grasslands constructed after PCA, the principal variables are different. SAVI, aspect, slope, and Prec are selected for desert grassland; NDVI, SWI2, longitude, mean temperature, and annual precipitation are selected for steppe; and OSAVI, PPR, longitude, precipitation, and temperature are selected for meadow.</p>
</list-item>
<list-item>
<p>(4) The accuracy of biomass models of the three types of grassland constructed by principal variables is significantly improved, and the nonparametric models are all better than the parametric models. The RF model is better than the other models for the biomass inversion of desert grassland, steppe, and meadow. However, for desert grasslands with extremely low vegetation cover, there are great uncertainties in multivariate inversion by remote sensing.</p>
</list-item>
</list>
</sec>
<sec id="s6" sec-type="data-availability">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.</p>
</sec>
<sec id="s7" sec-type="author-contributions">
<title>Author contributions</title>
<p>All authors contributed significantly to this manuscript. Specifically, RZ and JZ designed this study. RZ wrote the main manuscript text and analyzed the data and prepared figures. YM, JG and LZ conducted field surveys. All authors contributed to the article and approved the submitted version.</p>
</sec>
</body>
<back>
<sec id="s8" sec-type="funding-information">
<title>Funding</title>
<p>This research was supported by the Xinjiang Uygur Autonomous Region Key Research and Development Program (2022B01012-2) and the National Natural Science Foundation of China (31860145).</p>
</sec>
<ack>
<title>Acknowledgments</title>
<p>Acknowledgement for the data support from the Earth Observing System Data and Information System (EOSDIS) (<ext-link ext-link-type="uri" xlink:href="https://earthdata.nasa.gov/">https://earthdata.nasa.gov/</ext-link>) of the National Aeronautics and Space Administration (NASA). We also appreciate the editor&#x2019;s and reviewers&#x2019; constructive suggestions to greatly improve the paper.</p>
</ack>
<sec id="s9" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="s10" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abrougui</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Gabsi</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Mercatoris</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Khemis</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Amami</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Chehaibi</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Prediction of organic potato yield using tillage systems and soil properties by artificial neural network (ANN) and multiple linear regressions (MLR)</article-title>. <source>Soil Tillage Res.</source> <volume>190</volume>, <fpage>202</fpage>&#x2013;<lpage>208</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.still.2019.01.011</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adam</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Mutanga</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Abdel-Rahman</surname> <given-names>E. M.</given-names>
</name>
<name>
<surname>Ismail</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Estimating standing biomass in papyrus (Cyperus papyrus l.) swamp: exploratory of <italic>in situ</italic> hyperspectral indices and random forest regression</article-title>. <source>Int. J. Remote Sens.</source> <volume>35</volume> (<issue>2</issue>), <fpage>693</fpage>&#x2013;<lpage>714</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/01431161.2013.870676</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Anderson</surname> <given-names>K. E.</given-names>
</name>
<name>
<surname>Glenn</surname> <given-names>N. F.</given-names>
</name>
<name>
<surname>Spaete</surname> <given-names>L. P.</given-names>
</name>
<name>
<surname>Shinneman</surname> <given-names>D. J.</given-names>
</name>
<name>
<surname>Pilliod</surname> <given-names>D. S.</given-names>
</name>
<name>
<surname>Arkle</surname> <given-names>R. S.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>Estimating vegetation biomass and cover across large plots in shrub and grass dominated drylands using terrestrial lidar and machine learning</article-title>. <source>Ecol. Indic.</source> <volume>84</volume>, <fpage>793</fpage>&#x2013;<lpage>802</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ecolind.2017.09.034</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bannari</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Morin</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Bonn</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Huete</surname> <given-names>A. R.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>A review of vegetation indices</article-title>. <source>Remote Sens. Rev.</source> <volume>13</volume> (<issue>1-2</issue>), <fpage>95</fpage>&#x2013;<lpage>120</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/02757259509532298</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barati</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Rayegani</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Saati</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Sharifi</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Nasri</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Comparison the accuracies of different spectral indices for estimation of vegetation cover fraction in sparse vegetated areas</article-title>. <source>Egyptian J. Remote Sens. Space Sci.</source> <volume>14</volume> (<issue>1</issue>), <fpage>49</fpage>&#x2013;<lpage>56</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ejrs.2011.06.001</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bates</surname> <given-names>D. M.</given-names>
</name>
<name>
<surname>Lindstrom</surname> <given-names>M. J.</given-names>
</name>
<name>
<surname>Wahba</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Yandell</surname> <given-names>B. S.</given-names>
</name>
</person-group> (<year>1987</year>). <article-title>Gcvpack routines for generalized cross validation</article-title>. <source>Commun. Statistics-Simulation Comput.</source> <volume>16</volume> (<issue>1</issue>), <fpage>263</fpage>&#x2013;<lpage>297</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/03610918708812590</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Breiman</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Random forests</article-title>. <source>Mach. Learn.</source> <volume>45</volume> (<issue>1</issue>), <fpage>5</fpage>&#x2013;<lpage>32</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Camps-Valls</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Bruzzone</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Rojo-Alvarez</surname> <given-names>J. L.</given-names>
</name>
<name>
<surname>Melgani</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Robust support vector regression for biophysical variable estimation from remotely sensed images</article-title>. <source>IEEE Geosci. Remote Sens. Lett.</source> <volume>3</volume> (<issue>3</issue>), <fpage>339</fpage>&#x2013;<lpage>343</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/lgrs.2006.871748</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Catchpole</surname> <given-names>W. R.</given-names>
</name>
<name>
<surname>Wheeler</surname> <given-names>C. J.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>Estimating plant biomass: A review of techniques</article-title>. <source>Australian Journal of Ecology</source>. <volume>17</volume> <issue>(2)</issue>, <fpage>121</fpage>&#x2013;<lpage>131</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1442-9993.1992.tb00790.x</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chang</surname> <given-names>C.-C.</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>C.-J.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>LIBSVM: A library for support vector machines</article-title>. <source>ACM Trans. Intelligent Syst. Technol.</source> <volume>2</volume> (<issue>3</issue>). doi:&#xa0;<pub-id pub-id-type="doi">10.1145/1961189.1961199</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Coppin</surname> <given-names>P. R.</given-names>
</name>
<name>
<surname>Bauer</surname> <given-names>M. E.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Processing of multitemporal landsat Tm imagery to optimize extraction of forest cover change features</article-title>. <source>IEEE Trans. Geosci. Remote Sens.</source> <volume>32</volume> (<issue>4</issue>), <fpage>918</fpage>&#x2013;<lpage>927</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/36.298020</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Craine</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Nippert</surname> <given-names>J. B.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Cessation of Burning Dries Soils Long Term in a Tallgrass Prairie</article-title>. <source>Ecosystems</source> <volume>17</volume>(<issue>1</issue>), <fpage>54</fpage>&#x2013;<lpage>65</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s10021-013-9706-8</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Eisfelder</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Kuenzer</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Dech</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Derivation of biomass information for semi-arid areas using remote-sensing data</article-title>. <source>Int. J. Remote Sens.</source> <volume>33</volume> (<issue>9</issue>), <fpage>2937</fpage>&#x2013;<lpage>2984</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/01431161.2011.620034</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Feliciano</surname> <given-names>R. P.</given-names>
</name>
<name>
<surname>Bravo</surname> <given-names>M. N.</given-names>
</name>
<name>
<surname>Pires</surname> <given-names>M. M.</given-names>
</name>
<name>
<surname>Serra</surname> <given-names>A. T.</given-names>
</name>
<name>
<surname>Duarte</surname> <given-names>C. M.</given-names>
</name>
<name>
<surname>Boas</surname> <given-names>L. V.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>Phenolic content and antioxidant activity of moscatel dessert wines from the setubal region in Portugal</article-title>. <source>Food Analytical Methods</source> <volume>2</volume> (<issue>2</issue>), <fpage>149</fpage>&#x2013;<lpage>161</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s12161-008-9059-7</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gamon</surname> <given-names>J. A.</given-names>
</name>
<name>
<surname>Serrano</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Surfus</surname> <given-names>J. S.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>The photochemical reflectance index: an optical indicator of photosynthetic radiation use efficiency across species, functional types, and nutrient levels</article-title>. <source>Oecologia</source> <volume>112</volume> (<issue>4</issue>), <fpage>492</fpage>&#x2013;<lpage>501</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s004420050337</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Guo</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Ganjurjav</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Wan</surname> <given-names>Y.</given-names>
</name>
<etal/>
</person-group>. (<year>2016</year>). <article-title>Climate change and its impacts on vegetation distribution and net primary productivity of the alpine ecosystem in the qinghai-Tibetan plateau</article-title>. <source>Sci. Total Environ.</source> <volume>554</volume>, <fpage>34</fpage>&#x2013;<lpage>41</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.scitotenv.2016.02.131</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Garroutte</surname> <given-names>E. L.</given-names>
</name>
<name>
<surname>Hansen</surname> <given-names>A. J.</given-names>
</name>
<name>
<surname>Lawrence</surname> <given-names>R. L.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Using NDVI and EVI to map spatiotemporal variation in the biomass and quality of forage for migratory elk in the greater Yellowstone ecosystem</article-title>. <source>Remote Sens.</source> <volume>8</volume> (<issue>5</issue>). doi:&#xa0;<pub-id pub-id-type="doi">10.3390/rs8050404</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ge</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Meng</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Liang</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Gao</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>Modeling alpine grassland cover based on MODIS data and support vector machine regression in the headwater region of the huanghe river, China</article-title>. <source>Remote Sens. Environ.</source> <volume>218</volume>, <fpage>162</fpage>&#x2013;<lpage>173</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.rse.2018.09.019</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Han</surname> <given-names>L. C.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>A method of modifying error for non-synchronicity of grass yield remote sensing estimation and measurement</article-title>. <source>Int. J. Remote Sens.</source> <volume>22</volume> (<issue>17</issue>), <fpage>3363</fpage>&#x2013;<lpage>3372</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/01431160010006421</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hossain</surname> <given-names>M. B.</given-names>
</name>
<name>
<surname>Patras</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Barry-Ryan</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Martin-Diana</surname> <given-names>A. B.</given-names>
</name>
<name>
<surname>Brunton</surname> <given-names>N. P.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Application of principal component and hierarchical cluster analysis to classify different spices based on <italic>in vitro</italic> antioxidant activity and individual polyphenolic antioxidant compounds</article-title>. <source>J. Funct. Foods</source> <volume>3</volume> (<issue>3</issue>), <fpage>179</fpage>&#x2013;<lpage>189</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.jff.2011.03.010</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huete</surname> <given-names>A. R.</given-names>
</name>
<name>
<surname>Jackson</surname> <given-names>R. D.</given-names>
</name>
</person-group> (<year>1988</year>). <article-title>Soil and atmosphere influences on the spectra of partial canopies</article-title>. <source>Remote Sens. Environ.</source> <volume>25</volume> (<issue>1</issue>), <fpage>89</fpage>&#x2013;<lpage>105</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/0034-4257(88)90043-0</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huete</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Justice</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>H.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Development of vegetation and soil indexes for modis-eos</article-title>. <source>Remote Sens. Environ.</source> <volume>49</volume> (<issue>3</issue>), <fpage>224</fpage>&#x2013;<lpage>234</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/0034-4257(94)90018-3</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kearney</surname> <given-names>S. P.</given-names>
</name>
<name>
<surname>Porensky</surname> <given-names>L. M.</given-names>
</name>
<name>
<surname>Augustine</surname> <given-names>D. J.</given-names>
</name>
<name>
<surname>Gaffney</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Derner</surname> <given-names>J. D.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Monitoring standing herbaceous biomass and thresholds in semiarid rangelands from harmonized landsat 8 and sentinel-2 imagery to support within-season adaptive management</article-title>. <source>Remote Sens. Environ.</source> <volume>271</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.rse.2022.112907</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lehtonen</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Cienciala</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Tatarinov</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Makipaa</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Uncertainty estimation of biomass expansion factors for Norway spruce in the Czech republic</article-title>. <source>Ann. For. Sci.</source> <volume>64</volume> (<issue>2</issue>), <fpage>133</fpage>&#x2013;<lpage>140</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1051/forest:2006097</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liang</surname> <given-names>S. L.</given-names>
</name>
<name>
<surname>Fang</surname> <given-names>H. L.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>M. Z.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Atmospheric correction of landsat ETM+ land surface imagery - part I: Methods</article-title>. <source>IEEE Trans. ON Geosci. AND Remote Sens.</source> <volume>39</volume> (<issue>11</issue>), <fpage>2490</fpage>&#x2013;<lpage>2498</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/36.964986</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liang</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>X.</given-names>
</name>
<etal/>
</person-group>. (<year>2016</year>). <article-title>Multi-factor modeling of above-ground biomass in alpine grassland: A case study in the three-river headwaters region, China</article-title>. <source>Remote Sens. Environ.</source> <volume>186</volume>, <fpage>164</fpage>&#x2013;<lpage>172</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.rse.2016.08.014</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Zhu</surname> <given-names>A. X.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>The refined spatiotemporal representation of soil organic matter based on remote images fusion of sentinel-2 and sentinel-3</article-title>. <source>Int. J. Appl. Earth Observation Geoinformation</source> <volume>89</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.jag.2020.102094</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Bi</surname> <given-names>J. W.</given-names>
</name>
<name>
<surname>Fan</surname> <given-names>Z. P.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Multi-class sentiment classification: The experimental comparisons of feature selection and machine learning algorithms</article-title>. <source>Expert Syst. Appl.</source> <volume>80</volume>, <fpage>323</fpage>&#x2013;<lpage>339</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.eswa.2017.03.042</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Long</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Shang</surname> <given-names>Z.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Evaluation method of ecological services function and their value for grassland ecosystems</article-title>. <source>Acta Prataculturae Sin.</source> <volume>20</volume> (<issue>1</issue>), <fpage>167</fpage>&#x2013;<lpage>174</lpage>. doi: 10.1004-5759(2011)20
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meng</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Ge</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Liang</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Gao</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>Q.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>Evaluation of remote sensing inversion error for the above-ground biomass of alpine meadow grassland based on multi-source satellite data</article-title>. <source>Remote Sens.</source> <volume>9</volume> (<issue>4</issue>). doi:&#xa0;<pub-id pub-id-type="doi">10.3390/rs9040372</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Penuelas</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Filella</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Gamon</surname> <given-names>J. A.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>Assessment of photosynthetic radiation-use efficiency with spectral reflectance</article-title>. <source>New Phytol.</source> <volume>131</volume> (<issue>3</issue>), <fpage>291</fpage>&#x2013;<lpage>296</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1469-8137.1995.tb03064.x</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Piao</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Ciais</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Lomas</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Beer</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Fang</surname> <given-names>J.</given-names>
</name>
<etal/>
</person-group>. (<year>2011</year>). <article-title>Contribution of climate change and rising CO2 to terrestrial carbon balance in East Asia: A multi-model analysis</article-title>. <source>Global Planetary Change</source> <volume>75</volume> (<issue>3-4</issue>), <fpage>133</fpage>&#x2013;<lpage>142</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.gloplacha.2010.10.014</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Porter</surname> <given-names>T. F.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Long</surname> <given-names>J. A.</given-names>
</name>
<name>
<surname>Lawrence</surname> <given-names>R. L.</given-names>
</name>
<name>
<surname>Sowell</surname> <given-names>B. F.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Estimating biomass on CRP pastureland: A comparison of remote sensing techniques</article-title>. <source>Biomass Bioenergy</source> <volume>66</volume>, <fpage>268</fpage>&#x2013;<lpage>274</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.biombioe.2014.01.036</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Price</surname> <given-names>K. P.</given-names>
</name>
<name>
<surname>Guo</surname> <given-names>X. L.</given-names>
</name>
<name>
<surname>Stiles</surname> <given-names>J. M.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Optimal landsat TM band combinations and vegetation indices for discrimination of six grassland types in eastern Kansas</article-title>. <source>Int. J. Remote Sens.</source> <volume>23</volume> (<issue>23</issue>), <fpage>5031</fpage>&#x2013;<lpage>5042</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/01431160210121764</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qi</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Chehbouni</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Huete</surname> <given-names>A. R.</given-names>
</name>
<name>
<surname>Kerr</surname> <given-names>Y. H.</given-names>
</name>
<name>
<surname>Sorooshian</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>A modified soil adjusted vegetation index</article-title>. <source>Remote Sens. Environ.</source> <volume>48</volume> (<issue>2</issue>), <fpage>119</fpage>&#x2013;<lpage>126</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/0034-4257(94)90134-1</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rouse</surname> <given-names>Jr. J. W.</given-names>
</name>
<name>
<surname>Haas</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Schell</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Deering</surname> <given-names>D.</given-names>
</name>
</person-group> (<year>1974</year>). <article-title>Monitoring vegetation systems in the great plains with ERTS</article-title>. <source>NASA special Publ.</source> <volume>351</volume>, <fpage>309</fpage>.</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Scurlock</surname> <given-names>J. M. O.</given-names>
</name>
<name>
<surname>Hall</surname> <given-names>D. O.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>The global carbon sink: a grassland perspective</article-title>. <source>Global Change Biology</source>. <volume>4</volume> (<issue>2</issue>), <fpage>229</fpage>&#x2013;<lpage>233</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1046/j.1365-2486.1998.00151.x</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shoshany</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Karnibad</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Mapping shrubland biomass along Mediterranean climatic gradients: The synergy of rainfall-based and NDVI-based models</article-title>. <source>Int. J. Remote Sens.</source> <volume>32</volume> (<issue>24</issue>), <fpage>9497</fpage>&#x2013;<lpage>9508</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/01431161.2011.562255</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Silleos</surname> <given-names>N. G.</given-names>
</name>
<name>
<surname>Alexandridis</surname> <given-names>T. K.</given-names>
</name>
<name>
<surname>Gitas</surname> <given-names>I. Z.</given-names>
</name>
<name>
<surname>Perakis</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Vegetation indices: Advances made in biomass estimation and vegetation monitoring in the last 30 years</article-title>. <source>Geocarto Int.</source> <volume>21</volume> (<issue>4</issue>), <fpage>21</fpage>&#x2013;<lpage>28</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/10106040608542399</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tiryaki</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Aydin</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>An artificial neural network model for predicting compression strength of heat treated woods and comparison with a multiple linear regression model</article-title>. <source>Construction Building Materials</source> <volume>62</volume>, <fpage>102</fpage>&#x2013;<lpage>108</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.conbuildmat.2014.03.041</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Townshend</surname> <given-names>J. R. G.</given-names>
</name>
<name>
<surname>Justice</surname> <given-names>C. O.</given-names>
</name>
</person-group> (<year>1986</year>). <article-title>Analysis of the dynamics of African vegetation using the normalized difference vegetation index</article-title>. <source>Int. J. Remote Sens.</source> <volume>7</volume> (<issue>11</issue>), <fpage>1435</fpage>&#x2013;<lpage>1445</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/01431168608948946</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vanselow</surname> <given-names>K. A.</given-names>
</name>
<name>
<surname>Samimi</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Predictive mapping of dwarf shrub vegetation in an arid high mountain ecosystem using remote sensing and random forests</article-title>. <source>Remote Sens.</source> <volume>6</volume> (<issue>7</issue>), <fpage>6709</fpage>&#x2013;<lpage>6726</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3390/rs6076709</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Su</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Jin</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>Q.</given-names>
</name>
<etal/>
</person-group>. (<year>2020</year>). <article-title>Estimation of degraded grassland aboveground biomass using machine learning methods from terrestrial laser scanning data</article-title>. <source>Ecol. Indic.</source> <volume>108</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ecolind.2019.105747</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Liang</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Xie</surname> <given-names>H.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Modeling grassland above-ground biomass based on artificial neural network and remote sensing in the three-river headwaters region</article-title>. <source>Remote Sens. Environ.</source> <volume>204</volume>, <fpage>448</fpage>&#x2013;<lpage>455</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.rse.2017.10.011</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Jia</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Liang</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>X.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Comparison of four machine learning methods for generating the GLASS fractional vegetation cover product from MODIS data</article-title>. <source>Remote Sens.</source> <volume>8</volume> (<issue>8</issue>). doi:&#xa0;<pub-id pub-id-type="doi">10.3390/rs8080682</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Weisberg</surname> <given-names>P. J.</given-names>
</name>
<name>
<surname>Bristow</surname> <given-names>N. A.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Landsat remote sensing approaches for monitoring long-term tree cover dynamics in semi-arid woodlands: Comparison of vegetation indices and spectral mixture analysis</article-title>. <source>Remote Sens. Environ.</source> <volume>119</volume>, <fpage>62</fpage>&#x2013;<lpage>71</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.rse.2011.12.004</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuan</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Tian</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Luo</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>X.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Estimation of above-ground biomass using MODIS satellite imagery of multiple land-cover types in China</article-title>. <source>Remote Sens. Lett.</source> <volume>7</volume> (<issue>12</issue>), <fpage>1141</fpage>&#x2013;<lpage>1149</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/2150704x.2016.1219458</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuan</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Yu</surname> <given-names>H.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>Retrieving soybean leaf area index from unmanned aerial vehicle hyperspectral remote sensing: Analysis of RF, ANN, and SVM regression models</article-title>. <source>Remote Sens.</source> <volume>9</volume> (<issue>4</issue>). doi:&#xa0;<pub-id pub-id-type="doi">10.3390/rs9040309</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zandler</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Brenning</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Samimi</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Quantifying dwarf shrub biomass in an arid environment: comparing empirical methods in a high dimensional setting</article-title>. <source>Remote Sens. Environ.</source> <volume>158</volume>, <fpage>140</fpage>&#x2013;<lpage>155</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.rse.2014.11.007</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Guo</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Liang</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>Q.</given-names>
</name>
</person-group> (<year>2019</year>a). <article-title>Grassland vegetation phenological variations and responses to climate change in the xinjiang region, China</article-title>. <source>Quaternary Int.</source> <volume>513</volume>, <fpage>56</fpage>&#x2013;<lpage>65</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.quaint.2019.03.010</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Liang</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Guo</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Xie</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Aimaiti</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>2019</year>b). <article-title>Grassland dynamics in response to climate change and human activities in xinjiang from 2000 to 2014 (vol 8, 2888, 2018)</article-title>. <source>Sci. Rep.</source> <volume>9</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41598-019-41390-z</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Suarez</surname> <given-names>J. C.</given-names>
</name>
<name>
<surname>Garcia</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Londo</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Utility of multitemporal lidar for forest and carbon monitoring: Tree growth, biomass dynamics, and carbon flux</article-title>. <source>Remote Sens. Environ.</source> <volume>204</volume>, <fpage>883</fpage>&#x2013;<lpage>897</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.rse.2017.09.007</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Xie</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Nie</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Du</surname> <given-names>Z.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). <article-title>Remote sensing inversion of grassland aboveground biomass based on high accuracy surface modeling</article-title>. <source>Ecol. Indic.</source> <volume>121</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ecolind.2020.107215</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>