<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
	<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
		<front>
			<journal-meta>
				<journal-id journal-id-type="publisher-id">Front. Big Data</journal-id>
				<journal-title>Frontiers in Big Data</journal-title>
				<abbrev-journal-title abbrev-type="pubmed">Front. Big Data</abbrev-journal-title>
				<issn pub-type="epub">2624-909X</issn>
				<publisher>
					<publisher-name>Frontiers Media S.A.</publisher-name>
				</publisher>
			</journal-meta>
			<article-meta>
				<article-id pub-id-type="publisher-id">528441</article-id>
				<article-id pub-id-type="doi">10.3389/fdata.2020.528441</article-id>
				<article-categories>
					<subj-group subj-group-type="heading">
						<subject>Big Data</subject>
						<subj-group>
							<subject>Original Research</subject>
						</subj-group>
					</subj-group>
				</article-categories>
				<title-group>
					<article-title>Ensemble Machine Learning Approach Improves Predicted Spatial Variation of Surface Soil Organic Carbon Stocks in Data-Limited Northern Circumpolar Region</article-title>
					<alt-title alt-title-type="left-running-head">Mishra et al.</alt-title>
					<alt-title alt-title-type="right-running-head">Machine Learning to Predict Soil Carbon</alt-title>
				</title-group>
				<contrib-group>
					<contrib contrib-type="author" corresp="yes">
						<name>
							<surname>Mishra</surname>
							<given-names>Umakant</given-names>
						</name>
						<xref ref-type="aff" rid="aff1">
							<sup>1</sup>
						</xref>
						<xref ref-type="corresp" rid="c001">
							<sup>&#x2a;</sup>
						</xref>
						<uri xlink:href="http://loop.frontiersin.org/people/891727/overview"/>
					</contrib>
					<contrib contrib-type="author">
						<name>
							<surname>Gautam</surname>
							<given-names>Sagar</given-names>
						</name>
						<xref ref-type="aff" rid="aff1">
							<sup>1</sup>
						</xref>
						<uri xlink:href="http://loop.frontiersin.org/people/955318/overview"/>
					</contrib>
					<contrib contrib-type="author">
						<name>
							<surname>Riley</surname>
							<given-names>William J.</given-names>
						</name>
						<xref ref-type="aff" rid="aff2">
							<sup>2</sup>
						</xref>
						<uri xlink:href="http://loop.frontiersin.org/people/69321/overview"/>
					</contrib>
					<contrib contrib-type="author">
						<name>
							<surname>Hoffman</surname>
							<given-names>Forrest M.</given-names>
						</name>
						<xref ref-type="aff" rid="aff3">
							<sup>3</sup>
						</xref>
						<uri xlink:href="http://loop.frontiersin.org/people/548163/overview"/>
					</contrib>
				</contrib-group>
				<aff id="aff1">
					<label>
						<sup>1</sup>
					</label>Bioscience Division, Sandia National Laboratory, Livermore, <addr-line>CA</addr-line>, <country>United States</country>
				</aff>
				<aff id="aff2">
					<label>
						<sup>2</sup>
					</label>Earth and Environmental Sciences, Lawrence Berkeley National Lab, <addr-line>Berkeley</addr-line>, <addr-line>CA</addr-line>, <country>United States</country>
				</aff>
				<aff id="aff3">
					<label>
						<sup>3</sup>
					</label>Climate Change Institute, Oak Ridge National Laboratory, <addr-line>Oak Ridge</addr-line>, <addr-line>TN</addr-line>, <country>United States</country>
				</aff>
				<author-notes>
					<fn fn-type="edited-by">
						<p>
							<bold>Edited by:</bold> Rasmus Houborg, Planet Labs Inc., United States</p>
					</fn>
					<fn fn-type="edited-by">
						<p>
							<bold>Reviewed by:</bold> Elchin Jafarov, Los Alamos National Laboratory (DOE), United States</p>
						<p>Bin Peng, University of Illinois at Urbana-Champaign, United States</p>
						<p>Roland Baatz, Helmholtz Association of German Research Centers (HZ), Germany</p>
					</fn>
					<corresp id="c001">&#x2a;Correspondence: Umakant Mishra, <email>unmishr@sandia.gov</email>
					</corresp>
					<fn>
						<p>This article was submitted to Data-driven Climate Sciences, a section of the journal Frontiers in Big Data</p>
					</fn>
				</author-notes>
				<pub-date pub-type="epub">
					<day>28</day>
					<month>10</month>
					<year>2020</year>
				</pub-date>
				<pub-date pub-type="collection">
					<year>2020</year>
				</pub-date>
				<volume>3</volume>
				<elocation-id>528441</elocation-id>
				<history>
					<date date-type="received">
						<day>20</day>
						<month>01</month>
						<year>2020</year>
					</date>
					<date date-type="accepted">
						<day>22</day>
						<month>09</month>
						<year>2020</year>
					</date>
				</history>
				<permissions>
					<copyright-statement>Copyright &#x00A9; 2020 Mishra, Gautam, Riley and Hoffman</copyright-statement>
					<copyright-holder>Mishra, Gautam, Riley and Hoffman</copyright-holder>
					<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
						<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
					</license>
				</permissions>
				<abstract>
					<p>Various approaches of differing mathematical complexities are being applied for spatial prediction of soil properties. Regression kriging is a widely used hybrid approach of spatial variation that combines correlation between soil properties and environmental factors with spatial autocorrelation between soil observations. In this study, we compared four machine learning approaches (gradient boosting machine, multinarrative adaptive regression spline, random forest, and support vector machine) with regression kriging to predict the spatial variation of surface (0&#x2013;30&#xa0;cm) soil organic carbon (SOC) stocks at 250-m spatial resolution across the northern circumpolar permafrost region. We combined 2,374 soil profile observations (calibration datasets) with georeferenced datasets of environmental factors (climate, topography, land cover, bedrock geology, and soil types) to predict the spatial variation of surface SOC stocks. We evaluated the prediction accuracy at randomly selected sites (validation datasets) across the study area. We found that different techniques inferred different numbers of environmental factors and their relative importance for prediction of SOC stocks. Regression kriging produced lower prediction errors in comparison to multinarrative adaptive regression spline and support vector machine, and comparable prediction accuracy to gradient boosting machine and random forest. However, the ensemble median prediction of SOC stocks obtained from all four machine learning techniques showed highest prediction accuracy. Although the use of different approaches in spatial prediction of soil properties will depend on the availability of soil and environmental datasets and computational resources, we conclude that the ensemble median prediction obtained from multiple machine learning approaches provides greater spatial details and produces the highest prediction accuracy. Thus an ensemble prediction approach can be a better choice than any single prediction technique for predicting the spatial variation of SOC stocks.</p>
				</abstract>
				<kwd-group>
					<kwd>soil organic carbon</kwd>
					<kwd>spatial prediction</kwd>
					<kwd>machine learning</kwd>
					<kwd>permafrost soils</kwd>
					<kwd>environmental controllers</kwd>
				</kwd-group>
				<contract-num rid="cn001">DE-AC02-06CH11357</contract-num>
				<contract-sponsor id="cn001">Argonne National Laboratory<named-content content-type="fundref-id">10.13039/100006224</named-content>
				</contract-sponsor>
				<counts>
					<page-count count="0"/>
				</counts>
			</article-meta>
		</front>
		<body>
			<sec id="s1">
				<title>Introduction</title>
				<p>High latitude permafrost region soils store large stocks of soil organic carbon (SOC) due to multiple cryopedogenic processes operating over long time scales (<xref ref-type="bibr" rid="B45">Ping et al., 2008</xref>; <xref ref-type="bibr" rid="B58">Tarnocai et al., 2009</xref>; <xref ref-type="bibr" rid="B20">Hugelius et al., 2014</xref>; <xref ref-type="bibr" rid="B47">Ping et al., 2015</xref>). Enhanced rate of climate warming at high latitudes is causing widespread degradation and thawing of permafrost soils and subsequent release of greenhouse gases such as CO<sub>2</sub> and CH<sub>4</sub> to the atmosphere (<xref ref-type="bibr" rid="B48">Romanovsky et al., 2010</xref>; <xref ref-type="bibr" rid="B49">Rowland et al., 2010</xref>; <xref ref-type="bibr" rid="B4">Biskaborn et al., 2019</xref>). As a significant portion of permafrost region SOC stocks has the potential to be emitted as greenhouse gases under changing climate (<xref ref-type="bibr" rid="B29">McGuire et al., 2016</xref>; <xref ref-type="bibr" rid="B30">McGuire et al., 2017</xref>), permafrost region SOC stocks are a vulnerable component of the global carbon cycle. Current earth system models show large uncertainty both in baseline SOC stock representations and their release to the atmosphere under changing climate (<xref ref-type="bibr" rid="B36">Mishra et al., 2013</xref>; <xref ref-type="bibr" rid="B50">Schuur et al., 2015</xref>; <xref ref-type="bibr" rid="B29">McGuire et al., 2016</xref>). Reliable estimates of the magnitude and spatial variation of permafrost region SOC stocks are essential to better understand the environmental controls and to reduce the uncertainty in predicting permafrost region carbon -climate feedbacks. The magnitude of SOC stored in the soil per unit of land area is highly variable in permafrost region soils (<xref ref-type="bibr" rid="B40">Mishra and Riley, 2015</xref>; <xref ref-type="bibr" rid="B35">Mishra et al., 2017</xref>), as SOC stocks depend on various environmental factors such as soil type, land use, topographic features, and climatic conditions, which are site specific. Knowledge of soil and site-specific environmental controllers is essential to make reliable spatial predictions of SOC stocks.</p>
				<p>In spatial prediction of soil properties, mathematical or statistical relationships are usually developed using limited number of soil observations and environmental predictors. The derived relationship is then applied with environmental predictors across the study area to produce spatially-explicit estimates of soil properties. A number of spatial prediction approaches have been used to predict the spatial variation of SOC stocks depending upon the available data density and environmental data of soil-forming factors (<xref ref-type="bibr" rid="B37">Mishra and Lal, 2010</xref>; <xref ref-type="bibr" rid="B34">Minasny et al., 2013</xref>). Spatial prediction techniques can broadly be categorized into three groups that use: 1) environmental correlation between soil C and environmental factors (<xref ref-type="bibr" rid="B28">Martin et al., 2011</xref>; <xref ref-type="bibr" rid="B63">Zhang et al., 2011</xref>); 2) spatial autocorrelation among soil C observations (<xref ref-type="bibr" rid="B38">Mishra et al., 2009</xref>; <xref ref-type="bibr" rid="B8">Cambule et al., 2013</xref>); and 3) hybrid approaches that combine environmental correlation and spatial autocorrelation (<xref ref-type="bibr" rid="B27">Martin et al., 2014</xref>; <xref ref-type="bibr" rid="B32">Meng, 2014</xref>). Among spatial prediction approaches used to predict the spatial variation of SOC stocks, multiple linear regressions (group 1 that uses environmental correlation) and ordinary kriging (group 2 that uses spatial autocorrelation) are the most commonly used techniques in the literature, primarily because of their simplicity in interpretation and ease of use. However, the most accurate predictions (lowest prediction errors) have been achieved through the use of hybrid approaches [e.g., regression kriging (<xref ref-type="bibr" rid="B18">Hengl et al., 2007</xref>; <xref ref-type="bibr" rid="B34">Minasny et al., 2013</xref>; group 3] that combined environmental correlation and spatial autocorrelation.</p>
				<p>In addition to the above-mentioned three groups of spatial prediction, methods with increasing computational complexity are being used to predict the spatial variation of soil properties. For example, machine-learning based spatial modeling techniques such as random forest (<xref ref-type="bibr" rid="B55">Sreenivas et al. 2016</xref>; <xref ref-type="bibr" rid="B53">Siewert 2018</xref>), neural networks (<xref ref-type="bibr" rid="B26">Li et al., 2013</xref>), and rule-based models (<xref ref-type="bibr" rid="B60">Viscarra Rossel and Webster, 2012</xref>; <xref ref-type="bibr" rid="B25">Lacoste et al., 2014</xref>) have been used to capture non-linear relationships between soil C and environmental factors. These machine learning approaches are being increasingly applied for predicting soil properties including SOC stocks. More recently, ensembles of multiple approaches are also being applied to improve the spatial prediction of SOC stocks (<xref ref-type="bibr" rid="B59">Vasat et al., 2017</xref>; <xref ref-type="bibr" rid="B10">Chen et al., 2020</xref>). The use of average or median predictions from ensemble of different approaches improves spatial prediction of soil properties and the inter quartile range of ensemble predictions provides estimates of uncertainty ranges due to different model structures (<xref ref-type="bibr" rid="B29">McGuire et al., 2016</xref>; <xref ref-type="bibr" rid="B30">McGuire et al., 2017</xref>; <xref ref-type="bibr" rid="B52">Shi et al., 2018</xref>). Further, the spatial distribution of uncertainty estimates can also inform future sampling locations to reduce the existing uncertainty.</p>
				<p>Permafrost affected soils show vast spatial and vertical heterogeneity of soil properties (<xref ref-type="bibr" rid="B22">Johnson et al., 2011</xref>; <xref ref-type="bibr" rid="B54">Siewert et al., 2015</xref>; <xref ref-type="bibr" rid="B3">Beer, 2016</xref>), and therefore areal estimates of permafrost region soil properties, including SOC stocks, could benefit from advanced spatial modeling approaches. However, application of geospatial approaches in the permafrost region has been limited due to low sample density and limited availability of spatially resolved environmental datasets (<xref ref-type="bibr" rid="B36">Mishra et al., 2013</xref>; <xref ref-type="bibr" rid="B53">Siewert, 2018</xref>). Recently, spatial predictions of soil properties using geospatial and remote sensing information have been applied at local to regional scales to account for and better represent the spatial variation of permafrost affected soil properties (<xref ref-type="bibr" rid="B44">Pastick et al., 2014</xref>; <xref ref-type="bibr" rid="B2">Bartsch et al., 2016</xref>; <xref ref-type="bibr" rid="B13">Ding et al., 2016</xref>; <xref ref-type="bibr" rid="B53">Siewert, 2018</xref>). These high-resolution predictions using a variety of geospatial techniques have demonstrated promising results in the permafrost terrain.</p>
				<p>Multiple studies have documented that the regression kriging approach produces lower prediction errors (<xref ref-type="bibr" rid="B18">Hengl et al., 2007</xref>; <xref ref-type="bibr" rid="B24">Kumar et al., 2012</xref>; <xref ref-type="bibr" rid="B32">Meng, 2014</xref>) in comparison to other spatial prediction approaches. We designed this study to compare the prediction accuracy of regression kriging with different machine learning approaches. We hypothesized that because regression kriging approach captures both spatial autocorrelation and environmental correlation, it will produce lower prediction errors in comparison to machine learning approaches, which capture mainly non-linear relations between soil properties and environmental factors. The specific objectives in this study are to 1) compare prediction accuracy of machine learning approaches with regression kriging, 2) determine the importance of environmental predictors across different spatial prediction approaches, 3) evaluate the accuracy of individual and combined (ensemble) ML approaches, and (4) create a high-resolution estimate of surface (0&#x2013;30&#xa0;cm) northern circumpolar region SOC stocks using an ensemble machine learning approach.</p>
			</sec>
			<sec sec-type="materials|methods" id="s2">
				<title>Materials and Methods</title>
				<sec id="s2-1">
					<title>Spatial Variation in Environmental Factors of the Permafrost Region</title>
					<p>A digital elevation model with 250-m spatial resolution was obtained from the US Geological Survey (<xref ref-type="bibr" rid="B12">Danielson and Gesch, 2011</xref>). Elevations ranged from sea level to 6,130&#xa0;m in the northern circumpolar region. The digital elevation model was used to calculate seven major topographic attributes (elevation, slope, aspect, flow accumulation, topographic wetness index, sediment transport index, and stream power index) to evaluate their use in predicting the spatial variation of surface SOC stocks. Average annual (1960&#x2013;1990) precipitation and temperature data at 1-km spatial resolution were obtained from the global climate data of <xref ref-type="bibr" rid="B19">Hijmans et al. (2005)</xref>. This interpolated dataset was generated for global land surfaces using latitude, longitude and elevation as independent variables. In the northern circumpolar region, average annual precipitation ranged from 52&#xa0;mm in the Russian Arctic Desert to 2,956&#xa0;mm in southeast Greenland. Average annual temperatures were lowest in northern Canada and Greenland (&#x2212;28&#xb0; to &#x2212;20&#xb0;C) and highest in southern Canada (3&#xb0; to &#x2013;10&#xb0;C).</p>
					<p>Global land cover data at 250-m spatial resolution were obtained from the European Space Agency (<xref ref-type="bibr" rid="B16">Glob cover, 2009</xref>). Of the total land area in the northern circumpolar region, Needleleaf mixed forest covered 31%, sparse vegetation covered 27%, permanent snow and ice covered 11%, and shrub land covered 7%. The bedrock geology data was obtained from the global lithological map produced by <xref ref-type="bibr" rid="B17">Hartmann and Moosdorf (2012)</xref>. In the northern circumpolar region, the largest proportion of land area had mixed sedimentary rocks (24.7%), followed by siliciclastic sedimentary rocks (19%), metamorphic rocks (12.4%), and ice and glaciers (12.1%). The smallest proportions of land area were underlain by evaporates (0.05%), pyroclastics (0.37%), and acid volcanic rocks (0.73%). For this study, we resampled all the environmental data into a common spatial resolution of 250&#xa0;m. Continuous environmental variables were resampled using bilinear interpolation and categorical variables were resampled using nearest neighbor resampling technique by using the resample function of ArcGIS (ArcGIS version 10.4, Environmental Systems Research Institute, Inc., Redlands, CA, United States). The soil type information of the study area was obtained from the soil order map of <xref ref-type="bibr" rid="B58">Tarnocai et al. (2009)</xref>. The largest soil area in the study domain was under Gelisols (57%), followed by Histosols (5%), and remaining mineral soils Spodosols, Inceptisols, Mollisols, Entisols, Alfisols, Andisols, and Aridisols covered 38% soil area.</p>
				</sec>
				<sec id="s2-2">
					<title>Soil Organic Carbon Profile Observations and Their Distribution Across Environmental Factors</title>
					<p>We compiled and updated the existing SOC data for permafrost affected soils from various sources. In addition to the SOC data used by <xref ref-type="bibr" rid="B33">Michaelson et al. (2013)</xref>, <xref ref-type="bibr" rid="B20">Hugelius et al. (2014)</xref>, <xref ref-type="bibr" rid="B43">Palmtag et al. (2015)</xref>, <xref ref-type="bibr" rid="B54">Siewert et al. (2015)</xref>, and <xref ref-type="bibr" rid="B61">Vitharana et al. (2017)</xref>, we collected additional georeferenced SOC profile observations from individual investigators from Canada, Russia, South Korea, and Sweden. <xref ref-type="fig" rid="F1">Figure 1</xref> shows the spatial distribution of SOC profile observations across the study area.</p>
					<fig id="F1" position="float">
						<label>FIGURE 1</label>
						<caption>
							<p>Study area and distribution of soil samples (<italic>n</italic> &#x3d; 2,374). Green dots show distribution of soil organic carbon (SOC) observations across different permafrost types of the Northern circumpolar permafrost region.</p>
						</caption>
						<graphic xlink:href="fdata-03-528441-g001.tif"/>
					</fig>
					<fig id="F2" position="float">
						<label>FIGURE 2</label>
						<caption>
							<p>Histogram and descriptive statistics of surface soil organic carbon stocks (0&#x2013;30&#xa0;cm) observations used in this study (<italic>n</italic> &#x3d; 2,374).</p>
						</caption>
						<graphic xlink:href="fdata-03-528441-g002.tif"/>
					</fig>
					<p>The collected soil observations are broadly representative of the heterogeneity of environmental conditions of the northern circumpolar region permafrost affected soils. The SOC profile observations represented 13 different land cover types. The largest number of samples were from the Needleleaf forest land cover type (34.3%), followed by sparse vegetation (25.5%), and mosaic forest shrubland vegetation (10.5%). The smallest number of samples were from broadleaf deciduous forest (0.12%), followed by broadleaf evergreen forest (0.17%), and shrublands (0.21%). The SOC observations captured a large range of climatic factors: mean annual precipitation ranged from 820 to 1,625&#xa0;mm and mean annual temperature ranged from &#x2212;20&#xb0; to 6&#xb0;C. The SOC observations were distributed from sea level to 3,000&#xa0;m in elevation and captured a large range of slope angles (0.5 to 44.5&#xb0;). SOC samples covered 11 of 14 bedrock geology types in the circumpolar region. The largest number of samples were from siliciclastic sedimentary rocks (34.5%), mixed sedimentary rocks (25.3%), and unconsolidated sediments (8.7%). The smallest number of samples were from basic plutonic rocks (0.25%), intermediate plutonic rocks (0.3%), and acid volcanic rocks (1%).</p>
				</sec>
				<sec id="s2-3">
					<title>Spatial Prediction of SOC Stocks</title>
					<sec id="s2-3-1">
						<title>Regression Kriging</title>
						<p>Regression Kriging is a widely used spatial interpolation technique in soil science, which combines a linear regression of dependent variable such as SOC stocks with environmental variables with kriging of the regression residuals (<xref ref-type="bibr" rid="B18">Hengl et al., 2007</xref>; <xref ref-type="bibr" rid="B23">Keskin and grunwald, 2018</xref>; <xref ref-type="bibr" rid="B62">Wu et al., 2019</xref>). In this method, the SOC stocks at an unsampled location are predicted by adding the interpolated regression residuals into the regression predicted SOC stocks. This approach can be summarized by:<disp-formula id="equ1">
								<mml:math>
									<mml:mrow>
										<mml:msubsup>
											<mml:mtext>z</mml:mtext>
											<mml:mrow>
												<mml:mtext>LRK</mml:mtext>
											</mml:mrow>
											<mml:mo>&#x5e;</mml:mo>
										</mml:msubsup>
										<mml:mrow>
											<mml:mo>(</mml:mo>
											<mml:msub>
												<mml:mtext>s</mml:mtext>
												<mml:mrow>
													<mml:mn>0</mml:mn>
													<mml:mtext>&#xa0;</mml:mtext>
												</mml:mrow>
											</mml:msub>
											<mml:mo>)</mml:mo>
										</mml:mrow>
										<mml:mo>&#x3d;</mml:mo>
										<mml:msubsup>
											<mml:mtext>m</mml:mtext>
											<mml:mrow>
												<mml:mtext>MLR</mml:mtext>
											</mml:mrow>
											<mml:mo>&#x5e;</mml:mo>
										</mml:msubsup>
										<mml:mrow>
											<mml:mo>(</mml:mo>
											<mml:msub>
												<mml:mtext>s</mml:mtext>
												<mml:mn>0</mml:mn>
											</mml:msub>
											<mml:mo>)</mml:mo>
										</mml:mrow>
										<mml:mo>&#x2b;</mml:mo>
										<mml:msubsup>
											<mml:mtext>e</mml:mtext>
											<mml:mrow>
												<mml:mtext>OK</mml:mtext>
											</mml:mrow>
											<mml:mo>&#x5e;</mml:mo>
										</mml:msubsup>
										<mml:mrow>
											<mml:mo>(</mml:mo>
											<mml:msub>
												<mml:mtext>s</mml:mtext>
												<mml:mn>0</mml:mn>
											</mml:msub>
											<mml:mo>)</mml:mo>
										</mml:mrow>
									</mml:mrow>
								</mml:math>
							</disp-formula>Where, <inline-formula id="inf1">
								<mml:math>
									<mml:mrow>
										<mml:msubsup>
											<mml:mi>z</mml:mi>
											<mml:mrow>
												<mml:mi>L</mml:mi>
												<mml:mi>R</mml:mi>
												<mml:mi>K</mml:mi>
											</mml:mrow>
											<mml:mo>&#x5e;</mml:mo>
										</mml:msubsup>
										<mml:mrow>
											<mml:mo>(</mml:mo>
											<mml:mrow>
												<mml:msub>
													<mml:mi>s</mml:mi>
													<mml:mn>0</mml:mn>
												</mml:msub>
											</mml:mrow>
											<mml:mo>)</mml:mo>
										</mml:mrow>
									</mml:mrow>
								</mml:math>
							</inline-formula> is the estimated SOC stocks at location<inline-formula id="inf2">
								<mml:math>
									<mml:mrow>
										<mml:mo>&#xa0;</mml:mo>
										<mml:msub>
											<mml:mi>s</mml:mi>
											<mml:mn>0</mml:mn>
										</mml:msub>
									</mml:mrow>
								</mml:math>
							</inline-formula>, <inline-formula id="inf3">
								<mml:math>
									<mml:mrow>
										<mml:msubsup>
											<mml:mi>m</mml:mi>
											<mml:mrow>
												<mml:mi>M</mml:mi>
												<mml:mi>L</mml:mi>
												<mml:mi>R</mml:mi>
											</mml:mrow>
											<mml:mo>&#x5e;</mml:mo>
										</mml:msubsup>
									</mml:mrow>
								</mml:math>
							</inline-formula>is the value predicted from multiple linear regressions (MLR), and <inline-formula id="inf4">
								<mml:math>
									<mml:mrow>
										<mml:msubsup>
											<mml:mi>e</mml:mi>
											<mml:mrow>
												<mml:mi>O</mml:mi>
												<mml:mi>K</mml:mi>
											</mml:mrow>
											<mml:mo>&#x5e;</mml:mo>
										</mml:msubsup>
									</mml:mrow>
								</mml:math>
							</inline-formula> is the kriged values of the MLR residuals at point <inline-formula id="inf5">
								<mml:math>
									<mml:mrow>
										<mml:msub>
											<mml:mi>S</mml:mi>
											<mml:mn>0</mml:mn>
										</mml:msub>
									</mml:mrow>
								</mml:math>
							</inline-formula> using ordinary kriging. In summary, in this study the forward stepwise multiple linear regression was used to identify the statistically significant predictors of SOC stocks of the study area. Then Ln-transformed SOC stocks model residuals were calculated for the sample locations and covariance structure of the model residuals was fitted using a variogram model. Regression residuals were then interpolated using ordinary kriging and added to the estimated Ln-transformed SOC stocks regression model surface (<xref ref-type="bibr" rid="B41">Mishra et al., 2012</xref>). Several recent studies have also applied a variant of this technique using a geographically weighted regression approach to model the spatially varying regression relationships between the SOC stocks and its environmental controllers (<xref ref-type="bibr" rid="B63">Zhang et al., 2011</xref>; <xref ref-type="bibr" rid="B40">Mishra and Riley, 2015</xref>; <xref ref-type="bibr" rid="B42">Mitran et al., 2018</xref>).</p>
					</sec>
					<sec id="s2-3-2">
						<title>Machine Learning Approaches</title>
						<p>Machine learning approach is a family of algorithms which do not assume any mechanistic nature to the data and instead seek to &#x201c;learn&#x201d; a function that best maps input parameters to an output. We used gradient boosting machine (GBM), multinarrative adaptive regression spline (MARS), support vector machine (SVM), and random forests (RF) machine learning approaches to predict the surface SOC stocks which were previously used to predict soil properties in a variety of environments. Individual predictions from these machine learning techniques and their ensemble median were compared with the SOC stocks predicted by the regression kriging approach. The GBM algorithm which was originally proposed by <xref ref-type="bibr" rid="B15">Friedman (2001)</xref>, uses simple regression model &#x201c;weak learners&#x201d; and iteratively combine this simple model to obtain &#x201c;strong learner&#x201d; with improved accuracy by reducing the bias and the variance. GBM model include two major user defined parameters; number of tree and tree depth. The tree depth of 3 and number of trees of 150 were used based on the minimum root mean squared error (RMSE) of prediction. MARS, which was introduced by <xref ref-type="bibr" rid="B14">Friedman (1991)</xref>, computes the underlying nonlinear patterns hidden in the data. It builds the relationship between the response and dependent variable using distinct set of coefficients and the function which are controlled by the regression. MARS optimization is a two-step process. In the first step, a large number of basis functions (connected splines) are constructed to overfit the data and in the second step the basis functions are selected based on best fit. The tuning parameters for MARS include the nprune and degree. The nprune value of 18 and degree value of 1 were used based on the minimum RMSE of prediction. The SVM, originally proposed by <xref ref-type="bibr" rid="B11">Cortes and Vapnik (1995)</xref>, sets up a decision boundary in the feature space to separate different classes. Mathematically, it creates best fit hyperplanes between the classes to minimize errors. The objective function intends to select the best hyperplane with largest margin between the classes, where margin is the sum of distance between the separating hyperplane and nearest points of different class in either side of the hyperplane. The tuning parameters for SVM includes sigma and C. The sigma value of 0.14 and C value of 1 were used based on the minimum RMSE of prediction. RF is a tree-based machine learning approach that works by building a set of regression trees and averaging the results for final prediction (<xref ref-type="bibr" rid="B6">Breiman, 2001</xref>). RF works on a rationale that the combination of learning models (tree-based ensemble) increases the prediction accuracy. It consists of an ensemble of randomized classification and regression trees (CART, <xref ref-type="bibr" rid="B7">Breiman et al., 1984</xref>), where many decision trees are built using a random subsample of the available environmental factors. The final result is a single prediction constructed as a weighted average over all these individually suboptimal trees. In the RF approach, the model parameters that needed specification were: 1) the number of trees to grow in the forest (ntree &#x3d; 500), 2) the number of randomly selected predictor variables at each node (mtry &#x3d; 5), and 3) the minimum number of observations at the terminal nodes of the trees (nodesize &#x3d; 5). RF has been reported to have high predictive performance, low correlation of individual trees, and small bias and provides information on the relative importance of predictors (<xref ref-type="bibr" rid="B6">Breiman, 2001</xref>; <xref ref-type="bibr" rid="B57">Svetnik et al., 2003</xref>; <xref ref-type="bibr" rid="B1">Arun and Langmead, 2005</xref>).</p>
					</sec>
				</sec>
				<sec id="s2-4">
					<title>Evaluation of Prediction Accuracy of Different Geospatial Approaches</title>
					<p>We calculated four validation indices that demonstrate different quality attributes of predicted SOC stock maps: 1) the measure of inaccuracy expressed as the RMSE, 2) the measure of bias expressed as the mean estimation error (MEE), 3) the measure of linear relationship between observed and predicted SOC stock values expressed as Pearsons&#x2019;s correlation coefficient (r), and 4) the ratio of performance to deviation (RPD), which is the ratio of the standard deviation of the observed SOC stock values to the RMSE of the predictions. The larger the RPD, the more accurate the prediction. To calculate these validation indices, we split the SOC stock dataset into four different spatially balanced calibration and validation datasets (70/30, 75/25, 85/15, 90/10) using &#x201c;create subset&#x201d; function of ArcGIS (ArcGIS version 10, Environmental Systems Research Institute, Inc., Redlands, CA, United States). We reported average validation indices obtained from these four validation subsets. The predicted SOC stock values from all the prediction approaches were extracted at validation data sites and the following equations were applied:<disp-formula id="equ2">
							<mml:math>
								<mml:mrow>
									<mml:mi>M</mml:mi>
									<mml:mi>E</mml:mi>
									<mml:mi>E</mml:mi>
									<mml:mo>&#x3d;</mml:mo>
									<mml:mo>&#xa0;</mml:mo>
									<mml:mfrac>
										<mml:mn>1</mml:mn>
										<mml:mi>n</mml:mi>
									</mml:mfrac>
									<mml:munderover>
										<mml:mstyle displaystyle="true">
											<mml:mo>&#x2211;</mml:mo>
										</mml:mstyle>
										<mml:mrow>
											<mml:mi>i</mml:mi>
											<mml:mo>&#x3d;</mml:mo>
											<mml:mn>1</mml:mn>
										</mml:mrow>
										<mml:mi>n</mml:mi>
									</mml:munderover>
									<mml:mrow>
										<mml:mo>[</mml:mo>
										<mml:mrow>
											<mml:mi>S</mml:mi>
											<mml:mrow>
												<mml:mover accent="true">
													<mml:mi>O</mml:mi>
													<mml:mo>&#x5e;</mml:mo>
												</mml:mover>
											</mml:mrow>
											<mml:mi>C</mml:mi>
											<mml:mrow>
												<mml:mo>(</mml:mo>
												<mml:mrow>
													<mml:msub>
														<mml:mi>x</mml:mi>
														<mml:mi>i</mml:mi>
													</mml:msub>
												</mml:mrow>
												<mml:mo>)</mml:mo>
											</mml:mrow>
											<mml:mo>&#x2212;</mml:mo>
											<mml:mo>&#xa0;</mml:mo>
											<mml:mi>S</mml:mi>
											<mml:mi>O</mml:mi>
											<mml:mi>C</mml:mi>
											<mml:mrow>
												<mml:mo>(</mml:mo>
												<mml:mrow>
													<mml:msub>
														<mml:mi>x</mml:mi>
														<mml:mi>i</mml:mi>
													</mml:msub>
												</mml:mrow>
												<mml:mo>)</mml:mo>
											</mml:mrow>
										</mml:mrow>
										<mml:mo>]</mml:mo>
									</mml:mrow>
								</mml:mrow>
							</mml:math>
						</disp-formula>
						<disp-formula id="equ3">
							<mml:math>
								<mml:mrow>
									<mml:mi>R</mml:mi>
									<mml:mi>M</mml:mi>
									<mml:mi>S</mml:mi>
									<mml:mi>E</mml:mi>
									<mml:mo>&#x3d;</mml:mo>
									<mml:mo>&#xa0;</mml:mo>
									<mml:msqrt>
										<mml:mrow>
											<mml:mfrac>
												<mml:mn>1</mml:mn>
												<mml:mi>n</mml:mi>
											</mml:mfrac>
											<mml:munderover>
												<mml:mstyle displaystyle="true">
													<mml:mo>&#x2211;</mml:mo>
												</mml:mstyle>
												<mml:mrow>
													<mml:mi>i</mml:mi>
													<mml:mo>&#x3d;</mml:mo>
													<mml:mn>1</mml:mn>
												</mml:mrow>
												<mml:mi>n</mml:mi>
											</mml:munderover>
											<mml:msup>
												<mml:mrow>
													<mml:mrow>
														<mml:mo>[</mml:mo>
														<mml:mrow>
															<mml:mi>S</mml:mi>
															<mml:mrow>
																<mml:mover accent="true">
																	<mml:mi>O</mml:mi>
																	<mml:mo>&#x5e;</mml:mo>
																</mml:mover>
															</mml:mrow>
															<mml:mi>C</mml:mi>
															<mml:mrow>
																<mml:mo>(</mml:mo>
																<mml:mrow>
																	<mml:msub>
																		<mml:mi>x</mml:mi>
																		<mml:mi>i</mml:mi>
																	</mml:msub>
																</mml:mrow>
																<mml:mo>)</mml:mo>
															</mml:mrow>
															<mml:mo>&#x2212;</mml:mo>
															<mml:mo>&#xa0;</mml:mo>
															<mml:mi>S</mml:mi>
															<mml:mi>O</mml:mi>
															<mml:mi>C</mml:mi>
															<mml:mrow>
																<mml:mo>(</mml:mo>
																<mml:mrow>
																	<mml:msub>
																		<mml:mi>x</mml:mi>
																		<mml:mi>i</mml:mi>
																	</mml:msub>
																</mml:mrow>
																<mml:mo>)</mml:mo>
															</mml:mrow>
														</mml:mrow>
														<mml:mo>]</mml:mo>
													</mml:mrow>
												</mml:mrow>
												<mml:mn>2</mml:mn>
											</mml:msup>
										</mml:mrow>
									</mml:msqrt>
								</mml:mrow>
							</mml:math>
						</disp-formula>where <inline-formula id="inf6">
							<mml:math>
								<mml:mrow>
									<mml:mi>S</mml:mi>
									<mml:mi>O</mml:mi>
									<mml:mi>C</mml:mi>
									<mml:mrow>
										<mml:mo>(</mml:mo>
										<mml:mrow>
											<mml:msub>
												<mml:mi>x</mml:mi>
												<mml:mi>i</mml:mi>
											</mml:msub>
										</mml:mrow>
										<mml:mo>)</mml:mo>
									</mml:mrow>
								</mml:mrow>
							</mml:math>
						</inline-formula> is the measured SOC, <inline-formula id="inf7">
							<mml:math>
								<mml:mrow>
									<mml:mi>S</mml:mi>
									<mml:mrow>
										<mml:mover accent="true">
											<mml:mi>O</mml:mi>
											<mml:mo>&#x5e;</mml:mo>
										</mml:mover>
									</mml:mrow>
									<mml:mi>C</mml:mi>
									<mml:mrow>
										<mml:mo>(</mml:mo>
										<mml:mrow>
											<mml:msub>
												<mml:mi>x</mml:mi>
												<mml:mi>i</mml:mi>
											</mml:msub>
										</mml:mrow>
										<mml:mo>)</mml:mo>
									</mml:mrow>
								</mml:mrow>
							</mml:math>
						</inline-formula> the estimated SOC, and <italic>n</italic> is the number of validation observations (<italic>n</italic> &#x3d; 714). For optimal predictions, MEE and RMSE values should approach zero. <xref ref-type="bibr" rid="B9">Chang and Laird (2002)</xref> defined three classes of RPD; models that have high predictive ability (RPD &#x3e; 2), models that have intermediate predictive ability that can be possibly improved (RPD between 1.4 and 2), and models that have no predictive ability (RPD &#x3c; 1.4). In addition to predict the spatial variation of SOC stocks at 250-m spatial resolution across the study area, we also calculated the coefficient of variation of surface SOC stocks across different permafrost zones of the Northen circumpolar permafrost region.</p>
				</sec>
			</sec>
			<sec sec-type="results" id="s3">
				<title>Results</title>
				<sec id="s3-1">
					<title>Descriptive Statistics of Soil Organic Carbon Observations</title>
					<p>Statistical properties of the surface SOC profile observations at calibration and validation sites are summarized in <xref ref-type="fig" rid="F2">Figure 2</xref>. The average surface SOC stock of northern circumpolar region was 12.5&#xa0;kg&#xa0;m<sup>&#x2212;2</sup>, ranging from 0.25&#x2013;90 kg&#xa0;m<sup>&#x2212;2</sup>. The observed SOC stocks showed unimodal (kurtosis &#x3d; 2) and positively skewed (coefficient of skewness &#x3d; 2.1) distributions. Among total SOC observations, 2% of the samples had SOC stocks less than 1 kg&#xa0;m<sup>&#x2212;2</sup>, and about 6% of the samples had SOC stocks larger than 30 kg&#xa0;m<sup>&#x2212;2</sup>. The majority of samples (92%) had SOC stocks between 1 and 30 kg&#xa0;m<sup>&#x2212;2</sup>. The SOC stock values of validation samples were within the range of calibration samples. <xref ref-type="fig" rid="F3">Figure 3</xref> shows the linear relations between SOC stocks and different environmental factors used in this study.</p>
					<fig id="F3" position="float">
						<label>FIGURE 3</label>
						<caption>
							<p>Pearson&#x2019;s correlation coefficients between SOC stocks and environmental variables used in this study. The insignificant correlations (<italic>p</italic> value &#x3e;0.05) are blank (white).</p>
						</caption>
						<graphic xlink:href="fdata-03-528441-g003.tif"/>
					</fig>
				</sec>
				<sec id="s3-2">
					<title>Predicted Spatial Variation of Surface Soil Organic Carbon Stocks</title>
					<p>Due to lowest prediction errors of surface SOC stocks obtained, we used the results of the ensemble median of the machine learning approaches to describe the magnitude and spatial variation of surface SOC stocks (<xref ref-type="fig" rid="F4">Figure 4</xref>). Predicted median surface SOC content showed moderate spatial variation (CV &#x3d; 26%), ranging from 0.5 to 37.5 kg&#xa0;m<sup>&#x2212;2</sup>, with an average circumpolar region surface SOC content of 12.3 kg&#xa0;m<sup>&#x2212;2</sup>. Among different permafrost regions, the discontinuous permafrost region showed highest SOC content (12 kg&#xa0;m<sup>&#x2212;2</sup>, with lower and upper quartiles of 11.0 and 13.5 kg&#xa0;m<sup>&#x2212;2</sup>, respectively), followed by the sporadic permafrost region (10.5 kg&#xa0;m<sup>&#x2212;2</sup>, with lower and upper quartiles of 9.0 and 12.0 kg&#xa0;m<sup>&#x2212;2</sup>, respectively), and the continuous permafrost region (10.0 kg&#xa0;m<sup>&#x2212;2</sup>, with lower and upper quartiles of 8.6 and 11.0 kg&#xa0;m<sup>&#x2212;2</sup>, respectively). Lowest surface SOC content was found in isolated permafrost region soils (9.0 kg&#xa0;m<sup>&#x2212;2</sup>, with lower and upper quartiles of 8.0 and 10.0 kg&#xa0;m<sup>&#x2212;2</sup>, respectively).</p>
					<fig id="F4" position="float">
						<label>FIGURE 4</label>
						<caption>
							<p>Predicted spatial variation of surface (0&#x2013;30&#xa0;cm) SOC stocks (median predictions; <bold>(B)</bold> of the northern circumpolar permafrost region using ensemble machine learning approach, with its lower quartile <bold>(A)</bold> and upper quartile <bold>(C)</bold>. Areas in black show water surface or perennial ice, urban, and barren land with consolidated materials.</p>
						</caption>
						<graphic xlink:href="fdata-03-528441-g004.tif"/>
					</fig>
					<p>The ensemble median of machine learning approaches predicted total SOC stock <inline-formula id="inf8">
							<mml:math>
								<mml:mrow>
									<mml:msubsup>
										<mml:mrow>
											<mml:mn>218</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2212;</mml:mo>
											<mml:mn>26</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2b;</mml:mo>
											<mml:mn>22</mml:mn>
										</mml:mrow>
									</mml:msubsup>
								</mml:mrow>
							</mml:math>
						</inline-formula> Pg C in 0&#x2013;30&#xa0;cm depth of the northern circumpolar region. Out of this total, the continuous permafrost region contained 54.5% (<inline-formula id="inf9">
							<mml:math>
								<mml:mrow>
									<mml:msubsup>
										<mml:mrow>
											<mml:mn>119</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2212;</mml:mo>
											<mml:mn>14</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2b;</mml:mo>
											<mml:mn>12</mml:mn>
										</mml:mrow>
									</mml:msubsup>
								</mml:mrow>
							</mml:math>
						</inline-formula> Pg C), the discontinuous permafrost region contained 18% (<inline-formula id="inf10">
							<mml:math>
								<mml:mrow>
									<mml:msubsup>
										<mml:mrow>
											<mml:mn>39</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2212;</mml:mo>
											<mml:mn>4</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2b;</mml:mo>
											<mml:mn>4</mml:mn>
										</mml:mrow>
									</mml:msubsup>
								</mml:mrow>
							</mml:math>
						</inline-formula> Pg C), sporadic permafrost region contained 14% (<inline-formula id="inf11">
							<mml:math>
								<mml:mrow>
									<mml:msubsup>
										<mml:mrow>
											<mml:mn>31</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2212;</mml:mo>
											<mml:mn>4</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2b;</mml:mo>
											<mml:mn>3</mml:mn>
										</mml:mrow>
									</mml:msubsup>
								</mml:mrow>
							</mml:math>
						</inline-formula> Pg C), and the isolated permafrost region contained 13% (<inline-formula id="inf12">
							<mml:math>
								<mml:mrow>
									<mml:msubsup>
										<mml:mrow>
											<mml:mn>29</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2212;</mml:mo>
											<mml:mn>3</mml:mn>
										</mml:mrow>
										<mml:mrow>
											<mml:mo>&#x2b;</mml:mo>
											<mml:mn>4</mml:mn>
										</mml:mrow>
									</mml:msubsup>
								</mml:mrow>
							</mml:math>
						</inline-formula> Pg C) of the total surface SOC stocks. The largest spatial variation in predicted surface SOC stocks was found in continuous permafrost region soils (CV &#x3d; 61%), followed by isolated permafrost region (CV &#x3d; 50%). Both sporadic and discontinuous permafrost region soils showed similar spatial variation (CV &#x3d; 39%) in the surface SOC stocks (<xref ref-type="table" rid="T1">Table 1</xref>).</p>
					<table-wrap id="T1" position="float">
						<label>TABLE 1</label>
						<caption>
							<p>Average SOC content and predicted total SOC stocks in different permafrost zones within the circumpolar permafrost region.</p>
						</caption>
						<table frame="hsides" rules="groups">
							<thead>
								<tr>
									<th>Permafrost types</th>
									<th align="center">Average SOC content (kg m<sup>&#x2212;2</sup>)</th>
									<th align="center">Coefficient of variation (%)</th>
									<th align="center">Total SOC stock (Pg C)</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td>Continuous</td>
									<td align="center">10.0 (8.6&#x2013;11)</td>
									<td align="center">61.0</td>
									<td align="center">119.0 (105&#x2013;131)</td>
								</tr>
								<tr>
									<td>Discontinuous</td>
									<td align="center">12.0 (11&#x2013;13.5)</td>
									<td align="center">39.0</td>
									<td align="center">39.0 (35&#x2013;43)</td>
								</tr>
								<tr>
									<td>Sporadic</td>
									<td align="center">10.5 (9&#x2013;12)</td>
									<td align="center">39.0</td>
									<td align="center">31.0 (27&#x2013;34)</td>
								</tr>
								<tr>
									<td>Isolated</td>
									<td align="center">9.0 (8&#x2013;10)</td>
									<td align="center">50.0</td>
									<td align="center">29.0 (25&#x2013;32)</td>
								</tr>
							</tbody>
						</table>
						<table-wrap-foot>
							<fn>
								<p>Values in parentheses are the lower and upper quartiles.</p>
							</fn>
						</table-wrap-foot>
					</table-wrap>
					<p>In general, we observed an inverse spatial relationship between the magnitude of SOC stocks and its uncertainty (expressed as a percent calculated using the lower and upper quartile values), i.e., the areas that stored more SOC stocks (<xref ref-type="fig" rid="F4">Figure 4</xref>) were least uncertain and the areas that stored less SOC stocks were most uncertain. The uncertainty in surface SOC stocks was less than 20% in about half of the study area, shown by blue color in <xref ref-type="fig" rid="F5">Figure 5</xref>. Areas with high uncertainty (&#x3e;50% uncertainty; purple color in <xref ref-type="fig" rid="F5">Figure 5</xref>) in predicted SOC stocks was observed in small patches in Southern Alaska and Iceland, and in larger areas of Southern and Western Russian permafrost region. Our results showed that 7% of the total study area had high uncertainty in surface SOC stocks. Areas with intermediate uncertainty (20&#x2013;49% uncertainty; shown by green color in <xref ref-type="fig" rid="F5">Figure 5</xref>) in surface SOC stocks covered about 43% of the study area.</p>
					<fig id="F5" position="float">
						<label>FIGURE 5</label>
						<caption>
							<p>Distribution of uncertainties in the predicted surface soil organic carbon stocks of the northern circumpolar permafrost region. Blue color shows areas with &#x3c;20% uncertainty, green color shows areas with 20&#x2013;49% uncertainty and purple color shows areas with &#x3e;50% uncertainty in the predicted SOC stocks. Areas in black color show water surface or perennial ice, urban, and barren land with consolidated materials.</p>
						</caption>
						<graphic xlink:href="fdata-03-528441-g005.tif"/>
					</fig>
				</sec>
				<sec id="s3-3">
					<title>Environmental Predictors of Surface Soil Organic Carbon Stocks</title>
					<p>We applied multiple environmental predictors in different prediction approaches, and the number and importance of environmental predictors differed among prediction approaches (<xref ref-type="fig" rid="F6">Figure 6</xref>). In machine learning approaches where we applied all environmental predictors, temperature, land cover types, slope, and elevation had higher impacts and soil types, bedrock geology types, aspect, and sediment transport index had lower impacts on the predicted variation of SOC stocks. In the RF approach, which produced the highest prediction accuracy (lowest RMSE) among machine learning approaches, average annual temperature and precipitation, latitude, and elevation were the most important environmental predictors of surface SOC stocks. Similarly, soil types, bedrock geology types, and surface hydrology attributes (e.g., stream power index and flow accumulation) were less important predictors in the RF approach. In contrast, in the regression kriging approach where we applied statistically significant environmental predictors, soil types, land cover types, stream power index, and sediment transport index were the most important predictors of surface SOC stocks. Likewise, the bedrock geology type was the least important but a statistically significant predictor of surface SOC stocks in the regression kriging approach.</p>
					<fig id="F6" position="float">
						<label>FIGURE 6</label>
						<caption>
							<p>The importance of different environmental predictors differs in different machine learning approaches. Number in the horizontal bars shows the relative importance of environmental predictors of soil organic carbon stocks expressed as percent.</p>
						</caption>
						<graphic xlink:href="fdata-03-528441-g006.tif"/>
					</fig>
				</sec>
				<sec id="s3-4">
					<title>Comparison of Prediction Accuracy in Different Approaches</title>
					<p>The predicted SOC stocks using an ensemble median of machine learning approaches showed lowest prediction errors (r &#x3d; 0.64 and RMSE &#x3d; 6.75 kg&#xa0;m<sup>&#x2212;2</sup>; <xref ref-type="fig" rid="F7">Figure 7</xref>) among all the spatial prediction approaches we evaluated. MLR and MARS produced highest prediction errors (r &#x3d; 0.33 and 0.36, and RMSE &#x3d; 9.0 and 8.25 kg&#xa0;m<sup>&#x2212;2</sup>) among all the approaches we evaluated. Similarly, RF and regression kriging produced comparable prediction accuracies (comparable r and RMSE values; <xref ref-type="fig" rid="F7">Figure 7</xref>). The average error of prediction was largest in the MLR approach followed by the MARS approach. On average, all prediction approaches showed positive bias (positive MEE values) and over predicted surface SOC stocks. The MLR and SVM techniques showed largest biases in SOC stock predictions, and RF showed smallest bias among all the prediction approaches. The RPD results showed that the SOC stock predictions from the ensemble median of machine learning approaches had an RPD value of 1.8. This moderate predictive accuracy is higher than any individual approach we applied (<xref ref-type="bibr" rid="B9">Chang and Laird, 2002</xref>; <xref ref-type="bibr" rid="B60">Viscarra Rossel and Webster, 2012</xref>). Other individual spatial prediction approaches showed lower predictive ability.</p>
					<fig id="F7" position="float">
						<label>FIGURE 7</label>
						<caption>
							<p>Average validation indices calculated for four different randomly selected calibration and validation datasets (70/30, 75/25, 85/15, 90/10) across the study area.</p>
						</caption>
						<graphic xlink:href="fdata-03-528441-g007.tif"/>
					</fig>
				</sec>
			</sec>
			<sec sec-type="discussion" id="s4">
				<title>Discussion</title>
				<p>We compared multiple spatial prediction techniques to predict the surface SOC stocks of the northern circumpolar region. Calculated validation errors showed comparable prediction accuracies of GBM, RF, and regression kriging approaches. Prediction errors obtained from the ensemble median predictions of machine learning approaches were lowest in comparison to the regression kriging and other individual machine learning approaches. Our results show the distribution of the magnitude of uncertainty in SOC stocks across the northern circumpolar region, which can be used to guide future sampling efforts in order to reduce the modeled climate carbon feedback predictions.</p>
				<p>Using a different thematic upscaling approach, <xref ref-type="bibr" rid="B58">Tarnocai et al. (2009)</xref> and <xref ref-type="bibr" rid="B20">Hugelius et al. (2014)</xref> predicted 191 and 217 Pg C in the surface soils of the northern circumpolar region. Our estimate of total SOC stocks (<inline-formula id="inf13">
						<mml:math>
							<mml:mrow>
								<mml:msubsup>
									<mml:mrow>
										<mml:mn>218</mml:mn>
									</mml:mrow>
									<mml:mrow>
										<mml:mo>&#x2212;</mml:mo>
										<mml:mn>26</mml:mn>
									</mml:mrow>
									<mml:mrow>
										<mml:mo>&#x2b;</mml:mo>
										<mml:mn>22</mml:mn>
									</mml:mrow>
								</mml:msubsup>
							</mml:mrow>
						</mml:math>
					</inline-formula> Pg C) is consistent with these previous estimates of the northern circumpolar region surface SOC stocks. However, our results showed different spatial distribution of SOC stocks across the study area. Our results showed 54% of SOC stocks reside in the continuous permafrost region, compared with 58% in <xref ref-type="bibr" rid="B58">Tarnocai et al. (2009)</xref>. We report 18% SOC stocks in the discontinuous permafrost region, compared with 13% estimate in <xref ref-type="bibr" rid="B58">Tarnocai et al. (2009)</xref>. Both our and <xref ref-type="bibr" rid="B58">Tarnocai et al. (2009)</xref> SOC stock estimates were similar in the sporadic permafrost region (14% SOC stocks). Our estimates showed similar SOC stocks in the isolated permafrost region as reported in <xref ref-type="bibr" rid="B58">Tarnocai et al. (2009)</xref> (&#x3c;2%). <xref ref-type="bibr" rid="B20">Hugelius et al. (2014)</xref> used a different physiographic categorization to describe the spatial distribution of SOC stocks, and do not provide distributions of SOC stocks in different types of permafrost regions. In contrast to these previous estimates of surface SOC stocks, our approach provides greater spatial details and captures a larger range in predicted SOC stocks, primarily due to the larger number of field observations available to us and different geospatial approaches we used. Our ensemble median machine learning approach also elucidated impacts of different environmental variables representing various soil-forming factors, however, while both <xref ref-type="bibr" rid="B58">Tarnocai et al. (2009)</xref> and <xref ref-type="bibr" rid="B20">Hugelius et al. (2014)</xref> showed the impact of soil types only.</p>
				<p>Areas with high uncertainty (&#x3e;50%) in predicted SOC stocks have higher elevation and slope positions. Areas with low predicted uncertainty in SOC stocks have lower elevation and slope positions (<xref ref-type="table" rid="T2">Table 2</xref>). On average, areas with high uncertainty in SOC stocks receive lower precipitation and have drier soils. We note that these areas also have very few field observations (&#x223c;2% of total samples). The areas with medium and low uncertainty ranges have 37 and 61% of the observational samples, respectively. Our results of uncertainty distributions are consistent with findings of <xref ref-type="bibr" rid="B51">Shelef et al. (2017)</xref>, who also reported that northern circumpolar region areas with high SOC stock uncertainty are areas with hillslope topography. The environmental characteristics and sample numbers of areas with different uncertainty ranges in the predicted surface SOC stocks of the northern circumpolar region are provided in <xref ref-type="table" rid="T2">Table 2</xref>. In order to reduce the existing uncertainty in surface SOC stocks, future sampling efforts should focus in the areas represented by green and purple colors in the <xref ref-type="fig" rid="F5">Figure 5</xref> (<xref ref-type="table" rid="T2">Table 2</xref>).</p>
				<table-wrap id="T2" position="float">
					<label>TABLE 2</label>
					<caption>
						<p>Average values of environmental factors and number of samples in areas with different uncertainty ranges in the predicted surface SOC stocks of the northern circumpolar region.</p>
					</caption>
					<table frame="hsides" rules="groups">
						<thead>
							<tr>
								<th rowspan="2">Environmental factors</th>
								<th colspan="3" align="center">Uncertainty ranges</th>
							</tr>
							<tr>
								<th align="center">Low (&#x3e;20%)</th>
								<th align="center">Medium (20&#x2013;9%)</th>
								<th align="center">High (&#x3e;50%)</th>
							</tr>
						</thead>
						<tbody>
							<tr>
								<td>Elevation (m)</td>
								<td align="center">408.0 (0&#x2013;4,707)</td>
								<td align="center">575.0 (0&#x2013;4,711)</td>
								<td align="center">687.0 (0&#x2013;4,086)</td>
							</tr>
							<tr>
								<td>Slope (&#xb0;)</td>
								<td align="center">4.0 (0.5&#x2013;71)</td>
								<td align="center">5.5 (0.5&#x2013;72.2)</td>
								<td align="center">7.0 (0.5&#x2013;64.6)</td>
							</tr>
							<tr>
								<td>Temperature (<sup>o</sup>C)</td>
								<td align="center">&#x2212;7.7 (&#x2212;25.5&#x2013;9.5)</td>
								<td align="center">&#x2212;8.0 (&#x2212;25.65&#x2013;9.7)</td>
								<td align="center">&#x2212;8.5 (&#x2212;25.6&#x2013;9.5)</td>
							</tr>
							<tr>
								<td>Precipitation (mm)</td>
								<td align="center">407.0 (54&#x2013;1751)</td>
								<td align="center">415.0 (54&#x2013;2044)</td>
								<td align="center">395.0 (57&#x2013;2,861)</td>
							</tr>
							<tr>
								<td>Soil wetness index</td>
								<td align="center">9.5 (4.9&#x2013;13)</td>
								<td align="center">9.0 (4.4&#x2013;13)</td>
								<td align="center">8.0 (4.8&#x2013;12.7)</td>
							</tr>
							<tr>
								<td>Sample number</td>
								<td align="center">1,006</td>
								<td align="center">620</td>
								<td align="center">34</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<fn>
							<p>Values in parentheses show the range of environmental factors.</p>
						</fn>
					</table-wrap-foot>
				</table-wrap>
				<p>In this study, we used data of environmental factors that provided spatially-explicit information of major soil-forming factors across the study area. The impact of these environmental factors on soil formation is well documented in soil science literature (<xref ref-type="bibr" rid="B21">Jenny, 1941</xref>; <xref ref-type="bibr" rid="B31">McBratney et al., 2003</xref>; <xref ref-type="bibr" rid="B39">Mishra and Riley, 2012</xref>; <xref ref-type="bibr" rid="B61">Vitharana et al., 2017</xref>). For example, average annual precipitation and temperature data provided information of the average climate of the study area. Land cover types provided information on biota properties. Various topographic attributes that we used in this study provided information on relief. Soil types and bedrock geology types data provided information about different kinds of parent materials that impacted soil properties. In addition, in the permafrost domain, soil formation is also governed by cryopedogenic processes where the role of cold temperatures and ice formation are important for SOC stock accumulation and decomposition (<xref ref-type="bibr" rid="B5">Bockheim, 2007</xref>; <xref ref-type="bibr" rid="B46">Ping et al., 2013</xref>; <xref ref-type="bibr" rid="B47">Ping et al., 2015</xref>).</p>
				<p>The regression kriging approach combines both environmental correlation and spatial autocorrelation to predict SOC stocks. As a result, regression kriging usually produces lower prediction errors in comparison to other approaches (<xref ref-type="bibr" rid="B18">Hengl et al., 2007</xref>; <xref ref-type="bibr" rid="B41">Mishra et al., 2012</xref>; <xref ref-type="bibr" rid="B34">Minasny et al., 2013</xref>). Our results suggest that some machine learning approaches, such as GBM and RF, that capture the non-linear relations between environmental controllers and SOC stocks, can produce similar prediction accuracy to that of regression kriging. We note that not all the machine learning approaches (for example, SVM and MARS) produced comparable prediction accuracy as was obtained from the regression kriging approach. However, the ensemble median prediction of machine learning approaches convincingly decreased the prediction errors and resulted in the most accurate predictions of surface SOC stocks but did not allow attribution of importance of individual environmental factors. Future studies should focus on 1) deriving the non-linear relationships between soil properties and the environmental factors and (2) incorporating the spatial autocorrelation function into machine learning approaches to achieve greater prediction accuracies.</p>
			</sec>
			<sec id="s5">
				<title>Summary</title>
				<p>We compared multiple spatial prediction approaches to predict the surface SOC stocks of the northern circumpolar permafrost region. Using a larger number of samples than previously available, we compared the prediction accuracy of the regression kriging approach with four machine learning approaches. We found that SOC stock predictions from two machine learning approaches (GBM and RF) and regression kriging have comparable prediction accuracies. Prediction errors obtained from the ensemble median predictions of machine learning approaches were lowest in comparison to the regression kriging and other individual machine learning approaches. The number and importance of environmental predictors differed among different prediction approaches. Among machine learning approaches, temperature, latitude, land cover types, slope, and elevation had higher impacts on the predicted spatial variation of surface SOC stocks. Soil types were also important predictors in the regression kriging approach. We found an inverse spatial relationship between the magnitude of SOC stocks and its uncertainty. The uncertainty in surface SOC stocks was less than 20% in about half of the study area. Areas with high uncertainty (&#x3e;50% uncertainty) in predicted SOC stocks were observed in small patches in Southern Alaska and Iceland, and in larger areas of Southern and Western Russian permafrost region. Because different machine learning approaches make use of different environmental predictors, the ensemble approach provides greater spatial details, and would probably provide improved estimates of SOC changes as it captures the non-linear relations between SOC stocks and its environmental predictors.</p>
			</sec>
			<sec id="s6">
				<title>Data Availability Statement</title>
				<p>The data used in this study are available in supporting information section. Additional data and codes can be requested from the lead author.</p>
			</sec>
			<sec id="s7">
				<title>Author Contributions</title>
				<p>UM and SG designed research, conducted analysis and wrote the manuscript. WR and FH assisted in discussion of the results and preparation of manuscript.</p>
			</sec>
			<sec id="s8">
				<title>Funding</title>
				<p>This study was supported by the Director, Office of Science, Office of Biological and Environmental Research of the U.S. Department of Energy under Argonne National Laboratory contract No. DE-AC02-06CH11357. Efforts of WR were supported by the RUBISCO Scientific Focus Area in the Regional Global Climate Modeling Program by the Director, Office of Science, Office of Biological and Environmental Research, of the U.S. Department of Energy under contract DE-AC02-05CH11231 to Berkeley Lab.</p>
			</sec>
			<sec id="s9">
				<title>Disclaimer</title>
				<p>This paper describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the paper do not necessarily represent the views of the U.S. Department of Energy or the United States Government.</p>
			</sec>
			<sec sec-type="COI-statement" id="s10">
				<title>Conflict of Interest</title>
				<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
			</sec>
		</body>
		<back>
			<ack>
				<p>This research was performed for the Reducing Uncertainties in Biogeochemical Interactions through Synthesis and Computation Science Focus Area (RUBISCO SFA), which is sponsored by the Regional and Global Model Analysis (RGMA) activity of the Earth Environmental Systems Modeling (EESM) Program in the Earth and Environmental Systems Sciences Division (EESSD) of the Office of Biological and Environmental Research (BER) in the US Department of Energy Office of Science. Sandia National Laboratories is a multimission laboratory managed and operated by National Technology and Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International, Inc., for the U.S. Department of Energy's National Nuclear Security Administration under contract DE-NA-0003525. Lawrence Berkeley National Laboratory (LBNL) is managed by the Regents of the University of California for the U.S. Department of Energy under Contract No. DE-AC02-05CH11231. Oak Ridge National Laboratory (ORNL) is managed by UT-Battelle, LLC, for the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. Thanks to A. Lupachev, S. Smith, C. Shaw, and J. Y. Jung for providing access to some of the SOC profile data.</p>
			</ack>
			<ref-list>
				<title>References</title>
				<ref id="B1">
					<citation citation-type="book">
						<person-group person-group-type="author">
							<name>
								<surname>Arun</surname>
								<given-names>K.</given-names>
							</name>
							<name>
								<surname>Langmead</surname>
								<given-names>C. J.</given-names>
							</name>
						</person-group> (<year>2005</year>). <source>Structure based chemical shift prediction using random forest non-linear regression</source>. <publisher-loc>Pittsburgh, PA</publisher-loc>: <publisher-name>CarnegieMellon University</publisher-name>, <comment>CMU-CS-05-163</comment>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="http://reports-archive.adm.cs.cmu.edu/anon/2005/abstracts/05-163">http://reports-archive.adm.cs.cmu.edu/anon/2005/abstracts/05-163</ext-link>
						</comment>
					</citation>
				</ref>
				<ref id="B2">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Bartsch</surname>
								<given-names>A.</given-names>
							</name>
							<name>
								<surname>Widhalm</surname>
								<given-names>B.</given-names>
							</name>
							<name>
								<surname>Kuhry</surname>
								<given-names>P.</given-names>
							</name>
							<name>
								<surname>Hugelius</surname>
								<given-names>G.</given-names>
							</name>
							<name>
								<surname>Palmtag</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Siewert</surname>
								<given-names>M. B.</given-names>
							</name>
						</person-group> (<year>2016</year>). <article-title>Can C-band synthetic aperture radar be used to estimate soil organic carbon storage in tundra?</article-title>
						<source>Biogeosciences</source>
						<volume>13</volume> (<issue>9</issue>), <fpage>5433</fpage>&#x2013;<lpage>5470</lpage>. <pub-id pub-id-type="doi">10.5194/bg-13-5453-2016</pub-id>
					</citation>
				</ref>
				<ref id="B3">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Beer</surname>
								<given-names>C.</given-names>
							</name>
						</person-group> (<year>2016</year>). <article-title>Permafrost sub-grid heterogeneity of soil properties key for 3-D soil processes and future climate projections</article-title>. <source>Front. Earth Sci.</source>
						<volume>4</volume> (<issue>81</issue>), 81. <pub-id pub-id-type="doi">10.3389/feart.2016.00081</pub-id>
					</citation>
				</ref>
				<ref id="B4">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Biskaborn</surname>
								<given-names>B. K.</given-names>
							</name>
							<name>
								<surname>Smith</surname>
								<given-names>S. L.</given-names>
							</name>
							<name>
								<surname>Noetzli</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Matthes</surname>
								<given-names>H.</given-names>
							</name>
							<name>
								<surname>Vieira</surname>
								<given-names>G.</given-names>
							</name>
							<name>
								<surname>Streletskiy</surname>
								<given-names>D. A.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2019</year>). <article-title>Permafrost is warming at a global scale</article-title>. <source>Nat. Commun.</source>
						<volume>10</volume>, <fpage>264</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-018-08240-4</pub-id>
					</citation>
				</ref>
				<ref id="B5">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Bockheim</surname>
								<given-names>J. G.</given-names>
							</name>
						</person-group> (<year>2007</year>). <article-title>Importance of cryoturbation in redistributing organic carbon in permafrost-affected soils</article-title>. <source>Soil Sci. Soc. Am. J.</source>
						<volume>71</volume> (<issue>4</issue>), <fpage>1335</fpage>&#x2013;<lpage>1342</lpage>. <pub-id pub-id-type="doi">10.2136/sssaj2006.0414N</pub-id>
					</citation>
				</ref>
				<ref id="B6">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Breiman</surname>
								<given-names>L.</given-names>
							</name>
						</person-group> (<year>2001</year>). <article-title>Random forests</article-title>. <source>Mach. Learn.</source>
						<volume>45</volume> (<issue>1</issue>), <fpage>5</fpage>&#x2013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id>
					</citation>
				</ref>
				<ref id="B7">
					<citation citation-type="book">
						<person-group person-group-type="author">
							<name>
								<surname>Breiman</surname>
								<given-names>L.</given-names>
							</name>
							<name>
								<surname>Friedman</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Olshen</surname>
								<given-names>R.</given-names>
							</name>
							<name>
								<surname>Stone</surname>
								<given-names>C.</given-names>
							</name>
						</person-group> (<year>1984</year>). <source>Classification and regression trees</source>. <publisher-loc>Belmont, CA</publisher-loc>: <publisher-name>Wadsworth International Group</publisher-name>.</citation>
				</ref>
				<ref id="B8">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Cambule</surname>
								<given-names>A. H.</given-names>
							</name>
							<name>
								<surname>Rossiter</surname>
								<given-names>D. G.</given-names>
							</name>
							<name>
								<surname>Stoorvogel</surname>
								<given-names>J. J.</given-names>
							</name>
						</person-group> (<year>2013</year>). <article-title>A methodology for digital soil mapping in poorly-accessible areas</article-title>. <source>Geoderma</source>
						<volume>192</volume>, <fpage>341</fpage>&#x2013;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2012.08.020</pub-id>
					</citation>
				</ref>
				<ref id="B9">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Chang</surname>
								<given-names>C. W.</given-names>
							</name>
							<name>
								<surname>Laird</surname>
								<given-names>D. A.</given-names>
							</name>
						</person-group> (<year>2002</year>). <article-title>Near-infrared reflectance spectroscopic analysis of soil C and N</article-title>. <source>Soil Sci.</source>
						<volume>167</volume> (<issue>2</issue>), <fpage>110</fpage>&#x2013;<lpage>116</lpage>. <pub-id pub-id-type="doi">10.1097/00010694-200202000-00003</pub-id>
					</citation>
				</ref>
				<ref id="B10">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Chen</surname>
								<given-names>S.</given-names>
							</name>
							<name>
								<surname>Mulder</surname>
								<given-names>V. L.</given-names>
							</name>
							<name>
								<surname>Heuvelink</surname>
								<given-names>G. B. M.</given-names>
							</name>
							<name>
								<surname>Poggio</surname>
								<given-names>L.</given-names>
							</name>
							<name>
								<surname>Caubet</surname>
								<given-names>M.</given-names>
							</name>
							<name>
								<surname>Dobarco</surname>
								<given-names>M. R.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2020</year>). <article-title>Model averaging for mapping topsoil organic carbon in France</article-title>. <source>Geoderma</source>
						<volume>366</volume>, <fpage>114237</fpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2020.114237</pub-id>
					</citation>
				</ref>
				<ref id="B11">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Cortes</surname>
								<given-names>C.</given-names>
							</name>
							<name>
								<surname>Vapnik</surname>
								<given-names>V.</given-names>
							</name>
						</person-group> (<year>1995</year>). <article-title>Support-vector networks</article-title>. <source>Mach. Learn.</source>
						<volume>20</volume> (<issue>3</issue>), <fpage>273</fpage>&#x2013;<lpage>297</lpage>. <pub-id pub-id-type="doi">10.1007/BF00994018</pub-id>
					</citation>
				</ref>
				<ref id="B12">
					<citation citation-type="book">
						<person-group person-group-type="author">
							<name>
								<surname>Danielson</surname>
								<given-names>J. J.</given-names>
							</name>
							<name>
								<surname>Gesch</surname>
								<given-names>D. B.</given-names>
							</name>
						</person-group> (<year>2011</year>). <source>Global multi-resolution terrain elevation data 2010 (GMTED2010): U.S. geological survey open-file report 2011&#x2013;1073</source>. <publisher-loc>Reston, VA</publisher-loc>: <publisher-name>U.S. Department of the Interior, U.S. Geological Survey</publisher-name>, <fpage>26</fpage>.</citation>
				</ref>
				<ref id="B13">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Ding</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Li</surname>
								<given-names>F.</given-names>
							</name>
							<name>
								<surname>Yang</surname>
								<given-names>G.</given-names>
							</name>
							<name>
								<surname>Chen</surname>
								<given-names>L.</given-names>
							</name>
							<name>
								<surname>Zhang</surname>
								<given-names>B.</given-names>
							</name>
							<name>
								<surname>Liu</surname>
								<given-names>L.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2016</year>). <article-title>The permafrost carbon inventory on the Tibetan Plateau: a new evaluation using deep sediment cores</article-title>. <source>Glob. Change Biol.</source>
						<volume>22</volume>, <fpage>2688</fpage>&#x2013;<lpage>2701</lpage>. <pub-id pub-id-type="doi">10.1111/gcb.13257</pub-id>
					</citation>
				</ref>
				<ref id="B14">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Friedman</surname>
								<given-names>J. H.</given-names>
							</name>
						</person-group> (<year>1991</year>). <article-title>Multivariate adaptive regression splines</article-title>. <source>Ann. Stat.</source>
						<volume>19</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>67</lpage>.</citation>
				</ref>
				<ref id="B15">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Friedman</surname>
								<given-names>J. H.</given-names>
							</name>
						</person-group> (<year>2001</year>). <article-title>Greedy function approximation: a gradient boosting machine</article-title>. <source>Ann. Stat.</source>
						<volume>29</volume>, <fpage>1189</fpage>&#x2013;<lpage>1232</lpage>.</citation>
				</ref>
				<ref id="B16">
					<citation citation-type="book">
						<collab>Glob cover</collab> (<year>2009</year>). <source>Products description and validation report</source>. <publisher-loc>Louvain, Belgium</publisher-loc>: <publisher-name>Ecological Society of America and Universite Catholique de Louvain</publisher-name>.</citation>
				</ref>
				<ref id="B17">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Hartmann</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Moosdorf</surname>
								<given-names>N.</given-names>
							</name>
						</person-group> (<year>2012</year>). <article-title>The new global lithological map database GLiM: a representation of rock properties at the Earth surface</article-title>. <source>Geochem. Geophy. Geosy.</source>
						<volume>13</volume>, <fpage>Q12004</fpage>. <pub-id pub-id-type="doi">10.1029/2012GC004370</pub-id>
					</citation>
				</ref>
				<ref id="B18">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Hengl</surname>
								<given-names>T.</given-names>
							</name>
							<name>
								<surname>Heuvelink</surname>
								<given-names>G. B. M.</given-names>
							</name>
							<name>
								<surname>Rossiter</surname>
								<given-names>D. G.</given-names>
							</name>
						</person-group> (<year>2007</year>). <article-title>About regression&#x2010;kriging: from equations to case studies</article-title>. <source>Comput. Geosci.</source>
						<volume>33</volume> (<issue>10</issue>), <fpage>1301</fpage>&#x2013;<lpage>1315</lpage>. <pub-id pub-id-type="doi">10.1016/j.cageo.2007.05.001</pub-id>
					</citation>
				</ref>
				<ref id="B19">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Hijmans</surname>
								<given-names>R. J.</given-names>
							</name>
							<name>
								<surname>Cameron</surname>
								<given-names>S. E.</given-names>
							</name>
							<name>
								<surname>Parra</surname>
								<given-names>J. L.</given-names>
							</name>
							<name>
								<surname>Jones</surname>
								<given-names>P. G.</given-names>
							</name>
							<name>
								<surname>Jarvis</surname>
								<given-names>A.</given-names>
							</name>
						</person-group> (<year>2005</year>). <article-title>Very high resolution interpolated climate surfaces for global land areas</article-title>. <source>Int. J. Climatol.</source>
						<volume>25</volume>, <fpage>1965</fpage>&#x2013;<lpage>1978</lpage>. <pub-id pub-id-type="doi">10.1002/joc.1276</pub-id>link></citation>
				</ref>
				<ref id="B20">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Hugelius</surname>
								<given-names>G.</given-names>
							</name>
							<name>
								<surname>Strauss</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Zubrzycki</surname>
								<given-names>S.</given-names>
							</name>
							<name>
								<surname>Harden</surname>
								<given-names>J. W.</given-names>
							</name>
							<name>
								<surname>Schuur</surname>
								<given-names>E. A. G.</given-names>
							</name>
							<name>
								<surname>Ping</surname>
								<given-names>C. L.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2014</year>). <article-title>Estimated stocks of circumpolar permafrost carbon with quantified uncertainty ranges and identified data gaps</article-title>. <source>Biogeosciences</source>
						<volume>11</volume> (<issue>23</issue>), <fpage>6573</fpage>&#x2013;<lpage>6593</lpage>. <pub-id pub-id-type="doi">10.5194/bg-11-6573-2014</pub-id>
					</citation>
				</ref>
				<ref id="B21">
					<citation citation-type="book">
						<person-group person-group-type="author">
							<name>
								<surname>Jenny</surname>
								<given-names>H.</given-names>
							</name>
						</person-group> (<year>1941</year>). <source>Factors of soil formation: a system of quantitative pedology</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>McGraw Hill</publisher-name>
						<italic>,</italic>
						<fpage>1</fpage>&#x2013;<lpage>261</lpage>.</citation>
				</ref>
				<ref id="B22">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Johnson</surname>
								<given-names>K. D.</given-names>
							</name>
							<name>
								<surname>Harden</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>McGuire</surname>
								<given-names>A. D.</given-names>
							</name>
							<name>
								<surname>Bliss</surname>
								<given-names>N. B.</given-names>
							</name>
							<name>
								<surname>Bockheim</surname>
								<given-names>J. G.</given-names>
							</name>
							<name>
								<surname>Clark</surname>
								<given-names>M.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2011</year>). <article-title>Soil carbon distribution in Alaska in relation to soil-forming factors</article-title>. <source>Geoderma</source>
						<volume>167</volume>, <fpage>71</fpage>&#x2013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2011.10.006</pub-id>
					</citation>
				</ref>
				<ref id="B23">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Keskin</surname>
								<given-names>H.</given-names>
							</name>
							<name>
								<surname>Grunwald</surname>
								<given-names>S.</given-names>
							</name>
						</person-group> (<year>2018</year>). <article-title>Regression kriging as a workhorse in the digital soil mapper&#x2019;s toolbox</article-title>. <source>Geoderma</source>
						<volume>326</volume>, <fpage>22</fpage>&#x2013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2018.04.004</pub-id>
					</citation>
				</ref>
				<ref id="B24">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Kumar</surname>
								<given-names>S.</given-names>
							</name>
							<name>
								<surname>Lal</surname>
								<given-names>R.</given-names>
							</name>
							<name>
								<surname>Liu</surname>
								<given-names>D.</given-names>
							</name>
						</person-group> (<year>2012</year>). <article-title>A geographically weighted regression kriging approach for mapping soil organic carbon stock</article-title>. <source>Geoderma</source>
						<volume>189&#x2013;190</volume>, <fpage>627</fpage>&#x2013;<lpage>634</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2012.05.022</pub-id>
					</citation>
				</ref>
				<ref id="B25">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Lacoste</surname>
								<given-names>M.</given-names>
							</name>
							<name>
								<surname>Minasny</surname>
								<given-names>B.</given-names>
							</name>
							<name>
								<surname>McBratney</surname>
								<given-names>A.</given-names>
							</name>
							<name>
								<surname>Michot</surname>
								<given-names>D.</given-names>
							</name>
							<name>
								<surname>Viaud</surname>
								<given-names>V.</given-names>
							</name>
							<name>
								<surname>Walter</surname>
								<given-names>C.</given-names>
							</name>
						</person-group> (<year>2014</year>). <article-title>High resolution 3D mapping of soil organic carbon in a heterogeneous agricultural landscape</article-title>. <source>Geoderma</source>
						<volume>213</volume>, <fpage>296</fpage>&#x2013;<lpage>311</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2013.07.002</pub-id>
					</citation>
				</ref>
				<ref id="B26">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Li</surname>
								<given-names>Q. Q.</given-names>
							</name>
							<name>
								<surname>Yue</surname>
								<given-names>T. X.</given-names>
							</name>
							<name>
								<surname>Wang</surname>
								<given-names>C. Q.</given-names>
							</name>
							<name>
								<surname>Zhang</surname>
								<given-names>W. J.</given-names>
							</name>
							<name>
								<surname>Yu</surname>
								<given-names>Y.</given-names>
							</name>
							<name>
								<surname>Li</surname>
								<given-names>B.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2013</year>). <article-title>Spatially distributed modeling of soil organic matter across China: an application of artificial neural network approach</article-title>. <source>Catena</source>
						<volume>104</volume>, <fpage>210</fpage>&#x2013;<lpage>218</lpage>. <pub-id pub-id-type="doi">10.1016/j.catena.2012.11.012</pub-id>
					</citation>
				</ref>
				<ref id="B27">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Martin</surname>
								<given-names>M. P.</given-names>
							</name>
							<name>
								<surname>Orton</surname>
								<given-names>T. G.</given-names>
							</name>
							<name>
								<surname>Lacarce</surname>
								<given-names>E.</given-names>
							</name>
							<name>
								<surname>Meersmans</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Saby</surname>
								<given-names>N. P. A.</given-names>
							</name>
							<name>
								<surname>Paroissien</surname>
								<given-names>J. B.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2014</year>). <article-title>Evaluation of modelling approaches for predicting the spatial distribution of soil organic carbon stocks at the national scale</article-title>. <source>Geoderma</source>
						<volume>223&#x2013;225</volume>, <fpage>97</fpage>&#x2013;<lpage>107</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2014.01.005</pub-id>
					</citation>
				</ref>
				<ref id="B28">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Martin</surname>
								<given-names>M. P.</given-names>
							</name>
							<name>
								<surname>Wattenbach</surname>
								<given-names>M.</given-names>
							</name>
							<name>
								<surname>Smith</surname>
								<given-names>P.</given-names>
							</name>
							<name>
								<surname>Meersmans</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Jolivet</surname>
								<given-names>C.</given-names>
							</name>
							<name>
								<surname>Boulonne</surname>
								<given-names>L.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2011</year>). <article-title>Spatial distribution of soil organic carbon stocks in France</article-title>. <source>Biogeosciences</source>
						<volume>8</volume>, <fpage>1053</fpage>&#x2013;<lpage>1065</lpage>. <pub-id pub-id-type="doi">10.5194/bg-8-1053-2011</pub-id>
					</citation>
				</ref>
				<ref id="B29">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>McGuire</surname>
								<given-names>A. D.</given-names>
							</name>
							<name>
								<surname>Koven</surname>
								<given-names>C.</given-names>
							</name>
							<name>
								<surname>Lawrence</surname>
								<given-names>D. M.</given-names>
							</name>
							<name>
								<surname>Clein</surname>
								<given-names>J. S.</given-names>
							</name>
							<name>
								<surname>Xia</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Beer</surname>
								<given-names>C.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2016</year>). <article-title>Variability in the sensitivity among model simulations of permafrost and carbon dynamics in the permafrost region between 1960 and 2009</article-title>. <source>Glob. Biogeochem. Cycles.</source>
						<volume>30</volume> (<issue>7</issue>), <fpage>1015</fpage>&#x2013;<lpage>1037</lpage>. <pub-id pub-id-type="doi">10.1002/2016GB005405</pub-id>
					</citation>
				</ref>
				<ref id="B30">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>McGuire</surname>
								<given-names>A. D.</given-names>
							</name>
							<name>
								<surname>Lawrence</surname>
								<given-names>D. M.</given-names>
							</name>
							<name>
								<surname>Koven</surname>
								<given-names>C.</given-names>
							</name>
							<name>
								<surname>Clein</surname>
								<given-names>J. S.</given-names>
							</name>
							<name>
								<surname>Burke</surname>
								<given-names>E.</given-names>
							</name>
							<name>
								<surname>Chen</surname>
								<given-names>G.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2017</year>). <article-title>Dependence of the evolution of carbon dynamics in the northern permafrost region on the trajectory of climate change</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source>
						<volume>115</volume>, <fpage>3882</fpage>&#x2013;<lpage>3887</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1719903115</pub-id>
					</citation>
				</ref>
				<ref id="B31">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>McBratney</surname>
								<given-names>A. B.</given-names>
							</name>
							<name>
								<surname>MendoncaSantos</surname>
								<given-names>M. L.</given-names>
							</name>
							<name>
								<surname>Minasny</surname>
								<given-names>B.</given-names>
							</name>
						</person-group> (<year>2003</year>). <article-title>On digital soil mapping</article-title>. <source>Geoderma</source>
						<volume>117</volume> (<issue>1&#x2013;2</issue>), <fpage>3</fpage>&#x2013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.1016/S0016-7061(03)00223-4</pub-id>
					</citation>
				</ref>
				<ref id="B32">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Meng</surname>
								<given-names>Q.</given-names>
							</name>
						</person-group> (<year>2014</year>). <article-title>Regression kriging versus geographically weighted regression for spatial interpolation</article-title>. <source>Int. J. Adv. Remote Sens. GIS.</source>
						<volume>3</volume> (<issue>1</issue>), <fpage>606</fpage>.</citation>
				</ref>
				<ref id="B33">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Michaelson</surname>
								<given-names>G. J.</given-names>
							</name>
							<name>
								<surname>Ping</surname>
								<given-names>C.-L.</given-names>
							</name>
							<name>
								<surname>Clark</surname>
								<given-names>M.</given-names>
							</name>
						</person-group> (<year>2013</year>). <article-title>Soil pedon carbon and nitrogen data for Alaska: an analysis and update</article-title>. <source>Open J. Soil Sci.</source>
						<volume>3</volume> (<issue>2</issue>), <fpage>132</fpage>&#x2013;<lpage>142</lpage>. <pub-id pub-id-type="doi">10.4236/ojss.2013.32015</pub-id>
					</citation>
				</ref>
				<ref id="B34">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Minasny</surname>
								<given-names>B.</given-names>
							</name>
							<name>
								<surname>McBratney</surname>
								<given-names>A. B.</given-names>
							</name>
							<name>
								<surname>Malone</surname>
								<given-names>B. P.</given-names>
							</name>
							<name>
								<surname>Wheeler</surname>
								<given-names>I.</given-names>
							</name>
						</person-group> (<year>2013</year>). <article-title>Chapter One -Digital mapping of soil carbon</article-title>. <source>Adv. Agron.</source>
						<volume>118</volume>, <fpage>1</fpage>&#x2013;<lpage>47</lpage>. <pub-id pub-id-type="doi">10.1016/B978-0-12-405942-9.00001-3</pub-id>
					</citation>
				</ref>
				<ref id="B35">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Drewniak</surname>
								<given-names>B.</given-names>
							</name>
							<name>
								<surname>Jastrow</surname>
								<given-names>J. D.</given-names>
							</name>
							<name>
								<surname>Matamala</surname>
								<given-names>R. M.</given-names>
							</name>
							<name>
								<surname>Vitharana</surname>
								<given-names>U. W. A.</given-names>
							</name>
						</person-group> (<year>2017</year>). <article-title>Spatial representation of organic carbon and active-layer thickness of high latitude soils in CMIP5 earth system models</article-title>. <source>Geoderma</source>
						<volume>300</volume>, <fpage>55</fpage>&#x2013;<lpage>63</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2016.04.017</pub-id>
					</citation>
				</ref>
				<ref id="B36">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Jastrow</surname>
								<given-names>J. D.</given-names>
							</name>
							<name>
								<surname>Matamala</surname>
								<given-names>R.</given-names>
							</name>
							<name>
								<surname>Hugelius</surname>
								<given-names>G.</given-names>
							</name>
							<name>
								<surname>Koven</surname>
								<given-names>C. D.</given-names>
							</name>
							<name>
								<surname>Harden</surname>
								<given-names>J. W.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2013</year>). <article-title>Empirical estimates to reduce modeling uncertainties of soil organic carbon in permafrost regions: a review of recent progress and remaining challenges</article-title>. <source>Environ. Res. Lett.</source>
						<volume>8</volume>. <pub-id pub-id-type="doi">10.1088/1748-9326/8/3/035020</pub-id>
					</citation>
				</ref>
				<ref id="B37">
					<citation citation-type="book">
						<person-group person-group-type="author">
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Lal</surname>
								<given-names>R.</given-names>
							</name>
						</person-group> (<year>2010</year>). &#x201c;<article-title>Predictive mapping of soil organic carbon: a case study using geographic weighted regression approach</article-title>,&#x201d; in <source>GIS applications in agriculture&#x2013; nutrient management for improved Energy efficiency</source>. Editors <person-group person-group-type="editor">
							<name>
								<surname>Clay</surname>
								<given-names>D.</given-names>
							</name>
							<name>
								<surname>Shanahan</surname>
								<given-names>J.</given-names>
							</name>
						</person-group> (<publisher-loc>Boca Raton, FL</publisher-loc>: <publisher-name>CRC Press</publisher-name>), <fpage>209</fpage>&#x2013;<lpage>233</lpage>.</citation>
				</ref>
				<ref id="B38">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Lal</surname>
								<given-names>R.</given-names>
							</name>
							<name>
								<surname>Slater</surname>
								<given-names>B.</given-names>
							</name>
							<name>
								<surname>Calhoun</surname>
								<given-names>F.</given-names>
							</name>
							<name>
								<surname>Liu</surname>
								<given-names>D.</given-names>
							</name>
							<name>
								<surname>Van Meirvenne</surname>
								<given-names>M.</given-names>
							</name>
						</person-group> (<year>2009</year>). <article-title>Predicting soil organic carbon stock using profile depth distribution functions and ordinary kriging</article-title>. <source>Soil Sci. Soc. Am. J.</source>
						<volume>73</volume>, <fpage>614</fpage>&#x2013;<lpage>621</lpage>. <pub-id pub-id-type="doi">10.2136/sssaj2007.0410</pub-id>
					</citation>
				</ref>
				<ref id="B39">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Riley</surname>
								<given-names>W. J.</given-names>
							</name>
						</person-group> (<year>2012</year>). <article-title>Alaskan soil carbon stocks: spatial variability and dependence on environmental factors</article-title>. <source>Biogeosciences</source>
						<volume>9</volume>, <fpage>3637</fpage>&#x2013;<lpage>3645</lpage>. <pub-id pub-id-type="doi">10.5194/bg-9-3637-2012</pub-id>
					</citation>
				</ref>
				<ref id="B40">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Riley</surname>
								<given-names>W. J.</given-names>
							</name>
						</person-group> (<year>2015</year>). <article-title>Scaling impacts on environmental controls and spatial heterogeneity of soil organic carbon stocks</article-title>, <source>Biogeosciences</source>
						<volume>12</volume>
						<fpage>3993</fpage>&#x2013;<lpage>4004</lpage>. <pub-id pub-id-type="doi">10.5194/bg-12-3993-2015</pub-id>
					</citation>
				</ref>
				<ref id="B41">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Torn</surname>
								<given-names>M. S.</given-names>
							</name>
							<name>
								<surname>Ogle</surname>
								<given-names>S.</given-names>
							</name>
							<name>
								<surname>Masanet</surname>
								<given-names>E.</given-names>
							</name>
						</person-group> (<year>2012</year>). <article-title>Improving regional soil carbon inventories: combining IPCC carbon inventory method with regression kriging</article-title>. <source>Geoderma</source>, <volume>189&#x2013;190</volume>, <fpage>288</fpage>&#x2013;<lpage>295</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2012.06.022</pub-id>
					</citation>
				</ref>
				<ref id="B42">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Mitran</surname>
								<given-names>T.</given-names>
							</name>
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Lal</surname>
								<given-names>R.</given-names>
							</name>
							<name>
								<surname>Ravisankar</surname>
								<given-names>T.</given-names>
							</name>
							<name>
								<surname>Sreenivas</surname>
								<given-names>K.</given-names>
							</name>
						</person-group> (<year>2018</year>). <article-title>Spatial distribution of soil carbon stocks in a semi-arid region of India</article-title>. <source>Geoderma Reg.</source>
						<volume>15</volume>, <lpage>e00192</lpage>. <pub-id pub-id-type="doi">10.1016/j.geodrs.2018.e00192</pub-id>
					</citation>
				</ref>
				<ref id="B43">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Palmtag</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Hugelius</surname>
								<given-names>G.</given-names>
							</name>
							<name>
								<surname>Lashchinskiy</surname>
								<given-names>N.</given-names>
							</name>
							<name>
								<surname>Tamtorf</surname>
								<given-names>M. P.</given-names>
							</name>
							<name>
								<surname>Richter</surname>
								<given-names>A.</given-names>
							</name>
							<name>
								<surname>Elberling</surname>
								<given-names>B.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2015</year>). <article-title>Storage, landscape distribution, and burial history of soil organic matter in contrasting areas of continuous permafrost</article-title>. <source>Arc. Antarct. Alp. Res.</source>
						<volume>47</volume> (<issue>1</issue>), <fpage>71</fpage>&#x2013;<lpage>88</lpage>. <pub-id pub-id-type="doi">10.1657/AAAR0014-027</pub-id>
					</citation>
				</ref>
				<ref id="B44">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Pastick</surname>
								<given-names>N. J.</given-names>
							</name>
							<name>
								<surname>Rigge</surname>
								<given-names>M.</given-names>
							</name>
							<name>
								<surname>Wylie</surname>
								<given-names>B. K.</given-names>
							</name>
							<name>
								<surname>Jorgenson</surname>
								<given-names>M. T.</given-names>
							</name>
							<name>
								<surname>Rose</surname>
								<given-names>J. R.</given-names>
							</name>
							<name>
								<surname>Johnson</surname>
								<given-names>K. D.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2014</year>). <article-title>Distribution and landscape controls of organic layer thickness and carbon within the Alaskan Yukon River Basin</article-title>. <source>Geoderma</source>, <volume>230&#x2013;231</volume>, <fpage>79</fpage>&#x2013;<lpage>94</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2014.04.008</pub-id>
					</citation>
				</ref>
				<ref id="B45">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Ping</surname>
								<given-names>C. L.</given-names>
							</name>
							<name>
								<surname>Michaelson</surname>
								<given-names>G. J.</given-names>
							</name>
							<name>
								<surname>Kimble</surname>
								<given-names>J. M.</given-names>
							</name>
							<name>
								<surname>Romanovsky</surname>
								<given-names>V. E.</given-names>
							</name>
							<name>
								<surname>Shur</surname>
								<given-names>Y. L.</given-names>
							</name>
							<name>
								<surname>Swanson</surname>
								<given-names>D. K.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2008</year>). <article-title>Cryogenesis and soil formation along a bioclimate gradient in Arctic North America</article-title>. <source>J. Geophys. Res.</source>, <volume>113</volume>, <fpage>G03S12</fpage>. <pub-id pub-id-type="doi">10.1029/2008JG000744</pub-id>
					</citation>
				</ref>
				<ref id="B46">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Ping</surname>
								<given-names>C. L.</given-names>
							</name>
							<name>
								<surname>Clark</surname>
								<given-names>M. H.</given-names>
							</name>
							<name>
								<surname>Kimble</surname>
								<given-names>J. M.</given-names>
							</name>
							<name>
								<surname>Michaelson</surname>
								<given-names>G. J.</given-names>
							</name>
							<name>
								<surname>Shur</surname>
								<given-names>Y.</given-names>
							</name>
							<name>
								<surname>Stiles</surname>
								<given-names>C. A.</given-names>
							</name>
						</person-group> (<year>2013</year>). <article-title>Sampling protocols for Permafrost-affected soils</article-title>. <source>Soil Horizons.</source>
						<volume>54</volume>, <fpage>13</fpage>&#x2013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.2136/SH12-09-0027</pub-id>
					</citation>
				</ref>
				<ref id="B47">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Ping</surname>
								<given-names>C. L.</given-names>
							</name>
							<name>
								<surname>Jastrow</surname>
								<given-names>J. D.</given-names>
							</name>
							<name>
								<surname>Jorgenson</surname>
								<given-names>G. J.</given-names>
							</name>
							<name>
								<surname>Michaelson</surname>
								<given-names>M. T.</given-names>
							</name>
							<name>
								<surname>Shu</surname>
								<given-names>Y. L.</given-names>
							</name>
						</person-group> (<year>2015</year>). <article-title>Permafrost soils and carbon cycling</article-title>
						<source>SOIL</source>
						<volume>1</volume>, <fpage>147</fpage>&#x2013;<lpage>171</lpage>. <pub-id pub-id-type="doi">10.5194/soil-1-147-2015</pub-id>
					</citation>
				</ref>
				<ref id="B48">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Romanovsky</surname>
								<given-names>V. E.</given-names>
							</name>
							<name>
								<surname>Smith</surname>
								<given-names>S. L.</given-names>
							</name>
							<name>
								<surname>Christiansen</surname>
								<given-names>H. H.</given-names>
							</name>
						</person-group> (<year>2010</year>). <article-title>Permafrost thermal state in the polar Northern Hemisphere during the international polar year 2007&#x2013;2009: a synthesis</article-title>. <source>Permafrost Periglac.</source>
						<volume>21</volume>, <fpage>106</fpage>&#x2013;<lpage>116</lpage>. <pub-id pub-id-type="doi">10.1002/ppp.689</pub-id>
					</citation>
				</ref>
				<ref id="B49">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Rowland</surname>
								<given-names>J. C.</given-names>
							</name>
							<name>
								<surname>Jones</surname>
								<given-names>C. E.</given-names>
							</name>
							<name>
								<surname>Altmann</surname>
								<given-names>G.</given-names>
							</name>
							<name>
								<surname>Bryan</surname>
								<given-names>R.</given-names>
							</name>
							<name>
								<surname>Crosby</surname>
								<given-names>B. T.</given-names>
							</name>
							<name>
								<surname>Geernaert</surname>
								<given-names>G. L.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2010</year>). <article-title>Arctic landscapes in transition: responses to thawing permafrost</article-title>. <source>
							<italic>EOS (Earth and Space Science News</italic>)</source>
						<volume>91</volume>, <fpage>229</fpage>&#x2013;<lpage>230</lpage>. <pub-id pub-id-type="doi">10.1029/2010EO260001</pub-id>
					</citation>
				</ref>
				<ref id="B50">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Schuur</surname>
								<given-names>E. A. G.</given-names>
							</name>
							<name>
								<surname>McGuire</surname>
								<given-names>A. D.</given-names>
							</name>
							<name>
								<surname>Sch&#xe4;del</surname>
								<given-names>C.</given-names>
							</name>
							<name>
								<surname>Grosse</surname>
								<given-names>G.</given-names>
							</name>
							<name>
								<surname>Harden</surname>
								<given-names>J. W.</given-names>
							</name>
							<name>
								<surname>Hayes</surname>
								<given-names>D. J.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2015</year>). <article-title>Climate change and the permafrost carbon feedback</article-title>. <source>Nature</source>
						<volume>520</volume>, <fpage>171</fpage>&#x2013;<lpage>179</lpage>. <pub-id pub-id-type="doi">10.1038/nature14338</pub-id>
					</citation>
				</ref>
				<ref id="B51">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Shelef</surname>
								<given-names>E.</given-names>
							</name>
							<name>
								<surname>Rowland</surname>
								<given-names>J. C.</given-names>
							</name>
							<name>
								<surname>Willson</surname>
								<given-names>C. J</given-names>
							</name>
							<name>
								<surname>Hilley</surname>
								<given-names>G. E.</given-names>
							</name>
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Alttman</surname>
								<given-names>G. L.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2017</year>). <article-title>Large uncertainty in permafrost carbon stocks due to hillslope soil deposits</article-title>. <source>Geophys. Res. Lett.</source>
						<volume>44</volume> (<issue>12</issue>), <fpage>6134</fpage>&#x2013;<lpage>6144</lpage>. <pub-id pub-id-type="doi">10.1002/2017GL073823</pub-id>
					</citation>
				</ref>
				<ref id="B52">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Shi</surname>
								<given-names>Z.</given-names>
							</name>
							<name>
								<surname>Crowell</surname>
								<given-names>S.</given-names>
							</name>
							<name>
								<surname>Luo</surname>
								<given-names>Y.</given-names>
							</name>
							<name>
								<surname>Moore</surname>
								<given-names>B.</given-names>
							</name>
						</person-group> (<year>2018</year>). <article-title>Model structures amplify uncertainty in predicted soil carbon responses to climate change</article-title>. <source>Nat. Commun.</source>
						<volume>9</volume>, <fpage>2171</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-018-04526-9</pub-id>
					</citation>
				</ref>
				<ref id="B53">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Siewert</surname>
								<given-names>M. B.</given-names>
							</name>
						</person-group> (<year>2018</year>). <article-title>High-resolution digital mapping of soil organic carbon in permafrost terrain using machine learning: a case study in a sub-Arctic peatland environment</article-title>. <source>Biogeosciences</source>
						<volume>15</volume>, <fpage>1663</fpage>&#x2013;<lpage>1682</lpage>. <pub-id pub-id-type="doi">10.5194/bg-15-1663-2018</pub-id>
					</citation>
				</ref>
				<ref id="B54">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Siewert</surname>
								<given-names>M. B.</given-names>
							</name>
							<name>
								<surname>Hanisch</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>Weiss</surname>
								<given-names>N.</given-names>
							</name>
							<name>
								<surname>Kuhry</surname>
								<given-names>P.</given-names>
							</name>
							<name>
								<surname>Maximov</surname>
								<given-names>T. C.</given-names>
							</name>
							<name>
								<surname>Hugelius</surname>
								<given-names>G.</given-names>
							</name>
						</person-group> (<year>2015</year>). <article-title>Comparing carbon storage of Siberian tundra and taiga permafrost ecosystems at very high spatial resolution</article-title>. <source>J. Geohys. Res-Biogeo.</source>
						<volume>120</volume>, <fpage>1973</fpage>&#x2013;<lpage>1994</lpage>. <pub-id pub-id-type="doi">10.1002/2015JG002999</pub-id>
					</citation>
				</ref>
				<ref id="B55">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Sreenivas</surname>
								<given-names>K.</given-names>
							</name>
							<name>
								<surname>Dadhwal</surname>
								<given-names>V. K.</given-names>
							</name>
							<name>
								<surname>Kumar</surname>
								<given-names>S.</given-names>
							</name>
							<name>
								<surname>Harsha</surname>
								<given-names>G. S.</given-names>
							</name>
							<name>
								<surname>Mitran</surname>
								<given-names>T.</given-names>
							</name>
							<name>
								<surname>Sujatha</surname>
								<given-names>G.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2016</year>). <article-title>Digital mapping of soil organic and inorganic carbon status in India</article-title>. <source>Geoderma</source>
						<volume>269</volume>, <fpage>160</fpage>&#x2013;<lpage>173</lpage>. <pub-id pub-id-type="doi">10.1016/j.geoderma.2016.02.002</pub-id>
					</citation>
				</ref>
				<ref id="B57">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Svetnik</surname>
								<given-names>V.</given-names>
							</name>
							<name>
								<surname>Liaw</surname>
								<given-names>A.</given-names>
							</name>
							<name>
								<surname>Tong</surname>
								<given-names>C.</given-names>
							</name>
							<name>
								<surname>Culberson</surname>
								<given-names>J. C.</given-names>
							</name>
							<name>
								<surname>Sheridan</surname>
								<given-names>R. P.</given-names>
							</name>
							<name>
								<surname>Feuston</surname>
								<given-names>B. P.</given-names>
							</name>
						</person-group> (<year>2003</year>). <article-title>Random forest: a classification and regression tool for compound classification and QSAR modeling</article-title>. <source>J. Chem. Inf. and Comput. Sci.</source>
						<volume>43</volume> (<issue>6</issue>), <fpage>1947</fpage>&#x2013;<lpage>1958</lpage>. <pub-id pub-id-type="doi">10.1021/ci034160g</pub-id>
					</citation>
				</ref>
				<ref id="B58">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Tarnocai</surname>
								<given-names>C.</given-names>
							</name>
							<name>
								<surname>Canadell</surname>
								<given-names>J. P.</given-names>
							</name>
							<name>
								<surname>Schuur</surname>
								<given-names>E. A. G.</given-names>
							</name>
							<name>
								<surname>Kuhry</surname>
								<given-names>P.</given-names>
							</name>
							<name>
								<surname>Mazhitova</surname>
								<given-names>G.</given-names>
							</name>
							<name>
								<surname>Zimov</surname>
								<given-names>S.</given-names>
							</name>
						</person-group> (<year>2009</year>). <article-title>Soil organic carbon pools in the north circumpolar permafrost region</article-title>. <source>Global Biogeochem. Cycle.</source>
						<volume>23</volume>, <fpage>GB203</fpage>. <pub-id pub-id-type="doi">10.1029/2008GB003327</pub-id>
					</citation>
				</ref>
				<ref id="B59">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Vasat</surname>
								<given-names>R.</given-names>
							</name>
							<name>
								<surname>Kodesova</surname>
								<given-names>R.</given-names>
							</name>
							<name>
								<surname>Burkova</surname>
								<given-names>L.</given-names>
							</name>
						</person-group> (<year>2017</year>). <article-title>Ensemble predictive model for more accurate soil organic carbon spectroscopic estimation</article-title>. <source>Comput. Geosci.</source>
						<volume>104</volume>, <fpage>75</fpage>&#x2013;<lpage>83</lpage>. <pub-id pub-id-type="doi">10.1016/j.cageo.2017.04.008</pub-id>
					</citation>
				</ref>
				<ref id="B60">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Viscarra Rossel</surname>
								<given-names>R. A.</given-names>
							</name>
							<name>
								<surname>Webster</surname>
								<given-names>R.</given-names>
							</name>
						</person-group>. (<year>2012</year>). <article-title>Predicting soil properties from the Australian soil visible&#x2013;near infrared spectroscopic database</article-title>. <source>Eur. J. Soil Sci.</source>
						<volume>63</volume> (<issue>6</issue>), <fpage>848</fpage>&#x2013;<lpage>860</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2389.2012.01495.x</pub-id>
					</citation>
				</ref>
				<ref id="B61">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Vitharana</surname>
								<given-names>U. W. A.</given-names>
							</name>
							<name>
								<surname>Mishra</surname>
								<given-names>U.</given-names>
							</name>
							<name>
								<surname>Jastrow</surname>
								<given-names>J. D.</given-names>
							</name>
							<name>
								<surname>Matamala</surname>
								<given-names>R.</given-names>
							</name>
							<name>
								<surname>Fan</surname>
								<given-names>Z.</given-names>
							</name>
						</person-group> (<year>2017</year>). <article-title>Observational needs for estimating Alaskan soil carbon stocks under current and future climate</article-title>. <source>J. Geophys. Res. Biogeosci.</source>
						<volume>122</volume> (<issue>2</issue>), <fpage>415</fpage>&#x2013;<lpage>429</lpage>. <pub-id pub-id-type="doi">10.1002/2016JG003421</pub-id>
					</citation>
				</ref>
				<ref id="B62">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Wu</surname>
								<given-names>Z.</given-names>
							</name>
							<name>
								<surname>Wang</surname>
								<given-names>B.</given-names>
							</name>
							<name>
								<surname>Huang</surname>
								<given-names>J.</given-names>
							</name>
							<name>
								<surname>An</surname>
								<given-names>Z.</given-names>
							</name>
							<name>
								<surname>Jiang</surname>
								<given-names>P.</given-names>
							</name>
							<name>
								<surname>Chen</surname>
								<given-names>Y.</given-names>
							</name>
							<etal/>
						</person-group> (<year>2019</year>). <article-title>Estimating soil organic carbon density in plains using landscape metric-based regression Kriging model</article-title>. <source>Soil Till. Res.</source>
						<volume>195</volume>, <fpage>104381</fpage>. <pub-id pub-id-type="doi">10.1016/J.STILL.2019.104381</pub-id>
					</citation>
				</ref>
				<ref id="B63">
					<citation citation-type="journal">
						<person-group person-group-type="author">
							<name>
								<surname>Zhang</surname>
								<given-names>C.</given-names>
							</name>
							<name>
								<surname>Tang</surname>
								<given-names>Y.</given-names>
							</name>
							<name>
								<surname>Xu</surname>
								<given-names>X.</given-names>
							</name>
							<name>
								<surname>Kiely</surname>
								<given-names>G.</given-names>
							</name>
						</person-group> (<year>2011</year>). <article-title>Towards spatial geochemical modelling: use of geographically weighted regression for mapping soil organic carbon contents in Ireland</article-title>. <source>J. Appl. Geochem.</source>
						<volume>26</volume> (<issue>7</issue>), <fpage>1239</fpage>&#x2013;<lpage>1248</lpage>. <pub-id pub-id-type="doi">10.1016/j.apgeochem.2011.04.014</pub-id>
					</citation>
				</ref>
			</ref-list>
		</back>
	</article>
	