Surface ocean CO2 concentration and air-sea flux estimate by machine learning with modelled variable trends

Zeng, Jiye; Iida, Yosuke; Matsunaga, Tsuneo; Shirai, Tomoko

doi:10.3389/fmars.2022.989233

ORIGINAL RESEARCH article

Front. Mar. Sci., 14 September 2022

Sec. Global Change and the Future Ocean

Volume 9 - 2022 | https://doi.org/10.3389/fmars.2022.989233

Surface ocean CO₂ concentration and air-sea flux estimate by machine learning with modelled variable trends

Jiye Zeng^1*

Yosuke Iida²

Tsuneo Matsunaga¹

Tomoko Shirai¹

¹Earth Systems Division, National Institute for Environmental Studies, Tsukuba, Japan
²Atmosphere and Ocean Department, Japan Meteorological Agency, Tokyo, Japan

The global ocean is a major sink of anthropogenic carbon dioxide (CO₂) emitted into the atmosphere. Machine learning has been actively used in the past decades to estimate the oceanic sink, but it is still a challenge to obtain an accurate estimate due to scarcely available CO₂ measurements. One of the methods to deal with data scarcity was normalizing multiple years’ CO₂ values to a reference year to increase the spatial coverage. The practice assumed a constant CO₂ trend for the normalization. Here, we used three machine learning models to extract variable ocean CO₂ trends on a decadal scale and proposed a method to use the extracted ocean CO₂ trends to correct the decadal atmospheric CO₂ trends for data normalization. The method minimizes assumptions of using the extracted ocean CO₂ trends directly. Comparisons of our CO₂ flux estimate with machine learning products included in Global Carbon Budget 2021 indicates that using the variable trends improved the bias resulted from using a constant trend and that the trends are a critical factor for machine learning methods. Our dataset includes monthly distributions of surface ocean CO₂ concentration and air-sea flux in 1980-2020 with a spatial resolution of 1×1 degree.

Introduction

The oceans play a crucial role in mitigating the increase of atmospheric CO₂ emitted into the atmosphere by human activities (Sabine, 2004; Khatiwala et al., 2013; McKinley et al., 2016). Using machine learning to estimate the oceanic sink has been practiced in the past decades and the results have become an important part of the Global Carbon Budget (Friedlingstein et al., 2022). Nevertheless, it is still a challenge to obtain an accurate estimate due to scarcely available CO₂ measurements. Through internationally coordinated efforts, decades of in situ measurements have been combined to form high-quality databases, such as the Surface Ocean CO₂ Atlas Database (SOCAT) (Sabine et al., 2013; Pfeil et al., 2013; Bakker et al., 2016). The composite sampling map of SOCAT appears to cover most areas of the oceans. However, only a small portion of the oceans had samples in any single year and the samples were unevenly distributed in time and space. The dilemma of using multiple years’ data to train a machine learning model is that while ocean CO₂ tends to track the increase of atmospheric CO₂ closely (Fay and McKinley, 2013; Bates et al., 2014), the large seasonal and spatial variabilities up to a few hundred μatm make it difficult to detect the trends in the order of a few μatm per year. Current methods for solving the problem include normalizing ocean CO₂ to a reference year (Takahashi et al., 2009; Sasse et al., 2013a; Sasse et al., 2013b; Nakaoka et al., 2013; Zeng et al., 2014), including a linear time-dependent term in regression (Fay and McKinley, 2013; Iida et al., 2015; Jones et al., 2015; Watson et al., 2020; Iida et al., 2021), and including atmospheric CO₂ as a predictor to make models learn the trend implicitly (Landschützer et al., 2016; Denvil-Sommer et al., 2019; Gregor and Gruber, 2021; Chau et al., 2022). The former two methods assume a constant trend for the whole period. This can be a good approximation when the time span is short, but the error tends to become substantial in a long period as the trend could vary greatly with time. Landschützer et al. (2016) and Gloege et al. (2021) showed that such a problem could also exist in the third method.

There are two camps of using machine learning to reconstruct ocean CO₂ in terms of data pooling strategy. One camp treats the global oceans as one entity (Takahashi et al., 2009; Sasse et al., 2013b; Nakaoka et al., 2013; Zeng et al., 2014; Denvil-Sommer et al., 2019; Chau et al., 2022). The other camp divides the oceans into clusters with similar biogeochemical properties. Sasse et al. (2013a) and Landschützer et al. (2013) are early pioneers in this camp. They used a Self-Organization Map (SOM) for clustering in the first step and then used different regression methods in the second step for making predictions. This method, used by Landschützer et al. (2013) and named two-step method, was also applied by Laruelle et al. (2017); Watson et al. (2020), and Gloege et al. (2021). Other clustering methods include geographical blocking (Iida et al., 2015; Watson et al., 2020), K-mean clustering (Gregor et al., 2019; Gregor and Gruber, 2021), and CO₂ biome clustering (McKinley et al., 2011; Fay and McKinley, 2013; Gregor et al., 2019; Watson et al., 2020).

In this study, we used three machine learning models to extract the global time-dependent ocean CO₂ trends. They were used to correct the decadal atmospheric CO₂ trends to normalize ocean CO₂ measurements to a reference year for modelling the nonlinear dependence of CO₂ on biogeochemical predictors. Then, we reconstruct monthly CO₂ distributions between 1980 and 2020 with a spatial resolution of 1×1 degree. Our method is in the first camp discussed above. We compared the air-sea flux estimate with those included in the Global Carbon Budget 2021 (Friedlingstein et al., 2022). The results reveal that the ocean CO₂ trends are a critical factor for machine learning methods, which in turn implies the importance of having long-term observations to quantify the uptake, predict scenarios, and evolve adapting strategies.

Method

Model setup

Following Zeng et al. (2014) and considering the inconstancy that can be associated with long-term oceanic CO2 trends, we express the nonlinear dependence of ocean CO₂ on time and biogeochemical variables as:

\begin{array}{l} C O 2 W = f (S S T, d S S T, S S S, C H L, M L D, L A T, L O N) + f (y e a r), & (1) \end{array}

where SST stands for sea surface temperature, SSS for sea surface salinity, CHL for chlorophyll-a concentration, MLD for mixed layer depth, LAT for latitude, and LON for longitude. The sine and cosine converted values of LON were used to make the circular variable contingent. We replaced the month variable of Zeng et al. (2014) with the SST anomaly (dSST) against the annual mean to harmonize the seasons of the two hemispheres. The function of year represents the trends, which were a constant in Zeng et al. (2014).

We used machine learning to investigate the variable trends with varying lengths of data (Figure 1). A similar iteration method was also used by Zeng et al. (2014). For a given target year and data length, we fitted the dependence of CO2W on year by linear regression first. The first term in Eq.(1) was treated as an error in this step. Then we subtracted the trend from observations and used machine learning to model the nonlinear relationship between the residual and predictors. These two steps were repeated until the trend became stabilized. Initially, three years’ data were used: the target year plus and minus one year. The data length was increased to the longest available data length gradually. The longest data length was 41 years for the target year 2000, i.e., all data between 1980 and 2020 were included. The extracted trends were used as reference to model the decadal trends of atmospheric CO₂ by fitting its annual increase rates with the following harmonic function:

FIGURE 1

Figure 1 Flow chart of the iteration method for trend extraction.

\begin{array}{l} t r e n d = c_{0} + c_{1} y e a r + c_{2} cos (\frac{2 π y e a r}{T_{1}}) + c_{3} s i n (\frac{2 π y e a r}{T_{1}}) + c_{4} cos (\frac{2 π y e a r}{T_{2}}) + c_{5} s i n (\frac{2 π y e a r}{T_{2}}) . & (2) \end{array}

where T₁ and T₂ are time parameter in year. For training machine learning models, the atmospheric CO₂ trends obtained by Eq.(2) were used to normalize the observed CO₂ values to the reference year 2000 by the equation:

\begin{array}{l} C O 2 W^{n o r m} = C O 2 W^{r a w} (y e a r) \pm \sum_{i = 2000}^{y e a r} t r e n d (i), & (3) \end{array}

where ± is positive when i<2000 and negative when i>2000. At i=2000, the trend correction is zero. Global CO₂ concentrations were constructed by adding or subtracting the trend correction of Eq.(3) to the predicted CO₂ values. The process is the inverse of the normalization. Using atmospheric CO₂ trends for data normalization avoided problems in using oceanic CO₂ trends directly, e.g., insufficient data points in the early and later years and the difficulty of determining the best data length for trend extraction.

Models

We deployed three machine learning models: Random Forest (RF), Gradient Boost Machine (GBM), and Feedforward Neural Network (FNN). Using multiple models has the merit of mutual overfit checking and compensating model weakness with each other.

RF was proved to be a robust method for modelling carbon flux at the global scale (Zeng et al., 2020) and was applied to global ocean CO₂ mapping recently (Gregor et al., 2019). RF partitions a training dataset into subsets repeatedly by random sampling and uses the subsets to construct trees. We used the python library of Ranger (Wright and Ziegler, 2017) which implements the regression algorithm using a two-stage randomization procedure to partition trees. Given a subset, the root node in a tree is recursively split into binary nodes until the number of data points in the leaf nodes becomes no larger than a specified number. In each split, the RF randomly selects a subset of predictor variables and searches them for splitting points that minimize node impurity (Ishwaran, 2015). In making a prediction, a set of predictors are passed through branches of nodes according to the splitting rule until the journey ends up in a leaf node. The mean of the target variable in the leaf node is taken as an estimate. Then the mean estimate of all leaf nodes is used as the prediction. Sensitive configuration factors for the RF include the number of trees and the number of data points in the leaf nodes (Zeng et al., 2020). The default setting includes 500 trees and 5 data points. We raised the data points to 100 based on our experiments with the ocean CO₂ data discussed in the data section to prevent hot spots in predicted CO₂ in the southern oceans where vast empty areas exist in certain months. The configuration yielded good validation results.

A decision-tree-based GBM emerged in the ocean CO₂ mapping recently (Gregor et al., 2019; Gregor and Gruber, 2021). Like a RF, a GBM combines weak learners into a single strong learner (Natekin and Knoll, 2013), but in an iterative fashion. It adds trees one at a time, and existing trees are not changed. We used the python library of LightGBM (Ke et al., 2017). Instead of the level-wise strategy of the RF, the GBM grows a tree leaf-wise by splitting nodes that produce the highest loss change until the number of leaf nodes becomes no larger than a specified number. The observed values of the target variable are assigned to the leaf nodes of the first tree. Then, the residuals of the previous predictions minus observations are assigned to the leaf nodes of a subsequent tree. A gradient descent procedure is used to obtain parameters that improve the accuracy of predictions. By experimenting with our ocean CO₂ data and using the RF as a reference, we found that LightGBM performed well with 500 trees and a maximum number of 100 terminal nodes in a tree.

FNN has been used for ocean CO₂ mapping since the early 2010s (e.g., Landschützer et al., 2013; Zeng et al., 2014). FNN has a layered structure, including an input layer, one or more hidden layers, and an output layer. Neurons between adjacent layers are fully connected. A neuron in the hidden layer uses an activation function to transform the weighted sum of inputs to form an input for the neuron in the output layer, which in turn transforms the weighted sum of inputs to form a prediction. Details of the FNN method can be found in Svozil et al. (1997) and abundant other references. We used python’s MLPRegressor model with one hidden layer and 64 hidden neurons, which is the same as that used by Zeng et al. (2014). Their investigations show that the setting yielded uncertainties in the level of grid mean variation of measurements. We raised the default maximum training iterations of MLPRegressor from 200 to 500. Our tests indicate that when the iterations were larger than 300, doubling or tripling the number did not make a substantial change in the flux estimate. Training the FNN took a much longer time than training the RF and GBM. As the trend extraction and validation discussed later involve many rounds of training, we had to set a fixed number of training iterations so that the training could be completed within a reasonable amount of time. The settings also yielded results well harmonized with those of the RF and GBM.

We validated the model performance by a leave-one-year-out (LOYO) method. Given N years of data, N validations were done by setting aside one year’s data for validation and using the remaining N-1 years of data for training. A model’s performance was evaluated by the mean bias. The validation method has an advantage over the conventional n-fold method in that the validation data of LOYO are more likely to come from unsampled domains of the training data. Another advantage is that LOYO can also be used to detect trends. If the target variable has an increasing trend, a model trained with data in early years tends to make predictions smaller than the observations in later years and vice versa.

Data

We extracted monthly CO₂ fugacity (fCO₂) in 1×1-degree grids from the track-gridded database of SOCAT version 2021 (Sabine et al., 2013; Pfeil et al., 2013; Bakker et al., 2016). We relaxed the criteria of Zeng et al. (2014) to include data when fCO₂ values are between 50 μatm and 1000 μatm and salinity is larger than 15 g kg^-1. A total of 273,456 data points were extracted for 1980-2020. We confined the fCO₂ training data set to post-1980 due to large uncertainties in the early measuring techniques (Sasse et al., 2013). The sources of predictor variables are shown in Table 1. The monthly climatology of MODIS-AQUA and MODIS-TERRA of 2002-2019 in 0.083×0.083-degree grids (Hu et al., 2012.) were combined and re-binned into 1x1-degree grids. The values of CHL and MDL were scaled by log(1+CHL) and log(1+MDL) to reduce the skewness of sample distribution. Filling missing CHL data in high latitudes with a small constant is a common practice (Gregor and Gruber, 2021; Chau et al., 2022). We filled the missing CHL in a grid with the smallest observed value in that grid. The data of SST (Ishii et al., 2005) and SSS (Zweng et al., 2019) were used without pre-processing. For flux calculation, we used the wind speed (WIND) and surface pressure (Ps) of the fifth generation ECMWF atmospheric reanalysis of the global climate (ERA5) (Hersbach et al., 2020), and the mole fraction of air CO₂ (xCO2A) of NOAA’s Marine Boundary Layer Reference (Conway et al., 1994; Dlugokencky et al., 2021). The monthly WIND and Ps in 0.25×0.25 degrees were averaged to 1×1 degree. The surface xCO2A in sine latitude grids was interpolated to 1×1 degree.

TABLE 1

Table 1 Data sources.

Comparison

We compared our estimates with seven machine learning products included in GCB-2021 (Table 2). We recalculated their fluxes by the same procedure to eliminate the effect of using different flux dependent data and coefficients. As each product has a different spatial coverage, we adjusted their annual fluxes using the equation

TABLE 2

Table 2 Datasets for comparison.

\begin{array}{l} F_{a d j u s t e d} = F_{m o d e l} + (F_{M L 3} - {F'}_{M L 3}^{}}), & (4) \end{array}

where F_ML3 (PgC a^-1) is the mean annual flux of NIES-ML3 (the ensemble mean of the three models) in the available period of a pair of products under comparison and F’_ML3 (PgC a^-1) is the mean annual flux of NIES-ML3 in the grids where both products have data. The adjustment was intended to bring fluxes with different spatial coverages to the same coverage as the NIES-ML3.

Results and discussion

CO₂ trend

The ocean CO₂ trends obtained using the iteration method in Figure 1 with the longest available data are shown in Figure 2A along with the annual increase rates of global atmospheric CO₂ concentrations (ppm) (Friedlingstein et al., 2022). Because of data scarcity, the extracted trends fluctuate dramatically when the data length is short and converge gradually (Figure 2B). Sutton et al. (2019) pointed out that the number of years of observations needed (YON) to detect a statistically significant trend over variability ranges from 8 to 15 years at several open ocean sites. It is reasonable to assume the same YON range for open oceans. Ideally, the trend for a year should be extracted with the shortest data length possible. As it is difficult to determine the smallest stabilization length and the trend does not change much after 10 to 15 years, we presented the trend with the maximum data length. The extracted trends appear to track the decadal trends of the atmospheric CO₂ in 1990-2015 obtained by Eq.(2) with T_{1 =} 20 year and T_{2 =} 40 year.

FIGURE 2

Figure 2 Trends extraction. (A) The trend of ocean CO₂ for a target year (blue) was estimated by using the iteration method with the longest data length around the year. The final trends to be used for data normalization (magenta) are the corrected function fitting trends (orange) of the annual increase rate of air CO₂ (cyan). (B) Examples of trend variations with data length for the target years 1998-2002. (C) The trend of CO₂ biases (prediction – observation) detected by LOYO with CO₂ data normalized by the uncorrected decadal trends of air CO₂. The vertical lines show the standard residuals of the regression.

We applied the LOYO method to the data normalized by the trends of Eq.(2). A small trend of 0.1565 μatm a^-1 exists in the residual of model prediction minus observations (Figure 2C). The p-value of the trend is 9×10^-6, indicating that the trend is significant. We subtracted the residual trend from the decadal air CO₂ trends and yielded the trends shown in magenta line in Figure 2A. The numerical values of the corrected trends are listed in Table 3. They were used for final data normalization. The corrected trends agree well with the extracted ocean CO₂ trends in 1996-2013, during which the data length used to extract ocean CO₂ trends is longer than the YON of Sutton et al. (2019). The corrected trends in early 2000s are close to the those obtained by Sutton et al. (2019) for the time series station WHOTS in the subtropical North Pacific and Stratus in the South Pacific gyre in 2004-2013. Although the corrected trends before 1997 are smaller than those used by Takahashi et al. (2009) and Zeng et al.,(2014) for data normalization, they are within the range of trends summarized by Takahashi et al. (2009).

TABLE 3

Table 3 Trends for data normalization and LOYO validation results.

Validation and uncertainty

The performances of the three models were evaluated by the LOYO method with the normalized CO₂. The validation yields small biases (prediction minus observation) and good correlation coefficients. The annual mean bias ranges between -4.82 and 3.79 μatm for RF, between -4.34 and 3.92 μatm for GBM, and between -5.29 and 4.95 μatm for FNN (Table 3). Their mean biases are -0.36 μatm, -0.24 μatm, and -0.27 μatm, respectively. The correlation coefficient R² in individual years ranges between 0.50 and 0.90 for RF, between 0.49 and 0.88 for GBM, and between 0.43 and 0.89 for FNN. Their mean R² are 0.70, 0.69, and 0.62, respectively. Figure 3 shows the goodness of fit of the three models. The bias and R² in the figure were calculated directly using all validation data points and therefore are equivalent to the weighted mean bias and R² in Table 3.

FIGURE 3

Figure 3 Model predictions vs observations of ocean CO2 fugacity using normalized data with the trends in Table 3. Predictions come from 41 validations for each target year between 1980 and 2020. The colour indicates the density of data points. (A) Results of the RF model. (B) Results of the GBM model. (C) Results of the FNN model.

At the CO₂ level in year 2000, one unit CO₂ change results in a flux change of 0.19 PgC a^-1. We calculated the flux uncertainties approximately by multiplying this value with the biases in Table 3. For the RF model, the uncertainty ranges from -0.93 PgC a^-1 to 0.72 PgC a^-1 and the mean is -0.07 PgC a^-1. The GBM model has a smaller uncertainty range from -0.83 PgC a^-1 to 0.74 PgC a^-1 and a mean of -0.05 PgC a^-1. The FNN model has the largest uncertainty range from -1.01 PgC a^-1 to 0.94 PgC a^-1 and a mean of -0.05 PgC a^-1.

Comparison

The fluxes of the products in Table 2 were recalculated by equations in the Appendix and adjusted by Eq.(4) for comparison. The offset added to the products is 0.00 μatm for NIES-NN and JENA, 0.22 μatm for JMA-MLR, 0.01 μatm for MPI-SOMFFN, 0.34 μatm for CMEMS-FFNN, 0.07 μatm for CSIR-ML6, and 0.01 μatm for OceanSODA-ETHZ.

The difference between NIES-ML3 and NIES-NN is small in 1991-2006 but much larger in the early and late years (Figure 4A). NIES-NN used a constant trend of 1.54 μatm a^-1 to normalize data to the reference year 2000. The trend is larger than those in Table 3 before 1998 and smaller after that year. This resulted in larger reconstructed CO₂ in the periods. The bias caused larger flux estimates in the years further away from the reference year. In 1985 and 2019, NIES-NN flux is larger than that of NIES-ML3 by 0.536 PgC a^-1 and 1.057 PgC a^-1 respectively. The latter is close to half of the fluxes in recent years. Near the reference year of 2000, NIES-NN is smaller than NIES-ML3 in the order of 0.1 PgC a^-1. The JMA-MLR product (Iida et al., 2021) also shows an arch-shaped flux trend like NIES-NN does. Again, the differences are larger in the early and late years of the comparison period, especially in the 1990s. This is expected as the regression method of JMA-MLR includes a linear term of time for each geographic box, which is equivalent to using a constant trend for data normalization. The flux estimate of JMA-MLR is larger than that of NIES-ML in all years. In 1990 to 2020, JMA-MLR flux is larger than that of NIES-ML3 by 0.699 PgC a^-1 and 0.506 PgC a^-1 respectively. They are about a quarter of the flux in recent years.

FIGURE 4

Figure 4 Comparisons with machine learning products included in GSB-2021. Fluxes were recalculated by the same method and adjusted to have the same spatial coverage of NIES-ML3. (A) Variations of annual fluxes with time. (B) Mean differences of third-party products minus NIES-ML3 in the whole available period, the early half years, and later half years.

Instead of using explicit trends to normalize data or including a linear term of time in the regression, MPI-SOMFFN (Landschützer et al., 2016), CMEMS-FFNN (Chau et al., 2022), CSIR-ML6 (Gregor et al., 2019), and OceanSODA-ETHZ (Gregor and Gruber, 2021) used atmospheric CO₂ as a predictor so that their models could learn the trends implicitly. Their flux trend patterns indicate different implicit CO₂ trends. While the fluxes of MPI-SOMFFN and OceanSODA-ETHZ remain rather flat before 2000 and then increase with time, the fluxes of CMEMS-FFNN and CSIR-ML6 show a trend before the early 1990s and after 2000 and remain at a similar level in between. The JENA method (Rödenbeck et al., 2013) involves an inversion model for the ocean chemistry coupled with the atmospheric CO₂ of an atmospheric transport model. It has the largest inter-annual flux variations. Note that all products except for JENA are monthly with a 1×1-degree spatial resolution. We calculated the monthly mean CO₂ of JENA in 2×2.5-degree grids using its daily dataset and then filled the 1x1-degree grids with values in the nearest source grids. A different averaging and re-gridding method may yield a different result. The day of the year is also a predictor of CSIR-ML6.

Figure 4B reveals several patterns in the flux differences between NIES-ML3 and other products. The black, cyan, and orange bars represent the mean of a product minus NIES-ML3 in the whole available period, in the early half, and the latter-half years, respectively. NIES-ML3 agrees with CSIR-ML6 and OceanSODA-ETHZ the most in terms of the overall mean difference, which is -0.050 PgC a^-1 and -0.053 PgC a^-1, respectively. Their p-values of two tailed t-test with a significance level of 95%, 0.615 and 0.478 respectively, indicate that the differences are insignificant. While the flux of CSIR-ML6 is smaller than that of NIES-ML3 in the early-half years and larger in the latter-half years, the long-term change of NIES-ML3 is more consistent with that of OceanSODA-ETHZ. The differences between NIES-ML3 and MPI-SOMFFN (0.088 PgC a^-1, p-value=0.304), and between NIES-ML3 and CMEMS-FFNN (-0.121 PgC a^-1, p-value=0.151) are moderate but insignificant. The fluxes of NIES-NN and JMA-MLR are much larger than that of NIES-ML3, by 0.240 PgC a^-1 (p-value=0.030) and 0.267 PgC a^-1 (p-value=0.000) respectively, especially in the latter-half years of NIES-NN and the early-half years of JMA-MLR. The largest difference was between JENA and NIES-ML3 (-0.322 PgC a^-1, p-value=0.000). Overall, the difference between NIES-ML3 and other products is small, about 0.007 PgC a^-1.

Conclusion

Our results point out that the ocean CO₂ trends are an important factor affecting the global ocean CO₂ reconstruction and flux estimate by machine learning methods. So far, explicit trend methods assumed a constant trend. They yielded much larger flux estimates than most implicit methods in the early and later years of the modelled period. Because the ocean CO₂ trends tend to track the trends of air CO₂ and the later increased with time, using a constant ocean CO₂ trend tends to underestimate the concentration in those years. We proposed a new method to use variant trends for an explicit method that applies machine learning to trend removed data. On average, our flux estimates are significantly lower those of NIES-NN and JMA-MLR. Comparing to the implicit methods, they are smaller than that of MPI-SOMFFN but larger than those of CSIR-ML6, OceanSODA-ETHZ and CMEMS-FFNN. Even though the differences of implicit methods are less significant than those of the explicit methods, the fluxes among the implicit methods depart substantially in early and recent years. This reveals that the ocean CO₂ trends obtained by the implicit methods could be largely different. All the implicit methods regrouped data by clustering. While the merit point of clustering to regroup data by their biogeochemical properties have been stressed, its demerit point of worsening the data scarcity problem was rarely discussed. Therefore, our results are expected not only to enhance the accuracy of flux estimate by machine learning but also to provide a reference to investigate the trend differences of the implicit methods.

Data availability statement

The modelled results can be obtained from https://www.nies.go.jp/doi/10.17595/20220311.001-e.html. Other data and code used in the study may be provided upon request. Further inquiries can be directed to the corresponding author.

Author contributions

JZ: Model experiment design, data processing, and draft manuscript. YI: Flux comparisons. TM: Advice on satellite data. TS: Result checking and advice on carbon budget issues. All authors contributed to the article and approved the submitted version.

Acknowledgments

This study was partly supported by NIES GOSAT and GOSAT-2 projects. The Surface Ocean CO₂ Atlas (SOCAT) is an international effort, endorsed by the International Ocean Carbon Coordination Project (IOCCP), the Surface Ocean Lower Atmosphere Study (SOLAS), and the Integrated Marine Biosphere Research (IMBeR) program, to deliver a uniformly quality-controlled surface ocean CO₂ database. We thank the many researchers and funding agencies responsible for the collection of data and quality control for their contributions to SOCAT.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Bakker D. C. E., Pfeil B., Landa C. S., Metzl N., O’Brien K. M., Olsen A., et al. (2016). A multi-decade record of high-quality CO2 data in version 3 of the surface ocean CO2 atlas (SOCAT). Earth Syst. Sci. Data 8, 383–413. doi: 10.5194/essd-8-383-2016