<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mar. Sci.</journal-id>
<journal-title>Frontiers in Marine Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mar. Sci.</abbrev-journal-title>
<issn pub-type="epub">2296-7745</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmars.2023.1121334</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Marine Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Data driven three-dimensional temperature and salinity anomaly reconstruction of the northwest Pacific Ocean</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Chen</surname>
<given-names>Yuanhong</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2273454/"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Liu</surname>
<given-names>Li</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2133963"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Chen</surname>
<given-names>Xueen</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/112877"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wei</surname>
<given-names>Zhiqiang</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Sun</surname>
<given-names>Xiang</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2190985"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Yuan</surname>
<given-names>Chunxin</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2025026"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Gao</surname>
<given-names>Zhen</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2133384"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>School of Mathematical Sciences, Ocean University of China</institution>, <addr-line>Qingdao</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Frontier Science Center for Deep Ocean Multispheres and Earth System (FDOMES) and Physical Oceanography Laboratory, Ocean University of China</institution>, <addr-line>Qingdao</addr-line>, <country>China</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>College of Information Science and Engineering, Ocean University of China</institution>, <addr-line>Qingdao</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Francisco Mach&#xed;n, University of Las Palmas de Gran Canaria, Spain</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Lin Mu, Shenzhen University, China; Xianqing Lv, Ocean University of China, China; Kaijun Ren, National University of Defense Technology, China</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Zhen Gao, <email xlink:href="mailto:zhengao@ouc.edu.cn">zhengao@ouc.edu.cn</email>
</p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Physical Oceanography, a section of the journal Frontiers in Marine Science</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>04</day>
<month>05</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>10</volume>
<elocation-id>1121334</elocation-id>
<history>
<date date-type="received">
<day>04</day>
<month>01</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>03</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Chen, Liu, Chen, Wei, Sun, Yuan and Gao</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Chen, Liu, Chen, Wei, Sun, Yuan and Gao</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>By virtue of the rapid development of ocean observation technologies, tens of petabytes of data archives have been recorded, among which, the largest portion are those derived from the orbital satellites, embodying the character of ocean surface. Nevertheless, the insufficiency of information below the subsurface restricts the utilization of these data and the understanding of ocean dynamics. To circumvent these difficulties, we present the spatially three-dimensional reconstruction of ocean hydrographic profiles at depth based on the satellites and in-situ measurement data. In this manuscript, long short-term memory network (LSTM) and Gaussian process regression (GPR) methods are invoked to predict the temperature and salinity profiles in the northwest Pacific Ocean, and to improve computational and storage efficiency, the proper orthogonal decomposition (POD) method is subtly incorporated into these two models. LSTM and GPR show satisfactory results,  with the root mean square error (RMSE) of temperature is less than 1.45, and the RMSE of salinity is less than 0.19. The incorporation of the POD method substantially accelerates efficiency, particularly in the LSTM model, which improves 7.5-fold without significant accuracy loss. The sensitivity of different sea surface parameters on the reconstructed profiles reveals that sea surface height anomaly and latitude significantly influence the reconstruction of  temperature anomaly (TA) and salinity anomaly (SA) profiles. Besides, sea surface salinity and sea surface temperature anomalies can improve the model's estimation ability for the upper TAs and SAs, respectively. The contribution of monthly climatology to temperature and salinity profile estimation is also explored in this paper. It is shown that adding monthly mean climatology to the input of the model can achieve more accurate estimates.</p>
</abstract>
<kwd-group>
<kwd>Argo</kwd>
<kwd>satellite observation</kwd>
<kwd>data reconstruction</kwd>
<kwd>reduced-order model</kwd>
<kwd>long short-term memory network</kwd>
<kwd>Gaussian process regression</kwd>
</kwd-group>
<contract-num rid="cn002">202265005</contract-num>
<contract-sponsor id="cn001">National Key Research and Development Program of China<named-content content-type="fundref-id">10.13039/501100012166</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">Fundamental Research Funds for Central Universities of the Central South University<named-content content-type="fundref-id">10.13039/501100012476</named-content>
</contract-sponsor>
<contract-sponsor id="cn003">Taishan Scholar Project of Shandong Province<named-content content-type="fundref-id">10.13039/501100010040</named-content>
</contract-sponsor>
<counts>
<fig-count count="10"/>
<table-count count="0"/>
<equation-count count="12"/>
<ref-count count="73"/>
<page-count count="14"/>
<word-count count="7384"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<label>1</label>
<title>Introduction</title>
<p>Ocean is an integral part of the global climate system and plays a crucial role in regulating climate change and balancing the Earth&#x2019;s energy (<xref ref-type="bibr" rid="B61">Su et&#xa0;al., 2018</xref>). Knowledge of the vertical distribution of ocean temperature and salinity is significant for exploring the complex dynamical processes and ecosystems within the ocean (<xref ref-type="bibr" rid="B55">Rao and Sivakumar, 2003</xref>; <xref ref-type="bibr" rid="B70">Wilson and Coles, 2005</xref>; <xref ref-type="bibr" rid="B13">de Boyer Mont&#xe9;gut et&#xa0;al., 2007</xref>; <xref ref-type="bibr" rid="B23">Helber et&#xa0;al., 2010</xref>; <xref ref-type="bibr" rid="B44">Meehl et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B52">Qin et&#xa0;al., 2015</xref>). However, the currently accumulated vertical temperature and salinity data is far from sufficient, and the problem of sparseness and discontinuity of the observed data due to the limitation of the number of observation points has severely limited the study of ocean processes and mechanisms (<xref ref-type="bibr" rid="B31">Klemas and Yan, 2014</xref>; <xref ref-type="bibr" rid="B32">Liu, 2016</xref>). Although the rapid development of satellite remote sensing technology has made it possible to provide more and more high-resolution, multi-scale and long-term continuous observation data, these data are limited to the ocean&#x2019;s surface. They cannot provide spatial and temporal continuous information on the subsurface structure of the ocean (<xref ref-type="bibr" rid="B1">Ali et&#xa0;al., 2004</xref>; <xref ref-type="bibr" rid="B71">Wu et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B3">Bao et&#xa0;al., 2018</xref>). One fact is that the sea surface state is closely related to the subsurface features. According to the laws of ocean dynamics, many deep dynamic processes still generate signals at the sea surface, which satellite sensors can capture (<xref ref-type="bibr" rid="B14">Fiedler, 1988</xref>). Therefore, various underwater temperature and salinity reconstruction methods combining <italic>in situ</italic> observation data and satellite remote sensing data have been developed over the years.</p>
<p>One typical approach to reconstructing the internal structure from sea surface information is based on dynamics. Assimilation of observational data in numerical simulations (<xref ref-type="bibr" rid="B19">Ghil and Malanotte-Rizzoli, 1991</xref>; <xref ref-type="bibr" rid="B65">Troccoli and Haines, 1999</xref>; <xref ref-type="bibr" rid="B66">Vossepoel and Behringer, 2000</xref>; <xref ref-type="bibr" rid="B9">Carrassi et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B48">Moore et&#xa0;al., 2019</xref>) is a typical dynamics approach to inverse ocean subsurface information. However, this often requires many computing resources, and uncertainties in the initial and forced fields make the estimation accuracy impossible to guarantee (<xref ref-type="bibr" rid="B57">Robinson and Lermusiaux, 2000</xref>). Using a simplified dynamical framework can improve computational efficiency to some extent. Held et&#xa0;al. (<xref ref-type="bibr" rid="B28">Isern-Fontanet et&#xa0;al., 2006</xref>) proposes a method based on surface quasi-geostrophic (SQG) (<xref ref-type="bibr" rid="B24">Held et&#xa0;al., 1995</xref>) that derives subsurface density from sea surface density, sea surface height, and historical buoyancy frequency profiles. Further, Lapeyre and Klein (<xref ref-type="bibr" rid="B33">Lapeyre and Klein, 2006</xref>) developed an effective SQG (eSQG), assuming that the ocean interior&#x2019;s potential vorticity (PV) is correlated with the surface density. eSQG was shown to be effective in improving subsurface flow field reconstructions (<xref ref-type="bibr" rid="B29">Isern-Fontanet et&#xa0;al., 2008</xref>; <xref ref-type="bibr" rid="B53">Qiu et&#xa0;al., 2016</xref>). Based on SQG, Wang et&#xa0;al. (<xref ref-type="bibr" rid="B37">Liu et&#xa0;al., 2017</xref>) proposed the internal + SQG (isQG) method, which superimposes the SQG mode with the positive and first oblique pressure modes to achieve subsurface density reconstruction by solving the surface quasi-geostrophic equation and the internal equation. The method&#x2019;s effectiveness has been verified in different studies (<xref ref-type="bibr" rid="B38">Liu et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B37">Liu et&#xa0;al., 2017</xref>). However, due to the simplified model, some complex dynamical processes in the ocean are neglected (<xref ref-type="bibr" rid="B39">Liu et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B47">Meng and Yan, 2022</xref>), and the methods based on SQG can directly invert the density field, which will introduce additional errors when reconstructing the temperature and salinity fields (<xref ref-type="bibr" rid="B11">Chen et&#xa0;al., 2020</xref>).</p>
<p>Statistical methods are also widely used to reconstruct the three-dimensional structural field of the ocean. In earlier studies, linear regression (<xref ref-type="bibr" rid="B69">Willis et&#xa0;al., 2003</xref>; <xref ref-type="bibr" rid="B49">Nardelli and Santoleri, 2004</xref>; <xref ref-type="bibr" rid="B21">Guinehut et&#xa0;al., 2012</xref>) and least squares regression (<xref ref-type="bibr" rid="B7">Carnes et&#xa0;al., 1990</xref>; <xref ref-type="bibr" rid="B8">Carnes et&#xa0;al., 1994</xref>) were used to estimate deep ocean information. Besides, methods based on empirical orthogonal functions (EOF) (<xref ref-type="bibr" rid="B42">Maes et&#xa0;al., 2000</xref>; <xref ref-type="bibr" rid="B45">Meinen and Watts, 2000</xref>; <xref ref-type="bibr" rid="B6">Buongiorno Nardelli and Santoleri, 2005</xref>; <xref ref-type="bibr" rid="B72">Yan et&#xa0;al., 2020</xref>) are widely used to reconstruct the subsurface vertical structure. These methods use EOF to decompose the ocean vertical state vector, retain a few major modes, and then use least-squares regression or variational method (<xref ref-type="bibr" rid="B16">Fujii and Kamachi, 2003a</xref>; <xref ref-type="bibr" rid="B17">Fujii and Kamachi, 2003b</xref>) to solve for the objective function controlled by the sea surface information and the major modes. With the development of artificial intelligence techniques and machine learning methods, an increasing number of studies are focusing on the potential of machine learning methods in three-dimensional temperature and salinity field reconstruction. These methods can effectively mine the intrinsic patterns between data and estimate the structure of physical quantities in the ocean interior from sea surface parameters. Currently, self-organization mapping (<xref ref-type="bibr" rid="B71">Wu et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B12">Chen et&#xa0;al., 2018</xref>), support vector machine regression (<xref ref-type="bibr" rid="B63">Su et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B34">Li et&#xa0;al., 2017</xref>), random forest (<xref ref-type="bibr" rid="B61">Su et&#xa0;al., 2018</xref>), and neural network-based methods (<xref ref-type="bibr" rid="B1">Ali et&#xa0;al., 2004</xref>; <xref ref-type="bibr" rid="B2">Ballabrera-Poy et&#xa0;al., 2009</xref>; <xref ref-type="bibr" rid="B3">Bao et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B40">Lu et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B5">Buongiorno Nardelli, 2020</xref>; <xref ref-type="bibr" rid="B64">Su et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B62">Su et&#xa0;al., 2021</xref>) have been applied to estimate three-dimensional thermohaline fields. The results show that the machine learning methods can achieve better reconstruction based on a large amount of observation data and have strong generalizability.</p>
<p>In this context, this paper applies several different regression methods to estimate the subsurface temperature anomaly (TA) and subsurface salinity anomaly (SA). The first method used is Gaussian process regression (GPR) (<xref ref-type="bibr" rid="B68">Williams and Rasmussen, 1995</xref>; <xref ref-type="bibr" rid="B56">Rasmussen and Williams, 2005</xref>), an effective tool widely used in complex real-world problems (<xref ref-type="bibr" rid="B60">Stein, 1999</xref>; <xref ref-type="bibr" rid="B15">Forrester et&#xa0;al., 2008</xref>; <xref ref-type="bibr" rid="B50">Nguyen and Peraire, 2016</xref>). GPR is flexible enough to obtain estimates of unknown quantities using simple matrix operations and often achieves reliable accuracy on small data sets. More critically, it can effectively measure the uncertainty in the prediction because it gives the distribution of the predicted values (<xref ref-type="bibr" rid="B56">Rasmussen and Williams, 2005</xref>). The second one is the long short-term memory network (LSTM) (<xref ref-type="bibr" rid="B27">Hochreiter and Schmidhuber, 1997</xref>), a deep learning algorithm that can learn long-time dependencies (<xref ref-type="bibr" rid="B58">Sak et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B67">Wan et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B73">Yeo and Melnyk, 2019</xref>). However, training LSTM is often computationally expensive (<xref ref-type="bibr" rid="B43">Masuko, 2017</xref>), which is more fully reflected in ocean applications with &#x201d;big data&#x201d; characteristics. Therefore, to reduce the training time and save storage costs, we further propose LSTM-POD and GPR-POD to predict the vertical distribution of TA and SA by introducing POD (<xref ref-type="bibr" rid="B35">Liang et&#xa0;al., 2002</xref>), a widely used tool for reduced order modeling (<xref ref-type="bibr" rid="B41">Lucia et&#xa0;al., 2004</xref>; <xref ref-type="bibr" rid="B54">Quarteroni et&#xa0;al., 2015</xref>). Specifically, POD can achieve simplification and dimensionality reduction of the dataset by identifying the few main modes that represent the three-dimensional temperature and salinity fields with relative precision. And then the TA and SA profiles can be approximated as linear combinations of these modes, we only need to learn the relationship between the coefficients and the input parameters by LSTM and GPR, which greatly simplifies the regression model. Therefore, the first goal of this paper is to explore the reconstruction accuracy of different models and the degree of improvement of the POD on computational efficiency. The reliability and computational efficiency of the proposed methods is verified by calculating the root mean square error (RMSE) between the estimated TA and SA profiles and the Argo thermohaline anomaly profiles, as well as comparing the CPU time of different methods. In addition, we point out that it is only necessary to interpolate the modes about the depth, and the linear combination of the interpolated modes according to the predicted coefficients can be used to obtain TA and SA estimates for any depth.</p>
<p>In particular, the effects of different parameters on TA and SA estimates are also investigated by comparing the prediction accuracy of models with various input parameters and calculating the correlation coefficients between input parameters and the temperature and salinity anomalies. In addition, this paper evaluates the potential application of climatology in temperature and salinity reconstruction using different combinations of temperature and salinity climatology data.</p>
<p>This paper is organized as follows. The study area and the data we used are presented in Section 2. Section 3 gives an overview of the methodology used in this paper. Section 4 is devoted to the description of the results, we draw conclusions in section 5.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Data</title>
<p>The goal of this paper is to reconstruct the three-dimensional thermohaline structure of the Northwest Pacific Ocean (95&#xb0; W-135&#xb0; W, 5&#xb0; S-45&#xb0; N) for the period 2011-2021. The sources of ocean observations used in this study are: Argo data, satellite sea surface temperature (SST), satellite sea surface salinity (SSS), and sea surface height anomaly (SSHA), which are described below. The climatology of the World Ocean Atlas 2018 (WOA18), used as the monthly mean of climatology is also presented.</p>
<sec id="s2_1">
<label>2.1</label>
<title>Argo data</title>
<p>The Argo profiles are obtained from Global Sea Ocean Argo Scatter Dataset (V3.0) (<xref ref-type="bibr" rid="B36">Liu et&#xa0;al., 2021</xref>) provided by the China Argo Real-time Data Center (<ext-link ext-link-type="uri" xlink:href="ftp://ftp.argo.org.cn/pub/ARGO/global/">ftp://ftp.argo.org.cn/pub/ARGO/global/</ext-link>). This dataset collects more than 2.3 million temperature and salinity depth profiles observed by over 15,000 automated profiling buoys deployed in the global ocean by international Argo member countries during July 2000 through June 2020. Data inside the region of 95&#xb0; W-135&#xb0; W and 5&#xb0; S-45&#xb0; N for the period 2011-2021 are considered here. Profiles with depths exceeding 700 m are selected, and the profiles are interpolated through a spline into the same 71 vertical levels extending from the surface (5 m) to 705 m (vertical step is 10 m).</p>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Satellite SST data</title>
<p>The SST data of this study are created by the OSTIA (Operational SST and Ice Analysis) system, using re-processed ESA SST CCI, C3S EUMETSAT and REMSS satellite data and <italic>in situ</italic> data from the HadIOD dataset, distributed through the Copernicus Marine Environment Monitoring Service (CMEMS, <ext-link ext-link-type="uri" xlink:href="http://marine.copernicus.eu/services-portfolio/access-to-products/">http://marine.copernicus.eu/services-portfolio/access-to-products/</ext-link>, product_id = SST_GLO_SST_L4_REP_OBSERVATIONS_010_011). This product provides daily maps of the SST and SST uncertainty on a global regular grid at 0.05&#xb0; resolution, which are stored using the netCDF format using the Group for High Resolution SST specification.</p>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Satellite SSS data</title>
<p>The SSS data is a Level 4 products on a 0.25 degree spatial and 4-day temporal grid produced by the International Pacific Research Center (IPRC) at the University of Hawaii at Manoa in collaboration with the Santa Rosa Remote Sensing System (RSS) in California in conjunction with observations from NASA&#x2019;s Aquarius/SAC-D and Soil Moisture Active (SMAP) satellite missions. The product is a continuous, consistent multi-satellite SSS data obtained by optimal interpolation with a 7-day decorrelation time scale (<xref ref-type="bibr" rid="B46">Melnichenko et&#xa0;al., 2016</xref>). Their mean root mean squared difference from globally synchronized <italic>in situ</italic> data is about 0.19 psu and the product bias is about zero.</p>
</sec>
<sec id="s2_4">
<label>2.4</label>
<title>Satellite SSHA data</title>
<p>The altimeter sea level anomalies with daily and 0.25&#xb0;&#xd7;0.25&#xb0; resolutions are provided by Sea Level TAC (Thematic Assembly Centre, <ext-link ext-link-type="uri" xlink:href="https://resources.marine.copernicus.eu/product-detail/SEALEVEL_GLO_PHY_L4_MY_008_047/DATA-ACCESS">https://resources.marine.copernicus.eu/product-detail/SEALEVEL_GLO_PHY_L4_MY_008_047/DATA-ACCESS</ext-link>). The data produced in the frame of this TAC are generated by the processing system including data from all altimeter Copernicus missions (Sentinel-6A, Sentinel-3A/B) and other collaborative or opportunity missions (e.g.: Jason-3, Saral[-DP]/AltiKa, Cryosat-2, OSTM/Jason-2, Jason-1, Topex/Poseidon, Envisat, GFO, ERS-1/2, Haiyang-2A/B/C).</p>
</sec>
<sec id="s2_5">
<label>2.5</label>
<title>Climatology data</title>
<p>The climate data used in this study are from World Ocean Atlas 2018 (<xref ref-type="bibr" rid="B4">Boyer et&#xa0;al., 2018</xref>) (<ext-link ext-link-type="uri" xlink:href="https://www.ncei.noaa.gov/products/world-ocean-atlas">https://www.ncei.noaa.gov/products/world-ocean-atlas</ext-link>), which is provided by the National Oceanographic Data Center (now the National Centers for Environmental Information - NCEI)(<xref ref-type="bibr" rid="B4">Boyer et&#xa0;al., 2018</xref>) (<ext-link ext-link-type="uri" xlink:href="https://www.ncei.noaa.gov/products/world-ocean-atlas">https://www.ncei.noaa.gov/products/world-ocean-atlas</ext-link>). The atlas is an objectively analyzed, quality-controlled collection of temperature, salinity, oxygen, phosphate, silicate, and nitrate averages based on profile data from the World Ocean Database and distributed online by NCEI. Monthly climatology fields of temperature and salinity at standard depth levels at a spatial resolution of 0.25&#xb0;&#xd7;0.25&#xb0; were used in this study and interpolated by cubic spline onto regularly spaced vertical grids (10 m apart).</p>
<p>Note that all satellite data and climatology data are interpolated to the same spatial distribution as the Argo observations by bilinear interpolation in the present study. And anomalies are defined as the observation data (SST,SSS,Argo) subtracted by the monthly WOA18 data.</p>
</sec>
</sec>
<sec id="s3">
<label>3</label>
<title>Method</title>
<sec id="s3_1">
<label>3.1</label>
<title>Gaussian process regression</title>
<p>In this section, GPR is used to estimate the vertical profiles of TA and SA. The SAs and TAs in each level are considered as a collection of some random variables and obey a joint Gaussian distribution, defined as <inline-formula>
<mml:math display="inline" id="im1">
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mi>&#x3f5;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula>
<mml:math display="inline" id="im2">
<mml:mi>x</mml:mi>
</mml:math>
</inline-formula> is input vector of predicted parameters, which are sea surface temperature anomaly (SSTA), sea surface salinity anomaly (SSSA), sea surface height anomaly(SSHA), longitude (LON), latitude(LAT) and the day of the year projected on a circle (JULD). The prior distribution of <inline-formula>
<mml:math display="inline" id="im3">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> is assumed to be a GP given by <inline-formula>
<mml:math display="inline" id="im4">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&#x223c;</mml:mo>
<mml:mi>G</mml:mi>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>&#x3ba;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula>
<mml:math display="inline" id="im5">
<mml:mi>&#x3ba;</mml:mi>
</mml:math>
</inline-formula> is the semi-positive definite kernel. <inline-formula>
<mml:math display="inline" id="im6">
<mml:mrow>
<mml:mi>&#x3f5;</mml:mi>
<mml:mo>&#x223c;</mml:mo>
<mml:mi>G</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3c7;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> denotes the Gaussian noise term, here <inline-formula>
<mml:math display="inline" id="im7">
<mml:mi>&#x3c7;</mml:mi>
</mml:math>
</inline-formula> is the standard deviation. Based on the historical Argo profiles and remote sensing data, we can collect <inline-formula>
<mml:math display="inline" id="im8">
<mml:mi>n</mml:mi>
</mml:math>
</inline-formula> TA observations or SA observations at depth <inline-formula>
<mml:math display="inline" id="im9">
<mml:mi>z</mml:mi>
</mml:math>
</inline-formula> to form the observation set <inline-formula>
<mml:math display="inline" id="im10">
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>}</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. Corresponding to each observation, we have also collected a set of input parameters <inline-formula>
<mml:math display="inline" id="im11">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mtext mathvariant="bold">g</mml:mtext>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2223;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mo>&#x2026;</mml:mo>
</mml:mrow>
<mml:mo>|</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x211d;</mml:mi>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, here <inline-formula>
<mml:math display="inline" id="im12">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>(</mml:mo>
<mml:mi>S</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>T</mml:mi>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>S</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>S</mml:mi>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>S</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>H</mml:mi>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>L</mml:mi>
<mml:mi>O</mml:mi>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>L</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Then the prior distribution of <inline-formula>
<mml:math display="inline" id="im15">
<mml:mi>y</mml:mi>
</mml:math>
</inline-formula> can be given as:</p>
<disp-formula>
<label>(1)</label>
<mml:math display="block" id="M1">
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mtext mathvariant="bold">g</mml:mtext>
</mml:msub>
<mml:mo>&#x223c;</mml:mo>
<mml:mi mathvariant="script">N</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mi>y</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:msub>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mi>y</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mtext>Cov</mml:mtext>
<mml:mo stretchy="false">[</mml:mo>
<mml:mi>y</mml:mi>
<mml:mo>&#x2223;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mtext mathvariant="bold">g</mml:mtext>
</mml:msub>
<mml:mo stretchy="false">]</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi>&#x3ba;</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mtext mathvariant="bold">g</mml:mtext>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mtext mathvariant="bold">g</mml:mtext>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msup>
<mml:mi>&#x3c7;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:msub>
<mml:mtext mathvariant="bold">I</mml:mtext>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>GPR estimates the posterior distribution of an unknown quantity under the assumptions of a Gaussian process and the likelihood of a normal distribution. Specifically, from the above assumptions, it is known that the joint probability distribution of the estimated value <inline-formula>
<mml:math display="inline" id="im16">
<mml:mrow>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> at the new input parameter <inline-formula>
<mml:math display="inline" id="im17">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and the existing observation <inline-formula>
<mml:math display="inline" id="im18">
<mml:mi>y</mml:mi>
</mml:math>
</inline-formula> is a joint normal distribution of the following form:</p>
<disp-formula>
<label>(2)</label>
<mml:math display="block" id="M2">
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="bold-italic">y</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>&#x223c;</mml:mo>
<mml:mi mathvariant="script">N</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mi>y</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mo>*</mml:mo>
<mml:mtext>T</mml:mtext>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mrow>
<mml:mo>*</mml:mo>
<mml:mo>*</mml:mo>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math display="inline" id="im19">
<mml:mrow>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mtext mathvariant="bold">K</mml:mtext>
</mml:mstyle>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3ba;</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mtext mathvariant="bold">g</mml:mtext>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula>
<mml:math display="inline" id="im20">
<mml:mrow>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mtext mathvariant="bold">K</mml:mtext>
</mml:mstyle>
<mml:mrow>
<mml:mo>*</mml:mo>
<mml:mo>*</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3ba;</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. Combining the prior assumption and the likelihood, the posterior distribution of <inline-formula>
<mml:math display="inline" id="im21">
<mml:mrow>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> can be derived from the maximum likelihood method (<xref ref-type="bibr" rid="B68">Williams and Rasmussen, 1995</xref>),</p>
<disp-formula>
<label>(3)</label>
<mml:math display="block" id="M3">
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo>&#x2223;</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mtext mathvariant="bold">g</mml:mtext>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>y</mml:mi>
<mml:mo>&#x223c;</mml:mo>
<mml:mi>G</mml:mi>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>m</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msup>
<mml:mi>m</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mo>*</mml:mo>
<mml:mtext>T</mml:mtext>
</mml:msubsup>
<mml:msubsup>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mi>y</mml:mi>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:msup>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mrow>
<mml:mo>*</mml:mo>
<mml:mo>*</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mo>*</mml:mo>
<mml:mtext>T</mml:mtext>
</mml:msubsup>
<mml:msubsup>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mtext mathvariant="bold">K</mml:mtext>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
</disp-formula>
<p>The reconstruction of TA and SA profiles using GPR is carried out in the python library (<xref ref-type="bibr" rid="B20">GPy, 2012</xref>), and the output is a vector consisting of the TA profile and SA profile in series at the same location. The kernel function chosen in this paper is the radial basis function. The minimum and maximum of the input/output can be utilized to scale the input/output to [0,1] to eliminate the magnitude effect before the model is trained.</p>
<p>In summary, the derivation of the vertical structure of TA and SA using GPR involves two processes: online and offline stages. In the offline stage, historical observations are collected to build the corresponding input/output training set, from which the GPR is trained to learn the mapping of the input parameters to the TA and SA profiles. In the online stage, the corresponding TA and SA profiles are estimated from the already trained GPR for the new input parameters. The flowchart of TA and SA estimation using the GPR is shown in <xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>.</p>
<fig id="f1" position="float">
<label>Figure&#xa0;1</label>
<caption>
<p>Flowchart of salinity and temperature estimation using GPR.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g001.tif"/>
</fig>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Long short-term memory network</title>
<p>LSTM is an extension of the traditional recurrent neural network (RNN), which is mainly used to deal with the case of traditional RNN failure. It solves the problem of gradient disappearance and gradient explosion of traditional RNNs to a certain extent to learn long-term dependent information. Same as general RNN, it also consists of a series of repeating cells with sequential connections, but the structure of the cells is more complex. Crucially, the LSTM adds a state vector updated over time to the cells, which can selectively record the information of the system and preserve the long-term state of the system. Each cell has three inputs, the input <inline-formula>
<mml:math display="inline" id="im22">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> of the network at the current time, the output value <inline-formula>
<mml:math display="inline" id="im23">
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> at the previous time, and the cell state <inline-formula>
<mml:math display="inline" id="im24">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> at the last time. The input variables are passed through three gates with different functions, namely forget gate, input gate and output gate, to forget and add information to the cell state <italic>C<sub>i</sub>
</italic> at the current moment and update the output state <inline-formula>
<mml:math display="inline" id="im25">
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> at the current moment. The structure of each cell is shown in <xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref> (<xref ref-type="bibr" rid="B27">Hochreiter and Schmidhuber, 1997</xref>), and the tasks of the different gates are implemented using the following equations (<xref ref-type="bibr" rid="B26">Hochreiter and Schmidhuber, 1996</xref>; <xref ref-type="bibr" rid="B18">Gers et&#xa0;al., 2000</xref>):</p>
<fig id="f2" position="float">
<label>Figure&#xa0;2</label>
<caption>
<p>Structure of single cell.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g002.tif"/>
</fig>
<disp-formula>
<label>(4)</label>
<mml:math display="block" id="M4">
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>f</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>f</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>I</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>I</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>I</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>tanh</mml:mi>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>C</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>C</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>&#x3c3;</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>O</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>O</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>*</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>I</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>*</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>*</mml:mo>
<mml:mi>tanh</mml:mi>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math display="inline" id="im26">
<mml:mi>&#x3c3;</mml:mi>
</mml:math>
</inline-formula> denotes the sigmoid activation function, <inline-formula>
<mml:math display="inline" id="im27">
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>f</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>I</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>C</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>O</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are weight matrices, <inline-formula>
<mml:math display="inline" id="im28">
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>f</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>I</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>C</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>O</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are model biases, and <inline-formula>
<mml:math display="inline" id="im29">
<mml:mo>*</mml:mo>
</mml:math>
</inline-formula> represents dot product operation.</p>
<p>Referring to (<xref ref-type="bibr" rid="B5">Buongiorno Nardelli, 2020</xref>), we also use LSTM to estimate the TA and SA profiles. TAs and SAs from different depths are considered the output states of different cells. The input to each cell is the same, a multivariate vector consisting of SSTA, SSSA, SSHA, LON, LAT, and JULD corresponding to the current position. The structure of the LSTM we used is the same as that in (<xref ref-type="bibr" rid="B5">Buongiorno Nardelli, 2020</xref>), i.e., a 2-layer stacked network. Each layer contains 35 hidden units, and the optimization algorithm is Adam (<xref ref-type="bibr" rid="B30">Kingma and Ba, 2014</xref>). Similarly, we also use <italic>max</italic>&#x2212;<italic>min</italic> normalization to preprocess the data before the LSTM training.</p>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Proper orthogonal decomposition</title>
<p>POD, which has a wide range of applications in various fields (<xref ref-type="bibr" rid="B35">Liang et&#xa0;al., 2002</xref>; <xref ref-type="bibr" rid="B51">Pinnau, 2008</xref>; <xref ref-type="bibr" rid="B10">Chapelle et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B59">Singler, 2014</xref>), provides a means to obtain a low-dimensional description of the system. This method extracts a small number of modes from historical TA and SA profiles that can represent the main features of the field, which simplifies and reduces the dimensionality of the data. The temperature-salinity anomaly profiles at the same location are placed in a multivariate matrix <bold>X</bold>
</p>
<disp-formula>
<mml:math display="block" id="M5">
<mml:mrow>
<mml:mtext mathvariant="bold">X</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>11</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>21</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>22</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mn>11</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mn>21</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mn>21</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mover>
<mml:mo>=</mml:mo>
<mml:mtext>&#x394;</mml:mtext>
</mml:mover>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>u</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>u</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mo>&#x2026;</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>u</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x211d;</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>m</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math display="inline" id="im30">
<mml:mi>m</mml:mi>
</mml:math>
</inline-formula> is the number of vertical levels, <inline-formula>
<mml:math display="inline" id="im31">
<mml:mi>n</mml:mi>
</mml:math>
</inline-formula> is the number of vertical profiles. In order to calculate the <inline-formula>
<mml:math display="inline" id="im32">
<mml:mi>k</mml:mi>
</mml:math>
</inline-formula> modes, we only need to do the singular value decomposition of <inline-formula>
<mml:math display="inline" id="im33">
<mml:mtext mathvariant="bold">X</mml:mtext>
</mml:math>
</inline-formula> as follows:</p>
<disp-formula>
<mml:math display="block" id="M6">
<mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>X</mml:mi>
</mml:mstyle>
<mml:mo>=</mml:mo>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>U</mml:mi>
</mml:mstyle>
<mml:mtext>&#x3a3;</mml:mtext>
<mml:msup>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>Z</mml:mi>
</mml:mstyle>
<mml:mtext>T</mml:mtext>
</mml:msup>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math display="inline" id="im34">
<mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>U</mml:mi>
</mml:mstyle>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x211d;</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>m</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math display="inline" id="im35">
<mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>Z</mml:mi>
</mml:mstyle>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x211d;</mml:mi>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> are orthogonal matrices. The matrix <inline-formula>
<mml:math display="inline" id="im36">
<mml:mrow>
<mml:mtext>&#x3a3;</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mi>d</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> contains the singular value <inline-formula>
<mml:math display="inline" id="im37">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2265;</mml:mo>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>&#x2265;</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>&#x2265;</mml:mo>
<mml:msub>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula>
<mml:math display="inline" id="im38">
<mml:mi>r</mml:mi>
</mml:math>
</inline-formula> is the rank of <inline-formula>
<mml:math display="inline" id="im39">
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>X</mml:mi>
</mml:mstyle>
</mml:math>
</inline-formula>. The first <inline-formula>
<mml:math display="inline" id="im40">
<mml:mi>k</mml:mi>
</mml:math>
</inline-formula> columns of <inline-formula>
<mml:math display="inline" id="im41">
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>U</mml:mi>
</mml:mstyle>
</mml:math>
</inline-formula> are the <inline-formula>
<mml:math display="inline" id="im42">
<mml:mi>k</mml:mi>
</mml:math>
</inline-formula> modes to be computed, which we denote as</p>
<disp-formula>
<mml:math display="block" id="M7">
<mml:mrow>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>U</mml:mi>
</mml:mstyle>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>U</mml:mi>
</mml:mstyle>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mo>:</mml:mo>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>:</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#x2026;</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#x2026;</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>The dimensions of <inline-formula>
<mml:math display="inline" id="im43">
<mml:mrow>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>&#x22a4;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math display="inline" id="im44">
<mml:mrow>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>&#x22a4;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> is <inline-formula>
<mml:math display="inline" id="im45">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. Then the corresponding TA profile can be represented by a linear combination of <inline-formula>
<mml:math display="inline" id="im46">
<mml:mrow>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and the SA profile is represented by a linear combination of <inline-formula>
<mml:math display="inline" id="im47">
<mml:mrow>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, where the unknown coefficients are shared. A common method for solving the coefficients is to solve a linear system that is obtained by equating the corresponding sea surface elements inthe reconstructed anomaly profiles to sea surface observations. In this case, <inline-formula>
<mml:math display="inline" id="im48">
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> is required. To increase <inline-formula>
<mml:math display="inline" id="im49">
<mml:mi>k</mml:mi>
</mml:math>
</inline-formula>, other physical quantities of the ocean need to be added to <inline-formula>
<mml:math display="inline" id="im50">
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>X</mml:mi>
</mml:mstyle>
</mml:math>
</inline-formula> to impose constraints on the combined coefficients.</p>
<p>Different from the above methods, in this paper, two methods, GPR and LSTM, are utilized to learn the relationship from the sea surface parameters to the coefficients. In particular, <inline-formula>
<mml:math display="inline" id="im51">
<mml:mrow>
<mml:msub>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mi>U</mml:mi>
</mml:mstyle>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the solution of the following minimization problem</p>
<disp-formula>
<label>(5)</label>
<mml:math display="block" id="M8">
<mml:mrow>
<mml:munder>
<mml:mrow>
<mml:mi>min</mml:mi>
<mml:mtext>&#xa0;</mml:mtext>
</mml:mrow>
<mml:mrow>
<mml:mtext mathvariant="bold">W</mml:mtext>
<mml:mo>&#x3f5;</mml:mo>
<mml:msub>
<mml:mtext mathvariant="bold">Y</mml:mtext>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:munder>
<mml:msubsup>
<mml:mrow>
<mml:mo>&#x2016;</mml:mo>
<mml:mrow>
<mml:mtext mathvariant="bold">X</mml:mtext>
<mml:mo>&#x2212;</mml:mo>
<mml:mtext mathvariant="bold">&#xa0;W</mml:mtext>
<mml:msup>
<mml:mtext mathvariant="bold">W</mml:mtext>
<mml:mtext>T</mml:mtext>
</mml:msup>
<mml:mtext mathvariant="bold">X</mml:mtext>
</mml:mrow>
<mml:mo>&#x2016;</mml:mo>
</mml:mrow>
<mml:mi>F</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math display="inline" id="im52">
<mml:mrow>
<mml:msub>
<mml:mtext mathvariant="bold">Y</mml:mtext>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">{</mml:mo>
<mml:mrow>
<mml:mstyle mathvariant="bold" mathsize="normal">
<mml:mtext mathvariant="bold">W</mml:mtext>
</mml:mstyle>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x211d;</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>m</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>:</mml:mo>
<mml:msup>
<mml:mtext mathvariant="bold">W</mml:mtext>
<mml:mtext>T</mml:mtext>
</mml:msup>
<mml:mtext mathvariant="bold">W</mml:mtext>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mtext mathvariant="bold">I</mml:mtext>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">}</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula>
<mml:math display="inline" id="im53">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mo>&#x2016;</mml:mo>
<mml:mo>&#xb7;</mml:mo>
<mml:mo>&#x2016;</mml:mo>
</mml:mrow>
<mml:mi>F</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the Frobenius norm, and the error is estimated as</p>
<disp-formula>
<label>(6)</label>
<mml:math display="block" id="M9">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x2016;</mml:mo>
<mml:mrow>
<mml:mtext mathvariant="bold">X</mml:mtext>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mtext mathvariant="bold">U</mml:mtext>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mtext mathvariant="bold">U</mml:mtext>
<mml:mi>k</mml:mi>
<mml:mtext>T</mml:mtext>
</mml:msubsup>
<mml:mtext mathvariant="bold">X</mml:mtext>
</mml:mrow>
<mml:mo>&#x2016;</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mi>F</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>r</mml:mi>
</mml:munderover>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>i</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>From this, we can naturally compute the coefficient vector <inline-formula>
<mml:math display="inline" id="im54">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mtext mathvariant="bold">U</mml:mtext>
<mml:mi>k</mml:mi>
<mml:mtext>T</mml:mtext>
</mml:msubsup>
<mml:msub>
<mml:mi>u</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mover>
<mml:mo>=</mml:mo>
<mml:mtext>&#x394;</mml:mtext>
</mml:mover>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:msubsup>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
<mml:mn>1</mml:mn>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>k</mml:mi>
</mml:msubsup>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mo>&#x22a4;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> corresponding to the historical temperature and salinity anomaly profile <inline-formula>
<mml:math display="inline" id="im55">
<mml:mrow>
<mml:msub>
<mml:mi>u</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. And, we can determine <inline-formula>
<mml:math display="inline" id="im56">
<mml:mi>k</mml:mi>
</mml:math>
</inline-formula> by the following equation according to the required accuracy</p>
<disp-formula>
<label>(7)</label>
<mml:math display="block" id="M10">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>i</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>r</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mi>i</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x2265;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3b4;</mml:mi>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math display="inline" id="im57">
<mml:mi>&#x3b4;</mml:mi>
</mml:math>
</inline-formula> is user defined cut-off threshold. To retrieve the vertical structure of TA and SA from the sea surface parameters, we collect the input parameters <inline-formula>
<mml:math display="inline" id="im58">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> corresponding to the obtained state vector</p>
<p>
<inline-formula>
<mml:math display="inline" id="im59">
<mml:mrow>
<mml:msub>
<mml:mi>u</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. Here <inline-formula>
<mml:math display="inline" id="im60">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is still a vector consisting of SSTA, SSSA, SSHA, LON, LAT and JULD. Then based on the collected dataset <inline-formula>
<mml:math display="inline" id="im61">
<mml:mrow>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>}</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, we can train GPR or LSTM to build the mapping of input parameters <inline-formula>
<mml:math display="inline" id="im62">
<mml:mi>x</mml:mi>
</mml:math>
</inline-formula> to coefficients <inline-formula>
<mml:math display="inline" id="im63">
<mml:mi>&#x3b1;</mml:mi>
</mml:math>
</inline-formula>, then the TA and SA are reconstructed as (<xref ref-type="bibr" rid="B25">Hesthaven et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B22">Guo and Hesthaven, 2019</xref>)</p>
<disp-formula>
<mml:math display="block" id="M11">
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
</disp-formula>
<p>The corresponding flow chart is given in <xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>.</p>
<fig id="f3" position="float">
<label>Figure&#xa0;3</label>
<caption>
<p>Flowchart of salinity and temperature estimation using GPR-POD and LSTM-POD.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g003.tif"/>
</fig>
<p>Further, we point out that <inline-formula>
<mml:math display="inline" id="im64">
<mml:mrow>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math display="inline" id="im65">
<mml:mrow>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are actually functions of depth <inline-formula>
<mml:math display="inline" id="im66">
<mml:mi>z</mml:mi>
</mml:math>
</inline-formula>. To estimate the TA and SA at any depth <inline-formula>
<mml:math display="inline" id="im67">
<mml:mrow>
<mml:msup>
<mml:mi>z</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, we use cubic spline interpolation to interpolate <inline-formula>
<mml:math display="inline" id="im68">
<mml:mrow>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math display="inline" id="im69">
<mml:mrow>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> with respect to depth <inline-formula>
<mml:math display="inline" id="im70">
<mml:mi>z</mml:mi>
</mml:math>
</inline-formula> to obtain the corresponding interpolation functions <inline-formula>
<mml:math display="inline" id="im71">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>L</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math display="inline" id="im72">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>M</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> satisfying <inline-formula>
<mml:math display="inline" id="im73">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>L</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math display="inline" id="im74">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>M</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>k</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>m</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula>
<mml:math display="inline" id="im75">
<mml:mrow>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>m</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> denotes the vertical levels, so that the TA and SA at depth <inline-formula>
<mml:math display="inline" id="im76">
<mml:mrow>
<mml:msup>
<mml:mi>z</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> can be reconstructed as</p>
<disp-formula>
<mml:math display="block" id="M12">
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr columnalign="left">
<mml:mtd columnalign="left">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>z</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>L</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>z</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>z</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>M</mml:mi>
<mml:mo>&#x2dc;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>z</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
</disp-formula>
</sec>
</sec>
<sec id="s4" sec-type="results">
<label>4</label>
<title>Results</title>
<sec id="s4_1">
<label>4.1</label>
<title>Comparison between different models</title>
<p>To evaluate the performance of different models, we randomly selected 20% of the 11,374 Argo profiles as the test set and the remaining 80% as the training set. Different models are then employed to retrieve TA and SA profiles with the same inputs, i.e., SSTA, SSSA, SSHA, LON, LAT, and JULD. The RMSE between TA and SA profiles obtained from these models and the Argo profiles is calculated to evaluate the performance of the different methods. The RMSE of the WOA is obtained by comparing the temperature and salinity profiles obtained after interpolating the monthly mean of the climatology of WOA18 with the Argo profiles. Let &#x3b4;=0.04% and we choose <italic>k</italic>=14 modes, in fact, before performing the POD, we similarly scaled the data between 0 and 1. <xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref> shows the vertical distribution of the RMSEs estimated by the different models. The RMSEs of all proposed methods are smaller than the RMSE of the WOA18, and a similar vertical structure can be seen in the different models. The RMSEs are small at the sea surface but increase rapidly with depth, decrease rapidly after reaching the maximum, and stabilized. This may be related to the complex dynamical processes in the ocean&#x2019;s upper layers and the perturbations in the mixed and thermocline layers, while the seawater is relatively stable in the deeper layers. The RMSEs of temperature reach their maximum at 105 - 115 m, and only near this depth (where the temperature variation is relatively large), LSTM shows superior performance in predicting TA profiles compared to other methods. This suggests that LSTM may have greater potential for approximating strongly nonlinear functions since, at depths where temperature changes rapidly, the relationship between temperature and depth is more complex and may have stronger nonlinearity.</p>
<fig id="f4" position="float">
<label>Figure&#xa0;4</label>
<caption>
<p>Estimated RMSEs of different models. <bold>(A)</bold> RMSEs of estimated temperature. <bold>(B)</bold> RMSEs of estimated salinity.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g004.tif"/>
</fig>
<p>However, the performance of the four methods is comparable from an overall perspective. The RMSEs of salinity reach their maximum between 55 - 65 m. Although the four reconstruction methods have comparable accuracy, GPR performs a little better than LSTM in the reconstruction of upper salinity, which may arise because the rate of change of the salinity profile is not as large as that of the temperature profile, GPR can produce sufficiently accurate approximations, while LSTM has a more complex structure and numerous parameters, making them reliant on large amounts of training data to obtain more accurate predictions. In addition, the POD combined with the regression methods does not cause a large loss in the accuracy of the estimated profiles, which is better reflected in the RMSEs of GPR and GPR-POD, as the RMSEs of both are almost the same. The selected modes are sufficient to characterize the vast majority of the temperature and salinity fields. To further measure the uncertainty of the estimates of GPR-based methods, <xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref> shows the standard deviation of the posterior distribution of the GPR and GPR-POD predictions. We can see that both methods display similar distribution patterns, with comparable uncertainty estimates at most points, but the uncertainty of the model prediction is higher at the western boundary of the region, which can be attributed to the limited training data available in this region. This also shows a drawback of the GPR-based approach, which is better suited for interpolated predictions rather than extrapolation. Notably, the predictions of GPR-POD exhibit higher levels of uncertainty, despite comparable RMSEs between GPR-POD and GPR. This discrepancy may be attributed to the fact that more information is available in the training data for GPR, making it more confident in its predictions.</p>
<fig id="f5" position="float">
<label>Figure&#xa0;5</label>
<caption>
<p>Uncertainty estimation of <bold>(A)</bold> GPR and <bold>(B)</bold> GPR-POD.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g005.tif"/>
</fig>
<p>Considering the inhomogeneity and sparsity of spatio-temporal distribution of thermohaline data, all the thermohaline relationships in the test set are drawn in the same graph (<xref ref-type="fig" rid="f6">
<bold>Figure&#xa0;6</bold>
</xref>), the purpose of which is to compare the predictions of different models, in which the different colors of points are to better distinguish the differences between points. The main distribution structure of T-S graph reconstructed by all methods is similar to that of Argo field, but the distribution range of points is more concentrated than that of Argo field. From the T-S graphs, it can be observed that the reconstructed results are generally weaker for the points with larger absolute values of salinity anomalies. To better visualize the predicted distribution of temperature and salinity anomalies, a histogram of the number of salinity anomalies in different intervals is displayed at the top of the T-S graph, and a histogram of temperature anomalies is displayed on the right side of the T-S graph. The histogram of salinity anomalies reveals that the distribution of LSTM-POD is more consistent with that of Argo, while LSTM underestimates the number of points of salinity anomalies in the interval [-0.5,0], which is overestimated by GPR and GPR-POD. The histograms of temperature anomalies from different models indicate that all four methods overestimate the number of temperature anomalies in the interval [0,2.5], but their overall distribution is very similar to that of Argo. In conclusion, the reconstructed methods can predict the main distribution structure of the T-S graph. Although there are slight differences in the distribution range and there may be problems of underestimating the thermohaline state, the performance of the proposed reconstructed methods is generally satisfactory. Furthermore, <xref ref-type="fig" rid="f7">
<bold>Figure&#xa0;7</bold>
</xref> displays the scatter plots of the LSTM estimated and Argo&#x2019;s temperature and salinity anomalies at a depth of 105 m. This depth is significantly impacted by the variation of the upper mixed layer, and the RMSE estimated for temperature and salinity at this depth is also large. The results indicate that the overall discrepancy between them is minimal, as the LSTM prediction preserves the fundamental characteristics of the temperature and salinity anomalies, and its pattern (positive or negative) is accurately reconstructed.</p>
<fig id="f6" position="float">
<label>Figure&#xa0;6</label>
<caption>
<p>T-S graph and histogram of temperature and salinity anomalies for different models. <bold>(A)</bold> T-S graph for Agro (left and bottom), histogram of the number of salinity anomalies in different intervals for Agro (top), histogram of the number of temperature anomalies in different intervals for Agro (right). Panels <bold>(B-E)</bold> are the same as panels <bold>(A)</bold>, respectively, except for <bold>(B)</bold> LSTM, for <bold>(C)</bold> GPR, for <bold>(D)</bold> LSTM-POD and for <bold>(E)</bold> GPR-POD.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g006.tif"/>
</fig>
<fig id="f7" position="float">
<label>Figure&#xa0;7</label>
<caption>
<p>TAs and SAs of different models at 105 m. <bold>(A)</bold> LSTM-estimated TAs. <bold>(B)</bold> Argo TAs. <bold>(C)</bold> LSTM-estimated SAs. <bold>(D)</bold> Argo SAs.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g007.tif"/>
</fig>
<p>To further compare the improvement of POD on the computational efficiency of different models, we compare the running times of different models. The CPU times of LSTM and LSTM-POD are 11216 <italic>s</italic> and 1488 <italic>s</italic>, respectively, computed using a laptop with 4 Intel(R) Core(TM) i7-6770 CPU @ 3.40GHz and 8G Memory. Since GPR requires higher memory for matrix operations, GPR and GPR-POD are run using a single node with a single core in Tianhe-2, which adopts an Intel Xeon E5-2692 v2 CPU @ 2.20GHz and 64GB Memory. Furthermore, the running times of GPR and GPR-POD are 891 <italic>s</italic> and 788 <italic>s</italic>, respectively. We can see that the training time of LSTM can be significantly reduced by combining POD, as LSTM takes 7.5 times longer than LSTM-POD. However, this reduction is not apparent in GPR, this is attributed to the fact that the computational complexity of GPR-POD is <inline-formula>
<mml:math display="inline" id="im77">
<mml:mrow>
<mml:mi>&#x39f;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>k</mml:mi>
<mml:msup>
<mml:mi>n</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, while the computational complexity of GPR is <inline-formula>
<mml:math display="inline" id="im78">
<mml:mrow>
<mml:mi>&#x39f;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>m</mml:mi>
<mml:msup>
<mml:mi>n</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>. In this case, <italic>n</italic>=9099,<italic>k</italic>=14,<italic>m</italic>=71, the improvement in training time is insignificant when the number of training samples is much larger than the number of output dimensions.</p>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Sensitivity of different parameters in GPR-POD</title>
<p>To test the sensitivity of the estimated TA and SA to different input parameters, we compare GPR-POD estimates with different training inputs. <xref ref-type="fig" rid="f8">
<bold>Figure&#xa0;8</bold>
</xref>. shows the RMSEs of temperature with various inputs, and as well as the correlation between test input parameters, the correlation between test input parameters, and test TA profiles , and the correlation between test input parameters and test SA profiles. By observing the RMSEs of temperature, we can find that the RMSE of the model without SSHA in the input is large at all depths, which indicates that SSHA plays a vital role in ocean motion and processes. In fact, changes in the subsurface layer may give rise to changes in SSH resulting from the interaction of several factors, such as heat exchange, internal thermal expansion, and ocean circulation. A rise in sea temperature leads to an increase in SSH, while a decrease in sea temperature leads to a reduction in SSH. The correlation coefficients between SSHA and TA profiles also affirm the excellent association between SSHA and TA. Notably, the correlation coefficient between SSHA and TA is the highest among all input parameters except SSTA. In addition, SSH changes under the influence of wave and wind shear. Incorporating SSHA into the model input can provide more information for predicting TA. Latitude, longitude, and time all improve the TA profile reconstruction to different degrees, showing that geographic and temporal information is helpful in predicting TA. In particular, incorporating latitude data yields a more substantial reduction in RMSE than incorporating longitude data, suggesting that latitude has a greater impact on TA reconstruction. This is because the discrepancy in solar radiation across different latitudes results in significant variations in sea temperature in the north-south direction. Therefore, latitude information is essential to improve the performance of subsurface reconstruction. Furthermore, the correlation between latitude and TA profiles is higher compared to that between longitude and TA profiles, highlighting the importance of considering latitude when predicting TA. The model without SSTA in the input has the largest RMSE in the upper layer (&lt;100 m ), but when the depth is greater than 100 m, there is almost no difference between the model without SSTA and GPR-POD ( all parameters as input). This suggests that SSTA data mainly improve the reconstruction of upper layer TA, while deeper temperature variations are difficult to be interpreted from satellite measurements. In addition to the fact that SSSA has no significant relationship with ocean interior temperature, which can also be verified by observing the correlation between SSSA and TA profiles. Similar results can be observed in the RMSEs of salinity, where adding both latitude and SSHA data reduces the RMSEs of salinity at all depths. The relationship between ocean salinity and temperature on density is a fundamental aspect that affects changes in SSH, causing SSHA and SA profiles to have some correlation. Latitude also influences changes in ocean salinity as a result of a combination of factors, including precipitation, evaporation, and mixing of water masses. Consequently, incorporating latitude and SSHA into the prediction of subsurface salinity can significantly improve model accuracy. SSSA data play an important role in the reconstruction of upper ocean SA profiles, while they do not help much in the reconstruction of deep SA profiles, and the correlation between SSSA and SA gradually decreased with increasing depth. The RMSEs of several other models show that longitude, time, and SSTA have somewhat limited improvements on the models, where longitude and time can reduce the maximum of salinity RMSE, while SSTA has no significant improvement on the reconstruction of SA profiles.</p>
<fig id="f8" position="float">
<label>Figure&#xa0;8</label>
<caption>
<p>
<bold>(A)</bold> RMSEs of temperature from models with different sea surface inputs. <bold>(B)</bold> RMSEs of salinity from models with different sea surface inputs. <bold>(C)</bold> heat map of correlation coefficients.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g008.tif"/>
</fig>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>The performance of climatology on three-dimensional salinity and temperature fields reconstruction</title>
<p>To test the influence of climatology on the temperature and salinity reconstruction models, we considered three different inputs and outputs for LSTM and GPR. The inputs are 1) X: SSTA, SSSA, SSHA, LAT, LON, JULD; 2) X-Without-WOA: SST, SSS, SSHA, LAT, LON, JULD, and 3) X-WOA: SST, SSS, SSHA, LAT, LON, JULD, SS (climatology), ST (climatology) and the outputs are 1) X: STA, SSA; 2) X-Without-WOA: SS, ST, and 3) X-WOA: SS, ST. The RMSEs estimated by different models are shown in <xref ref-type="fig" rid="f9">
<bold>Figure&#xa0;9</bold>
</xref>. Observing the RMSEs of the two regression models with various inputs and outputs, we can find a similar behavior, especially when the temperature at depths of less than 100 m and the salinity at depths of less than 200 m, training the models with the temperature and salinity anomaly fields can significantly improve the prediction accuracy than directly using temperature and salinity as the input and output of the model. This is because the seasonal variation signal disappears after calculating the anomaly field using the monthly mean of climatology, thus making the SS and SA easier to predict. On the other hand, incorporating monthly temperature and salinity climatology fields into the model input gives the most accurate estimates. Climatological information added to the input can effectively reduce the impact of the sea surface parameter errors. In summary, seasonal variations in seawater temperature and salinity play a crucial role in oceanic processes. The information provided by seasonal variations in temperature and salinity can help us better estimate temperature and salinity structure within the ocean.</p>
<fig id="f9" position="float">
<label>Figure&#xa0;9</label>
<caption>
<p>The RMSEs of temperature and salinity from models with different inputs and outputs. <bold>(A)</bold> RMSEs of estimated temperature. <bold>(B)</bold> RMSEs of estimated salinity.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g009.tif"/>
</fig>
<p>Comparing LSTM and GPR with the same input and output, it can be found that the accuracy of GPR with temperature and salinity climatology as input is higher than that of LSTM. This is due to the inclusion of monthly climate fields of temperature and salinity in the input, which makes the model less complex. Similar to the findings in Section 4.1, GPR can provide more accurate predictions when the amount of training data is not abundant. In contrast, the performance of GPR without considering climatology is the worst, expressly, by up to 20% (&#x223c;245 m) over the GPR-WOA estimate. When only sea surface parameters are included in the input without climatology as an aid, it leads to a more complex relationship between the input and output in the model. The LSTM, on the other hand, can better approximate the strongly nonlinear function, so the GPR-Without-WOA does not perform as well as the LSTM-Without-WOA.</p>
</sec>
<sec id="s4_4">
<label>4.4</label>
<title>Estimation of temperature and salinity using continuous remote sensing data from SSHA and SSTA</title>
<p>Since the SSS data is the 4-day temporal grid, in order to make full use of the other daily satellite and <italic>in situ</italic> observations, we use the SSTA, SSHA, LAT, LON, and JULD as model inputs in this section to train the LSTM model (named LSTM-Daily) to estimate the TA and SA profiles. Thus, there are 45,031 Argo observation profiles available, and again, 80% are randomly selected as the training set, and the remaining 20% are used to test the model&#x2019;s performance. The RMSEs of the temperature and salinity estimated by LSTM-Daily are shown in <xref ref-type="fig" rid="f10">
<bold>Figure&#xa0;10</bold>
</xref>, respectively, besides the RMSE reduction rate of LSTM relative to climatological estimates also shown in <xref ref-type="fig" rid="f10">
<bold>Figure&#xa0;10</bold>
</xref>. It is clear that the RMSE of the estimated TA profiles is reduced by 20%&#x2212;70% compared to climatology. By increasing the training data size, a more accurate reconstruction of the temperature profiles can be obtained. On the other hand, the prediction accuracy decreases for salinity at depths less than 100 due to the lack of SSS information, which again validates the importance of SSSA for SA reconstruction at a depth of less than 100 m. However, when the depth is greater than 100 m, as the effect of SSSA data disappears, higher accuracy SA estimation can be achieved again due to increased training data size.</p>
<fig id="f10" position="float">
<label>Figure&#xa0;10</label>
<caption>
<p>The RMSEs of estimated temperature and salinity, and the reduction of RMSE for LSTM reconstructed profiles relative to climatological profiles. <bold>(A)</bold> RMSEs of estimated temperature. <bold>(B)</bold> RMSEs of estimated salinity. <bold>(C)</bold> The reduction of temperature&#x2019;s RMSE. <bold>(D)</bold> The reduction of salinity&#x2019;s RMSE.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-10-1121334-g010.tif"/>
</fig>
</sec>
</sec>
<sec id="s5" sec-type="discussion">
<label>5</label>
<title>Discussion</title>
<p>This paper applies several methods to estimate the TA and SA profiles from sea surface parameters, namely LSTM, GPR, LSTM-POD, and GPR-POD. LSTM and GPR directly train the model with TAs and SAs at different depths as the output. At the same time, LSTM-POD and GPR-POD combine LSTM, GPR respectively, with POD, which can downscale the data, and it assumes that only a small number of modes are needed to represent the main features of the temperature and salinity anomaly fields. Thus, only the model needs to be trained to estimate the coefficients, i.e., the output of the model is the reduced coefficients corresponding to the temperature-salinity anomaly profiles, which greatly simplifies the regression model. In addition, using the predicted coefficients, a linear combination of the interpolated modes can estimate TA and SA at any depth. We selected Argo observations and satellite remote sensing data for the Northwest Pacific Ocean in 2011-2021 for training and testing. The accuracy and reliability of the proposed methods are evaluated by calculating the RMSEs of the estimated profiles, and the results show that these methods can accurately derive the TA and SA profiles, and the introduction of POD greatly saves time and storage costs without additional loss of accuracy, especially for LSTM.</p>
<p>In order to determine the relative importance of different input parameters to the temperature and salinity reconstruction, we evaluate the results of several models with different inputs, in addition to which correlations between different parameters and TA and SA profiles are calculated. The results show that the most significant improvements can be obtained by including SSHA and latitude. On the other hand, SSTA and SSSA play an important role in the TA and SA reconstruction in the upper layers (&gt;100 m), but this role decreases with increasing depth. In addition, by using the temperature and salinity monthly climatological fields, especially when added to the input of the regression model, fairly good profile predictions can be obtained, both for LSTM and GPR. This suggests that the introduction of climatology can effectively reduce the effect of sea surface errors and provide more information to the regression model, thus effectively improving the accuracy and robustness of the model. This also inspires us to explore multiple satellite measurements further to improve the reliability of the estimates in the future.</p>
<p>In general, the proposed methods can be effectively used for reconstructing temperature and salinity profiles, particularly when monthly climatology of temperature and salinity is included as an input to the GPR. The techniques presented in this article for estimating subsurface temperature and salinity do not require any prior knowledge or assumptions and are highly versatile and generalized. These models can accurately predict new subsurface temperature and salinity values as long as it is well-trained. It is expected that the proposed methods can be beneficial for the detection of the thermal structure of the ocean interior in marine science and climate change research, as well as for more precise analysis of temperature and salinity changes. However, it is important to note that, unlike dynamic-based reconstruction methods, the proposed methods are solely based on data and lack of physical interpretation, which may be a limitation in the development of artificial intelligence approaches to oceanography. With the rapid advancement of ocean modeling capability, observation technology and artificial intelligence, it is a promising direction to effectively combine the advantages of a dynamics-based approach and data-driven approach in future work, enhance the ability of model prediction and physical interpretation, and establish a deep learning model based on physical information.</p>
</sec>
<sec id="s6" sec-type="data-availability">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7" sec-type="author-contributions">
<title>Author contributions</title>
<p>YC: Methodology, Investigation, Performer Experiment, Writing Original Draft. LL: Data Collection and Curation, Writing-Review. XC: Formal Analysis, Writing-Review and Editing. ZW: Supervision, Writing-Reviewing. XS: Writing-Review and Editing. CY: Writing-Review and Editing. ZG: Writing-Review, Project Administration, Funding Acquisition. All authors contributed to the article and approved the submitted version.</p>
</sec>
</body>
<back>
<sec id="s8" sec-type="funding-information">
<title>Funding</title>
<p>This study was supported by the National Key Research and Development Program of China (2021YFF0704000), the Taishan Scholars Progam (tsqn202211059) and the Fundamental Research Funds for the Central Universities (202265005, 202264007).</p>
</sec>
<ack>
<title>Acknowledgments</title>
<p>The authors would like to thank the reviewers for their valuable suggestions on the manuscript.</p>
</ack>
<sec id="s9" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
<p>The reviewer XL declared a shared affiliation with the authors to the handling editor at the time of review.</p>
</sec>
<sec id="s10" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ali</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Swain</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Weller</surname> <given-names>R. A.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Estimation of ocean subsurface thermal structure from surface parameters: a neural network approach</article-title>. <source>Geophysical Res. Lett.</source> <volume>31</volume>, <fpage>L20308</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2004GL021192</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ballabrera-Poy</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Mourre</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Garcia-Ladona</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Turiel</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Font</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Linear and non-linear t&#x2013;s models for the eastern north atlantic from argo data: role of surface salinity observations</article-title>. <source>Deep Sea Res. Part I: Oceanographic Res. Papers</source> <volume>56</volume>, <fpage>1605</fpage>&#x2013;<lpage>1614</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.dsr.2009.05.017</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bao</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Ren</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Yu</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Salinity profile estimation in the pacific ocean from satellite surface salinity observations</article-title>. <source>J. Atmospheric Oceanic Technol.</source> <volume>36</volume>, <page-range>53&#x2013;68</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/JTECH-D-17-0226.1</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Boyer</surname> <given-names>T. P.</given-names>
</name>
<name>
<surname>Garcia</surname> <given-names>H. E.</given-names>
</name>
<name>
<surname>Locarnini</surname> <given-names>R. A.</given-names>
</name>
<name>
<surname>Zweng</surname> <given-names>M. M.</given-names>
</name>
<name>
<surname>Mishonov</surname> <given-names>A. V.</given-names>
</name>
<name>
<surname>Reagan</surname> <given-names>J. R.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <source>World ocean atlas 2018</source>. (<publisher-loc>United States</publisher-loc>: <publisher-name>NOAA National Centers for Environmental Information</publisher-name>). Dataset. Available at: <uri xlink:href="https://www.ncei.noaa.gov/archive/accession/NCEI-WOA18">https://www.ncei.noaa.gov/archive/accession/NCEI-WOA18</uri>.</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Buongiorno Nardelli</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A deep learning network to retrieve ocean hydrographic profiles from combined satellite and <italic>in situ</italic> measurements</article-title>. <source>Remote Sens.</source> <volume>12</volume>, <elocation-id>3151</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3390/rs12193151</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Buongiorno Nardelli</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Santoleri</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Methods for the reconstruction of vertical profiles from surface data: multivariate analyses, residual gem, and variable temporal signals in the north pacific ocean</article-title>. <source>J. Atmospheric Oceanic Technol.</source> <volume>22</volume>, <fpage>1763</fpage>&#x2013;<lpage>1781</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/JTECH1792.1</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carnes</surname> <given-names>M. R.</given-names>
</name>
<name>
<surname>Mitchell</surname> <given-names>J. L.</given-names>
</name>
<name>
<surname>de Witt</surname> <given-names>P. W.</given-names>
</name>
</person-group> (<year>1990</year>). <article-title>Synthetic temperature profiles derived from geosat altimetry: comparison with air-dropped expendable bathythermograph profiles</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>95</volume>, <fpage>17979</fpage>&#x2013;<lpage>17992</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/JC095iC10p17979</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carnes</surname> <given-names>M. R.</given-names>
</name>
<name>
<surname>Teague</surname> <given-names>W. J.</given-names>
</name>
<name>
<surname>Mitchell</surname> <given-names>J. L.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Inference of subsurface thermohaline structure from fields measurable by satellite</article-title>. <source>J. Atmospheric Oceanic Technol.</source> <volume>11</volume>
<fpage>551</fpage>&#x2013;<lpage>566</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/1520-0426(1994)011&lt;0551:IOSTSF&gt;2.0.CO;2</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carrassi</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Bocquet</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Bertino</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Evensen</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Data assimilation in the geosciences: an overview of methods, issues, and perspectives</article-title>. <source>Wiley Interdiscip. Reviews: Climate Change</source> <volume>9</volume>, <elocation-id>e535</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/wcc.535</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chapelle</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Gariah</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Sainte-Marie</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Galerkin approximation with proper orthogonal decomposition : new error estimates and illustrative examples</article-title>. <source>ESAIM: Math. Model. Numerical Anal.</source> <volume>46</volume>, <fpage>731</fpage>&#x2013;<lpage>757</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1051/m2an/2011053</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Reconstruction of three-dimensional ocean structure from Sea surface data: an application of isQG method in the southwest Indian ocean</article-title>. <source>J. Geophysical Res. (Oceans)</source> <volume>125</volume>, <elocation-id>e2020JC016351</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2020JC016351</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Reconstructing the subsurface temperature field by using sea surface data through self-organizing map method</article-title>. <source>IEEE Geosci. Remote Sens. Lett.</source> <volume>15</volume>, <fpage>1812</fpage>&#x2013;<lpage>1816</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/LGRS.2018.2866237</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>de Boyer Mont&#xe9;gut</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Mignot</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Lazar</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Cravatte</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Control of salinity on the mixed layer depth in the world ocean: 1. general description</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>112</volume>, <fpage>C06011</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2006JC003953</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fiedler</surname> <given-names>P.</given-names>
</name>
</person-group> (<year>1988</year>). <article-title>Surface manifestations of subsurface thermal structure in the california current</article-title>. <source>J. Geophysical Res.</source> <volume>93</volume>, <fpage>4975</fpage>&#x2013;<lpage>4983</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/JC093iC05p04975</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Forrester</surname> <given-names>A. I.</given-names>
</name>
<name>
<surname>S&#xf3;bester</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Keane</surname> <given-names>A. J.</given-names>
</name>
</person-group> (<year>2008</year>). <source>Engineering Design via Surrogate Modelling: A Practical Guide</source>. (<publisher-loc>Chichester, UK</publisher-loc>: <publisher-name>John Wiley &amp; Sons Ltd.</publisher-name>) doi:&#xa0;<pub-id pub-id-type="doi">10.1002/9780470770801</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fujii</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Kamachi</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2003</year>a). <article-title>A reconstruction of observed profiles in the sea east of japan using vertical coupled temperature-salinity eof modes</article-title>. <source>J. Oceanography</source> <volume>59</volume>, <fpage>173</fpage>&#x2013;<lpage>186</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1023/A:1025539104750</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fujii</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Kamachi</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2003</year>b). <article-title>Three-dimensional analysis of temperature and salinity in the equatorial pacific using a variational method with vertical coupled temperature-salinity empirical orthogonal function modes</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>108</volume>, <fpage>3297</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2002JC001745</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gers</surname> <given-names>F. A.</given-names>
</name>
<name>
<surname>Schmidhuber</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Cummins</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Learning to forget: continual prediction with LSTM</article-title>. <source>Neural Comput.</source> <volume>12</volume>, <fpage>2451</fpage>&#x2013;<lpage>2471</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1162/089976600300015015</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ghil</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Malanotte-Rizzoli</surname> <given-names>P.</given-names>
</name>
</person-group> (<year>1991</year>). <article-title>Data Assimilation in Meteorology and Oceanography</article-title>. (<publisher-name>Elsevier</publisher-name>) <source>Adv. Geophysics.</source> <volume>33</volume>, <fpage>141</fpage>&#x2013;<lpage>266</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/S0065-2687(08)60442-2</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="book">
<person-group person-group-type="author">
<collab>GPy</collab>
</person-group> (<year>2012</year>). <source>Gpy: a gaussian process framework in python</source>. (<publisher-loc>Sheffield, UK</publisher-loc>: <publisher-name>The University of Sheffield</publisher-name>)</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guinehut</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Dhomps</surname> <given-names>A.-L.</given-names>
</name>
<name>
<surname>Gilles</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Traon</surname> <given-names>P.-Y.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>High resolution 3-d temperature and salinity fields derived from <italic>in situ</italic> and satellite observations</article-title>. <source>Ocean Sci. Discussions</source> <volume>9</volume>, <fpage>1313</fpage>&#x2013;<lpage>1347</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.5194/osd-9-1313-2012</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Hesthaven</surname> <given-names>J. S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Data-driven reduced order modeling for time-dependent problems</article-title>. <source>Comput. Methods Appl. Mechanics Eng.</source> <volume>345</volume>, <fpage>75</fpage>&#x2013;<lpage>99</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.cma.2018.10.029</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Helber</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Richman</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Barron</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>The influence of temperature and salinity variability on the upper ocean density and mixed layer</article-title>. <source>Ocean Sci. Discussions (OSD)</source> <volume>7</volume>, <fpage>1469</fpage>&#x2013;<lpage>1495</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.5194/osd-7-1469-2010</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Held</surname> <given-names>I. M.</given-names>
</name>
<name>
<surname>Pierrehumbert</surname> <given-names>R. T.</given-names>
</name>
<name>
<surname>Garner</surname> <given-names>S. T.</given-names>
</name>
<name>
<surname>Swanson</surname> <given-names>K. L.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>Surface quasi-geostrophic dynamics</article-title>. <source>J. Fluid Mechanics</source> <volume>282</volume>, <fpage>1</fpage>&#x2013;<lpage>20</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1017/S0022112095000012</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Hesthaven</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Rozza</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Stamm</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2016</year>). <source>Certified reduced basis methods for parametrized partial differential equations</source>. (<publisher-loc>Switzerland</publisher-loc>: <publisher-name>Springer</publisher-name>) doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-3-319-22470-1</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Hochreiter</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Schmidhuber</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>1996</year>). &#x201c;<article-title>LSTM can solve hard long time lag problems</article-title>,&#x201d; in <source>Proceedings of the 9th international conference on neural information processing systems</source>, vol. <volume>NIPS&#x2019;96</volume>. (<publisher-loc>Cambridge, MA, USA</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>473</fpage>&#x2013;<lpage>479</lpage>.</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hochreiter</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Schmidhuber</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Long short-term memory</article-title>. <source>Neural Comput.</source> <volume>9</volume>, <fpage>1735</fpage>&#x2013;<lpage>1780</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1162/neco.1997.9.8.1735</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Isern-Fontanet</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Chapron</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Lapeyre</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Klein</surname> <given-names>P.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Potential use of microwave sea surface temperatures for the estimation of ocean currents</article-title>. <source>Geophysical Res. Lett.</source> <volume>33</volume>, <fpage>OS14B</fpage>&#x2013;<lpage>OS101</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2006GL027801</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Isern-Fontanet</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Lapeyre</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Klein</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Chapron</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Hecht</surname> <given-names>M. W.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Three-dimensional reconstruction of oceanic mesoscale currents from surface information</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>113</volume>, <fpage>C09005</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2007JC004692</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kingma</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Ba</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Adam: A method for stochastic optimization</article-title>. (<publisher-loc>San Diego</publisher-loc>: <publisher-name>3rd International Conference for Learning Representations</publisher-name>). <source>arXiv preprint</source> <volume>arXiv</volume>:<fpage>1412.6980</fpage>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Klemas</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.-H.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Subsurface and deeper ocean remote sensing from satellites: an overview and new results</article-title>. <source>Prog. Oceanography</source> <volume>122</volume>, <fpage>1</fpage>&#x2013;<lpage>9</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.pocean.2013.11.010</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Klemas</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.-H.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Subsurface and deeper ocean remote sensing from satellites: An overview and new results</article-title>. <source>Progress in Oceanography</source> <volume>122</volume>, <fpage>1</fpage>&#x2013;<lpage>9</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.pocean.2013.11.010</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lapeyre</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Klein</surname> <given-names>P.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Dynamics of the upper oceanic layers in terms of surface quasigeostrophy theory</article-title>. <source>J. Phys. Oceanography</source> <volume>36</volume>, <fpage>165</fpage>&#x2013;<lpage>176</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/JPO2840.1</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Su</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Estimation of global subsurface temperature anomaly based on multisource satellite observations</article-title>. <source>J. Remote Sens.</source> <volume>21</volume>, <fpage>881</fpage>&#x2013;<lpage>891</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.11834/jrs.20177026</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Lim</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Proper orthogonal decomposition and its applications - part I: theory</article-title>. <source>J. Sound Vibration</source> <volume>252</volume>, <fpage>527</fpage>&#x2013;<lpage>544</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1006/jsvi.2001.4041</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Scattered dataset of global ocean temperature and salinity profiles from the international argo program</article-title>. <source>J. Global Change Data Discovery</source> <volume>5</volume>, <fpage>312</fpage>&#x2013;<lpage>321</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3974/geodp.2021.03.09</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Peng</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>R. X.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Reconstruction of ocean&#x2019;s interior from observed sea surface information</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>122</volume>, <fpage>1042</fpage>&#x2013;<lpage>1056</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/2016JC011927</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Peng</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>R. X.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Retrieving density and velocity fields of the ocean&#x2019;s interior from surface data</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>119</volume>, <fpage>8512</fpage>&#x2013;<lpage>8529</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/2014JC010221</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Xue</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Sasaki</surname> <given-names>H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Reconstructing the ocean interior from high-resolution Sea surface information</article-title>. <source>J. Phys. Oceanography</source> <volume>49</volume>, <fpage>3245</fpage>&#x2013;<lpage>3262</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/JPO-D-19-0118.1</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lu</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Su</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.-H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Subsurface temperature estimation from remote sensing data using a clustering-neural network method</article-title>. <source>Remote Sens. Environ.</source> <volume>422</volume>, <fpage>213</fpage>&#x2013;<lpage>222</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.rse.2019.04.009</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lucia</surname> <given-names>D. J.</given-names>
</name>
<name>
<surname>Beran</surname> <given-names>P. S.</given-names>
</name>
<name>
<surname>Silva</surname> <given-names>W. A.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Reduced-order modeling: new approaches for computational physics</article-title>. <source>Prog. Aerospace Sci.</source> <volume>40</volume>, <fpage>51</fpage>&#x2013;<lpage>117</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.paerosci.2003.12.001</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maes</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Behringer</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Reynolds</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Ji</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Retrospective analysis of the salinity variability in the western tropical pacific ocean using an indirect minimization approach</article-title>. <source>J. Atmospheric Oceanic Technol.</source> <volume>17</volume>, <fpage>512</fpage>&#x2013;<lpage>524</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/1520-0426(2000)017&lt;0512:RAOTSV&gt;2.0.CO;2</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Masuko</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>2017</year>). &#x201c;<article-title>Computational cost reduction of long short-term memory based on simultaneous compression of input and hidden state</article-title>,&#x201d; in <source>2017 IEEE automatic speech recognition and understanding workshop (ASRU)</source> (<publisher-loc>Piscataway, New Jersey, USA</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>126</fpage>&#x2013;<lpage>133</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/ASRU.2017.8268926</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meehl</surname> <given-names>G. A.</given-names>
</name>
<name>
<surname>Arblaster</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Fasullo</surname> <given-names>J. T.</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Trenberth</surname> <given-names>K. E.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Model-based evidence of deep-ocean heat uptake during surface-temperature hiatus periods</article-title>. <source>Nat. Climate Change</source> <volume>1</volume>, <fpage>360</fpage>&#x2013;<lpage>364</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nclimate1229</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meinen</surname> <given-names>C. S.</given-names>
</name>
<name>
<surname>Watts</surname> <given-names>D. R.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Vertical structure and transport on a transect across the north atlantic current near 42&#xb0; n: time series and mean</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>105</volume>, <fpage>21869</fpage>&#x2013;<lpage>21891</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2000JC900097</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Melnichenko</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Hacker</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Maximenko</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Lagerloef</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Potemra</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Optimum interpolation analysis of aquarius sea surface salinity</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>121</volume>, <fpage>602</fpage>&#x2013;<lpage>616</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/2015JC011343</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meng</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.-H.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Remote sensing for subsurface and deeper oceans: an overview and a future outlook</article-title>. <source>IEEE Geosci. Remote Sens. Magazine</source> <volume>10</volume>, <fpage>72</fpage>&#x2013;<lpage>92</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/MGRS.2022.3184951</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Moore</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Martin</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Akella</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Arango</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Balmaseda</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Bertino</surname> <given-names>L.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). <article-title>Synthesis of ocean observations using data assimilation for operational, real-time and reanalysis systems: a more complete picture of the state of the ocean</article-title>. <source>Front. Mar. Sci.</source> <volume>6</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fmars.2019.00090</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nardelli</surname> <given-names>B. B.</given-names>
</name>
<name>
<surname>Santoleri</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Reconstructing synthetic profiles from surface data</article-title>. <source>J. Atmospheric Oceanic Technol.</source> <volume>21</volume>, <fpage>693</fpage>&#x2013;<lpage>703</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/1520-0426(2004)021&lt;0693:RSPFSD&gt;2.0.CO;2</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nguyen</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Peraire</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Gaussian Functional regression for output prediction: model assimilation and experimental design</article-title>. <source>J. Comput. Phys.</source> <volume>309</volume>, <fpage>52</fpage>&#x2013;<lpage>68</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.jcp.2015.12.035</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Pinnau</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title><italic>Model reduction via proper orthogonal decomposition</italic></article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Schilders</surname> <given-names>W. H. A.</given-names>
</name>
<name>
<surname>van der Vorst</surname> <given-names>H. A.</given-names>
</name>
<name>
<surname>Rommes</surname> <given-names>J.</given-names>
</name>
</person-group> (eds) <source>Model Order Reduction: Theory, Research Aspects and Applications</source>. Mathematics in Industry. (<publisher-loc>Springer, Berlin, Heidelberg).</publisher-loc> vol <volume>13</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-3-540-78841-6_5</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qin</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Yin</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Seasonal variability in the thermohaline structure of the western pacific warm pool</article-title>. <source>Acta Oceanologica Sin.</source> <volume>34</volume>, <fpage>44</fpage>&#x2013;<lpage>53</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s13131-015-0696-6</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qiu</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Klein</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Ubelmann</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Fu</surname> <given-names>L.-L.</given-names>
</name>
<name>
<surname>Sasaki</surname> <given-names>H.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Reconstructability of three-dimensional upper-ocean circulation from swot sea surface height measurements</article-title>. <source>J. Phys. Oceanography</source> <volume>46</volume>, <fpage>947</fpage>&#x2013;<lpage>963</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/JPO-D-15-0188.1</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Quarteroni</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Manzoni</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Negri</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>2015</year>). <source>Reduced basis methods for partial differential equations: an introduction</source>. (<publisher-loc>Cham, Switzerland</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>). doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-3-319-15431-2</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rao</surname> <given-names>R. R.</given-names>
</name>
<name>
<surname>Sivakumar</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Seasonal variability of sea surface salinity and salt budget of the mixed layer of the north indian ocean</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>108</volume>, <fpage>3009</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2001JC000907</pub-id>
</citation>
</ref>
<ref id="B56">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Rasmussen</surname> <given-names>C. E.</given-names>
</name>
<name>
<surname>Williams</surname> <given-names>C. K. I.</given-names>
</name>
</person-group> (<year>2005</year>). <source>Gaussian Processes for machine learning</source> (<publisher-loc>Cambridge, Massachusetts, USA</publisher-loc>: <publisher-name>The MIT Press</publisher-name>). doi:&#xa0;<pub-id pub-id-type="doi">10.7551/mitpress/3206.001.0001</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Robinson</surname> <given-names>A. R.</given-names>
</name>
<name>
<surname>Lermusiaux</surname> <given-names>P. F. J.</given-names>
</name>
</person-group> (<year>2000</year>). &#x201c;<article-title>Overview of data assimilation</article-title>,&#x201d; in <source>Harvard Reports in Physical/Interdisciplinary, ocean science</source> (<publisher-loc>Cambridge, Massachusetts, USA</publisher-loc>: <publisher-name>Harvard University</publisher-name>), vol. <volume>62</volume>.</citation>
</ref>
<ref id="B58">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Sak</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Senior</surname> <given-names>A. W.</given-names>
</name>
<name>
<surname>Beaufays</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Long short-term memory recurrent neural network architectures for large scale acoustic modeling</article-title>. <conf-name>Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH</conf-name>. <fpage>338</fpage>&#x2013;<lpage>342</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.48550/arXiv.1402.1128</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singler</surname> <given-names>J. R.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>New POD error expressions, error bounds, and asymptotic results for reduced order models of parabolic PDEs</article-title>. <source>SIAM J. Numerical Anal.</source> <volume>52</volume>, <fpage>852</fpage>&#x2013;<lpage>876</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1137/120886947</pub-id>
</citation>
</ref>
<ref id="B60">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Stein</surname> <given-names>M. L.</given-names>
</name>
</person-group> (<year>1999</year>). <source>Interpolation of spatial data: some theory for kriging</source>. (<publisher-loc>New York</publisher-loc>: <publisher-name>Springer Science+Business Media</publisher-name>). doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-1-4612-1494-6</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Su</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.-H.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Retrieving temperature anomaly in the global subsurface and deeper ocean from satellite observations</article-title>. <source>J. Geophysical Res. (Oceans)</source> <volume>123</volume>, <fpage>399</fpage>&#x2013;<lpage>410</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/2017JC013631</pub-id>
</citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Su</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Qin</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Du</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.-H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Super-resolution of subsurface temperature field from remote sensing observations based on machine learning</article-title>. <source>Int. J. Appl. Earth Observation Geoinformation</source> <volume>102</volume>, <elocation-id>102440</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.jag.2021.102440</pub-id>
</citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Su</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.-H.</given-names>
</name>
<name>
<surname>Kidwell</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Estimation of subsurface temperature anomaly in the indian ocean during recent global surface warming hiatus from satellite measurements: a support vector machine approach</article-title>. <source>Remote Sens. Environ.</source> <volume>160</volume>, <fpage>63</fpage>&#x2013;<lpage>71</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.rse.2015.01.001</pub-id>
</citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Su</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Geng</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Qin</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.-H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Open: a new estimation of global ocean heat content for upper 2000 meters from remote sensing data</article-title>. <source>Remote Sens.</source> <volume>12</volume>, <fpage>2294</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3390/rs12142294</pub-id>
</citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Troccoli</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Haines</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Use of the temperature&#xe2;&#x20ac;&#x201d;salinity relation in a data assimilation context</article-title>. <source>J. Atmospheric Oceanic Technol.</source> <volume>16</volume>, <fpage>2011</fpage>&#x2013;<lpage>2025</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/1520-0426(1999)016&lt;2011:UOTTSR&gt;2.0.CO;2</pub-id>
</citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vossepoel</surname> <given-names>F. C.</given-names>
</name>
<name>
<surname>Behringer</surname> <given-names>D. W.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Impact of sea level assimilation on salinity variability in the western equatorial pacific</article-title>. <source>J. Phys. Oceanography</source> <volume>30</volume>, <fpage>1706</fpage>&#x2013;<lpage>1721</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/1520-0485(2000)030&lt;1706:IOSLAO&gt;2.0.CO;2</pub-id>
</citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wan</surname> <given-names>Z. Y.</given-names>
</name>
<name>
<surname>Vlachas</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Koumoutsakos</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Sapsis</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Data-assisted reduced-order modeling of extreme events in complex dynamical systems</article-title>. <source>PloS One</source> <volume>13</volume>, <elocation-id>e0197704</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.jcp.2015.12.035</pub-id>
</citation>
</ref>
<ref id="B68">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Williams</surname> <given-names>C. K. I.</given-names>
</name>
<name>
<surname>Rasmussen</surname> <given-names>C. E.</given-names>
</name>
</person-group> (<year>1995</year>). &#x201c;<article-title>Gaussian Processes for regression</article-title>,&#x201d; in <source>Proceedings of the 8th international conference on neural information processing systems</source> (<publisher-loc>Cambridge, MA, USA</publisher-loc>: <publisher-name>MIT Press</publisher-name>).</citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Willis</surname> <given-names>J. K.</given-names>
</name>
<name>
<surname>Roemmich</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Cornuelle</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Combining altimetric height with broadscale profile data to estimate steric height, heat storage, subsurface temperature and sea-surface temperature variability</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>108</volume>, <fpage>3292</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2002JC001755</pub-id>
</citation>
</ref>
<ref id="B70">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wilson</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Coles</surname> <given-names>V. J.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Global climatological relationships between satellite biological and physical observations and upper ocean properties</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>110</volume>, <fpage>C10001</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2004JC002724</pub-id>
</citation>
</ref>
<ref id="B71">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>X.-H.</given-names>
</name>
<name>
<surname>Jo</surname> <given-names>Y.-H.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>W. T. L.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Estimation of subsurface temperature anomaly in the north atlantic using a self-organizing map neural network</article-title>. <source>J. Atmospheric Oceanic Technol.</source> <volume>29</volume>, <fpage>1675</fpage>&#x2013;<lpage>1688</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1175/JTECH-D-12-00013.1</pub-id>
</citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yan</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Bao</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A dynamical-statistical approach to retrieve the ocean interior structure from surface data: sqg-meof-r</article-title>. <source>J. Geophysical Research: Oceans</source> <volume>125</volume>, <elocation-id>e2019JC015840</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1029/2019JC015840</pub-id>
</citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yeo</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Melnyk</surname> <given-names>I.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Deep learning algorithm for data-driven simulation of noisy dynamical system</article-title>. <source>J. Comput. Phys.</source> <volume>376</volume>, <fpage>1212</fpage>&#x2013;<lpage>1231</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.jcp.2018.10.024</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>