<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Appl. Math. Stat.</journal-id>
<journal-title>Frontiers in Applied Mathematics and Statistics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Appl. Math. Stat.</abbrev-journal-title>
<issn pub-type="epub">2297-4687</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fams.2022.830116</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Applied Mathematics and Statistics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A Knowledge-Aided Robust Ensemble Kalman Filter Algorithm for Non-Linear and Non-Gaussian Large Systems</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Lopez-Restrepo</surname> <given-names>Santiago</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Yarce</surname> <given-names>Andres</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1587508/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Pinel</surname> <given-names>Nicol&#x000E1;s</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Quintero</surname> <given-names>O. L.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1093779/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Segers</surname> <given-names>Arjo</given-names></name>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Heemink</surname> <given-names>A. W.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Mathematical Modelling Research Group, Universidad EAFIT</institution>, <addr-line>Medell&#x000ED;n</addr-line>, <country>Colombia</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Applied Mathematics, TU Delft</institution>, <addr-line>Delft</addr-line>, <country>Netherlands</country></aff>
<aff id="aff3"><sup>3</sup><institution>SimpleSpace</institution>, <addr-line>Medell&#x000ED;n</addr-line>, <country>Colombia</country></aff>
<aff id="aff4"><sup>4</sup><institution>Grupo de Investigaci&#x000F3;n en Biodiversidad Evoluci&#x000F3;n y Conservaci&#x000F3;n (BEC), Departamento de Ciencias Biolgicas, Universidad EAFIT</institution>, <addr-line>Medell&#x000ED;n</addr-line>, <country>Colombia</country></aff>
<aff id="aff5"><sup>5</sup><institution>TNO Department of Climate, Air and Sustainability</institution>, <addr-line>Utrecht</addr-line>, <country>Netherlands</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Antonio Linero Bas, University of Murcia, Spain</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Zheqi Shen, Hohai University, China; Jian Xu, National Space Science Center (CAS), China</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Santiago Lopez-Restrepo <email>slopezr2&#x00040;eafit.edu.co</email></corresp>
<corresp id="c002">Andres Yarce <email>ayarceb&#x00040;eafit.edu.co</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Dynamical Systems, a section of the journal Frontiers in Applied Mathematics and Statistics</p></fn></author-notes>
<pub-date pub-type="epub">
<day>09</day>
<month>03</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>8</volume>
<elocation-id>830116</elocation-id>
<history>
<date date-type="received">
<day>06</day>
<month>12</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>01</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Lopez-Restrepo, Yarce, Pinel, Quintero, Segers and Heemink.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Lopez-Restrepo, Yarce, Pinel, Quintero, Segers and Heemink</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>This work proposes a robust and non-Gaussian version of the shrinkage-based knowledge-aided EnKF implementation called Ensemble Time Local H<sub>&#x0221E;</sub> Filter Knowledge-Aided (EnTLHF-KA). The EnTLHF-KA requires a target covariance matrix to integrate previously obtained information and knowledge directly into the data assimilation (DA). The proposed method is based on the robust H<sub>&#x0221E;</sub> filter and on its ensemble time-local version the EnTLHF, using an adaptive inflation factor depending on the shrinkage covariance estimated matrix. This implies a theoretical and solid background to construct robust filters from the well-known covariance inflation technique. The proposed technique is implemented in a synthetic assimilation experiment, and in an air quality application using the LOTOS-EUROS model over the Aburr&#x000E1; Valley to evaluate its potential for non-linear and non-Gaussian large systems. In the spatial distribution of the PM<sub>2.5</sub> concentrations along the valley, the method outperforms the well-known Local Ensemble Transform Kalman Filter (LETKF), and the non-robust knowledge-aided Ensemble Kalman filter (EnKF-KA). In contrast to the other simulations, the ability to issue warnings for high concentration events is also increased. Finally, the simulation using EnTLHF-KA has lower error values than using EnKF-KA, indicating the advantages of robust approaches in high uncertainty systems.</p>
</abstract>
<kwd-group>
<kwd>data assimilation</kwd>
<kwd>air quality modeling</kwd>
<kwd>robust estimation</kwd>
<kwd>Ensemble Kalman filter</kwd>
<kwd>covariance estimation</kwd>
</kwd-group>
<counts>
<fig-count count="12"/>
<table-count count="4"/>
<equation-count count="40"/>
<ref-count count="71"/>
<page-count count="19"/>
<word-count count="11412"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>Data assimilation (DA) is a mathematical family of methods that allows the combination of observations and models. The model is used to fill observational gaps, and the observations constrain the model dynamics [<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B2">2</xref>]. In most DA methods, the aim is to minimize the estimated error variance. For instance, Kalman filter (KF) is an optimal method that minimizes the mean-squared-error in the estimation. The KF is optimal when the dynamic system is linear [<xref ref-type="bibr" rid="B3">3</xref>]. The Ensemble Kalman filter (EnKF) is a KF-based Monte Carlo approximation of the KF when the state space is large, and the model is non-linear [<xref ref-type="bibr" rid="B4">4</xref>]. The EnKF uses an ensemble of model realizations to approximate the first and second background error moments, making it efficient for large-scale models and suitable in the presence of non-linearities. However, in real DA applications, the assumptions required to obtain the optimal solution may not be accurate, degrading the filter performance [<xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B5">5</xref>]. Additionally, small ensemble sizes may produce a poor approximation of the model uncertainty, causing a reduction in the filter accuracy or even filter divergence. When the system conditions do not satisfy the requirements of the KF-based method, the robust filters are a powerful and practical alternative to solve the estimation problem. Motivated by robust control ideas that have been established over many years in the field of control engineering [<xref ref-type="bibr" rid="B6">6</xref>], the robust filters emphasize the robustness of the estimation to have better tolerances to high uncertainty sources. Since their purpose is not the optimality in the estimation, the robust estimators do not require a strictly statistical representation of the system and the observations [<xref ref-type="bibr" rid="B7">7</xref>], showing a better performance than the KF-based methods in scenarios with a poor statistical uncertainty representation [<xref ref-type="bibr" rid="B8">8</xref>, <xref ref-type="bibr" rid="B9">9</xref>]. There are several robust ensemble-based DA schemes based on different principles such as <italic>H</italic><sub>&#x0221E;</sub> formulation [<xref ref-type="bibr" rid="B8">8</xref>], replacing the traditional L<sub>2</sub> norm [<xref ref-type="bibr" rid="B10">10</xref>&#x02013;<xref ref-type="bibr" rid="B12">12</xref>], robust covariance estimation [<xref ref-type="bibr" rid="B13">13</xref>, <xref ref-type="bibr" rid="B14">14</xref>], and covariance inflation [<xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B7">7</xref>]. The approach that we propose uses a shrinkage-based covariance estimator that improves the model robustness and performance when the ensemble size is small [<xref ref-type="bibr" rid="B15">15</xref>]. Additionally, our method incorporates adaptive covariance inflation closely related to the <italic>H</italic><sub>&#x0221E;</sub> formulation.</p>
<p>The uncertainty in chemical transport models (CTM) simulations could be reduced by the improvement of the emission inventory and the upgrade of meteorological data. Alternatively one could incorporate ground data, satellite information, or vertical in the simulations using DA techniques to reduce the uncertainty [<xref ref-type="bibr" rid="B16">16</xref>&#x02013;<xref ref-type="bibr" rid="B19">19</xref>]. In Lopez et al.&#x00027;s [<xref ref-type="bibr" rid="B19">19</xref>] study, DA over the Aburr&#x000E1; Valley has been applied using the LOTOS-EUROS CTM, building on earlier applications [<xref ref-type="bibr" rid="B16">16</xref>&#x02013;<xref ref-type="bibr" rid="B18">18</xref>]. Aburr&#x000E1; Valley&#x00027;s pollution-related air quality issues have become worse over the last 10 years. Due to the Valley&#x00027;s meteorological dynamics transitioning between dry and rainy seasons, the air quality deteriorates two times a year dramatically, around the arrival of the Intertropical Convergence Zone (March-April, and with lower intensity in October-November) [<xref ref-type="bibr" rid="B20">20</xref>, <xref ref-type="bibr" rid="B21">21</xref>]. During these times, the atmospheric boundary layer remains below the canyon&#x00027;s rim throughout the day, trapping all of the pollutants from the city in the lower atmosphere. The resulting concentrations of particulate matter smaller than 10 &#x003BC;m (PM<sub>10</sub>) and 2.5 &#x003BC;m (PM<sub>2.5</sub>) remain at levels considered hazardous for the general population, leading to bi-annual periods of worsened air quality known locally as &#x0201C;environmental contingencies,&#x0201D; during which special measures are taken. In this study, the application of the LOTOS-EUROS CTM to reproduce the PM<sub>2.5</sub> over the valley integrating ground based observations is taken as a real-life study case.</p>
<p>The study is organized as follows. section 2 describes the basic concepts of DA used and introduces the derivation of the proposed method. In section 3 using numerical experiments with a low-scale model, we compare the proposed method&#x00027;s robustness and performance against its related DA algorithms. In section 4, we show the evaluation of the proposed method in a real-life and complex application and discuss the results in terms of investigating the ability to reproduce particulate matter concentrations and forecasting capability of the proposed method. Finally, section 5 offers some concluding remarks and outlines the needed future work. The CTM implementation description is presented in the <xref ref-type="app" rid="A1">Appendix</xref>.</p>
</sec>
<sec id="s2">
<title>2. Robust Ensemble-based DA Using Prior Knowledge</title>
<p>In ensemble-based DA, an ensemble of model realizations</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M1"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>is employed to estimate the first (<bold>x</bold><sup><italic>b</italic></sup>) and second moments (<bold>B</bold>) of the background error distributions, where <bold>x</bold><sup><italic>b</italic>[<italic>i</italic>]</sup> &#x02208; &#x0211D;<sup><italic>n</italic>&#x000D7;1</sup> is the <italic>i</italic>-th ensemble member, and <italic>N</italic> is the total number of ensemble members. Hence</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M2"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x02248;</mml:mo><mml:msup><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfrac><mml:mo>&#x000B7;</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>e</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>and</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M3"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle><mml:mo>&#x02248;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:mfrac><mml:mo>&#x000B7;</mml:mo><mml:mstyle mathvariant="bold"><mml:mo>&#x00394;</mml:mo></mml:mstyle><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mo>&#x00394;</mml:mo></mml:mstyle><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where</p>
<disp-formula id="E4"><label>(4)</label><mml:math id="M4"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mstyle mathvariant="bold"><mml:mo>&#x00394;</mml:mo></mml:mstyle><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mn>1</mml:mn></mml:mstyle></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>is the anomalies matrix, <inline-formula><mml:math id="M5"><mml:msup><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> is the ensemble mean, <bold>P</bold><sup><italic>b</italic></sup> is the sample covariance matrix, and <bold>1</bold> is a vector with components all ones. Once an observation is available, the posterior state can be computed <italic>via</italic> an ensemble-based method as EnKF [<xref ref-type="bibr" rid="B4">4</xref>] or its variants, EnKS [<xref ref-type="bibr" rid="B4">4</xref>], EnHF [<xref ref-type="bibr" rid="B22">22</xref>], or 4DEnVAR [<xref ref-type="bibr" rid="B22">22</xref>] for instance. The widely-used stochastic EnKF computed the analysis state as a combination of the prior state and the differences between the observations and model outputs is the following [<xref ref-type="bibr" rid="B4">4</xref>]:</p>
<disp-formula id="E5"><label>(5)</label><mml:math id="M6"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x0002B;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>R</mml:mtext></mml:mstyle><mml:mo>&#x0002B;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>D</mml:mtext></mml:mstyle><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <bold>X</bold><sup><italic>a</italic></sup> is the analysis ensemble, <bold>H</bold> is the linear (or linearized) output operator, and the <italic>e</italic>-th column of the innovation matrix on the synthetic observations <bold>D</bold> &#x02208; &#x0211D;<sup><italic>n</italic>&#x000D7;<italic>N</italic></sup> reads <inline-formula><mml:math id="M7"><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>d</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>y</mml:mtext></mml:mstyle><mml:mo>&#x0002B;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>&#x003F5;</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:mrow><mml:mi mathvariant="-tex-caligraphic">H</mml:mi></mml:mrow><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, with <inline-formula><mml:math id="M8"><mml:msup><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>&#x003F5;</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>&#x0007E;</mml:mo><mml:mrow><mml:mi mathvariant="-tex-caligraphic">N</mml:mi></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mstyle mathvariant="bold"><mml:mn>0</mml:mn></mml:mstyle><mml:mo>,</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>R</mml:mtext></mml:mstyle></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>. The quality of analysis corrections is directly impacted by the accuracy in the estimation of <bold>B</bold> throw <bold>P</bold><sup><italic>b</italic></sup>, which is highly susceptible to the limited number of ensemble members, the state distribution, and the system uncertainty quantification.</p>
<sec>
<title>2.1. LETKF</title>
<p>One of the most commonly used implementations of the EnKF method is the local ensemble transform Kalman filter (LETKF) [<xref ref-type="bibr" rid="B23">23</xref>], where the assimilation process is performed independently for each model variable. Around each model variable (grid point), a sub-domain of radius <italic>r</italic> is constructed, and the assimilation process is carried out within the local domain. Each local analysis is mapped onto the global domain to obtain the global analysis, and the assimilation is completed. In the assimilation process, all the information found within the sub-domain (i.e., observed components and error correlations) is used. LETKF&#x00027;s local approach has made it an interesting alternative for application in large-scale systems, so we use this method as a baseline to compare our proposed algorithm. The analysis state could be obtained following the implementation by Shin et al. [<xref ref-type="bibr" rid="B24">24</xref>] :</p>
<disp-formula id="E6"><label>(6a)</label><mml:math id="M9"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mstyle mathvariant="bold"><mml:mo>&#x00394;</mml:mo></mml:mstyle><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mn>1</mml:mn></mml:mstyle></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E7"><label>(6b)</label><mml:math id="M10"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mo>&#x00394;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>Y</mml:mtext></mml:mstyle><mml:mo>=</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle><mml:mo>&#x000B7;</mml:mo><mml:mstyle mathvariant="bold"><mml:mo>&#x00394;</mml:mo></mml:mstyle><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E8"><label>(6c)</label><mml:math id="M11"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x00394;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>Y</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>R</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:mo>&#x00394;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>Y</mml:mtext></mml:mstyle><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x0002B;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>I</mml:mtext></mml:mstyle></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E9"><label>(6d)</label><mml:math id="M12"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mstyle mathvariant="bold"><mml:mtext>D</mml:mtext></mml:mstyle><mml:mo>=</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>y</mml:mtext></mml:mstyle><mml:mo>-</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E10"><label>(6e)</label><mml:math id="M13"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>w</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>Y</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>R</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>D</mml:mtext></mml:mstyle><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E11"><label>(6f)</label><mml:math id="M14"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x0002B;</mml:mo><mml:mstyle mathvariant="bold"><mml:mo>&#x00394;</mml:mo></mml:mstyle><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>w</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E12"><label>(6g)</label><mml:math id="M15"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>n</italic>, <italic>m</italic>, and <italic>N</italic> are the model resolution, the number of observations, and the number of ensemble members, respectively, <bold>P</bold><sup><italic>a</italic></sup> &#x02208; &#x0211D;<sup><italic>n</italic>&#x000D7;<italic>n</italic></sup> is the analysis ensemble covariance matrix, and <bold>1</bold> is a vector of the consistent dimension whose components are all ones. In the LETKF algorithm, the above analysis is applied per grid cell. The algorithm uses the following steps:</p>
<list list-type="order">
<list-item><p>Compute in each domain simulated observations for all ensemble members.</p></list-item>
<list-item><p>Collect per domain also the observations from neighboring domains that are within <italic>r</italic> distance</p></list-item>
<list-item><p>Loop over grid cells.
<list list-type="alpha-lower">
<list-item><p>Select observations and simulations that are within range <italic>r</italic>.</p></list-item>
<list-item><p>Compute analysis weights <bold>w</bold><sup><italic>a</italic></sup>.</p></list-item>
<list-item><p>Apply the analysis with the ensemble elements for the selected grid cell.</p></list-item>
</list></p></list-item>
<list-item><p>Once all the local analyses are performed, map those to the global domain.</p></list-item>
</list>
<p>Note that the background error covariance matrix approximation in the LETKF is the sample covariance matrix (3), therefore for large radii of influence, the quality of the LETKF results could be influenced by spurious correlations.</p>
</sec>
<sec>
<title>2.2. Shrinkage-Based ENKF</title>
<p>A more robust family of covariance estimators for the case <italic>n</italic> &#x0226B; <italic>N</italic> are the shrinkage based estimators [<xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B26">26</xref>]. These kinds of estimators have the form [<xref ref-type="bibr" rid="B27">27</xref>]:</p>
<disp-formula id="E13"><label>(7)</label><mml:math id="M16"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle><mml:mo>&#x02248;</mml:mo><mml:mover accent="false"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>&#x003B1;</mml:mi><mml:mo>&#x000B7;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle><mml:mo>&#x0002B;</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x003B1; &#x02208; [0, 1], and <bold>T</bold> &#x02208; &#x0211D;<sup><italic>n</italic>&#x000D7;<italic>n</italic></sup> is a user-defined matrix. The value of &#x003B1; is chosen to minimize</p>
<disp-formula id="E14"><label>(8)</label><mml:math id="M17"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mo class="qopname">arg</mml:mo><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mo class="qopname">min</mml:mo></mml:mrow><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mi>&#x1D53C;</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle><mml:mo>-</mml:mo><mml:mover accent="false"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle></mml:mrow><mml:mo class="qopname">^</mml:mo></mml:mover><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:msubsup><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where ||&#x02022;||<sub><italic>F</italic></sub> represents the Frobenius norm. A close formulation to calculate the weight value &#x003B1; using a general target matrix <bold>T</bold><sub><italic>KA</italic></sub> is proposed in [<xref ref-type="bibr" rid="B28">28</xref>, <xref ref-type="bibr" rid="B29">29</xref>] (hereafter KA estimator),</p>
<disp-formula id="E15"><label>(9a)</label><mml:math id="M18"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>with</p>
<disp-formula id="E16"><label>(9b)</label><mml:math id="M19"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo class="qopname">min</mml:mo><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac><mml:mo>&#x000B7;</mml:mo><mml:mstyle displaystyle="true"><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup></mml:mstyle><mml:mo stretchy="true">&#x02016;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mo>&#x00394;</mml:mo></mml:mstyle><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:mfrac><mml:mo>&#x000B7;</mml:mo><mml:mo stretchy="true">&#x02016;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>b</mml:mtext></mml:mstyle></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>This general target matrix enables the incorporation of <italic>prior</italic> information about the system into the error covariance matrix. Although <bold>T</bold><sub><italic>KA</italic></sub> must meet all requirements of a covariance matrix, <bold>T</bold><sub><italic>KA</italic></sub> must not fulfill any requirement about its structure and also can change dynamically, allowing a complete degree of freedom in the matrix computation. Sections 3, 4, and Lopez-Restrepo et al. [<xref ref-type="bibr" rid="B15">15</xref>] show some examples of how to compute <bold>T</bold><sub><italic>KA</italic></sub>. Additionally, the KA estimator does not make any distributional assumptions, thus can also be used for non-Gaussian covariance matrix estimation [<xref ref-type="bibr" rid="B29">29</xref>]. An implementation of the EnKF can be obtained using the KA estimator, known as EnKF-KA [<xref ref-type="bibr" rid="B15">15</xref>]:</p>
<disp-formula id="E17"><mml:math id="M20"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>R</mml:mtext></mml:mstyle><mml:mo>&#x0002B;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>D</mml:mtext></mml:mstyle><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>In Lopez-Restrepo et al. [<xref ref-type="bibr" rid="B19">19</xref>], it is shown that incorporating <italic>prior</italic> information of the system in the data assimilation process can outperform the EnKF when <italic>n</italic> &#x0226B; <italic>N</italic>, and when, there are errors in the model specifications.</p>
</sec>
<sec>
<title>2.3. Ensemble Time-Local H<sub>&#x0221E;</sub> Filter</title>
<p>One of the most widely used robust filter is the <italic>H</italic><sub>&#x0221E;</sub> Filter (HF) [<xref ref-type="bibr" rid="B30">30</xref>]. The HF is based on the criterion of minimizing the supremum of the <italic>L</italic><sub>2</sub> norm of the uncertainty sources [<xref ref-type="bibr" rid="B8">8</xref>]. The ideas beyond the HF filters come from the robust control theory and applications in linear and low-scale systems [<xref ref-type="bibr" rid="B31">31</xref>]. In recent years, several works have been started to develop implementations of the HF in DA due to its potential to solve some limitations of the EnKF [<xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B31">31</xref>]. The HF ensures that the total energy of the estimation errors, is not larger than the uncertainty energy times a factor 1/&#x003B3;:</p>
<disp-formula id="E19"><label>(10)</label><mml:math id="M22"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mo stretchy="false">&#x02016;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>S</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x02264;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow></mml:mfrac><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mo>&#x00394;</mml:mo></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mo stretchy="false">&#x02016;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>u</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>Q</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x0002B;</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mo stretchy="false">&#x02016;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>v</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>R</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <bold>x</bold><sup><italic>t</italic></sup> is the true state, <bold>x</bold><sup><italic>a</italic></sup> is the analysis state, <bold>S</bold> is a user-chosen matrix of weights, <bold>u</bold> and <bold>v</bold> are the model and observation uncertainty, respectively, &#x00394;<sub>0</sub>, <bold>Q</bold>, and <bold>R</bold> are the uncertainty weighting matrices with respect to the initial conditions, model error, and observations error, and <italic>M</italic> is the DA windows length [<xref ref-type="bibr" rid="B7">7</xref>]. To solve (10), the cost function <inline-formula><mml:math id="M23"><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">J</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>H</mml:mi><mml:mi>F</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> is defined as follows:</p>
<disp-formula id="E20"><label>(11)</label><mml:math id="M24"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">J</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>H</mml:mi><mml:mi>F</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle displaystyle="true"><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:msubsup></mml:mstyle><mml:mo stretchy="false">&#x02016;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>S</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mo>&#x00394;</mml:mo></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:mstyle displaystyle="true"><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:msubsup></mml:mstyle><mml:mo stretchy="false">&#x02016;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>u</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>Q</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:mstyle displaystyle="true"><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:msubsup></mml:mstyle><mml:mo stretchy="false">&#x02016;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>v</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mo stretchy="false">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>R</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Then inequality (10) is equivalent to <inline-formula><mml:math id="M25"><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">J</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>H</mml:mi><mml:mi>F</mml:mi></mml:mrow></mml:msup><mml:mo>&#x02264;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow></mml:mfrac></mml:math></inline-formula>. Let &#x003B3;<sup>&#x0002A;</sup> be the value such that</p>
<disp-formula id="E21"><label>(12)</label><mml:math id="M26"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>f</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">{</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">}</mml:mo></mml:mrow></mml:mrow></mml:munder></mml:mstyle><mml:mtext>&#x000A0;</mml:mtext><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mi>s</mml:mi><mml:mi>u</mml:mi><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mo stretchy="false">{</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>u</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mo stretchy="false">{</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>v</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">}</mml:mo></mml:mrow></mml:mrow></mml:munder></mml:mstyle><mml:mtext>&#x000A0;</mml:mtext><mml:msup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">J</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>H</mml:mi><mml:mi>F</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mi>M</mml:mi><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>the optimal HF is then achieved when &#x003B3; &#x0003D; &#x003B3;<sup>&#x0002A;</sup>. In this formulation, the evaluation of &#x003B3;<sup>&#x0002A;</sup> is an application of the minimax rule [<xref ref-type="bibr" rid="B32">32</xref>], a strategy that aims to provide robust estimates and is different from its Bayesian counterpart [<xref ref-type="bibr" rid="B7">7</xref>]. An Ensemble-based HF implementation for a nonlinear DA problem is the Ensemble time-local H<sub>&#x0221E;</sub> filter (EnLTHF) proposed by Luo et al. [<xref ref-type="bibr" rid="B7">7</xref>]. In the EnLTHF, a local cost function is proposed:</p>
<disp-formula id="E22"><label>(13)</label><mml:math id="M27"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">J</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>H</mml:mi><mml:mi>F</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>S</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mo>&#x00394;</mml:mo></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:mo stretchy="true">&#x02016;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>u</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>Q</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:mo stretchy="true">&#x02016;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>v</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mo stretchy="true">&#x02016;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>R</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>The local performance level &#x003B3;<sub><italic>k</italic></sub> satisfies:</p>
<disp-formula id="E23"><label>(14)</label><mml:math id="M28"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>&#x02265;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>f</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">{</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">}</mml:mo></mml:mrow></mml:mrow></mml:munder></mml:mstyle><mml:mtext>&#x000A0;</mml:mtext><mml:mstyle displaystyle="true"><mml:munder><mml:mrow><mml:mi>s</mml:mi><mml:mi>u</mml:mi><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mo stretchy="false">{</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>u</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mo stretchy="false">{</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>v</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">}</mml:mo></mml:mrow></mml:mrow></mml:munder></mml:mstyle><mml:mtext>&#x000A0;</mml:mtext><mml:msubsup><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">J</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>H</mml:mi><mml:mi>F</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>The EnLTHF can be expressed in terms of the EnKF algorithm using the notation of Luo et al. [<xref ref-type="bibr" rid="B7">7</xref>]:</p>
<disp-formula id="E24"><label>(15a)</label><mml:math id="M29"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>K</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>E</mml:mi><mml:mi>n</mml:mi><mml:mi>K</mml:mi><mml:mi>F</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>Q</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E25"><label>(15b)</label><mml:math id="M30"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>G</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>I</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>S</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>K</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E26"><label>(15c)</label><mml:math id="M31"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mtext>x</mml:mtext></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mtext>x</mml:mtext></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>b</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>G</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mstyle mathvariant="bold"><mml:mtext>y</mml:mtext></mml:mstyle></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msubsup><mml:mrow><mml:mtext>x</mml:mtext></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>b</mml:mtext></mml:mstyle><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>i</mml:mtext></mml:mstyle></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>v</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>i</mml:mtext></mml:mstyle></mml:mrow></mml:msubsup></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E27"><label>(15d)</label><mml:math id="M32"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msubsup><mml:mrow><mml:mtext>x</mml:mtext></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow><mml:mo>/</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E28"><label>(15e)</label><mml:math id="M33"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mo>&#x00394;</mml:mo></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>a</mml:mtext></mml:mstyle></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>S</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>subject to the constraint</p>
<disp-formula id="E29"><label>(15f)</label><mml:math id="M34"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mo>&#x00394;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>S</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02265;</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where the operator <italic>EnKF</italic>(&#x000B7;, &#x000B7;, &#x000B7;) means that <inline-formula><mml:math id="M35"><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> and <bold>K</bold><sub><italic>k</italic></sub> are obtained through the EnKF.</p>
</sec>
<sec>
<title>2.4. Adaptive Inflation</title>
<p>A particular issue with ensemble-based DA algorithms is the covariance undersampling. Undersampling leads to further problems such as the ensemble collapse to an overconfident, but incorrect state, or even filter divergence [<xref ref-type="bibr" rid="B33">33</xref>]. The covariance inflation artificially increases uncertainties in the background covariance avoiding the underestimation of uncertainties and undersampling [<xref ref-type="bibr" rid="B34">34</xref>]. The magnitude of the inflation depends to a large degree on each system and application [<xref ref-type="bibr" rid="B35">35</xref>].</p>
<p>In (15e), the presence of the extra term &#x02212;&#x003B3;<sub><italic>k</italic></sub> &#x000B7; <bold>S</bold><sub><italic>k</italic></sub> inflates the EnKF covariance matrix. In this way, it is possible to interpret the EnTLHF as an EnKF formulation with a specific value of inflation. This implies a theoretical and solid background to construct robust filters. Consider the case where <bold>S</bold> &#x0003D; <bold>I</bold><sub><italic>n</italic></sub>, which corresponds with an inflation of the analysis covariance matrix eigenvalues. To satisfy the constraint (15f), or what is equivalent, to make <inline-formula><mml:math id="M36"><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mo>&#x00394;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> semi-definite positive, consider the SVD decomposition of <inline-formula><mml:math id="M37"><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula></p>
<disp-formula id="E30"><label>(16)</label><mml:math id="M38"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>V</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mo>&#x003A3;</mml:mo></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>U</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x003A3;<sub><bold>k</bold></sub> &#x0003D; <bold>diag</bold>(&#x003C3;<sub><bold>t</bold>,1</sub>, ..., &#x003C3;<sub><bold>t,n</bold></sub>) is a diagonal matrix with all the eigenvalues of <inline-formula><mml:math id="M39"><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> in descending order, that is, &#x003C3;<sub><italic>t</italic>,1</sub> &#x02265; &#x003C3;<sub><italic>t</italic>,2</sub> &#x02265; .... &#x02265; &#x003C3;<sub><italic>t,n</italic></sub> and &#x003B3;<sub><italic>k</italic></sub> is a variable that satisfies</p>
<disp-formula id="E31"><mml:math id="M40"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msup><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003C3;</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02265;</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>that corresponds with</p>
<disp-formula id="E32"><mml:math id="M41"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02264;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003C3;</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>guaranteeing that <inline-formula><mml:math id="M42"><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mo>&#x00394;</mml:mo></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>a</mml:mtext></mml:mstyle></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mstyle mathvariant="bold"><mml:mn>1</mml:mn></mml:mstyle></mml:mrow></mml:msup></mml:math></inline-formula> is semi-definite positive. It is convenient to introduce a performance level coefficient (PLC) <italic>c</italic> by defining</p>
<disp-formula id="E33"><label>(17)</label><mml:math id="M43"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02264;</mml:mo><mml:mfrac><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003C3;</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>In contrast to conventional inflation schemes, &#x003B3;<sub><italic>k</italic></sub> is adaptive in time even for a fixed <italic>c</italic> value, and it is directly related with the analysis covariance matrix.</p>
</sec>
<sec>
<title>2.5. Ensemble Time Local H<sub>&#x0221E;</sub> Filter Knowledge Aided (EnTLHF-KA)</title>
<p>According to sections 2.3 and 2.4, with a specific structure and inflation value, it is possible to obtain a robust version of the EnKF. Although the EnTLHF has shown to have a better performance than the EnKF in scenarios with high uncertainty [<xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B36">36</xref>, <xref ref-type="bibr" rid="B37">37</xref>], the limitations of the EnKF with respect to the ensemble size and the ensemble normality distribution are inherited in its robust version. When the ensemble size is small <italic>N</italic> &#x0003C;&#x0003C; <italic>n</italic>, sampling errors can have an impact on the quality of covariances matrix estimation, causing problems such as filter divergence and spurious correlations [<xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B35">35</xref>]. Even though many localization techniques have been developed to mitigate those problems, it usually prohibits its implementation in high dimensional applications [<xref ref-type="bibr" rid="B38">38</xref>]. The shrinkage-covariance estimator methods have shown a better performance than the classical sampling covariance matrix in scenarios with small ensemble sizes and non-Gaussianities [<xref ref-type="bibr" rid="B27">27</xref>, <xref ref-type="bibr" rid="B39">39</xref>&#x02013;<xref ref-type="bibr" rid="B41">41</xref>].</p>
<p>We propose a robust implementation of the EnKF-KA shrinkage-based method following the principles of the EnTLHF and the adaptive inflation denoted EnTLHF-KA. The EnTLHF-KA can be obtained similarly to the EnLTHF by taking as base the EnKF-KA:</p>
<disp-formula id="E34"><label>(18a)</label><mml:math id="M44"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>K</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mtext class="textrm" mathvariant="normal">EnKF-KA</mml:mtext><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E35"><label>(18b)</label><mml:math id="M45"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>G</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>I</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B3;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msubsup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>S</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>K</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E36"><label>(18c)</label><mml:math id="M46"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mtext>x</mml:mtext></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mtext>x</mml:mtext></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>b</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>G</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:mrow><mml:mo stretchy="false">[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mstyle mathvariant="bold"><mml:mtext>y</mml:mtext></mml:mstyle></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:msubsup><mml:mrow><mml:mtext>x</mml:mtext></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>b</mml:mtext></mml:mstyle><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>i</mml:mtext></mml:mstyle></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>&#x0002B;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>v</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>k</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>i</mml:mtext></mml:mstyle></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">]</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E37"><label>(18d)</label><mml:math id="M47"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msubsup><mml:mrow><mml:mtext>x</mml:mtext></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow><mml:mo>/</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where the operator EnKF-KA(&#x000B7;, &#x000B7;, &#x000B7;) represents the EnKF-KA shrinkage-based method (see section 2.2). For a specific PLC, the inflation value is obtained using (17).</p>
</sec>
</sec>
<sec id="s3">
<title>3. Results in Low-Scale System</title>
<p>A series of synthetic DA experiments allow us to expose the robust filter benefits over the former methods and evaluate the robustness with controlled scenarios. The Lorenz-96 is one of the most used benchmarks for testing DA algorithms. The model is highly non-linear and with a strong relationship between the states. The Lorenz-96 dynamics are described by [<xref ref-type="bibr" rid="B42">42</xref>, <xref ref-type="bibr" rid="B43">43</xref>]:</p>
<disp-formula id="E38"><label>(19)</label><mml:math id="M48"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mfrac><mml:mrow><mml:mi>d</mml:mi><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:mi>F</mml:mi></mml:mtd><mml:mtd><mml:mtext class="textrm" mathvariant="normal">for&#x000A0;</mml:mtext><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:mi>F</mml:mi></mml:mtd><mml:mtd><mml:mtext class="textrm" mathvariant="normal">for&#x000A0;</mml:mtext><mml:mn>2</mml:mn><mml:mo>&#x02264;</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>&#x000A0;</mml:mtext><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x000B7;</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:mi>F</mml:mi></mml:mtd><mml:mtd><mml:mtext class="textrm" mathvariant="normal">for&#x000A0;</mml:mtext><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>n</italic> is the state number chosen as 40 and <italic>F</italic> is the external force. For consistency, periodic boundary conditions are assumed. We take the next considerations for the numerical experiments:</p>
<list list-type="bullet">
<list-item><p>The assimilation window consists of <italic>M</italic> &#x0003D; 500 observations.</p></list-item>
<list-item><p>The number of observed components is <italic>m</italic> &#x0003D; 20, representing 50% of the model components.</p></list-item>
<list-item><p>The observation statistics are associated with the Gaussian distribution,
<disp-formula id="E39"><label>(20)</label><mml:math id="M49"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>y</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0007E;</mml:mo><mml:mrow><mml:mi mathvariant="-tex-caligraphic">N</mml:mi></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>H</mml:mtext></mml:mstyle><mml:mo>&#x000B7;</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003C1;</mml:mi></mml:mrow><mml:mrow><mml:mi>o</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x000B7;</mml:mo><mml:mstyle mathvariant="bold"><mml:mtext>I</mml:mtext></mml:mstyle></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mtext class="textrm" mathvariant="normal">for&#x000A0;</mml:mtext><mml:mn>1</mml:mn><mml:mo>&#x02264;</mml:mo><mml:mi>t</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mi>M</mml:mi><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
where &#x003C1;<sub><italic>o</italic></sub> &#x0003D; 0.001, and <bold>H</bold> is a linear operator that randomly chooses the <italic>m</italic> observed components.</p></list-item>
<list-item><p>To avoid random fluctuations, each experiment is repeated 20 times (<italic>L</italic> &#x0003D; 20).</p></list-item>
<list-item><p>We compare the performance and robustness of the EnTLHF-KA against the non-robust methods EnKF and EnKF-KA, and the robust method EnTLHF.</p></list-item>
<list-item><p>We use a Gaspari-Cohn [<xref ref-type="bibr" rid="B44">44</xref>] matrix with an influence radius of 2 as target matrix <bold>T</bold><sub><italic>KA</italic></sub> for the EnKF-KA and the EnTLHF-KA. Following [<xref ref-type="bibr" rid="B7">7</xref>], we do not use covariance localization to avoid complicating the analysis of our experiment results.</p></list-item>
<list-item><p>We take the Root-Mean-Square-Error (RMSE) of <italic>L</italic> experiments as a measure of performance,
<disp-formula id="E40"><label>(21)</label><mml:math id="M50"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mtext class="textrm" mathvariant="normal">RMSE</mml:mtext><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>L</mml:mi></mml:mrow></mml:mfrac><mml:mo>&#x000B7;</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>L</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:msqrt><mml:mrow><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:mfrac><mml:mo>&#x000B7;</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="true">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000B7;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:msqrt></mml:mrow><mml:mo stretchy="true">)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p></list-item>
<list-item><p>We chose a PLC value <italic>c</italic> &#x0003D; 0.5 for all the experiments, following Luo and Hoteit [<xref ref-type="bibr" rid="B7">7</xref>]. Other <italic>c</italic> values have been tested (not reported here), but no performance improvements were obtained.</p></list-item>
</list>
<sec>
<title>3.1. Robustness Against Ensemble Members</title>
<p>When the state dimension is large, it is important to test the performance with relative small ensemble sizes. We evaluate both the accuracy and the robustness of the EnTLHF-KA with respect to the ensemble size. For this case, we set the observation error &#x003B4;= 1 &#x000D7; 10<sup>&#x02212;3</sup>, the observation frequency <italic>f</italic> &#x0003D; 1, and the external force <italic>F</italic> &#x0003D; 8. The ensemble size <italic>N</italic> &#x02208; [10, 20, 50, 100, 1, 000]. <xref ref-type="fig" rid="F1">Figure 1</xref> presents the RMSE value for those values of <italic>N</italic>.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Error evaluation of the robust and non-robust methods with respect to the ensemble member number.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0001.tif"/>
</fig>
<p>The EnTLHF-KA has more constant RMSE values for different <italic>N</italic>. The other methods present variation in its performance when the ensemble size changes. In general, the RMSE values decrease for larger <italic>N</italic> values for all the methods. For <italic>N</italic> &#x0003D; 10, the EnTLHF-KA presents a superior performance compared to the others, followed by the EnKF-KA. This behavior is attributed to the shrinkage-based estimator used in both methods, that have shown a better covariance estimation when <italic>N</italic> &#x0003C;&#x0003C; <italic>n</italic> [<xref ref-type="bibr" rid="B19">19</xref>, <xref ref-type="bibr" rid="B41">41</xref>]. However, the adaptive inflation factor of the EnTLHF and the ENTLHF-KA improves these methods&#x00027; performance against their non-robust counterpart. For larger ensemble size, both EnTLHF-KA and EnKF-KA tend to converge to the EnTLHF and EnKF, respectively, since the sampling ensemble matrix represents a good estimator for the covariance matrix and <inline-formula><mml:math id="M51"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> converge to <bold>P</bold><sup><italic>b</italic></sup>. Due to the good estimation of <bold>B</bold> by <bold>P</bold><sup><italic>b</italic></sup>, and all the EnKF assumptions are satisfied, the non-robust methods present lower RMSE value for large ensemble size. This example clarifies the different advantages and disadvantages of the robust approach compared to the optimal approach. Although the EnTLHF-KA performance is not the best in all the scenarios, its robustness allows it to have low RMSE values in all the scenarios.</p>
</sec>
<sec>
<title>3.2. Robustness Against Observation Error</title>
<p><xref ref-type="fig" rid="F2">Figure 2</xref> shows the RMSE value when &#x003B4; &#x02208; [1 &#x000D7; 10<sup>&#x02212;4</sup>, 1 &#x000D7; 10<sup>&#x02212;3</sup>, 1 &#x000D7; 10<sup>&#x02212;2</sup>, 1 &#x000D7; 10<sup>&#x02212;1</sup>]. The other model parameters are <italic>N</italic> &#x0003D; 20, <italic>f</italic> &#x0003D; 1, and <italic>F</italic> &#x0003D; 8. The idea now is to evaluate the impact of the observation error in the new robust EnTLHF-KA. It can be seen that the performance of the non-robust methods is affected by the increase of the observation error, causing divergence of the EnKF-KA. This kind of behavior is one of the main reasons for the development of new robust techniques [<xref ref-type="bibr" rid="B12">12</xref>]. The observation error&#x00027;s impact is much lower in the robust methods, and the performance is almost constant, especially in the EnTLHF-KA. When &#x003B4; &#x0003D; 1 &#x000D7; 10<sup>&#x02212;4</sup>, the EnKF and the EnKF-KA perform better than their robust counterpart, but the robust filters hold a good performance even for large observation errors.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Error evaluation of the robust and non-robust methods with respect to the observation error.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0002.tif"/>
</fig>
</sec>
<sec>
<title>3.3. Robustness Against Model Errors</title>
<p>To evaluate the EnTLHF-KA robustness with respect to model errors, we compare the method&#x00027;s performance when <italic>F</italic> &#x02208; [6, 7, 8, 9, 10]. <italic>F</italic> &#x0003D; 8 corresponds with the assumption of a perfect model. <xref ref-type="fig" rid="F3">Figure 3</xref> presents the RMSE value for each <italic>F</italic> value and the comparison among the four filters. The RMSE values remain almost constant for both robust filters, with smaller values for the EnTLHF-KA. The adaptive inflation makes the analysis covariance matrix larger in the robust filters than in its non-robust counterpart, given the same background covariance. Consequently, the EnTLHF and the EnTLHF-KA put more weight in the observations, convenient when there are larger model errors.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Error evaluation of the robust and non-robust methods with respect to errors in the model.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0003.tif"/>
</fig>
</sec>
<sec>
<title>3.4. Robustness Against Ensemble Distribution</title>
<p>The standard EnKF assumes that the ensemble state has a Gaussian distribution. This assumption is especially essential because the state covariance <bold>B</bold> is approximated by the ensemble sample covariance <bold>P</bold><sup><italic>b</italic></sup>. Although the ensemble at <italic>t</italic><sub>0</sub> is Gaussian, non-linearities in the model dynamics can modify the ensemble distribution, causing the approximation of <bold>B</bold> by <bold>P</bold><sup><italic>b</italic></sup> to lose accuracy. <xref ref-type="fig" rid="F4">Figure 4</xref> presents an evaluation of the ensemble distribution for different times steps using the Lorenz-96 model. We use the Shapiro-Wilk to evaluate the Gaussianity of each state variable [<xref ref-type="bibr" rid="B45">45</xref>]. We take an initial Gaussian ensemble of 100 members as a reference. After 15-time steps, some variables begin to change their initial distribution, and after 30-time steps, the Gaussian assumption is not valid anymore for the ensemble.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Shapiro-Wilk test for each Lorenz component at a different time step. The ensemble size is 100. The white color represents that the null-hypothesis is not rejected (the ensemble for that specific variable is Gaussian). The gray color represents that the null-hypothesis is rejected (the ensemble for that specific variable is non-Gaussian).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0004.tif"/>
</fig>
<p>We perform different experiments varying the observation frequency or the number of time steps between two available observations. <xref ref-type="fig" rid="F5">Figure 5</xref> shows the time averaged RMSE for the EnKF, EnKF-KA, EnTLHF, and the EnTLHF-KA using an observation frequency <italic>f</italic> &#x02208; [1, 5, 10, 20, 30, 50] times steps. We set an ensemble size of <italic>N</italic> &#x0003D; 20, an observation error of &#x003B4; &#x0003D; 1 &#x000D7; 10<sup>&#x02212;3</sup>, and the external force <italic>F</italic> &#x0003D; 8. The EnKF performance decreases considerably when <italic>f</italic> increases, and after the value of <italic>f</italic> &#x0003D; 30 the method diverges. This result illustrates the importance of the Gaussian distribution for obtaining a good representation of <bold>B</bold> throw <bold>P</bold><sup><italic>b</italic></sup>. The adaptive inflation increases EnTLHF robustness and performance, even when both EnKF and EnTLHF are using the same approximation of <bold>B</bold>. Nevertheless, the EnTLHF performance decreases considerably when <italic>f</italic> &#x0003D; 50. In contrast, EnKF-KA and EnTLHF-KA use a shrinkage-based estimator for <bold>B</bold>. The KA estimator does not assume a Gaussian distribution, as other shrinkage-based estimators do [<xref ref-type="bibr" rid="B27">27</xref>, <xref ref-type="bibr" rid="B46">46</xref>]. Thus, the EnKF-KA presents better performance than EnKF for large <italic>f</italic> values and similar error levels than EnTLH without incorporating adaptive inflation. In the case of the EnTLHF-KA, the combination of both the shrinkage-based estimator and the adaptive inflation produces high robustness and performance even when the ensemble distribution is non-Gaussian.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Error evaluation of the robust and non-robust methods with respect to the observation frequency.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0005.tif"/>
</fig>
</sec>
</sec>
<sec id="s4">
<title>4. Application to a Non-linear Non-Gaussian Large Scale System</title>
<p>The implementation of the LOTOS-EUROS CTM over the Aburr&#x000E1; Valley is used as a real study case. This application consists of a non-linear and non-Gaussian large system, so it is a good opportunity to test the proposed method potential. The complete implementation and observations description is presented in the <xref ref-type="app" rid="A1">Appendix</xref>. The period of interest for all data evaluations, simulations, and DA experiments spans from February 25 to March 15, 2019. During these days, the PM concentrations are higher due to the Northbound transit of the Inter-Tropical Convergence Zone over the study domain. The data to be assimilated is located at the surface but the proposed method also applies to satellite data at different scales and resolutions.</p>
<p>In order to test the proposed method, we performed a total of four different LOTOS-EUROS simulations:</p>
<list list-type="order">
<list-item><p>a LOTOS-EUROS model simulation without DA (henceforth <italic>LE</italic>) for having a free run model under regular initial and boundary conditions looking for further comparison;</p></list-item>
<list-item><p>a DA simulation using the LETKF introduced in section 2.1 (henceforth <italic>LE-LETKF</italic>);</p></list-item>
<list-item><p>a DA simulation using the shrinkage-based EnKF-KA developed in Lopez-Restrepo et al. [<xref ref-type="bibr" rid="B15">15</xref>] (henceforth <italic>LE-KA</italic>);</p></list-item>
<list-item><p>a DA simulation using the robust and shrinkage-based EnTLHF-KA proposed in 2.5 (henceforth <italic>LE-Robust</italic>).</p></list-item>
</list>
<p>The set of validation sites is split into two groups: the stations located in the bottom part of the valley (BS, represented by circles in <xref ref-type="fig" rid="F12">Figure 12</xref>), and the stations located in the city&#x00027;s outskirts or hills (OS, represented by stars in <xref ref-type="fig" rid="F12">Figure 12</xref>). The objective of this division is to evaluate the simulation performance in regions where the PM<sub>2.5</sub> concentration regimes are different. All the simulations were evaluated using both validation station&#x00027;s sets, and the performance metrics Mean Fractional Bias (MFB) [<xref ref-type="bibr" rid="B47">47</xref>], Root Mean Square Error (RMSE) [<xref ref-type="bibr" rid="B48">48</xref>], and Pearson Correlation Factor [<xref ref-type="bibr" rid="B49">49</xref>]. The three ensemble-based algorithms estimate both concentrations and emissions, following the stochastic representation presented in Lopez-Restrepo et al. [<xref ref-type="bibr" rid="B19">19</xref>]. For all the methods, an ensemble size <italic>N</italic> of 25 members and a localization radius <italic>r</italic> of 5 km were used.</p>
<p>The DA methods are evaluated with forecast experiments, in which a model simulation over a limited number of days is performed using information from the assimilation. Forecasting experiments were performed to test the model&#x00027;s capability to predict the PM concentrations in the valley up to three days ahead. We applied the methodology proposed by Lopez-restrepo et al. [<xref ref-type="bibr" rid="B50">50</xref>], with all days from March 9 to 13 having predictions as the first, second, and third day of a forecast. We are especially interested in evaluating the ability of the model to predict warning-triggering episodes (AQI in orange, red, or purple levels, as shown in <xref ref-type="table" rid="T1">Table 1</xref>). All forecast simulations used the estimated emission correction factors from the last assimilation day, in each of the three forecast days. This inheritance scheme has shown the best option for the LE implementation over the Aburr&#x000E1; Valley [<xref ref-type="bibr" rid="B19">19</xref>].</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Air Quality Index (AQI) as defined for the Aburr&#x000E1; Valley with respect to PM<sub>2.5</sub> concentrations according to the ranges established by the Metropolitan Area.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th/>
<th/>
<th valign="top" align="center" colspan="5" style="border-bottom: thin solid #000000;"><bold>Average concentration (&#x003BC;g/<italic>m</italic><sup>3</sup>)</bold></th>
</tr>
<tr>
<th/>
<th/>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>No warning</bold></th>
<th valign="top" align="center" colspan="3" style="border-bottom: thin solid #000000;"><bold>Warning</bold></th>
</tr>
<tr>
<th valign="top" align="left"><bold>Pollutant</bold></th>
<th valign="top" align="center"><bold>Average time</bold></th>
<th valign="top" align="center" style="background-color:#1bb753"><bold>Green</bold></th>
<th valign="top" align="center" style="background-color:#f1e744"><bold>Yellow</bold></th>
<th valign="top" align="center" style="background-color:#fc9d39"><bold>Orange</bold></th>
<th valign="top" align="center" style="background-color:#ff6267"><bold>Red</bold></th>
<th valign="top" align="center" style="background-color:#5f67aa"><bold>Purple</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">PM<sub>2.5</sub></td>
<td valign="top" align="center">24 h</td>
<td valign="top" align="center">0&#x02013;12</td>
<td valign="top" align="center">13&#x02013;37</td>
<td valign="top" align="center">38&#x02013;55</td>
<td valign="top" align="center">56&#x02013;150</td>
<td valign="top" align="center">&#x02265; 151</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The colors correspond with the waning level</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>This is specially relevant in the sense that the robust method is evaluated in the forecast, enhancing the capability of reducing uncertainty in an operational fashion and direct implementation for decision making within our applied research programs in air pollution.</p>
<sec>
<title>4.1. Target Matrix</title>
<p>The shrinkage-based algorithm EnKF-KA and the robust EnTLHF-KA were implemented to be used with the LOTOS-EUROS model. This was mainly aimed by the fact that there are great opportunities for DA applied to CTM models and air pollution scenarios for decision making. The challenging the problem, the creative solutions arise. The aim of EnKF-KA and the robust EnTLHF-KA algorithms is to improve the model representation in the complex orography conditions of the Aburr&#x000E1; Valley. Both shrinkage-based algorithms required a target matrix <bold>T</bold><sub><italic>KA</italic></sub> to compute the covariance matrix <bold>B</bold> according to Equation (10). The matrix <bold>T</bold><sub><italic>KA</italic></sub> should guide the covariance structure in <bold>B</bold> by limiting the spurious correlations between elements at a large distance [<xref ref-type="bibr" rid="B40">40</xref>], or in the case of the EnKF-KA and the EnTLHF-KA, to incorporate previously obtained knowledge directly in the DA process [<xref ref-type="bibr" rid="B15">15</xref>]. For this application, we are interested in using the target matrix to represent the valley&#x00027;s complex orography in the covariance estimation. Previous works have shown issues reproducing the pollutant dynamics into the Aburr&#x000E1; valley due to the limited representation of the valley in the simulation model [<xref ref-type="bibr" rid="B19">19</xref>, <xref ref-type="bibr" rid="B21">21</xref>]. Even with high-resolution meteorological simulations, it is still challenging to capture the transport of pollutants in the narrow valleys [<xref ref-type="bibr" rid="B51">51</xref>].</p>
<p>The main purpose of the <bold>T</bold><sub><italic>KA</italic></sub> matrix is to reduce the covariance between elements in the state that are distant in the vertical direction but close in the horizontal direction. Thus, observations located in the bottom part of the valley (where the pollutant concentration are higher) should not have a high impact in the city&#x00027;s outskirts (where the concentrations are lower) and vice versa. A first version of the target matrix <inline-formula><mml:math id="M53"><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msubsup></mml:math></inline-formula> was built using a fourth-order-polynomial covariance function as described in Gaspari and Cohn [<xref ref-type="bibr" rid="B44">44</xref>]. To incorporate the previous knowledge and improve the valley representation into the model, we reduced the correlation as a function of vertical distance, with zero correlation for vertical distances exceeding 600 m. Other distances were tested too, without significant changes in the result. The chosen formulation preserves the dependency on the horizontal distance that is necessary to remove the spurious correlations and incorporates the physical restriction of the valley. To ensure that <bold>T</bold><sub><italic>KA</italic></sub> is positive semidefinite, we applied the method presented in Higham [<xref ref-type="bibr" rid="B52">52</xref>] to obtain the positive semidefinite matrix that is closest to <inline-formula><mml:math id="M54"><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msubsup></mml:math></inline-formula> in the Frobenius norm. <xref ref-type="fig" rid="F6">Figure 6</xref> illustrates the influence area of the Gaspari-Cohn based covariance matrix, the <inline-formula><mml:math id="M55"><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msubsup></mml:math></inline-formula> covariance matrix, and the <bold>T</bold><sub><italic>KA</italic></sub> covariance matrix for two locations. The influence area corresponds with a row (or column) of the covariance matrix. It is possible to see how the proposed <inline-formula><mml:math id="M56"><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msubsup></mml:math></inline-formula> matrix (<xref ref-type="fig" rid="F6">Figure 6C</xref>) follows the valley shape according to the orography shown in <xref ref-type="fig" rid="F6">Figure 6B</xref> unlike the Gaspari-Cohn covariance matrix (<xref ref-type="fig" rid="F6">Figure 6A</xref>). The generalization applies to very complex boundary conditions in large scale systems not only for the solution of the differential equations but also for the estimation tasks of the robust filters. Additionally, there are no significant modifications between the <bold>T</bold><sub><italic>KA</italic></sub> (<xref ref-type="fig" rid="F6">Figure 6D</xref>) and the <inline-formula><mml:math id="M57"><mml:msubsup><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msubsup></mml:math></inline-formula> matrix. Finally, the <bold>T</bold><sub><italic>KA</italic></sub> matrix is used as the target matrix for both EnKF-KA and EnTLHF-KA methods. Note that the final covariance between the state inside and outside the valley will not be necessary zero because the final covariance matrix <bold>B</bold><sub><italic>KA</italic></sub> is a convex combination of <bold>T</bold><sub><italic>KA</italic></sub> and <bold>P</bold><sup><italic>b</italic></sup>.</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p>Comparison of the influence area of two selected states (blue dots) between a distance depending localization, and the target covariance matrix based on the distance and the orography. <bold>(A)</bold> Influence area Gaspari-Cohn matrix. <bold>(B)</bold> Aburra Valley orography. <bold>(C)</bold> Influence area <inline-formula><mml:math id="M52"><mml:msubsup><mml:mrow><mml:mtext>T</mml:mtext></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mo>*</mml:mo></mml:mrow></mml:msubsup></mml:math></inline-formula>. <bold>(D)</bold> Influence area T<sub><italic>KA</italic></sub>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0006.tif"/>
</fig>
</sec>
<sec>
<title>4.2. Evaluation of LE simulations</title>
<p>The concentration fields produced by model simulations with or without DA were compared with the observations from official monitoring stations (<xref ref-type="fig" rid="F12">Figure 12</xref>), dividing the study into stations at the bottom of the valley (BS stations) and stations at the outskirts of the city (OS stations). The averaged assessment statistics over the validation station are shown in <xref ref-type="table" rid="T2">Table 2</xref>. In all validation stations, the simulation results without DA (LE) underestimated the observed concentrations. This is for example reflected in a high RMSE value. The correlation coefficient was low, which means that the model could not fully capture the temporal variations at hourly and daily scales. The three simulations using DA had MFB values similar to 0 for the BS stations (bottom of the valley), without a noticeable difference. DA was thus successful in reducing the discrepancy between the model and observations. The RMSE also decreased by 45.03% in the LE-LETKF, 41.57% in the LE-KA, and 41.91% in the LE-Robust simulations compared to the RMSE of the LE simulation. According to Mogoll&#x000F3;n-sotelo et al. [<xref ref-type="bibr" rid="B53">53</xref>], <xref ref-type="table" rid="T2">Table 2</xref> based on EPA [<xref ref-type="bibr" rid="B54">54</xref>] and Boylan and Russell [<xref ref-type="bibr" rid="B47">47</xref>], the <italic>R</italic> values were all above the criterion for good results. In contrast, over the OS stations (outskirts of the city), the simulations using the shrinkage-based methods presented better statistics compared to the LE-LETKF. For instance, the RMSE&#x00027;s improvements in OS stations using shrinkage-based methods are 15.02% for the LE-KA and 22.22% for the LE-Robust compared with the LE-LETKF.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Statistical evaluation of different simulations.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left" rowspan="2"><bold>Simulation</bold></th>
<th valign="top" align="center" colspan="3" style="border-bottom: thin solid #000000;"><bold>MFB</bold></th>
<th valign="top" align="center" colspan="3" style="border-bottom: thin solid #000000;"><bold>RMSE</bold></th>
<th valign="top" align="center" colspan="3" style="border-bottom: thin solid #000000;"><bold><bold>R</bold></bold></th>
</tr>
<tr>
<th valign="top" align="center"><bold>BS</bold></th>
<th valign="top" align="center"><bold>OS</bold></th>
<th valign="top" align="center"><bold>Total</bold></th>
<th valign="top" align="center"><bold>BS</bold></th>
<th valign="top" align="center"><bold>OS</bold></th>
<th valign="top" align="center"><bold>Total</bold></th>
<th valign="top" align="center"><bold>BS</bold></th>
<th valign="top" align="center"><bold>OS</bold></th>
<th valign="top" align="center"><bold>Total</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">LE</td>
<td valign="top" align="center">&#x02013;0.42</td>
<td valign="top" align="center">&#x02013;1.2</td>
<td valign="top" align="center">&#x02013;0.55</td>
<td valign="top" align="center">20.23</td>
<td valign="top" align="center">21.12</td>
<td valign="top" align="center">21.11</td>
<td valign="top" align="center">0.61</td>
<td valign="top" align="center">0.41</td>
<td valign="top" align="center">0.57</td>
</tr>
<tr>
<td valign="top" align="left">LE-LETKF</td>
<td valign="top" align="center">0.03</td>
<td valign="top" align="center">0.26</td>
<td valign="top" align="center">0.08</td>
<td valign="top" align="center">11.12</td>
<td valign="top" align="center">17.50</td>
<td valign="top" align="center">13.93</td>
<td valign="top" align="center">0.86</td>
<td valign="top" align="center">0.63</td>
<td valign="top" align="center">0.81</td>
</tr>
<tr>
<td valign="top" align="left">LE-KA</td>
<td valign="top" align="center">&#x02013;0.02</td>
<td valign="top" align="center">&#x02013;0.09</td>
<td valign="top" align="center">&#x02013;0.02</td>
<td valign="top" align="center">11.82</td>
<td valign="top" align="center">14.87</td>
<td valign="top" align="center">12.88</td>
<td valign="top" align="center">0.84</td>
<td valign="top" align="center">0.71</td>
<td valign="top" align="center">0.82</td>
</tr>
<tr>
<td valign="top" align="left">LE-Robust</td>
<td valign="top" align="center">0.02</td>
<td valign="top" align="center">&#x02013;0.03</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">11.75</td>
<td valign="top" align="center">13.61</td>
<td valign="top" align="center">12.22</td>
<td valign="top" align="center">0.84</td>
<td valign="top" align="center">0.78</td>
<td valign="top" align="center">0.83</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>BS corresponds with stations located at the bottom of the valley. OS corresponds with stations located in the outskirts of the city. The total value is calculated over all the validation stations</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Weather research forecast model (WRF) model domains description.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Domain</bold></th>
<th valign="top" align="center"><bold>Latitude</bold></th>
<th valign="top" align="center"><bold>Longitude</bold></th>
<th valign="top" align="center"><bold>Resolution</bold></th>
<th valign="top" align="center"><bold>Number of cells</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">D1</td>
<td valign="top" align="center">&#x02013;8.864, 19.091</td>
<td valign="top" align="center">&#x02013;86.694, &#x02013;59.275</td>
<td valign="top" align="center">0.3&#x000B0;</td>
<td valign="top" align="center">90 x 93</td>
</tr>
<tr>
<td valign="top" align="left">D2</td>
<td valign="top" align="center">&#x02013;4.946, 14.719</td>
<td valign="top" align="center">&#x02013;84.929, &#x02013;65.091</td>
<td valign="top" align="center">0.1&#x000B0;</td>
<td valign="top" align="center">193 x 193</td>
</tr>
<tr>
<td valign="top" align="left">D3</td>
<td valign="top" align="center">3.734, 9.064</td>
<td valign="top" align="center">&#x02013;78.108, &#x02013;73.677</td>
<td valign="top" align="center">0.033&#x000B0;</td>
<td valign="top" align="center">130 x 157</td>
</tr>
<tr>
<td valign="top" align="left">D4</td>
<td valign="top" align="center">5.379, 7.294</td>
<td valign="top" align="center">&#x02013;76.458, &#x02013;74.981</td>
<td valign="top" align="center">0.011&#x000B0;</td>
<td valign="top" align="center">130 x 169</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>WRF model set up.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Category</bold></th>
<th valign="top" align="left"><bold>Parameter</bold></th>
<th valign="top" align="left"><bold>Selection in WRF</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Domain settings</td>
<td valign="top" align="left">Coordinate system</td>
<td valign="top" align="left">Mercator</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">True latitude 1: 36&#x000B0;.</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">True latitude 2: 60&#x000B0;.</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Standard.</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">longitude:&#x02013;98&#x000B0;.</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Vertical setting</td>
<td valign="top" align="left">35.</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Nesting</td>
<td valign="top" align="left">Two way.</td>
</tr>
<tr>
<td valign="top" align="left">Input data</td>
<td valign="top" align="left">Land use</td>
<td valign="top" align="left">MODIS.</td>
</tr>
<tr>
<td valign="top" align="left">Initial-boundary conditions</td>
<td valign="top" align="left">Name of model</td>
<td valign="top" align="left">NCAR-GFS.</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Grid resolution</td>
<td valign="top" align="left">32 levels &#x0002B; 5 soil levels.</td>
</tr>
<tr>
<td valign="top" align="left">Physic Settings</td>
<td valign="top" align="left">Radiation scheme</td>
<td valign="top" align="left">CAM scheme.</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Microphysics</td>
<td valign="top" align="left">Single moment 6-class.</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Surface layer options</td>
<td valign="top" align="left">Layer: Monin-Obukhov.</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Physics: Thermal Diffusion.</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Scheme: soil temperature.</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">only, using five layers.</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">PBL Scheme</td>
<td valign="top" align="left">MYJ.</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Cumulus option</td>
<td valign="top" align="left">KF.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In general, all DA simulations showed lower scores in the OS stations than in the BS stations, mainly because of the poor representation in these areas by the background simulation (LE simulation) and the lack of close observations. Even so, the LE-Robust looks more robust among all the stations.</p>
<p><xref ref-type="fig" rid="F7">Figure 7</xref> shows diurnal cycles in the four chosen validation stations during the simulation phase. Those stations illustrate the differences between BS and OS, and are representative of all validation stations. The LE diurnal cycle differs from the observations in magnitude in the BS stations, and in the OS stations in both magnitude and temporal behavior. The highest peak of concentration in the BS stations around 09:00 is primarily due to traffic dynamics and is partially captured by the LE simulation. For example, the LE morning peak emerged faster in the simulations at station 44 than in the observations. This time lag could be due to a poor spatial representation of mobile sources in the emission inventory, or a failure by the meteorology or the model to reproduce the dynamics of the valley, indicating premature transport of particulate matter to these regions. In comparison, at 22:00 h, the LE simulation presents the highest point at station 44 (<xref ref-type="fig" rid="F7">Figure 7C</xref>), which does not correspond with the observations. The LE simulation in the other OS station 85 (<xref ref-type="fig" rid="F7">Figure 7D</xref>), cannot fit the observation interval, indicating a late morning peak and a minimum around 21:00 that does not appear in the measurements. The LE simulation shows a general underestimation of concentrations, with a better replication of the PM2.5 dynamics at the bottom of the valley.</p>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p>Daily cycle at different stations. The upper panel corresponds with stations located at the bottom of the valley. The bottom panel corresponds with stations located on the outskirts of the city. <bold>(A)</bold> Daily cycle at station 25. <bold>(B)</bold> Daily cycle at station 28. <bold>(C)</bold> Daily cycle at station 44. <bold>(D)</bold> Daily cycle at station 85.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0007.tif"/>
</fig>
<p>The simulations using DA presented diurnal cycles closer to the observations, with a marked difference in performance between BS stations and OS stations. In the BS stations (<xref ref-type="fig" rid="F7">Figures 7A,B</xref>), the three methods showed very similar daily cycles capturing the magnitude and the variability of the observations with high accuracy. These simulations corrected the concentration underestimation presented in the LE simulation and improved the temporal profile. Unlike in the BS stations, in the OS stations, the three DA methods showed different results.</p>
<p>The LE-LETKF tends to overestimate the concentrations and has different diurnal variability concerning the observations. In station 44, the LE-LETKF persistently displayed higher values than the observed, and a low variability around the day, with small peaks and valleys. In station 85, the LE-LETKF showed higher concentration values than the observations, and the morning peak appears later (similar to the LE simulation). The discrepancy in the magnitude and the lack of representation of the temporal variability suggest that the LE-LETKF simulation assimilates observations located in regions where the PM presents a different temporal behavior than those grid cells located in the outskirts.</p>
<p>On the other hand, the two simulations using the shrinkage-based covariance estimator and the target matrix <bold>T</bold><sub><italic>KA</italic></sub> (LE-KA and LE-Robust) improve the performance in the OS stations. The LE-KA simulation showed a similar temporal variability in both OS stations, although a concentration underestimation.</p>
<p>The LE-Robust displayed a high agreement between the simulated daily cycle and the observations. The difference in magnitude between the LE-Robust and LE-KA simulations can be explained by the fact that the robust methods tend to put more weight in the observations when there is high uncertainty in the background [<xref ref-type="bibr" rid="B7">7</xref>], such as the case in this application. Finally, the shrinkage-based simulations tend to follow the diurnal variability, which suggests that the <bold>T</bold><sub><italic>KA</italic></sub> matrix could limit the influence of observations from areas with a different temporal profile.</p>
</sec>
<sec>
<title>4.3. Spatial Distribution</title>
<p>To better understand the influence of the target matrix <bold>T</bold><sub><italic>KA</italic></sub> on shrinkage-based methods, it is important to analyze the spatial distribution of the concentrations over the valley. <xref ref-type="fig" rid="F8">Figure 8</xref> shows a three-dimensional representation of the average value of PM<sub>2.5</sub> over March 9. In these graphs, values less than 5 &#x003BC;g/m<sup>3</sup> are omitted. The averaged observed values are shown using the same color bar for all the validation stations by a circle and a star for the BS and OS stations, respectively.</p>
<fig id="F8" position="float">
<label>Figure 8</label>
<caption><p>3D maps of concentrations averaged over March 9 for different simulations. The values less than 5 &#x003BC;g/m<sup>3</sup> are omitted. The circles correspond with BS stations, and the stars correspond with OS stations. <bold>(A)</bold> LE, <bold>(B)</bold> LE-LETKF, <bold>(C)</bold> LE-KA, and <bold>(D)</bold> LE-Robust.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0008.tif"/>
</fig>
<p>The LE simulation has a spatial pattern similar to the observations, with the highest concentrations in the center and south part of the Medell&#x000ED;n city (refer to <xref ref-type="fig" rid="F12">Figure 12</xref> for reference). In general, the concentrations are higher in the bottom part of the valley, where most of the population and industry facilities are located. This characteristic is well captured by the LE simulation. Nevertheless, the LE simulation tends to underestimate the concentration along the valley and the hills.</p>
<p>The three DA simulations are able to correct the concentration bias in the bottom part of the valley. The LE-LETKF assimilation increases the concentrations in the hills to values higher than the observations. In station 85, located on the west slope of the valley (see <xref ref-type="fig" rid="F12">Figure 12</xref> for reference), the concentrations simulated by LE-LETKF are almost everywhere higher than the observed. This is because the concentrations in the west hill are influenced by observations located in the lower part of the valley, characterized by high concentrations. Those observations influence the grid cells located on the hill, generating values that do not correspond to the validation station. Both shrinkage-based simulations match better with the observations on the hills. In the case of station 85, both methods have the same range of values as the observed concentrations.</p>
<p>The use of the <bold>T</bold><sub><italic>KA</italic></sub> matrix limits the influence of the observations located at the bottom of the valley on the grid cells at the slopes. As shown in <xref ref-type="fig" rid="F6">Figure 6D</xref>, the influence of the observations is limited by horizontal and vertical distance, representing better the dynamics in the valley. A particular situation is observed at station 94 (see <xref ref-type="fig" rid="F12">Figure 12</xref> for reference), located on the top of the east slope. Although the observed values are in the range of 5&#x02013;10 &#x003BC;g/m<sup>3</sup>, all the simulations, even the DA simulations, show values under 5 &#x003BC;g/m<sup>3</sup> (not plotted in <xref ref-type="fig" rid="F8">Figure 8</xref>). The underestimation can be explained by an absence of emissions in the emission inventory (emission uncertainties), and the limited number of observations in that part of the domain.</p>
</sec>
<sec>
<title>4.4. Forecast Results</title>
<p>A fundamental prerequisite for a simulation and assimilation method of air quality to be valuable for a decision-making process is that it can predict the concentrations a few days in advance. <xref ref-type="fig" rid="F9">Figure 9</xref> shows examples of forecasts from March 12, 16:00 to March 15, 16:00. As was mentioned previously, the forecast runs are using the emission correction factors estimated between March 10, 16:00 and March 11, 16:00. The LE simulation persistently underestimates the concentrations, as observed in the assimilation window&#x00027;s results. In the BS stations, the three assimilation methods initiate a forecast that is quite close to the observations on the first day and remains with an acceptable similarity in the following two forecast days. As shown in the previous evaluations, the concentrations in the assimilation window are very similar for the three methods in the lower part of the valley. Thus, also the estimated emission correction factors are similar, leading to rather small differences between the forecasts. However, in the OS stations, the LE-LETKF forecasts show magnitudes and a temporal behavior that is different from the observations. This discrepancy in the values suggests an incorrect estimation of the emission correction factors on the slopes of the valley by LE-LETKF. The forecasts generated by the shrinkage-based methods are more similar to the observations. The LE-KA and LE-Robust show a good forecasting skills for the OS stations, with temporal behavior and magnitudes close to those observed for the first and second forecast days.</p>
<fig id="F9" position="float">
<label>Figure 9</label>
<caption><p>Forecast from March 12, 16:00 to March 15, 16:00 at different stations. The gray vertical dashed line represents the end of the assimilation window and the beginning of the forecast window. Bottom station <bold>(A)</bold> Forecast cycle at station 25. <bold>(B)</bold> Forecast cycle at station 28. Outskirt stations <bold>(C)</bold> Forecast cycle at station 44. <bold>(D)</bold> Forecast cycle at station 85.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0009.tif"/>
</fig>
<p>To be valuable for the public, a forecast should correctly warn for elevated air pollution events. The portion of true negatives, true positives, false negatives, and false positives regarding the prediction of warning-triggering episodes (AQI in orange, red, or purple levels, see <xref ref-type="table" rid="T1">Table 1</xref>) is summarized by the confusion matrix [<xref ref-type="bibr" rid="B55">55</xref>].</p>
<p><xref ref-type="fig" rid="F10">Figure 10</xref> shows the confusion matrices for LE-LETKF, LE-KA, and LE-Robust assimilations and forecasts. In the assimilation or forecast window, the LE simulation did not give an alert at any station; for that reason, we do not provide its confusion matrix. DA simulations have a ratio between true negatives and true positives equal to or greater than 90% of the 20 alarms registered in the assimilation window, 18 correspond to BS stations.</p>
<fig id="F10" position="float">
<label>Figure 10</label>
<caption><p>Comparison of confusion matrices for the data assimilation (DA) and forecast window depending on warning or no warning per station. The values are calculated across all the days of the corresponding window. The value of 0 corresponds with no warning, the value of 1 corresponds with a warning. For the LE simulation, there are neither warnings in the DA window nor forecast windows <bold>(A,B)</bold>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0010.tif"/>
</fig>
<fig id="F11" position="float">
<label>Figure 11</label>
<caption><p>WRF and LOTOS-EUROS model nested domain configuration. The red squares correspond with the LOTOS-EUROS domains, the black squares correspond with the WRF domains.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0011.tif"/>
</fig>
<fig id="F12" position="float">
<label>Figure 12</label>
<caption><p><bold>(A)</bold> Validation network. The circles and stars represent the bottom part stations (BS), and the outskirt stations (OS), respectively. <bold>(B)</bold> Assimilation network. The gray raster corresponds with the LOTOS-EUROS model grid, and the black lines are the municipalities, borders.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-830116-g0012.tif"/>
</fig>
<p>In the forecast window, the forecast skill of the three models was lower than in the assimilation window. From the 10 actually observed alerts in the forecast period, the DA simulations could replicate 8. A higher proportion of false-positive alerts was reported by the LE-LETKF, documenting nine false alerts more than the shrinkage-based approaches. The high amount of false-positive alerts is due to the overestimation of the LE-LETKF concentration in the OS stations, where the additional alerts were recorded incorrectly. In general, the LE-KA and LE-Robust simulations had better alert forecast performance than the LE-LETKF simulation.</p>
</sec>
<sec>
<title>4.5. Discussion and Comments</title>
<p>In a free run scenario for a CTM model, the LOTOS-EUROS model has served as an example for some contributions. Previous studies already suggested the need for meteorological fields at a higher resolution to correctly represent the dynamics and transport of pollutants in the Aburr&#x000E1; Valley [<xref ref-type="bibr" rid="B19">19</xref>]. Simulation without DA and using weather research forecast Model (WRF) meteorology (LE simulation) shows an improvement compared to implementations using the lower resolution ECMWF meteorology. This procedure improves the model performance. An underestimation of PM<sub>2.5</sub> concentrations is strongly reduced (although still present) and an increment in the correlation is observed. It is important to continue evaluating the model&#x00027;s performance with different configurations of the WRF model, specifically to reproduce the dominant dynamics of pollutant transport in inhabited valleys [<xref ref-type="bibr" rid="B21">21</xref>, <xref ref-type="bibr" rid="B51">51</xref>]. Additionally, it is necessary to carry out a more exhaustive evaluation of the model&#x00027;s vertical resolution, given the new possibilities offered by the coupling with the WRF model. Finally, a reduction in meteorology&#x00027;s uncertainty will improve the estimation of the emissions using DA and could help to create more accurate emission inventories. Data assimilation for uncertainty reduction of the WRF model is under research.</p>
<p>The DA considerably improves the simulations by the model. With each of the three assimilation methods, smaller differences and higher similarities between the simulated and observed concentrations were found, as shown in <xref ref-type="table" rid="T2">Table 2</xref>. The standard metrics that are used to compare the various algorithms showed an improvement compared to previous EnKF implementations, assimilating the same observations [<xref ref-type="bibr" rid="B50">50</xref>]. This improvement is due to the better background obtained using WRF meteorology and the impact of the localization schemes present in the DA algorithms. Using the new assimilation schemes, the spatial distribution of concentrations within the valley is better resolved.</p>
<p>Under the assumption the WRF meteorological fields are on a basis improving the model representation of reality, we will focus on the main differences between the model in a free run and the assimilation. Using a target covariance matrix to adapt the covariances computed from the ensemble results in better representation of the actual covariance structure. The target covariance matrix limits the influence of observations located in the lower part of the valley on the grid cells located in the hills of the valley and vice versa. This makes it possible to separate the different regimes and avoids incorrect corrections in concentrations, as could occur with the standard LEKTF method. The forecast experiments also suggest a better estimate of the emission correction factors when shrinkage methods are employed. As a result, the forecasts of dangerous pollution levels is improved in all the stations (shown in <xref ref-type="fig" rid="F10">Figure 10</xref>). These results encourage further improvement of these types of methods and to incorporate more and more prior knowledge in the covariance estimation. Possible new directions include dynamic target matrices dependent on the weather or on patterns in public behavior.</p>
<p>Both shrinkage-based methods, EnKF-KA and EnTLHF-KA, showed lower error statistics than the standard LETKF. The use of the shrinkage estimator and the incorporation of orography information through the <bold>T</bold><sub><italic>KA</italic></sub> matrix allows both methods to achieve satisfactory results with a relatively low number of ensemble members (25). Previous experiments in toy models (Lorenz96 and 2D advection-diffusion model) and real pseudo applications (SPEEDY model) have shown that the shrinkage-based family of methods can improve DA when the size of the ensemble is small [<xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B40">40</xref>], supported by our results in a real high-dimensional application. This capability is important given the computational difficulty involved in generating many simulations of highly complex models. Although the overall performance of both methods is similar, the robust method achieves better results, especially in stations on the slopes of the valley. This is very important for this family of models because it seems to improve estimation results even if the solution of the differential equation may not be deeply accurate.</p>
<p>The EnTLHF-KA algorithm tends to put more weight on the observations than the EnKF-KA in the analysis step due to the adaptive inflation term that is present. Additionally, the robust methods do not require a completely correct characterization of the observation representation errors or the uncertainties of the model [<xref ref-type="bibr" rid="B7">7</xref>]. This characteristic benefits the EnTLHF-KA in our application, given the lack of precise information on the modeling system&#x00027;s uncertainties, e.g., emissions inventory, meteorology, composition, and reaction schemes.</p>
<p>Although the methods presented in this work were tested in a specific setting, their formulation is quite general and could be used in other applications [<xref ref-type="bibr" rid="B15">15</xref>]. The basic concept of both EnKF-KA and EnTLHF-KA is to incorporate information or prior system knowledge that is not captured by the model directly in the DA.</p>
<p>In our case, for example, this principle works as a modification to the well-known concept of distance-based location. Several works have followed this line, mainly in history matching applications [<xref ref-type="bibr" rid="B56">56</xref>, <xref ref-type="bibr" rid="B57">57</xref>] but with a different approach. We believe that EnKF-KA and EnTLHF-KA possess sufficiently interesting characteristics to be applied and tested in areas other than that shown in this work.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s5">
<title>5. Conclusion</title>
<p>This study introduces the concept of robustness from control and systems to a family of DA techniques. We aimed to the natural development of a filter&#x00027;s family that not only avoids spurious correlation but also can be generalized, computationally efficient, and very robust inspired in real life complex systems [<xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B19">19</xref>]. We developed the intuition for adding the H<sub>&#x0221E;</sub> robustness to a shrinkage-based estimator finding a simple and very understandable solution. Using a low-scale model implementation, easily extendable for example to biological systems [<xref ref-type="bibr" rid="B58">58</xref>&#x02013;<xref ref-type="bibr" rid="B60">60</xref>] or closed loop estimators for biotechnological process [<xref ref-type="bibr" rid="B61">61</xref>, <xref ref-type="bibr" rid="B62">62</xref>], we compared the proposed method&#x00027;s robustness and performance against the standard EnKF, the shrinkage-based EnKF-KA, and the robust filter EnTLHF. The EnTLHF-KA has lower RMSE values in conditions with high observation error and model errors than the other methods. When the number of ensembles is small, the shrinkage estimator gives a better approximation of the background covariance matrix than the sample covariance matrix, generating lower errors in both shrinkage-based algorithm, especially in the EnTLHF-KA. The combination of the non-Gaussian shrinkage estimator and the adaptive inflation grant a higher robustness to the EnTLHF-KA when the ensemble distribution is non-Gaussian.</p>
<p>Additionally , we presented an application using the chemical transport model LOTOS-EUROS over a densely populated valley. The proposed method outperform the standard LETKF, especially in places with complex orography. Incorporating the orography characteristics in the DA through a target matrix, limits the influence of observations in grid cells that are far away in vertical distance. The final result can be understood as a localization scheme that does not depend only on the horizontal distance, but also on the change in orography. The robustness of the EnTLHF-KA allows having a high similarity between the simulated and observed PM<sub>2.5</sub> concentrations, even with a small ensemble size and an incomplete representation of the system uncertainties. The model&#x00027;s forecasting capabilities are also improved, achieving a good representation of the concentrations on the first forecast day, being acceptable until the third day. After assimilation, the model is an accurate tool for forecasting alerts for high levels of air pollution.</p>
</sec>
<sec sec-type="data-availability" id="s6">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>SL-R: conceptualization, methodology, software, and writing&#x02014;original draft. AY: methodology and software. NP: conceptualization, methodology, writing&#x02014;review, and editing. OQ: conceptualization, methodology, writing&#x02014;original draft, editing, and supervision. AS: methodology, software, writing&#x02014;review, and editing. AH: writing&#x02014;review, editing, and supervision. All authors have read and agreed to the published version of the manuscript.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>SL-R and AY were employed by the company SimpleSpace. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s8">
<title>Publisher&#x00027;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack><p>The authors acknowledge the supercomputing resources made available by the Centro de Computaci&#x000F3;n Cient&#x000ED;fica Apolo at Universidad EAFIT (<ext-link ext-link-type="uri" xlink:href="http://www.eafit.edu.co/apolo">http://www.eafit.edu.co/apolo</ext-link>) to conduct this work.</p></ack>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lahoz</surname> <given-names>WA</given-names></name> <name><surname>Schneider</surname> <given-names>P</given-names></name></person-group>. <article-title>Data assimilation: making sense of earth observation</article-title>. <source>Front Environ Sci</source>. (<year>2014</year>) <volume>2</volume>:<fpage>16</fpage>. <pub-id pub-id-type="doi">10.3389/fenvs.2014.00016</pub-id></citation>
</ref>
<ref id="B2">
<label>2.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bocquet</surname> <given-names>M</given-names></name> <name><surname>Elbern</surname> <given-names>H</given-names></name> <name><surname>Eskes</surname> <given-names>H</given-names></name> <name><surname>Hirtl</surname> <given-names>M</given-names></name> <name><surname>Aabkar</surname> <given-names>R</given-names></name> <name><surname>Carmichael</surname> <given-names>GR</given-names></name> <etal/></person-group>. <article-title>Data assimilation in atmospheric chemistry models: current status and future prospects for coupled chemistry meteorology models</article-title>. <source>Atmosphere Chem Phys</source>. (<year>2015</year>) <volume>15</volume>:<fpage>5325</fpage>&#x02013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.5194/acp-15-5325-2015</pub-id></citation>
</ref>
<ref id="B3">
<label>3.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kalman</surname> <given-names>RE</given-names></name></person-group>. <article-title>A new approach to linear filtering and prediction problems</article-title>. <source>J Basic Eng</source>. (<year>1960</year>) <volume>82</volume>:<fpage>35</fpage>&#x02013;<lpage>45</lpage>. <pub-id pub-id-type="doi">10.1115/1.3662552</pub-id><pub-id pub-id-type="pmid">30253628</pub-id></citation></ref>
<ref id="B4">
<label>4.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Evensen</surname> <given-names>G</given-names></name></person-group>. <article-title>The ensemble kalman filter: theoretical formulation and practical implementation</article-title>. <source>Ocean Dyn</source>. (<year>2003</year>) <volume>53</volume>:<fpage>343</fpage>&#x02013;<lpage>67</lpage>. <pub-id pub-id-type="doi">10.1007/s10236-003-0036-9</pub-id></citation>
</ref>
<ref id="B5">
<label>5.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Houtekamer</surname> <given-names>PL</given-names></name> <name><surname>Mitchell</surname> <given-names>HL</given-names></name> <name><surname>Pellerin</surname> <given-names>G</given-names></name> <name><surname>Buehner</surname> <given-names>M</given-names></name> <name><surname>Charron</surname> <given-names>M</given-names></name> <name><surname>Spacek</surname> <given-names>L</given-names></name> <etal/></person-group>. <article-title>Atmospheric data assimilation with an ensemble kalman filter: results with real observations</article-title>. <source>Mon Weather Rev</source>. (<year>2005</year>) <volume>133</volume>:<fpage>604</fpage>&#x02013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1175/MWR-2864.1</pub-id></citation>
</ref>
<ref id="B6">
<label>6.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bai</surname> <given-names>Y</given-names></name> <name><surname>Zhang</surname> <given-names>Z</given-names></name> <name><surname>Zhang</surname> <given-names>Y</given-names></name> <name><surname>Wang</surname> <given-names>L</given-names></name></person-group>. <article-title>Inflating transform matrices to mitigate assimilation errors with robust filtering based ensemble Kalman filters</article-title>. <source>Atmosphere Sci Lett</source>. (<year>2016</year>) <volume>17</volume>:<fpage>470</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1002/asl.681</pub-id></citation>
</ref>
<ref id="B7">
<label>7.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>X</given-names></name> <name><surname>Hoteit</surname> <given-names>I</given-names></name></person-group>. <article-title>Robust ensemble filtering and its relation to covariance inflation in the ensemble kalman filter</article-title>. <source>Mon Weather Rev</source>. (<year>2011</year>) <volume>139</volume>:<fpage>3938</fpage>&#x02013;<lpage>53</lpage>. <pub-id pub-id-type="doi">10.1175/MWR-D-10-05068.1</pub-id></citation>
</ref>
<ref id="B8">
<label>8.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname> <given-names>Y</given-names></name> <name><surname>Zhang</surname> <given-names>Y</given-names></name> <name><surname>Wang</surname> <given-names>Y</given-names></name> <name><surname>Ye</surname> <given-names>S</given-names></name> <name><surname>Fang</surname> <given-names>H</given-names></name></person-group>. <article-title>A new sequential data assimilation method</article-title>. <source>Sci China E Technol Sci</source>. (<year>2009</year>) <volume>52</volume>:<fpage>1027</fpage>&#x02013;<lpage>38</lpage>. <pub-id pub-id-type="doi">10.1007/s11431-008-0189-3</pub-id></citation>
</ref>
<ref id="B9">
<label>9.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nan</surname> <given-names>TC</given-names></name> <name><surname>Wu</surname> <given-names>JC</given-names></name></person-group>. <article-title>Application of ensemble H-infinity filter in aquifer characterization and comparison to ensemble Kalman filter</article-title>. <source>Water Sci Eng</source>. (<year>2017</year>) <volume>10</volume>:<fpage>25</fpage>&#x02013;<lpage>35</lpage>. <pub-id pub-id-type="doi">10.1016/j.wse.2017.03.009</pub-id></citation>
</ref>
<ref id="B10">
<label>10.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roh</surname> <given-names>S</given-names></name> <name><surname>Genton</surname> <given-names>MG</given-names></name> <name><surname>Jun</surname> <given-names>M</given-names></name> <name><surname>Szunyogh</surname> <given-names>I</given-names></name> <name><surname>Hoteit</surname> <given-names>I</given-names></name></person-group>. <article-title>Observation quality control with a robust ensemble kalman filter</article-title>. <source>Mon Weather Rev</source>. (<year>2013</year>) <volume>141</volume>:<fpage>4414</fpage>&#x02013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.1175/MWR-D-13-00091.1</pub-id></citation>
</ref>
<ref id="B11">
<label>11.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Freitag</surname> <given-names>MA</given-names></name> <name><surname>Nichols</surname> <given-names>NK</given-names></name> <name><surname>Budd</surname> <given-names>CJ</given-names></name></person-group>. <article-title>Resolution of sharp fronts in the presence of model error in variational data assimilation</article-title>. <source>Q J R Meteorol Soc</source>. (<year>2013</year>) <volume>139</volume>:<fpage>742</fpage>&#x02013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1002/qj.2002</pub-id></citation>
</ref>
<ref id="B12">
<label>12.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rao</surname> <given-names>V</given-names></name> <name><surname>Sandu</surname> <given-names>A</given-names></name> <name><surname>Ng</surname> <given-names>M</given-names></name> <name><surname>Nino-Ruiz</surname> <given-names>ED</given-names></name></person-group>. <article-title>Robust data assimilation using l1 and huber norms</article-title>. <source>SIAM J Sci Comput</source>. (<year>2017</year>) <volume>39</volume>:<fpage>B548</fpage>&#x02013;<lpage>70</lpage>. <pub-id pub-id-type="doi">10.1137/15M1045910</pub-id></citation>
</ref>
<ref id="B13">
<label>13.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>Y</given-names></name> <name><surname>He</surname> <given-names>H</given-names></name> <name><surname>Xu</surname> <given-names>G</given-names></name></person-group>. <article-title>Adaptively robust fitering for kinematic geodetic positioning</article-title>. <source>J Geodesy</source>. (<year>2001</year>) <volume>75</volume>:<fpage>109</fpage>&#x02013;<lpage>16</lpage>. <pub-id pub-id-type="doi">10.1007/s001900000157</pub-id></citation>
</ref>
<ref id="B14">
<label>14.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nino-Ruiz</surname> <given-names>E</given-names></name> <name><surname>Cheng</surname> <given-names>H</given-names></name> <name><surname>Beltran</surname> <given-names>R</given-names></name> <name><surname>Nino-Ruiz</surname> <given-names>ED</given-names></name> <name><surname>Cheng</surname> <given-names>H</given-names></name> <name><surname>Beltran</surname> <given-names>R</given-names></name></person-group>. <article-title>A robust non-gaussian data assimilation method for highly non-linear models</article-title>. <source>Atmosphere</source>. (<year>2018</year>) <volume>9</volume>:<fpage>126</fpage>. <pub-id pub-id-type="doi">10.3390/atmos9040126</pub-id></citation>
</ref>
<ref id="B15">
<label>15.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lopez-Restrepo</surname> <given-names>S</given-names></name> <name><surname>Nino-Ruis</surname> <given-names>ED</given-names></name> <name><surname>Yarce</surname> <given-names>A</given-names></name> <name><surname>Quintero</surname> <given-names>OL</given-names></name> <name><surname>Pinel</surname> <given-names>N</given-names></name> <name><surname>Segers</surname> <given-names>A</given-names></name> <etal/></person-group>. <article-title>An efficient ensemble kalman filter implementation via shrinkage covariance matrix estimation: exploiting prior knowledge</article-title>. <source>Comput Geosci</source>. (<year>2021</year>) <volume>25</volume>:<fpage>985</fpage>&#x02013;<lpage>1003</lpage>. <pub-id pub-id-type="doi">10.1007/s10596-021-10035-4</pub-id></citation>
</ref>
<ref id="B16">
<label>16.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fu</surname> <given-names>G</given-names></name> <name><surname>Prata</surname> <given-names>F</given-names></name> <name><surname>Xiang Lin</surname> <given-names>H</given-names></name> <name><surname>Heemink</surname> <given-names>A</given-names></name> <name><surname>Segers</surname> <given-names>A</given-names></name> <name><surname>Lu</surname> <given-names>S</given-names></name></person-group>. <article-title>Data assimilation for volcanic ash plumes using a satellite observational operator: a case study on the 2010 Eyjafjallaj&#x000F6;kull volcanic eruption</article-title>. <source>Atmosphere Chem Phys</source>. (<year>2017</year>) <volume>17</volume>:<fpage>1187</fpage>&#x02013;<lpage>205</lpage>. <pub-id pub-id-type="doi">10.5194/acp-17-1187-2017</pub-id></citation>
</ref>
<ref id="B17">
<label>17.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>S</given-names></name> <name><surname>Lin</surname> <given-names>HX</given-names></name> <name><surname>Heemink</surname> <given-names>A</given-names></name> <name><surname>Segers</surname> <given-names>A</given-names></name> <name><surname>Fu</surname> <given-names>G</given-names></name></person-group>. <article-title>Estimation of volcanic ash emissions through assimilating satellite data and ground-based observations</article-title>. <source>J Geophys Res</source>. (<year>2016</year>) <volume>121</volume>:<fpage>971</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1002/2016JD025131</pub-id></citation>
</ref>
<ref id="B18">
<label>18.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>J</given-names></name> <name><surname>Lin</surname> <given-names>HX</given-names></name> <name><surname>Heemink</surname> <given-names>A</given-names></name> <name><surname>Segers</surname> <given-names>A</given-names></name></person-group>. <article-title>Spatially varying parameter estimation for dust emissions using reduced-tangent-linearization 4DVar</article-title>. <source>Atmos Environ</source>. (<year>2018</year>) <volume>187</volume>:<fpage>358</fpage>&#x02013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2018.05.060</pub-id></citation>
</ref>
<ref id="B19">
<label>19.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lopez-Restrepo</surname> <given-names>S</given-names></name> <name><surname>Yarce</surname> <given-names>A</given-names></name> <name><surname>Pinel</surname> <given-names>N</given-names></name> <name><surname>Quintero</surname> <given-names>OL</given-names></name> <name><surname>Segers</surname> <given-names>A</given-names></name> <name><surname>Heemink</surname> <given-names>AW</given-names></name></person-group>. <article-title>Forecasting PM10 and PM2</article-title>.5 in the Aburr&#x000E1; valley (Medell&#x000ED;n, Colombia) via EnKF based data assimilation. <source>Atmos Environ</source>. (<year>2020</year>) <volume>232</volume>:<fpage>117507</fpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2020.117507</pub-id></citation>
</ref>
<ref id="B20">
<label>20.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hoyos</surname> <given-names>CD</given-names></name> <name><surname>Herrera-Mej&#x000ED;a</surname> <given-names>L</given-names></name> <name><surname>Rold&#x000E1;n-Henao</surname> <given-names>N</given-names></name> <name><surname>Isaza</surname> <given-names>A</given-names></name></person-group>. <article-title>Effects of fireworks on particulate matter concentration in a narrow valley: the case of the Medell&#x000ED;n metropolitan area</article-title>. <source>Environ Monit Assess</source>. (<year>2019</year>) <volume>192</volume>:<fpage>6</fpage>. <pub-id pub-id-type="doi">10.1007/s10661-019-7838-9</pub-id><pub-id pub-id-type="pmid">31797169</pub-id></citation></ref>
<ref id="B21">
<label>21.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Henao</surname> <given-names>JJ</given-names></name> <name><surname>Mej&#x000ED;a</surname> <given-names>JF</given-names></name> <name><surname>Rend&#x000F3;n</surname> <given-names>AM</given-names></name> <name><surname>Salazar</surname> <given-names>JF</given-names></name></person-group>. <article-title>Sub-kilometer dispersion simulation of a CO tracer for an inter-Andean urban valley</article-title>. <source>Atmos Pollut Res</source>. (<year>2020</year>) <volume>11</volume>:<fpage>928</fpage>&#x02013;<lpage>945</lpage>. <pub-id pub-id-type="doi">10.1016/j.apr.2020.02.005</pub-id></citation>
</ref>
<ref id="B22">
<label>22.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>C</given-names></name> <name><surname>Xiao</surname> <given-names>Q</given-names></name> <name><surname>Wang</surname> <given-names>B</given-names></name></person-group>. <article-title>An ensemble-based four-dimensional variational data assimilation scheme. Part I: technical formulation and preliminary test</article-title>. <source>Mon Weather Rev</source>. (<year>2008</year>) <volume>136</volume>:<fpage>3363</fpage>&#x02013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1175/2008MWR2312.1</pub-id></citation>
</ref>
<ref id="B23">
<label>23.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ott</surname> <given-names>E</given-names></name> <name><surname>Hunt</surname> <given-names>BR</given-names></name> <name><surname>Szunyogh</surname> <given-names>I</given-names></name> <name><surname>Zimin</surname> <given-names>AV</given-names></name> <name><surname>Kostelich</surname> <given-names>E</given-names></name> <name><surname>Corazza</surname> <given-names>M</given-names></name> <etal/></person-group>. <article-title>A local ensemble Kalman filter for atmospheric data assimilation</article-title>. <source>Tellus</source>. (<year>2004</year>) <volume>56</volume>:<fpage>415</fpage>&#x02013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.3402/tellusa.v56i5.14462</pub-id><pub-id pub-id-type="pmid">28839201</pub-id></citation></ref>
<ref id="B24">
<label>24.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shin</surname> <given-names>S</given-names></name> <name><surname>Kang</surname> <given-names>JS</given-names></name> <name><surname>Jo</surname> <given-names>Y</given-names></name></person-group>. <article-title>The local ensemble transform kalman filter (LETKF) with a global NWP model on the cubed sphere</article-title>. <source>Pure Appl Geophys</source>. (<year>2016</year>) <volume>173</volume>:<fpage>2555</fpage>&#x02013;<lpage>70</lpage>. <pub-id pub-id-type="doi">10.1007/s00024-016-1269-0</pub-id></citation>
</ref>
<ref id="B25">
<label>25.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Touloumis</surname> <given-names>A</given-names></name></person-group>. <article-title>Nonparametric Stein-type shrinkage covariance matrix estimators in high-dimensional settings</article-title>. <source>Comput Stat Data Anal</source>. (<year>2015</year>) <volume>83</volume>:<fpage>251</fpage>&#x02013;<lpage>61</lpage>. <pub-id pub-id-type="doi">10.1016/j.csda.2014.10.018</pub-id></citation>
</ref>
<ref id="B26">
<label>26.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Couillet</surname> <given-names>R</given-names></name> <name><surname>McKay</surname> <given-names>M</given-names></name></person-group>. <article-title>Large dimensional analysis and optimization of robust shrinkage covariance matrix estimators</article-title>. <source>J Multivar Anal</source>. (<year>2014</year>) <volume>131</volume>:<fpage>99</fpage>&#x02013;<lpage>120</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmva.2014.06.018</pub-id></citation>
</ref>
<ref id="B27">
<label>27.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ledoit</surname> <given-names>O</given-names></name> <name><surname>Wolf</surname> <given-names>M</given-names></name></person-group>. <article-title>Optimal estimation of a large-dimensional covariance matrix under Stein&#x00027;s loss</article-title>. <source>Bernoulli</source>. (<year>2018</year>) <volume>24</volume>:<fpage>3791</fpage>&#x02013;<lpage>832</lpage>. <pub-id pub-id-type="doi">10.3150/17-BEJ979</pub-id></citation>
</ref>
<ref id="B28">
<label>28.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stoica</surname> <given-names>P</given-names></name> <name><surname>Li</surname> <given-names>J</given-names></name> <name><surname>Zhu</surname> <given-names>X</given-names></name> <name><surname>Guerci</surname> <given-names>JR</given-names></name></person-group>. <article-title>On using a priori knowledge in space-time adaptive processing</article-title>. <source>IEEE Trans Signal Process</source>. (<year>2008</year>) <volume>56</volume>:<fpage>2598</fpage>&#x02013;<lpage>602</lpage>. <pub-id pub-id-type="doi">10.1109/TSP.2007.914347</pub-id><pub-id pub-id-type="pmid">22595087</pub-id></citation></ref>
<ref id="B29">
<label>29.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>X</given-names></name> <name><surname>Li</surname> <given-names>J</given-names></name> <name><surname>Stoica</surname> <given-names>P</given-names></name></person-group>. <article-title>Knowledge-aided space-time adaptive processing</article-title>. <source>IEEE Trans Aerospace Electron Syst</source>. (<year>2011</year>) <volume>47</volume>:<fpage>1325</fpage>&#x02013;<lpage>36</lpage>. <pub-id pub-id-type="doi">10.1109/TAES.2011.5751261</pub-id><pub-id pub-id-type="pmid">25222035</pub-id></citation></ref>
<ref id="B30">
<label>30.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hassibi</surname> <given-names>B</given-names></name> <name><surname>Kailath</surname> <given-names>T</given-names></name> <name><surname>Sayed</surname> <given-names>A</given-names></name></person-group>. <article-title>Array algorithms for H estimation</article-title>. <source>Automatic Control IEEE ldots</source>. (<year>2000</year>) <volume>45</volume>:<fpage>702</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1109/9.847105</pub-id></citation>
</ref>
<ref id="B31">
<label>31.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>D</given-names></name> <name><surname>Cai</surname> <given-names>X</given-names></name></person-group>. <article-title>Robust data assimilation in hydrological modeling &#x02013; A comparison of Kalman and H -infinity filters</article-title>. <source>Adv Water Resour</source>. (<year>2008</year>) <volume>31</volume>:<fpage>455</fpage>&#x02013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1016/j.advwatres.2007.10.001</pub-id></citation>
</ref>
<ref id="B32">
<label>32.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Berger</surname> <given-names>JO</given-names></name></person-group>. <source>Statistical Decision Theory and Bayesian Analysis. Springer Series in Statistics</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name> (<year>1985</year>). <pub-id pub-id-type="pmid">16364661</pub-id></citation></ref>
<ref id="B33">
<label>33.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anderson</surname> <given-names>JL</given-names></name></person-group>. <article-title>An ensemble adjustment kalman filter for data assimilation</article-title>. <source>Mon Weather Rev</source>. (<year>2001</year>) <volume>129</volume>:<fpage>2884</fpage>&#x02013;<lpage>2903</lpage>. <pub-id pub-id-type="doi">10.1175/1520-0493(2001)129&#x0003C;2884:AEAKFF&#x0003E;2.0.CO;2</pub-id></citation>
</ref>
<ref id="B34">
<label>34.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bellsky</surname> <given-names>T</given-names></name> <name><surname>Mitchell</surname> <given-names>L</given-names></name></person-group>. <article-title>A shadowing-based inflation scheme for ensemble data assimilation</article-title>. <source>Physica D</source>. (<year>2018</year>) 380-<volume>381</volume>:<fpage>1</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1016/j.physd.2018.05.002</pub-id></citation>
</ref>
<ref id="B35">
<label>35.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Houtekamer</surname> <given-names>PL</given-names></name> <name><surname>Zhang</surname> <given-names>F</given-names></name></person-group>. <article-title>Review of the ensemble kalman filter for atmospheric data assimilation</article-title>. <source>Mon Weather Rev</source>. (<year>2016</year>) <volume>144</volume>, <fpage>4489</fpage>&#x02013;<lpage>4532</lpage>. <pub-id pub-id-type="doi">10.1175/MWR-D-15-0440.1</pub-id></citation>
</ref>
<ref id="B36">
<label>36.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altaf</surname> <given-names>MU</given-names></name> <name><surname>Butler</surname> <given-names>T</given-names></name> <name><surname>Luo</surname> <given-names>X</given-names></name> <name><surname>Dawson</surname> <given-names>C</given-names></name> <name><surname>Mayo</surname> <given-names>T</given-names></name> <name><surname>Hoteit</surname> <given-names>I</given-names></name></person-group>. <article-title>Improving short-range ensemble kalman storm surge forecasting using robust adaptive inflation</article-title>. <source>Mon Weather Rev</source>. (<year>2013</year>) <volume>141</volume>:<fpage>2705</fpage>&#x02013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1175/MWR-D-12-00310.1</pub-id></citation>
</ref>
<ref id="B37">
<label>37.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Triantafyllou</surname> <given-names>G</given-names></name> <name><surname>Hoteit</surname> <given-names>I</given-names></name> <name><surname>Luo</surname> <given-names>X</given-names></name> <name><surname>Tsiaras</surname> <given-names>K</given-names></name> <name><surname>Petihakis</surname> <given-names>G</given-names></name></person-group>. <article-title>Assessing a robust ensemble-based Kalman filter for efficient ecosystem data assimilation of the Cretan Sea</article-title>. <source>J Mar Syst</source>. (<year>2013</year>) <volume>125</volume>:<fpage>90</fpage>&#x02013;<lpage>100</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmarsys.2012.12.006</pub-id></citation>
</ref>
<ref id="B38">
<label>38.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sakov</surname> <given-names>P</given-names></name> <name><surname>Bertino</surname> <given-names>L</given-names></name></person-group>. <article-title>Relation between two common localisation methods for the EnKF</article-title>. <source>Comput Geosci</source>. (<year>2011</year>) <volume>15</volume>:<fpage>225</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1007/s10596-010-9202-6</pub-id></citation>
</ref>
<ref id="B39">
<label>39.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Y</given-names></name> <name><surname>Wiesel</surname> <given-names>A</given-names></name> <name><surname>Hero</surname> <given-names>AO</given-names></name></person-group>. <article-title>Shrinkage estimation of high dimensional covariance matrices</article-title>. In: <source>2009 IEEE International Conference on Acoustics, Speech and Signal Processing</source>. <publisher-loc>Taipei</publisher-loc>: <publisher-name>IEEE</publisher-name> (<year>2009</year>). p. <fpage>2937</fpage>&#x02013;<lpage>40</lpage>.</citation>
</ref>
<ref id="B40">
<label>40.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nino-ruiz</surname> <given-names>ED</given-names></name> <name><surname>Sandu</surname> <given-names>A</given-names></name></person-group>. <article-title>Ensemble Kalman filter implementations based on shrinkage covariance matrix estimation</article-title>. <source>Ocean Dyn</source>. (<year>2015</year>) <volume>65</volume>:<fpage>1423</fpage>&#x02013;<lpage>39</lpage>. <pub-id pub-id-type="doi">10.1007/s10236-015-0888-9</pub-id></citation>
</ref>
<ref id="B41">
<label>41.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nino-Ruiz</surname> <given-names>ED</given-names></name> <name><surname>Sandu</surname> <given-names>A</given-names></name></person-group>. <article-title>Efficient parallel implementation of DDDAS inference using an ensemble Kalman filter with shrinkage covariance matrix estimation</article-title>. <source>Cluster Comput</source>. (<year>2017</year>) <volume>22</volume>:<fpage>1</fpage>&#x02013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1007/s10586-017-1407-1</pub-id></citation>
</ref>
<ref id="B42">
<label>42.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lorenz</surname> <given-names>EN</given-names></name> <name><surname>Emanuel</surname> <given-names>KA</given-names></name></person-group>. <article-title>optimal sites for supplementary weather observations: simulation with a small model</article-title>. <source>J Atmosphere Sci</source>. (<year>1998</year>) <volume>55</volume>:<fpage>399</fpage>&#x02013;<lpage>414</lpage>. <pub-id pub-id-type="doi">10.1175/1520-0469(1998)055&#x0003C;0399:OSFSWO&#x0003E;2.0.CO;2</pub-id></citation>
</ref>
<ref id="B43">
<label>43.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gottwald</surname> <given-names>GA</given-names></name> <name><surname>Melbourne</surname> <given-names>I</given-names></name></person-group>. <article-title>Testing for chaos in deterministic systems with noise</article-title>. <source>Physica D</source>. (<year>2005</year>) <volume>212</volume>:<fpage>100</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1016/j.physd.2005.09.011</pub-id></citation>
</ref>
<ref id="B44">
<label>44.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gaspari</surname> <given-names>G</given-names></name> <name><surname>Cohn</surname> <given-names>SE</given-names></name></person-group>. <article-title>Construction of correlation functions in two and three dimensions</article-title>. <source>Q J R Meteorol Soc</source>. (<year>1999</year>) <volume>125</volume>:<fpage>723</fpage>&#x02013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1002/qj.49712555417</pub-id></citation>
</ref>
<ref id="B45">
<label>45.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shapiro</surname> <given-names>S</given-names></name> <name><surname>Wilk</surname> <given-names>M</given-names></name></person-group>. <article-title>An analysis of variance test for normality (complete samples)&#x02020;</article-title>. <source>Biometrika</source>. (<year>1965</year>) <volume>52</volume>:<fpage>591</fpage>&#x02013;<lpage>611</lpage>. <pub-id pub-id-type="doi">10.1093/biomet/52.3-4.591</pub-id></citation>
</ref>
<ref id="B46">
<label>46.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nino-Ruiz</surname> <given-names>ED</given-names></name> <name><surname>Guzman</surname> <given-names>L</given-names></name> <name><surname>Jabba</surname> <given-names>D</given-names></name></person-group>. <article-title>An ensemble Kalman filter implementation based on the Ledoit and Wolf covariance matrix estimator</article-title>. <source>J Comput Appl Math</source>. (<year>2021</year>) <volume>384</volume>:<fpage>113163</fpage>. <pub-id pub-id-type="doi">10.1016/j.cam.2020.113163</pub-id></citation>
</ref>
<ref id="B47">
<label>47.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boylan</surname> <given-names>JW</given-names></name> <name><surname>Russell</surname> <given-names>AG</given-names></name></person-group>. <article-title>PM and light extinction model performance metrics, goals, and criteria for three-dimensional air quality models</article-title>. <source>Atmos Environ</source>. (<year>2006</year>) <volume>40</volume>: <fpage>4946</fpage>&#x02013;<lpage>59</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2005.09.087</pub-id></citation>
</ref>
<ref id="B48">
<label>48.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chai</surname> <given-names>T</given-names></name> <name><surname>Draxler</surname> <given-names>RR</given-names></name></person-group>. <article-title>Root mean square error (RMSE) or mean absolute error (MAE): arguments against avoiding RMSE in the literature</article-title>. <source>Geosci Model Dev</source>. (<year>2014</year>) <volume>7</volume>:<fpage>1247</fpage>&#x02013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.5194/gmd-7-1247-2014</pub-id></citation>
</ref>
<ref id="B49">
<label>49.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>S</given-names></name> <name><surname>Eder</surname> <given-names>B</given-names></name> <name><surname>Dennis</surname> <given-names>R</given-names></name> <name><surname>Chu</surname> <given-names>SH</given-names></name> <name><surname>Schwartz</surname> <given-names>SE</given-names></name></person-group>. <article-title>New unbiased symmetric metrics for evaluation of air quality models</article-title>. <source>Atmosphere Sci Lett</source>. (<year>2006</year>) <volume>7</volume>:<fpage>26</fpage>&#x02013;<lpage>34</lpage>. <pub-id pub-id-type="doi">10.1002/asl.125</pub-id></citation>
</ref>
<ref id="B50">
<label>50.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lopez-restrepo</surname> <given-names>S</given-names></name> <name><surname>Yarce</surname> <given-names>A</given-names></name> <name><surname>Pinel</surname> <given-names>N</given-names></name> <name><surname>Heemink</surname> <given-names>AW</given-names></name></person-group>. <article-title>Urban Air quality modeling using low-cost sensor network and data assimilation in the aburr&#x000E1; valley, colombia</article-title>. <source>Atmosphere</source>. (<year>2021</year>) <volume>12</volume>:<fpage>1</fpage>&#x02013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.3390/atmos12010091</pub-id></citation>
</ref>
<ref id="B51">
<label>51.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rend&#x000F3;n</surname> <given-names>AM</given-names></name> <name><surname>Salazar</surname> <given-names>JF</given-names></name> <name><surname>Wirth</surname> <given-names>V</given-names></name></person-group>. <article-title>Daytime air pollution transport mechanisms in stable atmospheres of narrow versus wide urban valleys</article-title>. <source>Environ Fluid Mech</source>. (<year>2020</year>) <volume>20</volume>:<fpage>1101</fpage>&#x02013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1007/s10652-020-09743-9</pub-id></citation>
</ref>
<ref id="B52">
<label>52.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Higham</surname> <given-names>NJ</given-names></name></person-group>. <article-title>Computing a nearest symmetric positive semidefinite matrix</article-title>. <source>Linear Algebra Appl</source>. (<year>1988</year>) <volume>103</volume>:<fpage>103</fpage>&#x02013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1016/0024-3795(88)90223-6</pub-id></citation>
</ref>
<ref id="B53">
<label>53.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mogoll&#x000F3;n-sotelo</surname> <given-names>C</given-names></name> <name><surname>Belalcazar</surname> <given-names>L</given-names></name> <name><surname>Vidal</surname> <given-names>S</given-names></name></person-group>. <article-title>A support vector machine model to forecast ground-level PM2</article-title>.5 in a highly populated city with a complex terrain. <source>Air Qual Atmosphere Health</source>. (<year>2020</year>) <volume>14</volume>:<fpage>399</fpage>&#x02013;<lpage>409</lpage>. <pub-id pub-id-type="doi">10.1007/s11869-020-00945-0</pub-id></citation>
</ref>
<ref id="B54">
<label>54.</label>
<citation citation-type="book"><person-group person-group-type="author"><collab>EPA</collab></person-group>. <source>Meteorological Monitoring Guidance for Regulatory Modeling Applications</source>. Research Triangle Park, <publisher-loc>NC</publisher-loc>: <publisher-name>U.S. Environmental Protection Agency</publisher-name> (<year>2000</year>).</citation>
</ref>
<ref id="B55">
<label>55.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kohavi</surname> <given-names>R</given-names></name> <name><surname>Provost</surname> <given-names>F</given-names></name></person-group>. <article-title>Applications of machine learning and the knowledge</article-title>. <source>Appl Mach Learn Knowl Mach Learn</source>. (<year>1998</year>) <volume>30</volume>:<fpage>349</fpage>&#x02013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1023/A:1007442505281</pub-id></citation>
</ref>
<ref id="B56">
<label>56.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Soares</surname> <given-names>RV</given-names></name> <name><surname>Maschio</surname> <given-names>C</given-names></name> <name><surname>Schiozer</surname> <given-names>DJ</given-names></name></person-group>. <article-title>Applying a localization technique to Kalman Gain and assessing the influence on the variability of models in history matching</article-title>. <source>J Petrol Sci Eng</source>. (<year>2018</year>) <volume>169</volume>:<fpage>110</fpage>&#x02013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.1016/j.petrol.2018.05.059</pub-id></citation>
</ref>
<ref id="B57">
<label>57.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lacerda</surname> <given-names>JM</given-names></name> <name><surname>Emerick</surname> <given-names>AA</given-names></name> <name><surname>Pires</surname> <given-names>AP</given-names></name></person-group>. <article-title>Using a machine learning proxy for localization in ensemble data assimilation</article-title>. <source>Comput Geosci</source>. (<year>2021</year>) <volume>25</volume>:<fpage>11</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1007/s10596-020-10031-0</pub-id></citation>
</ref>
<ref id="B58">
<label>58.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parra-Amaya</surname> <given-names>ME</given-names></name> <name><surname>Puerta-Yepes</surname> <given-names>ME</given-names></name> <name><surname>Lizarralde-Bejarano</surname> <given-names>DP</given-names></name> <name><surname>Arboleda-S&#x000E1;nchez</surname> <given-names>S</given-names></name></person-group>. <article-title>Early detection for dengue using local indicator of spatial association (LISA) analysis</article-title>. <source>Diseases</source>. (<year>2016</year>) <volume>4</volume>:<fpage>16</fpage>. <pub-id pub-id-type="doi">10.3390/diseases4020016</pub-id><pub-id pub-id-type="pmid">28933396</pub-id></citation></ref>
<ref id="B59">
<label>59.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lizarralde-Bejarano</surname> <given-names>DP</given-names></name> <name><surname>Arboleda-S&#x000E1;nchez</surname> <given-names>S</given-names></name> <name><surname>Puerta-Yepes</surname> <given-names>ME</given-names></name></person-group>. <article-title>Understanding epidemics from mathematical models: details of the 2010 dengue epidemic in Bello (Antioquia, Colombia)</article-title>. <source>Appl Math Model</source>. (<year>2017</year>) <volume>43</volume>:<fpage>566</fpage>&#x02013;<lpage>78</lpage>. <pub-id pub-id-type="doi">10.1016/j.apm.2016.11.022</pub-id></citation>
</ref>
<ref id="B60">
<label>60.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Catano-Lopez</surname> <given-names>A</given-names></name> <name><surname>Rojas-Diaz</surname> <given-names>D</given-names></name> <name><surname>Laniado</surname> <given-names>H</given-names></name> <name><surname>Arboleda-S&#x000E1;nchez</surname> <given-names>S</given-names></name> <name><surname>Puerta-Yepes</surname> <given-names>ME</given-names></name> <name><surname>Lizarralde-Bejarano</surname> <given-names>DP</given-names></name></person-group>. <article-title>An alternative model to explain the vectorial capacity using as example Aedes aegypti case in dengue transmission</article-title>. <source>Heliyon</source>. (<year>2019</year>) <volume>5</volume>:<fpage>e02577</fpage>. <pub-id pub-id-type="doi">10.1016/j.heliyon.2019.e02577</pub-id><pub-id pub-id-type="pmid">31687486</pub-id></citation></ref>
<ref id="B61">
<label>61.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quintero</surname> <given-names>OL</given-names></name> <name><surname>Amicarelli</surname> <given-names>AA</given-names></name> <name><surname>Di Sciascio</surname> <given-names>F</given-names></name> <name><surname>Scaglia</surname> <given-names>G</given-names></name></person-group>. <article-title>State estimation in alcoholic continuous fermentation of Zymomonas mobilis using recursive bayesian filtering: a simulation approach</article-title>. <source>BioResources</source>. (<year>2008</year>) <volume>3</volume>:<fpage>316</fpage>&#x02013;<lpage>34</lpage>.</citation>
</ref>
<ref id="B62">
<label>62.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quintero</surname> <given-names>OL</given-names></name> <name><surname>Amicarelli</surname> <given-names>AA</given-names></name> <name><surname>Scaglia</surname> <given-names>G</given-names></name> <name><surname>di Sciascio</surname> <given-names>F</given-names></name></person-group>. <article-title>Control based on numerical methods and recursive Bayesian estimation in a continuous alcoholic fermentation process</article-title>. <source>BioResources</source>. (<year>2009</year>) <volume>4</volume>:<fpage>1372</fpage>&#x02013;<lpage>95</lpage>.</citation>
</ref>
<ref id="B63">
<label>63.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manders</surname> <given-names>AMM</given-names></name> <name><surname>Builtjes</surname> <given-names>PJH</given-names></name> <name><surname>Curier</surname> <given-names>L</given-names></name> <name><surname>Denier Van Der Gon</surname> <given-names>HAC</given-names></name> <name><surname>Hendriks</surname> <given-names>C</given-names></name> <name><surname>Jonkers</surname> <given-names>S</given-names></name> <etal/></person-group>. <article-title>Curriculum vitae of the LOTOS-EUROS (v2</article-title>.0) chemistry transport model. <source>Geosci Model Dev</source>. (<year>2017</year>) <volume>10</volume>:<fpage>4145</fpage>&#x02013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.5194/gmd-10-4145-2017</pub-id></citation>
</ref>
<ref id="B64">
<label>64.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Skamarock</surname> <given-names>WC</given-names></name> <name><surname>Klemp</surname> <given-names>JB</given-names></name> <name><surname>Dudhi</surname> <given-names>J</given-names></name> <name><surname>Gill</surname> <given-names>DO</given-names></name> <name><surname>Barker</surname> <given-names>DM</given-names></name> <name><surname>Duda</surname> <given-names>MG</given-names></name> <etal/></person-group>. <source>A Description of the Advanced Research WRF Version 3</source>. <publisher-loc>Boulder, CO</publisher-loc>: <publisher-name>University Corporation for Atmospheric Research</publisher-name> (<year>2008</year>).</citation>
</ref>
<ref id="B65">
<label>65.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Petrescu</surname> <given-names>AMR</given-names></name> <name><surname>Abad-Vi nas</surname> <given-names>R</given-names></name> <name><surname>Janssens-Maenhout</surname> <given-names>G</given-names></name> <name><surname>Blujdea</surname> <given-names>VNB</given-names></name> <name><surname>Grassi</surname> <given-names>G</given-names></name></person-group>. <article-title>Global estimates of carbon stock changes in living forest biomass: EDGARv4</article-title>.3 - time series from 1990 to 2010. <source>Biogeosciences</source>. (<year>2012</year>) <volume>9</volume>:<fpage>3437</fpage>&#x02013;<lpage>47</lpage>. <pub-id pub-id-type="doi">10.5194/bg-9-3437-2012</pub-id></citation>
</ref>
<ref id="B66">
<label>66.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Misenis</surname> <given-names>C</given-names></name> <name><surname>Zhang</surname> <given-names>Y</given-names></name></person-group>. <article-title>An examination of sensitivity of WRF/Chem predictions to physical parameterizations, horizontal grid spacing, and nesting options</article-title>. <source>Atmosphere Res</source>. (<year>2010</year>) <volume>97</volume>:<fpage>315</fpage>&#x02013;<lpage>34</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosres.2010.04.005</pub-id></citation>
</ref>
<ref id="B67">
<label>67.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carvalho</surname> <given-names>D</given-names></name> <name><surname>Rocha</surname> <given-names>A</given-names></name> <name><surname>G&#x000F3;mez-Gesteira</surname> <given-names>M</given-names></name> <name><surname>Santos</surname> <given-names>C</given-names></name></person-group>. <article-title>A sensitivity study of the WRF model in wind simulation for an area of high wind energy</article-title>. <source>Environ Model Softw</source>. (<year>2012</year>) <volume>33</volume>:<fpage>23</fpage>&#x02013;<lpage>34</lpage>. <pub-id pub-id-type="doi">10.1016/j.envsoft.2012.01.019</pub-id></citation>
</ref>
<ref id="B68">
<label>68.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tuccella</surname> <given-names>P</given-names></name> <name><surname>Curci</surname> <given-names>G</given-names></name> <name><surname>Visconti</surname> <given-names>G</given-names></name> <name><surname>Bessagnet</surname> <given-names>B</given-names></name> <name><surname>Menut</surname> <given-names>L</given-names></name> <name><surname>Park</surname> <given-names>RJ</given-names></name></person-group>. <article-title>Modeling of gas and aerosol with WRF/Chem over Europe: evaluation and sensitivity study</article-title>. <source>J Geophys Res Atmospheres</source>. (<year>2012</year>) <volume>117</volume>:<fpage>1</fpage>&#x02013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.1029/2011JD016302</pub-id></citation>
</ref>
<ref id="B69">
<label>69.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>XM</given-names></name> <name><surname>Klein</surname> <given-names>PM</given-names></name> <name><surname>Xue</surname> <given-names>M</given-names></name></person-group>. <article-title>Evaluation of the updated YSU planetary boundary layer scheme within WRF for wind resource and air quality assessments</article-title>. <source>J Geophys Res Atmospheres</source>. (<year>2013</year>) <volume>118</volume>:<fpage>10490</fpage>&#x02013;<lpage>505</lpage>. <pub-id pub-id-type="doi">10.1002/jgrd.50823</pub-id></citation>
</ref>
<ref id="B70">
<label>70.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dillon</surname> <given-names>ME</given-names></name> <name><surname>Skabar</surname> <given-names>YG</given-names></name> <name><surname>Ruiz</surname> <given-names>J</given-names></name> <name><surname>Kalnay</surname> <given-names>E</given-names></name> <name><surname>Collini</surname> <given-names>EA</given-names></name> <name><surname>Echevarr&#x000ED;a</surname> <given-names>P</given-names></name> <etal/></person-group>. <article-title>Application of the WRF-LETKF data assimilation system over southern South America: sensitivity to model physics</article-title>. <source>Weather Forecast</source>. (<year>2016</year>) <volume>31</volume>:<fpage>217</fpage>&#x02013;<lpage>36</lpage>. <pub-id pub-id-type="doi">10.1175/WAF-D-14-00157.1</pub-id></citation>
</ref>
<ref id="B71">
<label>71.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>A</given-names></name> <name><surname>Jim&#x000E9;nez</surname> <given-names>R</given-names></name> <name><surname>Belalc&#x000E1;zar</surname> <given-names>LC</given-names></name> <name><surname>Rojas</surname> <given-names>NY</given-names></name></person-group>. <article-title>Application of WRF-Chem model to simulate PM10 concentration over Bogot&#x000E1;</article-title>. <source>Aerosol Air Qual Res</source>. (<year>2016</year>) <volume>16</volume>:<fpage>1206</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.4209/aaqr.2015.05.0318</pub-id></citation>
</ref>
</ref-list>
<app-group>
<app id="A1">
<title>Appendix</title>
<sec>
<title>The Chemical Transport Model LOTOS-EUROS Setup</title>
<p>The LOTOS-EUROS (LOng Term Ozone Simulation - EURopean Operational Smog) model is a 3D Chemical Transport Model that simulates trace gas and aerosol concentrations in the lower troposphere [<xref ref-type="bibr" rid="B63">63</xref>]. The physical processes in the model include emission, advection, diffusion, chemical reactions, and dry and wet deposition. The input to the LOTOS-EUROS model mainly consists of meteorological data, emission inventories, and surface data such as land-use and vegetation type. For a full description of the physical processes and input data could be found [<xref ref-type="bibr" rid="B63">63</xref>]. Simulations were conducted with the LE model, adopting a nested domain configuration shown in <xref ref-type="fig" rid="F11">Figure 11</xref> and following previous implementations [<xref ref-type="bibr" rid="B19">19</xref>, <xref ref-type="bibr" rid="B50">50</xref>]. The first Domain (D1) has a model resolution of 0.27&#x000B0; &#x000D7; 0.27&#x000B0;. For this domain, meteorological data from ECMWF was used at a resolution of 0.14&#x000B0; &#x000D7; 0.14&#x000B0;. The inner domain D2 is centered over the valley, encompassing most of the Colombian Andes; the model resolution was set to 0.09&#x000B0; &#x000D7; 0.09&#x000B0;. For this and the following inner domain, meteorological data were obtained from ECMWF at 0.07&#x000B0; &#x000D7; 0.07&#x000B0; resolution. The third inner domain, D3 includes the department of Antioquia, at a model resolution of 0.03&#x000B0; &#x000D7; 0.003&#x000B0;. The innermost domain D4 includes primarily the region of the Aburr&#x000E1; Valley, using the model resolution of 0.01&#x000B0; &#x000D7; 0.01&#x000B0;. The simulations in the domain of interest (D4) were performed using the meteorological fields coming from the Weather Research and Forecasting (WRF) model [<xref ref-type="bibr" rid="B64">64</xref>]. The description of the WRF meteorology is presented in Section 25. The anthropogenic emissions for the domains D4, D3, and D2 were obtained from the global EDGAR emission inventory V4.3 [<xref ref-type="bibr" rid="B65">65</xref>]. In domain D4, the local emission inventory for particulate matter presented in Lopez-restrepo et al. [<xref ref-type="bibr" rid="B50">50</xref>] was used as anthropogenic emissions. For all the domains, the biogenic emissions were obtained from the MEGAN emission inventory and the biomass burning and fires from MACC/CAMS GFAS inventory.</p>
</sec>
<sec>
<title>The WRF Meteorology</title>
<p>The WRF model is a numerical weather prediction and atmospheric simulation system designed for research and operational applications [<xref ref-type="bibr" rid="B64">64</xref>]. The WRF simulations are suitable to understand the behavior of meteorological variables in a domain like the Aburr&#x000E1; Valley. The WRF model has been used over Colombia in previous studies [<xref ref-type="bibr" rid="B21">21</xref>, <xref ref-type="bibr" rid="B66">66</xref>&#x02013;<xref ref-type="bibr" rid="B71">71</xref>]. The configuration of the nested domains used in this study is shown in the <xref ref-type="fig" rid="F11">Figure 11</xref> and described in <xref ref-type="table" rid="T3">Table 3</xref>. The settings used for the WRF simulations are summarized in <xref ref-type="table" rid="T4">Table 4</xref>.</p>
</sec>
<sec>
<title>The Data Used for Assimilation and Validation</title>
<p>We used the hyper-dense low-cost network deployed and operated by the <italic>Sistema de Alerta Temprana del Valle de Aburr&#x000E1;</italic> (SIATA) as observations for the DA methods. The low-cost network consists of 255 real-time PM<sub>2.5</sub> sensors across the Aburr&#x000E1; Valley and its hills. Hoyos et al. [<xref ref-type="bibr" rid="B20">20</xref>] presents the description and calibration process of the low-cost sensor. In Lopez et al.&#x00027;s [<xref ref-type="bibr" rid="B50">50</xref>] study, the low-cost sensor networks are evaluated and used as observations for the standard DA method, EnKF, outperforming the simulation where the standard network was used as observations for the same DA method. For validation, we used the independent official monitoring network of the metropolitan area. The official network has 21 measurement sites that observer particulate matter at hourly frequency [<xref ref-type="bibr" rid="B20">20</xref>]. The distribution of both observations network is shown in <xref ref-type="fig" rid="F12">Figure 12</xref>.</p>
</sec>
</app>
</app-group>
<sec id="s10">
<title>Nomenclature</title>
</sec>
<sec id="s11">
<title>List of Symbols</title>
<table-wrap position="float">
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td valign="top" align="left"><bold>x</bold><sup><italic>b</italic></sup></td>
<td valign="top" align="left">Background state vector</td>
</tr>
<tr>
<td valign="top" align="left"><bold>x</bold><sup><italic>a</italic></sup></td>
<td valign="top" align="left">Analyzed state vector</td>
</tr>
<tr>
<td valign="top" align="left"><bold>X</bold><sup><italic>b</italic></sup></td>
<td valign="top" align="left">Background ensemble of model realizations</td>
</tr>
<tr>
<td valign="top" align="left"><bold>X</bold><sup><italic>a</italic></sup></td>
<td valign="top" align="left">Analyzed ensemble</td>
</tr>
<tr>
<td valign="top" align="left"><inline-formula><mml:math id="M58"><mml:msup><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>x</mml:mtext></mml:mstyle></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula></td>
<td valign="top" align="left">Background ensemble mean</td>
</tr>
<tr>
<td valign="top" align="left"><bold>B</bold></td>
<td valign="top" align="left">Background error covariance</td>
</tr>
<tr>
<td valign="top" align="left"><bold>&#x00394;X</bold></td>
<td valign="top" align="left">Anomalies matrix</td>
</tr>
<tr>
<td valign="top" align="left"><bold>P</bold><sup><italic>b</italic></sup></td>
<td valign="top" align="left">Background ensemble sample covariance matrix</td>
</tr>
<tr>
<td valign="top" align="left"><bold>P</bold><sup><italic>a</italic></sup></td>
<td valign="top" align="left">Analyzed covariance matrix</td>
</tr>
<tr>
<td valign="top" align="left"><bold>y</bold></td>
<td valign="top" align="left">Observations</td>
</tr>
<tr>
<td valign="top" align="left"><italic><bold>&#x003F5;</bold></italic></td>
<td valign="top" align="left">Observation error</td>
</tr>
<tr>
<td valign="top" align="left"><inline-formula><mml:math id="M59"><mml:mrow><mml:mi mathvariant="-tex-caligraphic">H</mml:mi></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td valign="top" align="left">Output operator</td>
</tr>
<tr>
<td valign="top" align="left"><bold>H</bold></td>
<td valign="top" align="left">Linear output operator</td>
</tr>
<tr>
<td valign="top" align="left"><bold>R</bold></td>
<td valign="top" align="left">Observation error covariance matrix</td>
</tr>
<tr>
<td valign="top" align="left"><bold>D</bold></td>
<td valign="top" align="left">Innovation matrix</td>
</tr>
<tr>
<td valign="top" align="left"><bold>K</bold></td>
<td valign="top" align="left">Kalman gain</td>
</tr>
<tr>
<td valign="top" align="left">&#x00394;<bold>Y</bold></td>
<td valign="top" align="left">Matrix of observations anomalies</td>
</tr>
<tr>
<td valign="top" align="left"><inline-formula><mml:math id="M60"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>B</mml:mtext></mml:mstyle></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></td>
<td valign="top" align="left">Knowled-aided estimator of the background<break/> covariance matrix</td>
</tr>
<tr>
<td valign="top" align="left"><bold>T</bold><sub><italic>KA</italic></sub></td>
<td valign="top" align="left">Target matrix</td>
</tr>
<tr>
<td valign="top" align="left">&#x003B1;</td>
<td valign="top" align="left">Wight value</td>
</tr>
<tr>
<td valign="top" align="left">&#x003B3;</td>
<td valign="top" align="left">Performance level</td>
</tr>
<tr>
<td valign="top" align="left"><bold>G</bold></td>
<td valign="top" align="left">H<sub>&#x0221E;</sub> gain</td>
</tr>
<tr>
<td valign="top" align="left">&#x00394;</td>
<td valign="top" align="left">H<sub>&#x0221E;</sub> covariance matrix</td>
</tr>
<tr>
<td valign="top" align="left"><bold>S</bold></td>
<td valign="top" align="left">H<sub>&#x0221E;</sub> user-chosen matrix</td>
</tr>
<tr>
<td valign="top" align="left"><italic>c</italic></td>
<td valign="top" align="left">Performance level coefficient</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term>DA</term>
<def><p>Data Assimilation</p></def></def-item>
<def-item><term>KF</term>
<def><p>Kalman Filter</p></def></def-item>
<def-item><term>EnKF</term>
<def><p>ENsemble Kalman Filter</p></def></def-item>
<def-item><term>LETKF</term>
<def><p>Local Ensemble Transform Kalman Filter</p></def></def-item>
<def-item><term>KA</term>
<def><p>Knowledge-Aided</p></def></def-item>
<def-item><term>EnKF-KA</term>
<def><p>Ensemble Kalman Filter Knowledge-Aided</p></def></def-item>
<def-item><term>HF</term>
<def><p>H<sub>&#x0221E;</sub> Filter</p></def></def-item>
<def-item><term>EnTLHF</term>
<def><p>ENsemble Time Local H<sub>&#x0221E;</sub> Filter</p></def></def-item>
<def-item><term>EnTLHF-KA</term>
<def><p>ENsemble Time Local H<sub>&#x0221E;</sub> Filter Knowledge-Aided</p></def></def-item>
<def-item><term>RMSE</term>
<def><p>Root Mean Square Error</p></def></def-item>
<def-item><term>CTM</term>
<def><p>Chemical Transport Model</p></def></def-item>
<def-item><term>LE</term>
<def><p>LOTOS-EUROS simulation without data assimilation</p></def></def-item>
<def-item><term>LE-LETKF</term>
<def><p>LOTOS-EUROS simulation using the LETKF</p></def></def-item>
<def-item><term>LE-KA</term>
<def><p>LOTOS-EUROS simulation using the EnKF-KA</p></def></def-item>
<def-item><term>LE-Robust</term>
<def><p>LOTOS-EUROS simulation using the EnTLHF-KA</p></def></def-item>
<def-item><term>BS</term>
<def><p>Bottom Stations</p></def></def-item>
<def-item><term>OS</term>
<def><p>Outskirts Stations.</p></def></def-item>
</def-list>
</glossary>
</back>
</article>