Commentary: On the Efficiency of Covariance Localisation of the Ensemble Kalman Filter Using Augmented Ensembles

Bishop, Craig H.; Whitaker, Jeffrey S.; Lei, Lili

doi:10.3389/fams.2020.00002

GENERAL COMMENTARY article

Front. Appl. Math. Stat., 17 March 2020

Sec. Dynamical Systems

Volume 6 - 2020 | https://doi.org/10.3389/fams.2020.00002

This article is part of the Research TopicData Assimilation of Nonlocal Observations in Complex SystemsView all 8 articles

Commentary: On the Efficiency of Covariance Localisation of the Ensemble Kalman Filter Using Augmented Ensembles

This article is a commentary on:

On the Efficiency of Covariance Localisation of the Ensemble Kalman Filter Using Augmented Ensembles
1. Read original article

Craig H. Bishop¹^*

Jeffrey S. Whitaker²

Lili Lei³

¹School of Earth Sciences and Centre of Excellence for Climate Extremes, University of Melbourne, Parkville, VIC, Australia
²NOAA Earth System Research Lab, Boulder, CO, United States
³School of Atmospheric Sciences, Nanjing University, Nanjing, China

A Commentary on
On the Efficiency of Covariance Localisation of the Ensemble Kalman Filter Using Augmented Ensembles

by Farchi, A., and Bocquet, M. (2019). On the Efficiency of Covariance Localisation of the Ensemble Kalman Filter Using Augmented Ensembles. Front. Appl. Math. Stat. 5:3. doi: 10.3389/fams.2019.00003

Introduction

In discussing Equation (39) (Equation (25) of Bocquet [1]), Farchi and Bouquet [2] state that “This perturbation update has been rediscovered by Bishop et al. [3] and included in their gain ETKF (GETKF) algorithm. However, the update formula used in the GETKF is prone to numerical cancellation errors as opposed to Equation (39)”. Here, we note:

(i) The predecessor of the GETKF eigenvalue form of the modified gain matrix equation appeared in Posselt and Bishop [4, 5]—before Bocquet [1].

(ii) The spectral shift theorem reduces the differences in the numerical cancellation errors referred to by Farchi and Bouquet.

(iii) The eigenvalue form enables Wang et al.'s [6] corrections for ensemble rank deficiency.

(iv) A proof of the equivalence of the eigenvalue form and Bouquet's form.

On page 12, Farchi and Bouquet [2] also state that “Such an extension had been discussed by Bishop et al. [3] but without numerical illustration.” This is incorrect. Lei et al. [7] used the GETKF to show that model space ensemble covariance localization provided satellite data assimilation (DA) performance comparable to 3DEnsVar.

To be specific about the forms of the modified gain matrix, let

\begin{array}{l} Z^{f} = \frac{[((x_{1}^{f}) - \bar{(x_{}^{f})}), ((x_{2}^{f}) - \bar{(x_{}^{f})}), \dots, ((x_{K}^{f}) - \bar{(x_{}^{f})})]}{\sqrt{K - 1}}, and \\ \tilde{H} Z^{f} = \frac{R^{- 1 / 2} [(H (x_{1}^{f}) - \bar{H (x_{}^{f})}), (H (x_{2}^{f}) - \bar{H (x_{}^{f})}), \dots, (H (x_{K}^{f}) - \bar{H (x_{}^{f})})]}{\sqrt{K - 1}} & (1) \end{array}

where K is the total number of ensemble members in the ensemble forecast and where the n-vector $x_{i}^{f}$ is the ith member of the prior ensemble forecast and where the p-vector $H (x_{i}^{f})$ is the ith member of the prior ensemble forecast of the p-vector y of p observations. When p<K, the numerical cost of the pxp eigen decomposition

\begin{array}{l} \tilde{H} Z^{f} {(\tilde{H} Z^{f})}^{T} = E Γ_{p x p} E^{T} & (2) \end{array}

is less than the K × K eigen decomposition

\begin{array}{l} {(\tilde{H} Z^{f})}^{T} \tilde{H} Z^{f} = C Γ_{K \times K} C^{T} . & (3) \end{array}

In (2), E is a pxp eigenvector matrix for which $E E^{T} = E^{T} E = I_{p x p}$ , Γ_pxp is a pxp diagonal matrix of eigenvalues. In (3), C is a K × K orthonormal matrix of eigenvectors $(C C^{T} = C^{T} C = I_{K \times K})$ and Γ_KxK is a K × K diagonal matrix of eigenvalues. At least K-p of the eigenvalues in Γ_{K × K} will be equal to zero in the case of K>p. Equation's (2) and (3) are directly connected to the verbose singular value decomposition $\tilde{H} Z^{f} = E_{p x p} Γ_{p x K}^{1 / 2} C_{K \times K}^{T}$ where $Γ_{p x K}^{1 / 2} = [Γ_{p x p}^{1 / 2} 0_{p x (K - p)}]$ where 0_px(K−p) is a px(K−p) matrix of zeros. However, since the columns of C associated with zero eigenvalues cannot contribute to products of the matrix $\tilde{H} Z^{f}$ with other vectors, it is more efficient to work with the concise svd given by $\tilde{H} Z^{f} = E_{p x p} Γ_{p x p}^{1 / 2} {(L_{K x p}^{})}^{T}$ where $L_{K x p}^{}$ lists the p columns of $C_{K \times K}^{}$ having non-zero eigenvalues. Posselt and Bishop [4, 5] note that $L_{K x p}^{}$ is given by

\begin{array}{l} {(L_{K x p}^{})}^{T} = Γ_{p x p}^{- 1 / 2} E^{T} \tilde{H} Z^{f} or L_{K x p}^{} = {(\tilde{H} Z^{f})}^{T} E Γ_{p x p}^{- 1 / 2} & (4) \end{array}

and hence can be computed without performing an eigen decomposition of the larger K × K matrix in (3). Posselt and Bishop [4, 5] prove that for a linear observation operator $\tilde{H}$ , if

\begin{array}{l} Z^{a} = {I - Z^{f} L_{K x p}^{} [I_{p x p} - {(Γ_{p x p} + I)}^{- 1 / 2}] Γ_{p x p}^{- 1 / 2} E^{T} \tilde{H}} Z^{f} & (5) \end{array}

(see [5], Equation A10) then

\begin{array}{l} P^{a} = Z^{a} Z^{a T} \\ = Z^{f} Z^{f T} - Z^{f} {(\tilde{H} Z^{f})}^{T} {((\tilde{H} Z^{f}) {(\tilde{H} Z^{f})}^{T} + I_{p x p})}^{- 1} (\tilde{H} Z^{f}) Z^{f T} \\ = P^{f} - P^{f} {\tilde{H}}^{T} {(\tilde{H} P^{f} {\tilde{H}}^{T} + I_{p x p})}^{- 1} \tilde{H} P^{f} . & (6) \end{array}

The analysis perturbations are given by $X^{a} = Z^{a} \sqrt{K - 1}$ , hence,

\begin{array}{l} X^{a} = {I_{K x K} - Z^{f} L_{K x p} [I_{p x p} - {(Γ_{p x p} + I)}^{- 1 / 2}] Γ_{p x p}^{- 1 / 2} E^{T} \tilde{H}} X^{f}, \\ where X^{f} = Z^{f} \sqrt{K - 1} & (7) \end{array}

is the perturbation update equation implied by Posselt and Bishop [4, 5].

In the above notation, and when propagation of small amplitude ensemble perturbations by the non-linear model is replaced by the propagation of raw ensemble perturbations by the non-linear model (i.e., no tangent linear model approximation is made), Bocquet's Equation (25) [1] for the ensemble perturbation update takes the form,

\begin{array}{l} X_{B o u q u e t}^{a} = {I_{K \times K} - Z^{f} [I_{K \times K} + {(\tilde{H} Z^{f})}^{T} (\tilde{H} Z^{f}) \\ + {(I_{K \times K} + {(\tilde{H} Z^{f})}^{T} (\tilde{H} Z^{f}))}^{1 / 2}]^{- 1} {(\tilde{H} Z^{f})}^{T} \tilde{H}} X^{f} . & (8) \end{array}

A fundamental difference between (7) and (8) is that while Bouquet multiplies Z^f by the KxK matrix ${[I_{K \times K} + {(\tilde{H} Z^{f})}^{T} (\tilde{H} Z^{f}) + {(I_{K \times K} + {(\tilde{H} Z^{f})}^{T} (\tilde{H} Z^{f}))}^{1 / 2}]}^{- 1}$ Posselt and Bishop multiply it by the Kxp matrix $L_{K x p}^{} [I_{p x p} - {(Γ_{p x p} + I)}^{- 1 / 2}]$ . When K>p, Posselt and Bishop's form only requires the eigenvector decomposition of a pxp matrix, whereas Bouquet's form requires the inversion of a larger K × K matrix. However, when p>K, the eigen decomposition (3) is cheaper than (2), $L_{K x p}^{}$ becomes identical to the K × K matrix $C_{K \times K}^{}$ and $\tilde{H} Z^{f} = E_{p x K} Γ_{K \times K}^{1 / 2} C_{K \times K}^{T}$ becomes the concise svd of $\tilde{H} Z^{f}$ . In this case, E_pxK is efficiently given by $\tilde{H} Z^{f} C_{K \times K}^{} Γ_{K \times K}^{- 1 / 2} = E_{p x K}$ and (7) becomes

\begin{array}{l} X^{a} = {I - Z^{f} C_{K \times K} [I_{K \times K} - {(Γ_{K \times K} + I)}^{- 1 / 2}] \\ Γ_{K \times K}^{- 1} C_{K \times K}^{T} {(\tilde{H} Z^{f})}^{T} \tilde{H}} X^{f} & (9) \end{array}

Dividing Equation (9) by $\sqrt{K - 1}$ recovers Equation (24) of Bishop et al. [3].

The above shows that Bishop et al.'s Equation (24) [3] was not “rediscovered” from Bocquet's [1] form as implied by Farchi and Bocquet [2]. It is an extension of Posselt and Bishop's [4, 5] eigenvalue form to the case of K>p. Equation (9) is just an eigenvalue form of the modified gain matrix of Whitaker and Hamill's [8] Ensemble Square Root Filter.

Equivalence of (9) and (8)

Bocquet's Equation (25) [1] can be derived from (9) with the following steps:

(i) Drop the dimension subscripts and manipulate [I − (Γ + I)^−1/2]Γ⁻¹ as follows

\begin{array}{l} \begin{array}{l} Γ^{- 1} [I - {(Γ + I)}^{- 1 / 2}] \\ = Γ^{- 1} [I - {(Γ + I)}^{- 1 / 2}] [I + {(Γ + I)}^{- 1 / 2}] \\ {[I + {(Γ + I)}^{- 1 / 2}]}^{- 1} \\ = Γ^{- 1} [I - {(Γ + I)}^{- 1}] {[I + {(Γ + I)}^{- 1 / 2}]}^{- 1} \\ = Γ^{- 1} [(Γ + I) {(Γ + I)}^{- 1} - {(Γ + I)}^{- 1}] \\ {[I + {(Γ + I)}^{- 1 / 2}]}^{- 1} \end{array} \\ \begin{array}{l} = Γ^{- 1} [Γ {(Γ + I)}^{- 1}] {[I + {(Γ + I)}^{- 1 / 2}]}^{- 1} \\ = Γ^{- 1} Γ {[(Γ + I) [I + {(Γ + I)}^{- 1 / 2}]]}^{- 1} \\ = {[(Γ + I) + {(Γ + I)}^{1 / 2}]}^{- 1} \end{array} & (10) \end{array}

(ii) Use (10) in (9) to give

\begin{array}{l} X^{a} = {I - Z^{f} C {[(Γ + I) + {(Γ + I)}^{1 / 2}]}^{- 1} C_{}^{T} {(\tilde{H} Z^{f})}^{T} \tilde{H}} X^{f} \\ = {I - Z^{f} {[C_{} (Γ + I) C_{}^{T} + C_{} {(Γ + I)}^{1 / 2} C_{}^{T}]}^{- 1} {(\tilde{H} Z^{f})}^{T} \tilde{H}} X^{f} \\ = {I - Z^{f} {[(C Γ C_{}^{T} + I) + C_{} {(Γ + I)}^{1 / 2} C_{}^{T}]}^{- 1} {(\tilde{H} Z^{f})}^{T} \tilde{H}} X^{f} & (11) \end{array}

(iii) But

\begin{array}{l} {[(C_{} Γ C_{}^{T} + I)]}^{1 / 2} = {[C_{} (Γ + I) C_{}^{T}]}^{1 / 2} = C {(Γ + I)}^{1 / 2} C_{}^{T} & (12) \end{array}

(iv) Using (12) and (3) in (11) gives

\begin{array}{l} \begin{array}{l} X^{a} = {I - Z^{f} {[(C Γ C_{}^{T} + I) + {(C Γ C_{}^{T} + I)}^{1 / 2}]}^{- 1} {(\tilde{H} Z^{f})}^{T} \tilde{H}} X^{f} \\ = {I - Z^{f} [({(\tilde{H} Z^{f})}^{T} \tilde{H} Z^{f} + I) \\ {+ {({(\tilde{H} Z^{f})}^{T} \tilde{H} Z^{f} + I)}^{1 / 2}]}^{- 1} {(\tilde{H} Z^{f})}^{T} \tilde{H}} X^{f} \end{array} & (13) \end{array}

Equation (13) is equivalent to (8) and Bocquet's Equation (25) [1].

Numerical Issues, Condition Numbers, and Understanding

Numerical cancellation errors increase when the condition number of the matrix increases. Let us define the scalars $γ_{i}^{max}$ and $γ_{i}^{min}$ to, respectively, denote the maximum and minimum of the eigenvalues listed in the eigenvalue matrix Γ_pxp. The condition number of ${(\tilde{H} Z)}^{T} \tilde{H} Z$ is $κ [{(\tilde{H} Z)}^{T} \tilde{H} Z] = \frac{γ_{i}^{max}}{γ_{i}^{min}}$ . Because $γ_{i}^{min}$ can be zero, $κ [{(\tilde{H} Z)}^{T} \tilde{H} Z]$ can be infinite. In contrast, $κ [({(\tilde{H} Z)}^{T} \tilde{H} Z + I) + {({(\tilde{H} Z)}^{T} \tilde{H} Z + I)}^{1 / 2}] = \frac{(γ_{i}^{max} + 1) + {(γ_{i}^{max} + 1)}^{1 / 2}}{(γ_{i}^{min} + 1) + {(γ_{i}^{min} + 1)}^{1 / 2}}$ is bounded above by $[(γ_{i}^{max} + 1) + {(γ_{i}^{max} + 1)}^{1 / 2}] / 2$ . However, note that the matrix $[{(\tilde{H} Z)}^{T} \tilde{H} Z + α I]$ has the eigenvalue decomposition

\begin{array}{l} [{(\tilde{H} Z)}^{T} \tilde{H} Z + α I] = C Λ C^{T} = C (Γ + α I) C^{T} & (14) \end{array}

and hence has $κ [{(\tilde{H} Z)}^{T} \tilde{H} Z + α I] = \frac{γ_{i}^{max} + α}{γ_{i}^{min} + α}$ which is bounded above by $\frac{γ_{i}^{max}}{α} + 1$ where α is a positive scalar. Hence, α can be chosen to create a matrix that is better conditioned than $[({(\tilde{H} Z)}^{T} \tilde{H} Z + I) + {({(\tilde{H} Z)}^{T} \tilde{H} Z + I)}^{1 / 2}]$ . Once the eigen decomposition CΛC^T of (14) has been obtained, one obtains the eigenvalues required by the GETKF or ETKF using Γ = Λ − αI. Thus, condition number differences between the Bouquet and eigenvalue form are easily eliminated.

The eigenvalue form lends understanding to the performance of DA schemes in much the same way that Empirical Orthogonal Functions lend understanding to climate variability. Wang et al. [6] used this understanding to correct gross aspects of the eigenvalue overestimation that occurs when the size of the ensemble is much smaller than the rank of the true observation space forecast error covariance matrix.

Discussion

Bocquet [1] and Farchi and Bocquet [2] may have overlooked Posselt and Bishop's [4, 5] work because:

(i) It is difficult to find all relevant literature to one's own work.

(ii) Bishop et al. [3] did not cite Posselt and Bishop [4, 5].

(iii) The equivalence of Posselt and Bishop's [4, 5] form and Bocquet's [1] form is not obvious.

Similarly, Bishop et al. [3] overlooked Bocquet's [1] work because of (i) and (iii). This note serves to clarify the origins and uses of modified gain matrices used in ensemble DA.

Author Contributions

CB led the study and wrote the text. JW and LL carefully reviewed the text.

Funding

CB acknowledges support from the Australian Research Council's Centre of Excellence in Climate Extremes (CE170100023). LL acknowledges joint sponsorship by the National Key R&D Program of China through grant 2017YFC1501603 and the National Natural Science Foundation of China through grant 41675052. JW acknowledges support through the NOAA High-Impact Weather Prediction Project (HIWPP) under award NA14OAR4830123, and the NOAA/NWS Next-Generation Global Prediction System (NGGPS) project.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1. Bocquet M. Localization and the iterative ensemble Kalman smoother. Q J R Meteorol Soc. (2016) 142:1075–89. doi: 10.1002/qj.2711

CrossRef Full Text | Google Scholar

2. Farchi A, Bocquet M. On the efficiency of covariance localisation of the ensemble Kalman filter using augmented ensembles. Front Appl Math Stat. (2019) 5:3. doi: 10.3389/fams.2019.00003

CrossRef Full Text | Google Scholar

3. Bishop CH, Whitaker JS, Lei L. Gain form of the ensemble transform Kalman filter and its relevance to satellite data assimilation with model space ensemble covariance localization. Mon Wea Rev. (2017) 145:4575–92. doi: 10.1175/MWR-D-17-0102.1

CrossRef Full Text | Google Scholar

4. Posselt DJ, Bishop CH. Nonlinear parameter estimation: comparison of an ensemble Kalman smoother with a Markov chain Monte Carlo algorithm. Mon Wea Rev. (2012) 140:1957–74. doi: 10.1175/MWR-D-11-00242.1

CrossRef Full Text | Google Scholar

5. Posselt DJ. Bishop CH. Corrigendum. Mon Wea Rev. (2014) 142:1382. doi: 10.1175/MWR-D-13-00342.1

CrossRef Full Text

6. Wang X, Hamill TM, Whitaker JS, Bishop CH. A comparison of hybrid ensemble transform Kalman filter–optimum interpolation and ensemble square root filter analysis schemes. Mon Wea Rev. (2007) 135:1055–76. doi: 10.1175/MWR3307.1

CrossRef Full Text | Google Scholar

7. Lei L, Whitaker JS, Bishop C. Improving assimilation of radiance observations by implementing model space localization in an ensemble Kalman filter. J Adv Model Earth Syst. (2018) 10:3221–32. doi: 10.1029/2018MS001468

CrossRef Full Text | Google Scholar

8. Whitaker JS, Hamill TM. Ensemble data assimilation without perturbed observations. Mon Wea Rev. (2002) 130:1913–24. doi: 10.1175/1520-0493(2002)130<1913:EDAWPO>2.0.CO;2

CrossRef Full Text | Google Scholar

Keywords: ensemble Kalman filter, modified Kalman gain, eigenvalue form of Kalman gain, GETKF, ETKF

Citation: Bishop CH, Whitaker JS and Lei L (2020) Commentary: On the Efficiency of Covariance Localisation of the Ensemble Kalman Filter Using Augmented Ensembles. Front. Appl. Math. Stat. 6:2. doi: 10.3389/fams.2020.00002

Received: 26 November 2019; Accepted: 29 January 2020;
Published: 17 March 2020.

Edited by:

Raluca Eftimie, University of Dundee, United Kingdom

Reviewed by:

Xin Tong, National University of Singapore, Singapore

Copyright © 2020 Bishop, Whitaker and Lei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Craig H. Bishop, Y3JhaWcuYmlzaG9wQHVuaW1lbGIuZWR1LmF1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.