Full article: Some quantitative characteristics of error covariance for Kalman filters

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Some quantitative characteristics of error covariance are studied for linear Kalman filters. These quantitative characteristics include the peak value and location in the matrix, the decay rate from peak to bottom, and some algebraic constraints of the elements in the covariance matrix. We mathematically prove a matrix upper bound and its quantitative characteristics for the error covariance of Kalman filters. Computational methods are developed to numerically estimate the elements in a matrix upper bound and its decay rate. The quantitative characteristics and the computational methods are illustrated using three examples, two linear systems and one nonlinear system of shallow water equations.

Keywords:

1. Introduction

A fundamental challenge in data assimilation lies in the computationally intractable process of estimating and updating the error covariance for high dimensional problems. Various methodologies, from ensemble-based methods to approaches of hybrid error covariance, have been extensively studied in the literature (Houtekamer and Zhang, Citation2016). Due to the limitation of computational power, the error covariance has to be estimated based upon a relatively small number of ensemble members. Rank deficiency, underestimation, and sampling errors are some notable drawbacks of these approaches. As a result, additional numerical treatments, such as covariance localisation and variance inflation, are required to improve the stability of algorithm and estimation accuracy. Some key parameters are critical for the effectiveness of these numerical treatments, for instance the radius of localisation and the magnitude of inflation. These parameters are estimated based on physical and statistical grounds and are validated through numerical experimentations (Bannister, Citation2008a, Citation2008b; Ménétrier et al., Citation2015).

In Gaspari and Cohn (Citation1999), the magnitude and decay of error covariances are quantitatively outlined using correlation functions. The result has been widely used for the localisation of ensemble Kalman filters (EnKFs). Based on Schur product theorem, the element-wise multiplication of two positive definite matrices guarantees that the resulting symmetric matrix is a positive definite matrix itself. However, the underlying relationship between the parameter in a correlation function and the system model to be estimated is not clear. In this paper, we study some closely related subjects, the magnitude and decay of elements in error covariance, but taking a different approach. We derive and prove some quantitative characteristics of error covariance that are determined by various parts in a system model, including its dynamic model, the observation model and the covariances of random variables representing model uncertainties and observation error. The quantitative characteristics studied in this paper include the peak value and its location in the matrix, the decay rate from peak to bottom, the approximate sparsity pattern, etc.

The contributions of this paper consist of three parts. (i) We mathematically prove a theorem and several propositions on the quantitative characteristics of error covariance for linear Kalman filters. The results unveil some interconnections between system models and error covariance. (ii) We develop computational methods so that quantitative characteristics can be numerically computed. (iii) We demonstrate some potential applications of the discoveries through three examples. In the first example, we sketch the outline shape of error covariance using its peak and bottom value as well as the curve of a decay function. The method’s goal is not about replacing the need of estimating the error covariance in a process, such as EnKF or 4D-Var. Instead, the outline shape provides information that helps in the validation of an estimated error covariance if the full matrix is intractable or if its estimation is based on a relatively small ensemble size. In the second example, the quantitative characteristics of error covariance are used as an indicator of the upper bound for the localisation radius of an EnKF. In the third example, the results proved for linear systems are tested using a nonlinear model, the shallow water equations.

The theoretical approach in this paper is based on the use of the controllability Gramian of dynamical systems (see, for instance, Kailath (Citation1980)). The concept has been widely used in control theory to measure the impact of a system’s inputs, such as the model uncertainty in this study, on the system’s trajectories. We prove that the error covariance of the Kalman-Bucy filter equals the difference between two controllability Gramians, in addition to the propagation of the initial background error covariance. This relationship leads to a matrix upper bound of the error covariance. Based upon the concept of duality in the control theory, each element in the upper bound matrix can be computed individually without the need of estimation of the entire error covariance matrix of a Kalman filter. This property makes the computational algorithms in this paper being component-based, i.e. the quantitative characteristics of error covariance can be computed over any given component in the matrix, such as a submatrix, a block of any shape, a row/column, etc. It is also proved that, if the observation measures a state variable, some elements in error covariance satisfy an inequality constraint.

In Section 2, the concept of controllability Gramian is introduced. The relationship between error covariance and controllability Gramian is proved. Then a matrix upper bound of error covariance is added. In Section 3.1, some quantitative characteristics of controllability Gramian, such as the location of peak covariance and the decay rate from peak to bottom, are proved. The tedious algebraic derivations in the proof are included in the Appendix at the end of the paper. In Section 3.2, an algebraic inequality of error covariance deduced from the observation model is proved. In Section 4, we introduce two computational algorithms that are component-based. Section 5 contains three examples demonstrating how to collectively apply the theorems and computational algorithms from previous sections to outline the features of the error covariance.

2. Error covariance and controllability Gramian

Shown in are two error covariance matrices of Kalman filters, one for a linear system of ordinary differential equations and the other for a Discretised shallow water equation. The x- and y- axes represent row and column indices, i and j; the z-axis represents the absolute value of the covariance, $| P_{i j} | .$ In this way, a matrix is visualised as a 3D graph. In this paper, the geometric outline of the 3D graph is referred as the shape of error covariance. Both graphs in have similar geometric patterns in which a large number of elements has relatively small value and the covariance has its peaks along one or multiple diagonal lines. Relative to their peak value, both matrices are approximately sparse. In this paper, the shape of error covariance is studied using a set of quantitative characteristics that include, but not limited to, the following list of quantifiable geometric properties:

Fig. 1. Examples of Kalman filter error covariance.

the peak value and its location,
the decay rate from peak to bottom,
inequality constraints of covariance elements.

For high dimensional problems, the true error covariance of a Kalman filter is computationally intractable. In mathematical analysis, a widely used idea in the estimation of an unknown variable is to find a computationally feasible upper/lower bound. In this section, we prove a relationship between the error covariance of Kalman-Bucy filters and two controllability Gramians. As a corollary of the relationship, a matrix upper bound of the error covariance is found. An advantage of this matrix upper bound is that the quantitative characteristics of its shape are computationally tractable. Let’s consider a dynamical system model defined by ordinary differential equations (ODEs) (1a) $\dot{x} (t) = A x (t) + q w (t),$ (1a) (1b) $y (t) = H x (t) + r v (t)$ (1b) where $x \in R^{n}$ is the state variable, $\dot{x}$ represents its time derivative, $y \in R^{m}$ is the observation variable, $w \in R^{n}$ and $v \in R^{m}$ are zero-mean Gaussian white noises with an identity covariance. The matrices $q \in R^{n \times n}$ and $r \in R^{m \times m}$ determine the covariances of the model uncertainty and the observation error, respectively. Denote $Q = q q^{T}$ and $R = r r^{T} .$ Its Kalman-Bucy filter has the following form (2a) $\dot{\hat{x}} (t) = A \hat{x} (t) + K (t) (y (t) - H \hat{x} (t)),$ (2a) (2b) $K (t) = P (t) H^{T} R^{- 1}$ (2b) (2c) $\dot{P} (t) = A P (t) + P (t) A^{T} + Q - K (t) R K {(t)}^{T} .$ (2c) where K(t) is the Kalman gain, P(t) is the estimation error covariance and y(t) represents observed data. Multiplying $e^{- A t}$ and $e^{- A^{T} t}$ to (Equation2c(2c) $\dot{P} (t) = A P (t) + P (t) A^{T} + Q - K (t) R K {(t)}^{T} .$ (2c) ), we have $e^{- A t} \dot{P} (t) e^{- A^{T} t} - e^{- A t} A P (t) e^{- A^{T} t} - e^{- A t} P (t) A^{T} e^{- A^{T} t} = e^{- A t} (Q - K (t) R K {(t)}^{T}) e^{- A^{T} t}$ This is equivalent to $\frac{d}{d t} (e^{- A t} P (t) e^{- A^{T} t}) = e^{- A t} (Q - K (t) R K {(t)}^{T}) e^{- A^{T} t}$ The integration of this equation yields an integral equation of P(t) (3) $P (t) = e^{A t} P (0) e^{A^{T} t} + \int_{0}^{t} e^{A (t - τ)} (Q - K (τ) R K {(τ)}^{T}) e^{A^{T} (t - τ)} d τ$ (3) The integration on the right-hand side requires the history of K(t). If it is integrated in a subinterval $[t_{1}, t_{2}] \subseteq [0, t],$ we have a matrix upper bound of the error covariance (4) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + \int_{0}^{t} e^{A (t - τ)} Q e^{A^{T} (t - τ)} d τ - \int_{t_{1}}^{t_{2}} e^{A (t - τ)} K (τ) R K {(τ)}^{T} e^{A^{T} (t - τ)} d τ$ (4) If t₁ = t₂, the second integration equals zero. The matrix upper bound, in this case, represents the error covariance without corrections using observation information. The integrals on the right-hand side of (Equation3(3) $P (t) = e^{A t} P (0) e^{A^{T} t} + \int_{0}^{t} e^{A (t - τ)} (Q - K (τ) R K {(τ)}^{T}) e^{A^{T} (t - τ)} d τ$ (3) ) and (Equation4(4) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + \int_{0}^{t} e^{A (t - τ)} Q e^{A^{T} (t - τ)} d τ - \int_{t_{1}}^{t_{2}} e^{A (t - τ)} K (τ) R K {(τ)}^{T} e^{A^{T} (t - τ)} d τ$ (4) ) are closely related to the concept of controllability Gramian in control theory.

Definition 1.

The controllability Gramian of the system (5) $\dot{x} (t) = A x (t) + q (t) u, x \in R^{n}, A \in R^{n \times n}, u \in R^{p}, q \in R^{n \times p}$ (5) in the time interval $[t_{1}, t_{2}]$ is (6) $G^{C} (t_{1}, t_{2}) = \int_{t_{1}}^{t_{2}} e^{A (t_{2} - τ)} q (τ) q^{T} (τ) e^{A^{T} (t_{2} - τ)} d τ$ (6) For the special case $[t_{1}, t_{2}] = [0, t],$ we use a simpler notation, $G^{C} (t) .$

The Gramian, a symmetric positive semidefinite matrix, is a quantitative measure of the sensitivity of system trajectories to the variation of the system’s input. If the Gramian has a set of large eigenvalues, then x(t) can reach any given state using a control input that has a relatively small L₂-norm. The expressions in (Equation3(3) $P (t) = e^{A t} P (0) e^{A^{T} t} + \int_{0}^{t} e^{A (t - τ)} (Q - K (τ) R K {(τ)}^{T}) e^{A^{T} (t - τ)} d τ$ (3) ) and (Equation4(4) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + \int_{0}^{t} e^{A (t - τ)} Q e^{A^{T} (t - τ)} d τ - \int_{t_{1}}^{t_{2}} e^{A (t - τ)} K (τ) R K {(τ)}^{T} e^{A^{T} (t - τ)} d τ$ (4) ) are summarised in the following propositions.

Proposition 1.

The error covariance, P(t), of the Kalman-Busy filter (Equation2(2a) $\dot{\hat{x}} (t) = A \hat{x} (t) + K (t) (y (t) - H \hat{x} (t)),$ (2a) ) satisfies, (7) $P (t) = e^{A t} P (0) e^{A^{T} t} + G^{C} (t) - {\hat{G}}^{C} (t)$ (7) where $G^{C} (t)$ is the controllability Gramian of (Equation1a(1a) $\dot{x} (t) = A x (t) + q w (t),$ (1a) ) in the time interval $[0, t], {\hat{G}}^{C} (t)$ is the controllability Gramian of the control system (8) $\dot{\hat{x}} (t) = A \hat{x} (t) + K (t) r v$ (8) which is the Kalman-Bucy filter (Equation2a(2a) $\dot{\hat{x}} (t) = A \hat{x} (t) + K (t) (y (t) - H \hat{x} (t)),$ (2a) ) in which $y - H \hat{x}$ is treated as a control input rv.

Proposition 2.

The error covariance, P(t), of the Kalman-Busy filter (2) has an upper bound matrix (9) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t) - {\hat{G}}^{C} (t_{1}, t_{2})$ (9) where $[t_{1}, t_{2}] \subseteq [0, t]$ . As a special case, (10) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t)$ (10)

Propositions 1 and 2 can be interpreted as follows. In addition to the propagation of P(0), P(t) is determined by the controllability Gramian of the system model subtracting the controllability Gramian of the Kalman filter. If we consider the model uncertainty, w, as a control variable, then its controllability Gramian represents the worst estimation error without any error correction. On the other hand, the controllability Gramian of the Kalman-Bucy filter represents the reduction of error covariance in the filtering process. A more controllable filter results in more aggressive error correction, thus smaller estimation error. Among the three terms in (Equation9(9) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t) - {\hat{G}}^{C} (t_{1}, t_{2})$ (9) ), we focus on the the second term, the controllability Gramian of the original system (Equation1(1a) $\dot{x} (t) = A x (t) + q w (t),$ (1a) ). However, the same computational algorithm that we use for the estimation of G^C is also applicable to the propagation of initial error covariance, the first term in (Equation9(9) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t) - {\hat{G}}^{C} (t_{1}, t_{2})$ (9) ). It is summarised in Proposition 6. In this paper, we do not address the computation of the 3rd term in (Equation9(9) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t) - {\hat{G}}^{C} (t_{1}, t_{2})$ (9) ). Although this is a very important term that reduces the matrix upper bound using observation model, an efficient computational algorithm for this term in high dimensional spaces has yet to be found.

The inequality (Equation10(10) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t)$ (10) ) has a discrete-time version. Consider the system model, (11a) $x_{k} = A_{k - 1} x_{k - 1} + q_{k - 1} w_{k - 1}, x_{k} \in R^{n}, A_{k} \in R^{n \times n}$ (11a) (11b) $y_{k} = H_{k} x_{k} + r_{k} v_{k}, y_{k} \in R^{m}, H_{k} \in R^{m \times n}$ (11b) where w_k and v_k are serially uncorrelated random variables with a Gaussian distribution that has zero mean and identity covariance. Let $Q_{k} = q_{k} q_{k}^{T}$ and $R_{k} = r_{k} r_{k}^{T} .$ Its Kalman filter is a discrete-time system in the following form, (12a) $x_{k}^{-} = A_{k - 1} x_{k - 1}^{+},$ (12a) (12b) $P_{k}^{-} = A_{k - 1} P_{k - 1}^{+} A_{k - 1}^{T} + Q_{k - 1}$ (12b) (12c) $K_{k} = P_{k}^{-} H_{k}^{T} {(H_{k} P_{k}^{-} H_{k}^{T} + R_{k})}^{- 1}$ (12c) (12d) $x_{k}^{+} = x_{k}^{-} + K_{k} (y_{k} - H_{k} x_{k}^{-})$ (12d) (12e) $P_{k}^{+} = P_{k}^{-} - K_{k} (H_{k} P_{k}^{-} H_{k}^{T} + R_{k}) K_{k}^{T}$ (12e) where $x_{k}^{+}$ and $P_{k}^{+}$ are the analysis state and analysis error covariance, $x_{k}^{-}$ and $P_{k}^{-}$ are the background state and background error covariance. Using mathematical induction, it is straightforward to prove the following upper bound of $P_{k}^{+} .$

Proposition 3.

(13) $P_{k}^{+} \leq A_{k - 1} \dots A_{0} P_{0}^{+} A_{0}^{T} \dots A_{k - 1}^{T} + G_{k}^{C}$ (13)

where $G_{k}^{C}$ is the (discrete-time) controllability Gramian (14) $G_{k}^{C} = \sum_{j = 0}^{k - 2} A_{k - 1} A_{k - 2} \dots A_{j + 1} Q_{j} A_{j + 1}^{T} \dots A_{k - 2}^{T} A_{k - 1}^{T} + Q_{k - 1}$ (14)

Remark.

If the space dimension is high, the controllability Gramian is computationally intractable. However, there exist numerical methods that can compute individual elements in the Gramian, i.e. component-based algorithms. They are introduced in Section 4, including Proposition 5 for linear systems, Proposition 6 for the propagation of the initial error covariance, and (Equation34(34) $G_{i j}^{C} = \sum_{k = 1}^{K - 1} z_{i}^{T} (k) \bar{Q} z_{j} (k) + {\bar{Q}}_{i j}$ (34) ) for nonlinear systems.

3. Some quantitative characteristics of the shape of error covariance

In this section, we prove a decay rate for $G^{C} (t)$ and a set of algebraic constraints for the error covariance of Kalman filters.

3.1. The decay rate of $G^{C} (t)$

Some quantitative characteristics of the shape of $G^{C} (t)$ can be mathematically proved and numerically computed. The main result in this section is inspired by the approximate sparsity pattern of exponential matrices proved in Iserles (Citation2000). The following function, $c (α, x),$ is used repeatedly in the analysis, (15) $c (α, x) = \frac{α^{| x |}}{| x |^{| x |}}, α > 0, x \neq 0.$ (15) Because $\lim_{| x | \to \infty} | x |^{1 / | x |} = 1, \lim_{| x | \to \infty} α^{1 / | x |} = 1$ and $\frac{1}{e} < 1,$ we know $\begin{matrix} c (α, x) = {(\frac{α^{1 / | x |}}{| x |^{1 / | x |}})}^{x^{2}} > {(\frac{1}{e})}^{x^{2}} \\ c (α, x) = {(\frac{α}{| x |})}^{| x |} < {(\frac{1}{e})}^{| x |} \end{matrix}$ provided that $| x |$ is large enough. This inequality and some properties of $c (α, x)$ are summarised as follows.

Lemma 1.

Suppose that α and x are positive numbers. Then $c (α, x)$ is an increasing function of $α > 0$ and a decreasing function of $x \in [α, \infty]$ . Furthermore $\lim_{x \to \infty} c (α, x) = 0$ for any constant α. When $| x | \to \infty,$ (16) $e^{- x^{2}} < c (α, x) < e^{- | x |} .$ (16) i.e. $c (α, x)$ decreases faster than exponential but slower than the rate of $e^{- x^{2}} .$

In the following, a matrix A is said to be banded with bandwidth $s \geq 0$ if A_ij = 0 whenever $| i - j | > s .$ If s = 0, then the matrix is diagonal.

Theorem 1.

Suppose that A and $Q = q q^{T}$ in (1) are banded. For any t > 0, there exist positive constants $\bar{G}$ , α, β, γ, and L so that (17) ${| (G^{C} (t))}_{i j} | \leq \bar{G} c (α, (| j - i | + β) / γ), if | j - i | > L .$ (17)

Note that the inequality (Equation17(17) ${| (G^{C} (t))}_{i j} | \leq \bar{G} c (α, (| j - i | + β) / γ), if | j - i | > L .$ (17) ) is an upper bound that approximates the decay of $G_{i j}^{C}$ as $| i - j |$ increases. It implies that $G^{C} (t)$ peaks around the diagonal in the matrix; and the value of $| G_{i j}^{C} |$ decreases faster than exponential if moving away from the diagonal. In this theorem, it is assumed that A is banded along the main diagonal. If A consists of multiple banded submatrices, simulations show that the error covariance peaks along corresponding sub-diagonals. In the following, the right-hand side of the equation Equation((17) ${| (G^{C} (t))}_{i j} | \leq \bar{G} c (α, (| j - i | + β) / γ), if | j - i | > L .$ (17) Equation17(17) ${| (G^{C} (t))}_{i j} | \leq \bar{G} c (α, (| j - i | + β) / γ), if | j - i | > L .$ (17) Equation)(17) ${| (G^{C} (t))}_{i j} | \leq \bar{G} c (α, (| j - i | + β) / γ), if | j - i | > L .$ (17) is called the decay function.

The proof of this theorem involves lengthy and tedious algebraic derivations, which is given in Appendix at the end of the paper. In (Equation17(17) ${| (G^{C} (t))}_{i j} | \leq \bar{G} c (α, (| j - i | + β) / γ), if | j - i | > L .$ (17) ), $| i - j | \to \infty$ implies that the result is meaningful only if $n ≫ L,$ where n is the dimension of the state space. In fact, the results in Iserles (Citation2000), the theoretical foundation of Theorem 1, are based upon the assumption that the bandwidth of A satisfies $s ≪ n .$ Then L is much smaller than n. In Appendix, an analytic estimation of the parameters in (Equation41(41) $| G_{i j}^{C} (t) | \leq \bar{G} c (ρ e^{2} t, (| j - i | - L) / s), if | j - i | > L_{1}$ (41) ), i.e. $\bar{G},$ α, β, and γ is proved. However, the estimated value is conservative, which leads to a decay rate much slower than that of the true matrix. In Section 4, a numerical optimisation problem is formulated in (Equation38(38) $(\bar{G}, α, γ) = \arg \min_{_{\bar{G}, α, γ}} \sum_{(i, j) \in I} {(\bar{G} c (α, α / e + (| j - i |) / γ) - | G_{i j}^{C} |)}^{2}$ (38) ) that allows to fit the optimal value of the parameters to the component-based estimate of the upper bound matrix (Equation34(34) $G_{i j}^{C} = \sum_{k = 1}^{K - 1} z_{i}^{T} (k) \bar{Q} z_{j} (k) + {\bar{Q}}_{i j}$ (34) ). Showing in an example, the estimated decay function closely follows the true decay function of $G^{C} (t)$ as $| i - j |$ increases.

3.2. Some covariance constraints deduced from observation model

An error covariance, as a matrix, satisfies some constraints such as symmetry and positive definiteness. In the following, we prove some covariance constraints resulting from the observation model. In this section, the study is focussed on the discrete-time system (Equation11(11a) $x_{k} = A_{k - 1} x_{k - 1} + q_{k - 1} w_{k - 1}, x_{k} \in R^{n}, A_{k} \in R^{n \times n}$ (11a) ) and its Kalman Filter (Equation12(12a) $x_{k}^{-} = A_{k - 1} x_{k - 1}^{+},$ (12a) ).

Proposition 4.

Suppose the diagonal elements of $P_{k}^{+}$ are bounded by a positive number P_max, i.e. (18) $| {(P_{k}^{+})}_{i i} | \leq P_{\max}, 1 \leq i \leq n$ (18) Then $P_{k}^{+}$ satsifies (19a) $0 < {(H_{k} P_{k}^{+} H_{k}^{T})}_{i i} \leq {(R_{k})}_{i i}, 1 \leq i \leq m,$ (19a) (19b) $| {(H_{k} P_{k}^{+})}_{i j} | < \sqrt{{(R_{k})}_{i i} P_{\max}}, 1 \leq i \leq m, 1 \leq j \leq n, j \neq i$ (19b) As a special case if the observation measures ${(x_{k})}_{i}$ for some $1 \leq i \leq n$ , then (20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) (20b) $| {(P_{k}^{+})}_{i j} | < \sqrt{{(R_{k})}_{i i} P_{\max}}, 1 \leq j \leq n, j \neq i,$ (20b)

Different from the matrix upper bound in Theorem 1, the algebraic inequalities in Proposition 4 provide upper bounds for the individual elements in an error covariance. For example, if the sensor that measures the ith state variable, ${(x_{k})}_{i},$ is very accurate, then R_ii is very small. In this case, (Equation20(20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) ) is a tight upper bound for ${(P_{k}^{+})}_{i j}, 1 \leq j \leq n .$

Proof.

We prove (Equation20(20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) ) first. Without loss of generality, let i = 1. In this case, the first state variable, ${(x_{k})}_{1},$ is measured, i.e. $H_{k} = [\begin{matrix} 1 & 0_{1 \times (n - 1)} \\ {(H_{k})}_{2 : n, 1} & {(H_{k})}_{2 : n, 2 : n} \end{matrix}]$ where ${(H_{k})}_{2 : n, 1}$ is the first column of H_k starting from the second row, i.e. $i \geq 2 .$ The submatrix ${(H_{k})}_{2 : n, 2 : n}$ is similarly defined. Let K_k be the Kalman gain (Equation12c(12c) $K_{k} = P_{k}^{-} H_{k}^{T} {(H_{k} P_{k}^{-} H_{k}^{T} + R_{k})}^{- 1}$ (12c) ). Define a new gain ${\tilde{K}}_{k} .$ All rows of ${\tilde{K}}_{k}$ equal that of K_k except for the first row, which is $[\begin{matrix} 1 & 0 & 0 & \dots & 0 \end{matrix}],$ i.e. (21a) ${({\tilde{K}}_{k})}_{i j} = {(K_{k})}_{i j}, 1 < i \leq n, 1 \leq j \leq m$ (21a) (21b) ${({\tilde{K}}_{k})}_{1 j} = δ_{1 j}, 1 \leq j \leq m$ (21b) The estimation using ${\tilde{K}}_{k}$ is ${\tilde{x}}_{k}^{+} = x_{k}^{-} + {\tilde{K}}_{k} (y_{k} - H_{k} x_{k}^{-})$ Then ${\tilde{x}}_{k}^{+}$ equals the estimation using the Kalman filter except for the first state, ${({\tilde{x}}_{k}^{+})}_{1} = {(y_{k})}_{1}$ Its error covariance is ${(R_{k})}_{11} .$ Because the Kalman gain minimises the trace of $P_{k}^{+}$ (see, for instance, Gelb and The Technical Staff (1974)), we have ${(P_{k}^{+})}_{11} = E ({({(x_{k}^{+})}_{1} - {(x_{k})}_{1})}^{2}) \leq E ({({({\tilde{x}}_{k}^{+})}_{1} - {(x_{k})}_{1})}^{2}) = {(R_{k})}_{11}$ i.e. (Equation20a(20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) ) holds true. Because P_k is positive definite, the following 2 × 2 matrix must be positive definite $[\begin{matrix} {(P_{k}^{+})}_{11} & {(P_{k}^{+})}_{1 j} \\ {(P_{k}^{+})}_{j 1} & {(P_{k}^{+})}_{j j} \end{matrix}] .$ for any $1 \leq j \leq n .$ This implies that its determinant is greater than zero, ${(P_{k}^{+})}_{11} {(P_{k}^{+})}_{j j} - {({(P_{k}^{+})}_{1 j})}^{2} > 0$ Therefore, $| {(P_{k}^{+})}_{1 j} | < \sqrt{{(P_{k}^{+})}_{11} {(P_{k}^{+})}_{j j}}$ This inequality implies (Equation20b(20b) $| {(P_{k}^{+})}_{i j} | < \sqrt{{(R_{k})}_{i i} P_{\max}}, 1 \leq j \leq n, j \neq i,$ (20b) ) because $0 < {(P_{k}^{+})}_{11} \leq {(R_{k})}_{11}$ and $0 < {(P_{k}^{+})}_{j j} \leq P_{\max} .$

To prove (Equation19(19a) $0 < {(H_{k} P_{k}^{+} H_{k}^{T})}_{i i} \leq {(R_{k})}_{i i}, 1 \leq i \leq m,$ (19a) ), let’s focus on ${(H_{k})}_{1, 1 : n},$ the first row of H_k. The proof for other rows is similar. Because ${(H_{k})}_{1, 1 : n}$ is not a zero vector, we assume ${(H_{k})}_{11} \neq 0$ (if it is zero, we can re-arrange the index to move a nonzero element to the first position). Define an invertible matrix (22) $T = [\begin{matrix} {(H_{k})}_{11} & {(H_{k})}_{1, 2 : n} \\ 0_{(n - 1) \times 1} & I_{(n - 1) \times (n - 1)} \end{matrix}], T^{- 1} = [\begin{matrix} {(H_{k})}_{11}^{- 1} & - {(H_{k})}_{11}^{- 1} {(H_{k})}_{1, 2 : n} \\ 0_{(n - 1) \times 1} & I_{(n - 1) \times (n - 1)} \end{matrix}] .$ (22) Obviously (23) ${(H_{k})}_{1, 1 : n} T^{- 1} = [\begin{matrix} 1 & 0 & \dots & 0 \end{matrix}] .$ (23)

Using T as a transformation we define a new state variable $\tilde{x} = T x$ Its dynamical model is (24a) ${\tilde{x}}_{k} = {\tilde{A}}_{k - 1} {\tilde{x}}_{k - 1} + {\tilde{q}}_{k - 1} w_{k}, {\tilde{A}}_{k - 1} = T A_{k - 1} T^{- 1}, {\tilde{q}}_{k - 1} = T q_{k - 1}$ (24a) (24b) $y_{k} = {\tilde{H}}_{k} {\tilde{x}}_{k} + r_{k} v_{k}, {\tilde{H}}_{k} = H_{k} T^{- 1}$ (24b) In ${\tilde{H}}_{k},$ the first row is the unit vector (Equation23(23) ${(H_{k})}_{1, 1 : n} T^{- 1} = [\begin{matrix} 1 & 0 & \dots & 0 \end{matrix}] .$ (23) ). Therefore, the associated error covariance satisfies (Equation20(20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) ). Under the conditional probability distribution $P ({\tilde{x}}_{k} | y_{1 : k}),$ the covariance is (25a) ${\tilde{P}}_{k}^{+} = E (({\tilde{x}}_{k} - mean ({\tilde{x}}_{k})) {({\tilde{x}}_{k} - mean ({\tilde{x}}_{k}))}^{T})$ (25a) (25b) $= E (T (x_{k} - mean (x_{k})) (x_{k} - mean {(x_{k})}^{T}) T^{T})$ (25b) (25c) $= T P_{k}^{+} T^{T}$ (25c) From (Equation20(20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) ), (Equation22(22) $T = [\begin{matrix} {(H_{k})}_{11} & {(H_{k})}_{1, 2 : n} \\ 0_{(n - 1) \times 1} & I_{(n - 1) \times (n - 1)} \end{matrix}], T^{- 1} = [\begin{matrix} {(H_{k})}_{11}^{- 1} & - {(H_{k})}_{11}^{- 1} {(H_{k})}_{1, 2 : n} \\ 0_{(n - 1) \times 1} & I_{(n - 1) \times (n - 1)} \end{matrix}] .$ (22) ) and (Equation25(25a) ${\tilde{P}}_{k}^{+} = E (({\tilde{x}}_{k} - mean ({\tilde{x}}_{k})) {({\tilde{x}}_{k} - mean ({\tilde{x}}_{k}))}^{T})$ (25a) ), we have $0 < {(H_{k} P_{k}^{+} H_{k}^{T})}_{11} = {({\tilde{P}}_{k}^{+})}_{11} \leq {(R_{k})}_{11},$ i.e. (Equation19a(19a) $0 < {(H_{k} P_{k}^{+} H_{k}^{T})}_{i i} \leq {(R_{k})}_{i i}, 1 \leq i \leq m,$ (19a) ) holds true. If $j \neq 1,$ then $| {(H_{k} P_{k})}_{1 j} | = {({\tilde{P}}_{k}^{+})}_{1 j} < \sqrt{{(R_{k})}_{11} P_{\max}}$ Here, we use the same upper bound P_max for both $P_{k}^{+}$ and ${\tilde{P}}_{k}^{+}$ because (Equation22(22) $T = [\begin{matrix} {(H_{k})}_{11} & {(H_{k})}_{1, 2 : n} \\ 0_{(n - 1) \times 1} & I_{(n - 1) \times (n - 1)} \end{matrix}], T^{- 1} = [\begin{matrix} {(H_{k})}_{11}^{- 1} & - {(H_{k})}_{11}^{- 1} {(H_{k})}_{1, 2 : n} \\ 0_{(n - 1) \times 1} & I_{(n - 1) \times (n - 1)} \end{matrix}] .$ (22) ) and (Equation25(25a) ${\tilde{P}}_{k}^{+} = E (({\tilde{x}}_{k} - mean ({\tilde{x}}_{k})) {({\tilde{x}}_{k} - mean ({\tilde{x}}_{k}))}^{T})$ (25a) ) imply ${(P_{k}^{+})}_{2 : n, 2 : n} = {({\tilde{P}}_{k}^{+})}_{2 : n, 2 : n} .$

□

4. Computational issues

The results in previous sections describe various aspects of error covariance. When applied collectively these results together give a global picture of the shape of error covariance. To summarise, we have proved the following quantitative characteristics of the shape of P(t):

Upper bound (26a) $P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t), continuous ‐ time$ (26a) (26b) $P_{k}^{+} \leq A_{k - 1} \dots A_{0} P_{0}^{+} A_{0}^{T} \dots A_{k - 1}^{T} + G_{k}^{C}, discrete ‐ time$ (26b)
Decay rate from peak (27) ${| (G^{C} (t))}_{i j} | \leq \bar{G} c (α, (| j - i | + β) / γ), if | i - j | > L (\bar{G}, α, β, γ, L are parameters)$ (27)
Covariance constraints deduced from observation

If $y_{k} = {(x_{k})}_{i}$ (28a) ${(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (28a) (28b) $| {(P_{k}^{+})}_{i j} | < \sqrt{{(R_{k})}_{i i} P_{\max}}, 1 \leq j \leq n, j \neq i .$ (28b)

Or in general (28c) $0 < {(H_{k} P_{k}^{+} H_{k}^{T})}_{i i} \leq {(R_{k})}_{i i}, 1 \leq i \leq m,$ (28c) (28d) $| {(H_{k} P_{k}^{+})}_{i j} | < \sqrt{{(R_{k})}_{i i} P_{\max}}, 1 \leq i \leq m, 1 \leq j ɤ n, j \neq i,$ (28d)

4.1. Computing the elements in $G^{C} (t)$

Definition 2.

The observability Gramian of the system (29a) $\dot{x} (t) = A x (t), x \in R^{n},$ (29a) (29b) $y = H (t) x, y \in R^{m}, H \in R^{m \times n}$ (29b) at t = t₁ in the time interval $[0, t]$ is (30) $G^{O} (t) = \int_{0}^{t} e^{A^{T} (τ - t_{1})} H^{T} (τ) H (τ) e^{A (τ - t_{1})} d τ$ (30)

The controllability Gramian has the same dimension as the error covariance. Therefore propagating the entire matrix along a trajectory is intractable for problems that have high dimensions. However, the Gramian matrix can be computed element-by-element. In control theory, given a system with state variable x and control input u $\dot{x} (t) = A x (t) + q u, x \in R^{n}, A \in R^{n \times n}, q \in R^{n \times n_{u}}, u \in R^{n_{u}}$ its dual system is a linear system (31a) $\dot{z} (τ) = - A^{T} z (τ), z \in R^{n}$ (31a) (31b) $y^{z} (τ) = q^{T} z (τ), y^{z} \in R^{n_{u}}$ (31b) where z is the dual state, and y^z is the output of the dual system (Kailath, Citation1980).

Proposition 5.

Consider (Equation1a(1a) $\dot{x} (t) = A x (t) + q w (t),$ (1a) ) and its dual system (31).

(i) The controllability Gramian of (Equation1a(1a) $\dot{x} (t) = A x (t) + q w (t),$ (1a) ) in $[0, t]$ equals the observability Gramian of (31) at t, which is (32) $G^{C} (t) = \int_{0}^{t} e^{- A (τ - t)} Q e^{- A^{T} (τ - t)} d τ$ (32)

(ii) Let e_i represent the ith basis vector of $R^{n}$ . Let $z_{i} (τ)$ and $z_{j} (τ)$ be the solutions of (31) with final states $z_{i} (t) = e_{i}$ and $z_{j} (t) = e_{j}$ . Then (33) ${(G^{C} (t))}_{i j} = \int_{0}^{t} z_{i}^{T} (τ) Q z_{j} (τ) d τ,$ (33)

The formula (Equation33(33) ${(G^{C} (t))}_{i j} = \int_{0}^{t} z_{i}^{T} (τ) Q z_{j} (τ) d τ,$ (33) ) implies that individual elements in G^C can be computed without the need to evaluate the entire matrix, provided that the dual system, or the adjoint model, can be numerically propagated reversely in time. Integrating the adjoint model twice, $z_{i} (τ)$ and $z_{j} (τ),$ one can compute three elements in the matrix, ${(G^{C} (t))}_{i i}, {(G^{C} (t))}_{j j}$ and ${(G^{C} (t))}_{i j} = {(G^{C} (t))}_{j i} .$

Proof.

Based on Definitions 1 - 2, Part (i) is obvious. To prove (ii), Note that $z_{i} (τ) = (e^{- A^{T} (τ - t)}) e_{i},$ where e_i denotes the ith basis vector. Therefore, the ith row and jth column of $G^{C} (t)$ is the integration of the inner product of $q^{T} z_{i} (τ)$ and $q^{T} z_{j} (τ) .$ Therefore, (Equation33(33) ${(G^{C} (t))}_{i j} = \int_{0}^{t} z_{i}^{T} (τ) Q z_{j} (τ) d τ,$ (33) ) holds, where $Q (τ) = q (τ) q^{T} (τ) .$ □

The idea used in Proposition 5 is also applicable to the propagation of initial error covariance.

Proposition 6.

Suppose B₀ is a matrix satisfying $P (0) = B_{0} B_{0}^{T}$ Consider the following dual system $\begin{matrix} \dot{z} (τ) = A^{T} z (τ), z \in R^{n} \\ y^{z} (τ) = B_{0}^{T} z (τ), y^{z} \in R^{n} \end{matrix}$ Let $z_{i} (τ)$ and $z_{j} (τ)$ be the solutions satisfying $z_{i} (0) = e_{i}$ and $z_{j} (0) = e_{j}$ . Then ${(e^{A t} P (0) e^{A^{T} t})}_{i j} = \int_{0}^{t} z_{i}^{T} (τ) B_{0} B_{0}^{T} z_{j} (τ) d τ,$ $= \int_{0}^{t} z_{i}^{T} (τ) P (0) z_{j} (τ) d τ$

Propositions 5 and 6 are proved for continuous-time linear systems. To derive its discrete-time version, let $Δ t$ be the step size, $x_{k} = x (k Δ t), t = K Δ t$ for some positive integer K. A Discretisation of (Equation1a(1a) $\dot{x} (t) = A x (t) + q w (t),$ (1a) ) is $x_{k} = e^{A Δ t} x_{k - 1} + \bar{q} v_{k}, \bar{q} = \sqrt{Δ t} q$ Define $z_{s} (k) = e^{(K - k) A Δ t} e_{s}, s = i, j$ Denote ${\bar{q}}^{T} \bar{q}$ by $\bar{Q} .$ The Euler Discretisation of (Equation33(33) ${(G^{C} (t))}_{i j} = \int_{0}^{t} z_{i}^{T} (τ) Q z_{j} (τ) d τ,$ (33) ) is (34) $G_{i j}^{C} = \sum_{k = 1}^{K - 1} z_{i}^{T} (k) \bar{Q} z_{j} (k) + {\bar{Q}}_{i j}$ (34) For a nonlinear system $x_{k} = M (x_{k - 1}) + \bar{q} v_{k}$ Given a trajectory ${x_{k}}_{k = 0}^{K} .$ Let $M_{k}^{*}$ represent the adjoint model at x_k. Then (Equation34(34) $G_{i j}^{C} = \sum_{k = 1}^{K - 1} z_{i}^{T} (k) \bar{Q} z_{j} (k) + {\bar{Q}}_{i j}$ (34) ) is an approximation of controllability Gramian in which (35) $z_{s} (k) = M_{k}^{*} ° M_{k + 1}^{*} \dots ° M_{K - 1}^{*} e_{s}, s = i, j$ (35) This nonlinear version is a form of empirical Gramians. It is used in various applications of nonlinear systems, from ocean dynamics, power systems, to WRF data assimilation (Kang amd Xu, Citation2009; Krener and Ide, Citation2009; Kang and Xu, Citation2012; Yoshimura et al., Citation2020).

4.2. Decay rate approximation

In the next, we introduce a method of approximating the parameters, $(\bar{G}, α, β, γ),$ in the decay function (Equation27(27) ${| (G^{C} (t))}_{i j} | \leq \bar{G} c (α, (| j - i | + β) / γ), if | i - j | > L (\bar{G}, α, β, γ, L are parameters)$ (27) ). Their value given in the proof of Theorem 1 (Appendix) is conservative, which results in a decay rate that is much slower than that of G^C. If one can compute the value of a subset of elements in $G^{C} (t),$ which is possible as proved in Proposition 5, then we can find $(\bar{G}, α, β, γ)$ by using curve fitting. More specifically, assume that the set of elements (36) ${G_{i j}^{C} | (i, j) \in I}$ (36) is known, were $I$ is an index subset for $i, j \in {1, 2, \dots, n} .$ It is easy to check that $c (α, x)$ reaches its maximum value at $x = α / e .$ Set (37) $β = \frac{α γ}{e}$ (37) Then we compute the following minimisation (38) $(\bar{G}, α, γ) = \arg \min_{_{\bar{G}, α, γ}} \sum_{(i, j) \in I} {(\bar{G} c (α, α / e + (| j - i |) / γ) - | G_{i j}^{C} |)}^{2}$ (38) In practical applications, the location of $(i, j) \in I$ is important. We recommend selecting the set $I$ so that the points are distributed with some close to the peak and some away from the peak. In the examples in Section 5, $I$ contains five diagonals () for each block. It covers the range from the peak (along main diagonal in the examples) to the area where the elements of error covariance are very small (the tail of the decay function).

Fig. 2. A partition of error covariance.

4.3. Component-based computation

Because the elements in G^C can be computed individually using (Equation33(33) ${(G^{C} (t))}_{i j} = \int_{0}^{t} z_{i}^{T} (τ) Q z_{j} (τ) d τ,$ (33) ) or its Discretisation (Equation34(34) $G_{i j}^{C} = \sum_{k = 1}^{K - 1} z_{i}^{T} (k) \bar{Q} z_{j} (k) + {\bar{Q}}_{i j}$ (34) ), one can focus on a given area or a block in a matrix to find its shape without the need to know the covariance value outside the block, thus the term ‘component-based.’ Three examples are given in Section 5. The main computational steps are summarised as follows.

Partition a large matrix into blocks shown in . The elements in a matrix are represented by ‘*’ in the plot. A partition of the matrix consists of blocks of elements, which are represented by shaded areas with different colours. The partition is shown in the figure for the upper half of the matrix. The lower half has the same value due to the symmetry of error covariance.
The peak value in each block is the average of its diagonal. The computation of diagonal elements is based on (Equation34(34) $G_{i j}^{C} = \sum_{k = 1}^{K - 1} z_{i}^{T} (k) \bar{Q} z_{j} (k) + {\bar{Q}}_{i j}$ (34) ), the Discretisation of (Equation33(33) ${(G^{C} (t))}_{i j} = \int_{0}^{t} z_{i}^{T} (τ) Q z_{j} (τ) d τ,$ (33) ).
Decay function is estimated using (Equation38(38) $(\bar{G}, α, γ) = \arg \min_{_{\bar{G}, α, γ}} \sum_{(i, j) \in I} {(\bar{G} c (α, α / e + (| j - i |) / γ) - | G_{i j}^{C} |)}^{2}$ (38) ). To apply this formula of curve fitting, a set of elements in G^C (represented by circles in ) is computed using (Equation34(34) $G_{i j}^{C} = \sum_{k = 1}^{K - 1} z_{i}^{T} (k) \bar{Q} z_{j} (k) + {\bar{Q}}_{i j}$ (34) ).

Note that the results in Section 3.2, i.e. covariance constraints (Equation28(28a) ${(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (28a) ) deduced from observation model, are not included in these steps. The constraints can be applied to individual rows/columns separately in addition to the computed decay rate. It tends to be a more accurate upper bound but applicable to only special rows determined by the observation model.

5. Examples

In the following, we give three examples in which the quantitative characteristics in Section 3 are computed to approximate the upper bound matrix of the error covariance. The outline shapes of the upper bound matrix and the true error covariance are shown in plots. In the first example we demonstrate the computational algorithm. In the second example, the approximated outline shape of an upper bound matrix is used as an indicator of upper bound in the determination of localisation radius. The third example is a case study in which we explore the application of the results proved in previous sections to a nonlinear system that models a tsunami wave using the shallow water equations.

5.1. Example 1

The illustrative example has n = 150 state variables. This dimension is chosen because it is small enough so that the Kalman filter is computationally tractable; and it is large enough so that one can demonstrate the sparsity of error covariance. It is defined in the form of differential equation (Equation1(1a) $\dot{x} (t) = A x (t) + q w (t),$ (1a) ). The matrices are partially shown in (Equation39(39) $\begin{matrix} A = [\begin{matrix} - 2.4689 & - 0.7236 & 0.1301 & - 0.2498 & - 0.2450 & \dots & \dots \\ 0.7227 & - 2.2126 & 0.7261 & 0.6476 & 0.6578 & \dots & \dots \\ 0.4485 & - 0.0722 & - 2.7618 & - 0.1565 & - 0.7478 & \dots & \dots \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ \end{matrix}], (\begin{matrix} random value \\ banded matrix \\ bandwidth = 8 \end{matrix}) \\ q = σ_{Q} [\begin{matrix} 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & 0 & 0 & 0 & \dots & \dots \\ 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & 0 & 0 & \dots & \dots \\ 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & 0 & \dots & \dots \\ 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & \dots & \dots \\ 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & \dots & \dots \\ 0 & 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & \dots & \dots \\ 0 & 0 & 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & \dots & \dots \\ 0 & 0 & 0 & 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & \dots & \dots \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ \end{matrix}], σ_{Q} = 0.2121 \\ y = [\begin{matrix} x_{1} & x_{50} & x_{100} & x_{150} \end{matrix}] + r v (t), \\ r = [\begin{matrix} 0.0102 & 0.0098 & 0.0044 & 0.0110 \\ - 0.0083 & - 0.0088 & 0.0028 & - 0.0014 \\ - 0.0097 & 0.0034 & - 0.0103 & - 0.0035 \\ 0.0047 & - 0.0080 & 0.0067 & - 0.0102 \end{matrix}] \end{matrix}$ (39) ). The A matrix is banded whose bandwidth is s = 8. For many methods of PDE Discretisation, the resulting ODE system has a bandwidth less than s = 8. To generate the elements in A, we first form a matrix using random numbers with the uniform distribution in $[- 1, 1] .$ Then we add a diagonal matrix $a I$ to it, where a is chosen to make sure the system is stable, i.e. placing the real parts of all eigenvalues less than zero. Assuming that model uncertainty has its impact in a local area only, the matrix q is also banded in which s = 4; thus the bandwidth of $Q = q q^{T}$ is 8. The matrix associated with observation error is r. We assume a sensor sampling rate dt = 0.125. The elements of r in the Discretised model are random numbers uniformly distributed in $[- 0.0113, 0.0113] .$ This range is about 2% of the standard deviation of the initial state. The observation measures x₁, x₅₀, x₁₀₀, and x₁₅₀. The initial error covariance is $P (0) = Q,$ i.e. the only known information about the background error is the model uncertainty. In the simulations, the final time is t_f = 10. (39) $\begin{matrix} A = [\begin{matrix} - 2.4689 & - 0.7236 & 0.1301 & - 0.2498 & - 0.2450 & \dots & \dots \\ 0.7227 & - 2.2126 & 0.7261 & 0.6476 & 0.6578 & \dots & \dots \\ 0.4485 & - 0.0722 & - 2.7618 & - 0.1565 & - 0.7478 & \dots & \dots \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ \end{matrix}], (\begin{matrix} random value \\ banded matrix \\ bandwidth = 8 \end{matrix}) \\ q = σ_{Q} [\begin{matrix} 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & 0 & 0 & 0 & \dots & \dots \\ 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & 0 & 0 & \dots & \dots \\ 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & 0 & \dots & \dots \\ 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & \dots & \dots \\ 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & \dots & \dots \\ 0 & 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & \dots & \dots \\ 0 & 0 & 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & \dots & \dots \\ 0 & 0 & 0 & 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & \dots & \dots \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ \end{matrix}], σ_{Q} = 0.2121 \\ y = [\begin{matrix} x_{1} & x_{50} & x_{100} & x_{150} \end{matrix}] + r v (t), \\ r = [\begin{matrix} 0.0102 & 0.0098 & 0.0044 & 0.0110 \\ - 0.0083 & - 0.0088 & 0.0028 & - 0.0014 \\ - 0.0097 & 0.0034 & - 0.0103 & - 0.0035 \\ 0.0047 & - 0.0080 & 0.0067 & - 0.0102 \end{matrix}] \end{matrix}$ (39)

The σ_Q value above is equivalent to continuous-time white noise of model uncertainty with a standard deviation $σ = 0.6 .$ The component-based computational method makes it possible to focus on any given area in the covariance matrix. In this example we divide the matrix into ten blocks. Each block is a strip of rows above the diagonal of the matrix (). The goal is to approximate the shape of the error covariance in each block. The number of blocks in the partition depends on various factors. Basically, increasing the number of blocks refines the approximation with the cost of higher computational load. As the first step, we apply the inequalities (Equation28(28a) ${(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (28a) ) deduced from the observation model to this problem. More specifically, we have $\begin{matrix} {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i}, i = 1, 50, 100, 150 \\ | {(P_{k}^{+})}_{i j} | < \sqrt{{(R_{k})}_{i i} P_{\max}}, i = 1, 50, 100, 150, j \neq i, 1 \leq j \leq n \end{matrix}$ A guaranteed upper bound, P_max, is not available. Using component-based computation based on the dual system, we compute the diagonal elements of G^C to find that most of them satisfy $G_{i, i}^{C} < 1 .$ It indicates that $P_{\max} \approx 1 .$ So, the upper bound of error covariance along these four rows and columns can be determined, shown in .

Fig. 3. Rows and columns of error covariance constrained by the observation model.

In the next step, we approximate the peak value of each block. We use the peak value of G^C as an approximation because the impact of the initial error decays rapidly for the stable system. The computation is based on (Equation34(34) $G_{i j}^{C} = \sum_{k = 1}^{K - 1} z_{i}^{T} (k) \bar{Q} z_{j} (k) + {\bar{Q}}_{i j}$ (34) ), the Discretisation of (Equation33(33) ${(G^{C} (t))}_{i j} = \int_{0}^{t} z_{i}^{T} (τ) Q z_{j} (τ) d τ,$ (33) ). Then the average value is used to represent the peak value over each block. This is shown in .

Fig. 4. The approximate peak value of error covariance computed using an upper bound matrix.

Finally, we estimate the decay rate by finding the decay function $c (α, x)$ of the matrix upper bound. For this purpose, it is necessary to evaluate a subset of elements for each component. As shown in , five diagonals are evaluated for each block. Then, the solution of (Equation38(38) $(\bar{G}, α, γ) = \arg \min_{_{\bar{G}, α, γ}} \sum_{(i, j) \in I} {(\bar{G} c (α, α / e + (| j - i |) / γ) - | G_{i j}^{C} |)}^{2}$ (38) ) based on the computed $| G_{i j}^{C} |$ determines the parameter value in the decay function $c (α, x) .$ The computations are based on the absolute value of entries in G^C. The approximated shape of error covariance is shown in (left). For comparison, the plot of the matrix upper bound is also shown in (right). The approximate shape captures the the key feature, such as peak and decay, very well.

Fig. 5. The approximated shape (left) and the matrix upper bound (right) of the error covariance.

For a more detailed comparison, the average value along the main diagonal as well as all other diagonals are shown in for every block. All computations are based on the absolute value of entries in covariance matrices. There are a total of ten blocks. The gap between the true error covariance and its matrix upper bound varies. Shown in are two enlarged plots from . The plot on the left shows that the matrix upper bound (red) is quite close to the true error covariance (blue); and the approximated decay curve (green) is very close to both. In this block, the approximated shape is very close to the true shape of the error covariance. However, this is not always the case. The plot on the right shows a block in which the Kalman filter is very effective in making error corrections. As a result, the true error covariance is much smaller than the matrix upper bound. In this block, the matrix upper bound over estimates the true error covariance by a large amount. For improvement, one needs to compute the third term in (Equation9(9) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t) - {\hat{G}}^{C} (t_{1}, t_{2})$ (9) ), a problem that cannot be solved using the results in this paper. To summarise, the matrices of error covariance and its upper bound are computationally intractable for high dimensional problems. However, using the component-based algorithm, the decay function of the error covariance upper bound can be numerically computed over a given region.

Fig. 6. The averaged error covariance along diagonals of blocks. The horizontal axis represents the indices of diagonals, where the middle point is the main diagonal. The vertical axis represents the average value of the elements in the error covariance along diagonals. The blue curve represents the true error covariance of a linear Kalman filter. The red curve is the matrix upper bound of error covariance. The green curve represents the approximated decay function.

Fig. 7. The averaged error covariance along diagonals of two representative blocks. See the caption of for details.

Fig. 7. The averaged error covariance along diagonals of two representative blocks. See the caption of Figure 6 for details.

5.2. Example 2

In applications of EnKFs, finding a localisation radius is essential to the filter’s accuracy. Because an upper bound matrix decays exponentially when $| i - j |$ increases (Theorem 1), this upper bound provides valuable information about the localisation radius. The matrices, A, q, and r in this example are generated in the same way as in Example 1 with some parameter value changes. More specifically, σ_Q and r in (Equation39(39) $\begin{matrix} A = [\begin{matrix} - 2.4689 & - 0.7236 & 0.1301 & - 0.2498 & - 0.2450 & \dots & \dots \\ 0.7227 & - 2.2126 & 0.7261 & 0.6476 & 0.6578 & \dots & \dots \\ 0.4485 & - 0.0722 & - 2.7618 & - 0.1565 & - 0.7478 & \dots & \dots \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ \end{matrix}], (\begin{matrix} random value \\ banded matrix \\ bandwidth = 8 \end{matrix}) \\ q = σ_{Q} [\begin{matrix} 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & 0 & 0 & 0 & \dots & \dots \\ 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & 0 & 0 & \dots & \dots \\ 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & 0 & \dots & \dots \\ 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & 0.25 & \dots & \dots \\ 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & 0.33 & \dots & \dots \\ 0 & 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & 0.50 & \dots & \dots \\ 0 & 0 & 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & 1.00 & \dots & \dots \\ 0 & 0 & 0 & 0.25 & 0.33 & 0.50 & 1.00 & 1.00 & \dots & \dots \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ \end{matrix}], σ_{Q} = 0.2121 \\ y = [\begin{matrix} x_{1} & x_{50} & x_{100} & x_{150} \end{matrix}] + r v (t), \\ r = [\begin{matrix} 0.0102 & 0.0098 & 0.0044 & 0.0110 \\ - 0.0083 & - 0.0088 & 0.0028 & - 0.0014 \\ - 0.0097 & 0.0034 & - 0.0103 & - 0.0035 \\ 0.0047 & - 0.0080 & 0.0067 & - 0.0102 \end{matrix}] \end{matrix}$ (39) ) are assigned the following value $\begin{matrix} σ_{Q} = 0.0707 \\ r = [\begin{matrix} 0.0917 & - 0.1071 & - 0.0122 & - 0.0239 \\ - 0.0992 & 0.0014 & 0.0082 & - 0.1032 \\ - 0.0341 & - 0.0700 & - 0.0441 & - 0.0586 \\ 0.0330 & - 0.0524 & 0.0995 & 0.0297 \end{matrix}] \end{matrix}$ Similar to Example 1, the observation measures $x_{1}, x_{50}, x_{100}$ and x₁₅₀. For the estimation of x(t), we use an EnKF in which the ensemble size is n_ens = 15. The localisation is based on the correlation function as defined in Gaspari and Cohn (Citation1999) $ρ = {\begin{array}{l} - \frac{1}{4} {(| z | / c)}^{5} + \frac{1}{2} {(| z | / c)}^{4} + \frac{5}{8} {(| z | / c)}^{3} - \frac{5}{3} {(| z | / c)}^{2} + 1, & 0 \leq | z | \leq c, \\ \frac{1}{12} {(| z | / c)}^{5} - \frac{1}{2} {(| z | / c)}^{4} + \frac{5}{8} {(| z | / c)}^{3} + \frac{5}{3} {(| z | / c)}^{2}, \\ - 5 (| z | / c) + 4 - \frac{2}{3} (c / | z |), & c \leq | z | \leq 2 c, \\ 0, & 2 c \leq | z | . \end{array}$

The value of c determines the localisation radius. The curves in are three correlation functions for c = 7, 15, 30.

Fig. 8. Correlation functions.

In the simulation, we first apply the localisation with the largest radius, c = 30. Following the same approach in Example 1, we divide the covariance matrix into ten blocks. Each block is a strip of rows above the diagonal of the matrix (). The matrix upper bound over each block at t_f = 10 is computed and then compared to the average value of the EnKF error covariance along diagonals shown in (taking absolute value before averaging). In , the horizontal axis in each plot represents the indices of diagonals, where the middle point is the main diagonal. The vertical axis of each plot represents the average value of the elements of error covariance along diagonals. The blue curve is the error covariance of an EnKF. The red curve is the matrix upper bound of error covariance. For problems with high dimensions, both matrices are computationally intractable. In this case, what can be computed is the approximated decay function. It is shown as the green curve in the plots. In , Plot(1,1) (the plot located in the first row first column), Plot(1,2), Plot(2,3) and Plot(2,4) show that the average value of error covariance of the EnKF is much larger than the decay function, implying that the localisation radius is too large. shows the state and its EnKF estimation at t = t_f, where $t_{f} = 3.75 .$ The estimation missed all peaks located in the middle part of the plot. The relative error is greater than 0.51.

Fig. 9. The averaged error covariance along diagonals of blocks for localisation c = 30. The horizontal axis represents the indices of diagonals, where the middle point is the main diagonal. The vertical axis represents the average value of the elements of error covariance along diagonals. The blue curve represents the error covariance of an EnKF. The red curve is the matrix upper bound of error covariance. The green curve represents the approximated decay function.

Fig. 10. The state $x_{1} (t_{f}), x_{2} (t_{f}), \dots, x_{n} (t_{f})$ where n = 150. Solid: truth. Dotted: estimated value.

Fig. 10. The state x1(tf),x2(tf),…,xn(tf) where n = 150. Solid: truth. Dotted: estimated value.

Using the correlation function with c = 15, the radius of localisation is much smaller. The computation shows that the average value of error covariance is below or close to the decay function in eight of the ten blocks. However, there are two blocks where the average value of error covariance is still much larger when it is compared to the decay function. Comparing to the previous case of c = 30, the estimated value of $x (t_{f}),$ the final state, follows the peaks of the truth, although the filter overestimates the value at some points. The overall relative error is reduced to 0.47.

If c = 7, the localisation has the smallest radius among the three correlation functions in . In , all blue curves are below or close to the green line, implying that the radius of localisation is bounded by the decay function. The final state $x (t_{f})$ and its estimation is shown in . It is obviously improved relative to the previous cases when the location radius is too large. In fact, the relative estimation error is less than 0.25.

Fig. 11. The averaged error covariance along diagonals of blocks for localisation c = 7. See the caption of for details.

Fig. 11. The averaged error covariance along diagonals of blocks for localisation c = 7. See the caption of Figure 9 for details.

Fig. 12. The state $x_{1} (t_{f}), x_{2} (t_{f}), \dots, x_{n} (t_{f})$ where n = 150. Solid: truth. Dotted: estimated value.

Fig. 12. The state x1(tf),x2(tf),…,xn(tf) where n = 150. Solid: truth. Dotted: estimated value.

Remark.

In this example, the decay function provides an indicator of upper bound in the determination of localisation radius. From (Equation9(9) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t) - {\hat{G}}^{C} (t_{1}, t_{2})$ (9) ), the radius of localisation should be the radius of G^C with a correction due to the term ${\hat{G}}^{C} (t_{1}, t_{2})$ (for the simplicity in argument we assume that the impact of the initial error covariance is small). This example shows that such correction reduces the radius. Elements of the error covariance matrix resulting from a sound localisation radius are expected to lie under the estimate of the matrix upper bound.

5.3. Example 3 - the shallow water equations

The theorems proved in this paper are based on a common assumption: the system is linear. Generalising the theory to nonlinear systems is beyond the scope of this paper. Instead, the following is a case study in which we explore the application of the results proved in previous sections to a nonlinear system. It is an example adopted from Deleanu and Dumitrache (Citation2019) in which a tsunami wave is simulated using the shallow water equations. Illustrated in , the water wave travels a horizontal distance of $L = 1, 296 \times 10^{3}$ m. It approaches the shore on a variable seabed depth that has a constant slope $s = 40 / L .$ The wave propagation is modelled using the shallow water equations (40a) $\frac{\partial h}{\partial t} + \frac{\partial}{\partial x} (u h) = 0$ (40a) (40b) $\frac{\partial}{\partial t} (u h) + \frac{\partial}{\partial x} (v^{2} h + \frac{1}{2} g h^{2}) = - g h \frac{d B}{d x} .$ (40b) where $h = h (t, x)$ is the flow depth, $u = u (t, x)$ is the flow velocity, B(x) is the seabed height, g is the gravitational acceleration and $(t, x) \in [0, T] \times [0, L] .$ In this example, the wave propagation is simulated for the time interval T = 60,000 s. The result is used as the truth in the study of error covariance. A total of m = 80 sensors are placed along the seabed with an equal distance of 16,200 m apart. Each sensor measures the full state ${[h, u h]}^{T} .$ The value of parameters is summarised in (also see for illustration).

Fig. 13. The variables in the model of a tsunami wave.

Table 1. The parameters in Example 3.

Display Table

The boundary conditions are $\begin{matrix} B (x) = \frac{40 x}{L} \\ h (t, 0) = 64.5 + 3 sin (π (\frac{4 t}{86400} - \frac{1}{2})), h (t, L) = H_{0}, \\ u (t, 0) = \sqrt{g} (\sqrt{h (t, 0)} - \sqrt{H_{0}}), u (t, L) = 0 \\ h (0, x) = H_{0}, u (0, x) = 0 \end{matrix}$ Coded in Matlab, the shallow water equations are numerically solved using the first order Lax-Friedrichs algorithm (LeVeque, Citation1992) over equally spaced grid points $x_{0} = 0 < x_{1} < x_{2} < \dots < x_{N_{x}} = L, t_{0} = 0 < t_{1} < t_{2} < \dots < t_{N_{t}}$ In the Discretisation, $Δ x = 324$ m, $N_{x} = 4, 001, Δ t = 10$ s and $N_{t} = 6, 000 .$ The associated tangent linear model and adjoint model are also programmed in Matlab. Shown in is the nominal trajectory around which the error covariance is analysed.

Fig. 14. Water wave at t = 20,000, 40,000, 60,000 s.

An error covariance in full dimension, 8002 × 8002, is computed using unscented Kalman Filter (UKF) (Julier et al., Citation2000; Julier and Uhlmann, Citation2004). The model uncertainty and the observation noise are introduced in the same way as that in (Equation1(1a) $\dot{x} (t) = A x (t) + q w (t),$ (1a) ), $\begin{matrix} z_{k} = M (z_{k - 1}) + q w_{k - 1} \\ y_{k} = H (x_{k}) + r v_{k} \end{matrix}$ where M(x) represents the Discretised shallow water equations and $q = [\begin{matrix} 0.04 & 0.04 & 0.03 & 0.02 & 0.01 & 0 & \dots & 0 \\ 0.04 & 0.04 & 0.04 & 0.03 & 0.02 & 0.01 & \dots & 0 \\ 0.03 & 0.04 & 0.04 & 0.04 & 0.03 & 0.02 & \dots & 0 \\ 0.02 & 0.03 & 0.04 & 0.04 & 0.04 & 0.03 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \end{matrix}], r = [\begin{matrix} 0.03 & 0 & 0 & 0 & \dots 0 \\ 0 & 0.03 & 0 & 0 & \dots 0 \\ 0 & 0 & 0.03 & 0 & \dots 0 \\ 0 & 0 & 0 & 0.03 & \dots 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \end{matrix}]$ Obviously, q is a banded matrix of bandwidth s = 4 and r is diagonal. Shown in is the error covariance of the UKF. Different from linear systems, the estimation from a UKF is not guaranteed to be optimal. However, UKF has been a widely used nonlinear filter known for its good accuracy and robust stability. In this example, it serves as a reference for comparison. The water wave and its estimation at t = 60,000 s is shown in . The relative error of u(t, x) is 0.050 and the relative error of h(t, x) is 0.005. For the component-based computation of G^C, the adjoint model is applied in (Equation34(34) $G_{i j}^{C} = \sum_{k = 1}^{K - 1} z_{i}^{T} (k) \bar{Q} z_{j} (k) + {\bar{Q}}_{i j}$ (34) )–(Equation35(35) $z_{s} (k) = M_{k}^{*} ° M_{k + 1}^{*} \dots ° M_{K - 1}^{*} e_{s}, s = i, j$ (35) ). The length of time window is K. We choose a value of K different from N_t. Its value should be large enough so that the initial error is diminished. On the other hand, we do not want to apply a K that is too large because the adjoint model is a linearisation that introduces additional error in each iteration. In this example K = 150, a time window during which the UKF reduces about 75% of the initial error. Following the same approach in the previous examples, we divide the covariance into blocks as shown in . For illustrative purpose, let us focus on the block between the rows $2001 \leq i \leq 2100 .$ The average value along diagonals in this block is shown in . The figure contains three plots because the error covariance as a matrix has three submatrices, odd row vs odd column (covariance of h), even row vs even column (covariance of uh), and odd row vs even column (covariance between h and uh).

Fig. 15. UKF error covariance (the portion $1 \leq i, j \leq 500$ ).

Fig. 15. UKF error covariance (the portion 1≤i,j≤500).

Fig. 16. UKF estimation at t = 60,000 s. Solid: truth. Dotted: estimated value.

Fig. 17. The averaged error covariance along diagonals of the block between the rows $2001 \leq i \leq 2100 .$ See the caption of for details.

Fig. 17. The averaged error covariance along diagonals of the block between the rows 2001≤i≤2100. See the caption of Figure 6 for details.

The goal of this example is to explore if the results proved for linear systems are applicable to a nonlinear system such as the shallow water equations. We first check the results in Proposition 2 and Theorem 1. In other words, we want to check if the controllability Gramin G^C provides a matrix upper bound; and if it decays at the rate that is claimed in Theorem 1. From , the average value of $G_{i j}^{C}$ (red) is located on top of the average P_ij (blue). It implies that G^C is an upper bound of the error covariance. A decay function (green curve) is plotted as an approximation of G^C. It is computed based on four diagonals of G^C in the block. For this nonlinear example, it shows that the decay curve agrees with G^C (red) around the diagonal when the covariance is high. However, different from the linear examples, G^C approaches zero slowly when the location is moving away from the diagonal. Nevertheless, the decay curve approaches zero anyway. As a result, the decay curve serves as a better estimate of the error covariance than G^C. It is unclear if this is a general phenomenon or just a special case, a topic for further research.

In the next, we verify the results in Proposition 4, more specifically the inequalities in (Equation20(20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) ). It is a set of upper bounds determined by the observation model. As an example, we focus on the row i = 2001 and i = 2002 in $P_{k}^{+} .$ The corresponding state variables, x_i, are directly measured by a sensor. In , the green line on top is the constant on the right side of the inequality (Equation20b(20b) $| {(P_{k}^{+})}_{i j} | < \sqrt{{(R_{k})}_{i i} P_{\max}}, 1 \leq j \leq n, j \neq i,$ (20b) ). The blue curve below it is the value of elements in the ith row of $P_{k}^{+}$ (only the columns $1850 \leq j \leq 2150$ are shown here). Clearly, the inequality (Equation20b(20b) $| {(P_{k}^{+})}_{i j} | < \sqrt{{(R_{k})}_{i i} P_{\max}}, 1 \leq j \leq n, j \neq i,$ (20b) ) holds for this nonlinear example. The red mark at j = 2001 in the left plot and j = 2002 in the right plot in is R_ii, the right side of the inequality (Equation20a(20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) ). It is below the blue curve, which implies that R_ii is not an upper bound of ${(P_{k}^{+})}_{i i} .$ Therefore, (Equation20a(20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) ) does not hold for this nonlinear example. The difference is ${(P_{k}^{+})}_{i i} - R_{i i} = 0.0075$ for i = 2001 and 0.0076 for i = 2002. In fact, this difference is relatively insignificant because the covariance at grid points without a sensor has much larger value. For instance, the row adjacent to i = 2002 in $P_{k}^{+}$ is shown in (green curve). Relative to it, the ith row (blue curve) is so small the different between R_ii and ${(P_{k}^{+})}_{i i}$ is negligible. In other words, even though the inequality (Equation20a(20a) $0 < {(P_{k}^{+})}_{i i} \leq {(R_{k})}_{i i},$ (20a) ) does not hold, R_ii still provides a good approximation of ${(P_{k}^{+})}_{i i}$ with a small relative error. The observation model in this example is linear. We speculate that the reason why the inequality does not hold is because of the nonlinear dynamics of the system. The deep investigation of this behaviour is the subject of an on-going study.

Fig. 18. The upper bound of error covariance deduced from observation model.

Fig. 19. Error covariance along two adjacent rows, i = 2002 and i = 2004.

6. Conclusions

Several quantitative characteristics are introduced and studied that can be used to outline the shape of error covariance as a 3D graph. The proved quantitative characteristics include matrix upper bounds of error covariance, the decay rate from peak to bottom of an upper bound matrix, and some inequality constraints of error covariance deduced from observation models. The concept of controllability Gramian and its computational methods play an important role in several parts of the study. These unveiled interconnections between error covariance and system models are new discoveries in the literature of Kalman filters. Although the computation of error covariance is generally intractable for high dimensional systems, computational methods developed in this paper can numerically compute the quantitative characteristics with a limited computational load. Some ideas of using the results to improve Kalman filter applications are explored in three examples, in which more questions are raised for future research than that are solved. For instance, the quantitative characteristics are collectively, but not interactively, applied. Specifically, the computational algorithm of the decay function does not use the constraints deduced from the observation model. The constraints are applied to the approximated shape of error covariance after the decay function is computed. In addition, the controllability Gramian of the Kalman-Bucy filter in (Equation9(9) $0 \leq P (t) \leq e^{A t} P (0) e^{A^{T} t} + G^{C} (t) - {\hat{G}}^{C} (t_{1}, t_{2})$ (9) ) is a correction term. Based on the observation model, this term reduces the error of the matrix upper bound. As a problem for future research, an integration of these algorithms and constraints may lead us to computational methods of approximating error covariance with improved accuracy.

Acknowledgment

We thank Professor Qing Zhang, University of Georgia, and Professor Jiangang Ying, Fudan University, for their comments and suggestions on several questions related to error covariance and stochastic processes.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported in part by U.S. Naval Research Laboratory - Monterey, CA.

References

Bannister, R. N. 2008a. A review of forecast error covariance statistics in atmospheric variational data assimilation. I: Characteristics and measurements of forecast error covariances. Q. J. R. Meteorol. Soc. 134, 1951–1970. doi:https://doi.org/10.1002/qj.339
Web of Science ®Google Scholar
Bannister, R. N. 2008b. A review of forecast error covariance statistics in atmospheric variational data assimilation. II: Modelling the forecast error covariance statistics. Q. J. R. Meteorol. Soc. 134, 1971–1996. doi:https://doi.org/10.1002/qj.340
Web of Science ®Google Scholar
Deleanu, D. and Dumitrache, C. L. 2019. Explicit finite difference schemes for solving the one-dimensional tsunami wave propagation equations. J. Phys. Conf. Ser. 1297, 012005. doi:https://doi.org/10.1088/1742-6596/1297/1/012005
Google Scholar
Gaspari, G. and Cohn, S. E. 1999. Construction of correlation functions in two and three dimensions. Q. J. R. Meteorol. Soc. 125, 723–757. doi:https://doi.org/10.1002/qj.49712555417
Web of Science ®Google Scholar
Gelb, A. and The Technical Staff. 1974. Applied Optimal Estimation. MIT Press, Cambridge, Massachusetts, USA.
Google Scholar
Houtekamer, P. L. and Zhang, F. 2016. Review of the ensemble Kalman filter for atmospheric data assimilation. Month. Wea. Rev. 144, 4489–4532. doi:https://doi.org/10.1175/MWR-D-15-0440.1
Web of Science ®Google Scholar
Iserles, A. 2000. How large is the exponential of a banded matrix. New Zealand J. Math. 29, 177–192.
Google Scholar
Julier, S. and Uhlmann, J. 2004. Unscented filtering and nonlinear estimation. Proc. IEEE 92, 401–422. doi:https://doi.org/10.1109/JPROC.2003.823141
Web of Science ®Google Scholar
Julier, S., Uhlmann, J. and Durrant-Whyte, H. F. 2000. A new method for the nonlinear transformation of means and covariances in filters and estimators. IEEE Trans. Automat. Contr. 45, 477–482. doi:https://doi.org/10.1109/9.847726
Web of Science ®Google Scholar
Kailath, T. 1980. Linear Systems. Englewood Cliffs, NJ: Prentice-Hall.
Google Scholar
Kang, W. and Xu, L. 2009. Computational analysis of control systems using dynamic optimization. arXiv: 0906.0215v2, https://arxiv.org/abs/0906.0215.
Google Scholar
Kang, W. and Xu, L. 2012. Optimal placement of mobile sensors for data assimilations. Tellus A 64, 17133. doi:https://doi.org/10.3402/tellusa.v64i0.17133
Google Scholar
Krener, A. J. and Ide, K. 2009. Measures of unobservability. In 48th IEEE Conference on Decision and Control, Shanghai, China, 6401–6406.
Google Scholar
LeVeque, R. J. 1992. Numerical Methods for Conservation Laws. Birkhäuser Verlag, Basel, Boston, Berlin.
Google Scholar
Ménétrier, B., Montmerle, T., Michel, Y. and Berre, L. 2015. Linear filtering of sample covariances for ensemble–based data assimilation. Part I: optimality criteria and application to variance filtering and covariance localization. Month. Wea. Rev. 143, 1622–1643. doi:https://doi.org/10.1175/MWR-D-14-00157.1
Web of Science ®Google Scholar
Yoshimura, R., Yakeno, A., Misaka, T. and Obayashi, S. 2020. Application of observability Gramian to targeted observation in wrf data assimilation. Tellus A 72, 1–11. doi:https://doi.org/10.1080/16000870.2019.1697602
Google Scholar

Appendix:

the proof of Theorem 1

In the following, we prove the following statement that implies the result in Theorem 1. Suppose that A, the matrix in (Equation1), is banded with a bandwidth

s \geq 1 .

Let ρ be an upper bound of the entries,

ρ = \max_{_{0 \leq i, j \leq n}} {| A_{i, j} |} .

Suppose Q is a banded matrix. Let P(t) be the error covariance of the Kalman-Bucy filter (Equation2) in the time interval

[0, t] .

Then

(41)

| G_{i j}^{C} (t) | \leq \bar{G} c (ρ e^{2} t, (| j - i | - L) / s), if | j - i | > L_{1}

(41) for some constant

\bar{G} > 0

and some integers

L, L_{1} \leq n .

The following theorem from Iserles (Citation2000) is fundamental to the proof of Theorem 1.

Theorem 2.

(Iserles, Citation2000) Let $E = e^{A}$ , where $A \in R^{n \times n}$ is a banded matrix of bandwidth $s \geq 1$ . Let $ρ = \max_{_{0 \leq i, j \leq n}} {| A_{i, j} |},$ then (42) $| E_{i, j} | \leq c (ρ, | i - j | / s) (e^{| i - j | / s} - \sum_{m = 0}^{| i - j | - 1} \frac{{(| i - j | / s)}^{m}}{m!}), | i - j | > L,$ (42) for some integer L > 0.

Because $\sum_{m = 0}^{N} \frac{{(| i - j | / s)}^{m}}{m!}$ is a sequence that is increasing and approaching $e^{| i - j | / s}$ when $N \to \infty,$ (Equation42(42) $| E_{i, j} | \leq c (ρ, | i - j | / s) (e^{| i - j | / s} - \sum_{m = 0}^{| i - j | - 1} \frac{{(| i - j | / s)}^{m}}{m!}), | i - j | > L,$ (42) ) implies the following inequality (43) $| E_{i, j} | \leq c (ρ e, | i - j | / s), | i - j | > L .$ (43)

Remark 1.

For problems with large dimensions, the proof in Iserles (Citation2000) assumes that $ρ ≪ n .$ This assumption leads to an estimation in which $L ≪ n .$ However, there is no computational formula of estimating L.

Lemma 2.

Let E and Z be n × n matrixes satisfying (Equation43(43) $| E_{i, j} | \leq c (ρ e, | i - j | / s), | i - j | > L .$ (43) ) for some integers L_E and L_Z, respectively. Let (44a) $L = \max {L_{E}, L_{Z}, s ρ e}$ (44a) (44b) $β = \max {E_{i j}, Z_{i j} | 1 \leq i, j \leq n} .$ (44b) For all $1 \leq i, j \leq n$ satisfying $| i - j | > 2 L$ , we have (45a) $| {(E Z)}_{i j} | \leq \frac{2}{1 - {(s ρ e / L)}^{1 / s}} c (ρ e, | j - i | / s)$ (45a) (45b) $+ 2 β (L + 1) c (ρ e, | j - i | / s)$ (45b) (45c) $+ 2 β L c (ρ e, (| j - i | - L) / s)$ (45c) (45d) $+ (| j - i | - 2 L - 1) c (ρ e^{2}, | j - i | / s)$ (45d)

Proof.

Without loss of generality, we assume i < j. More specifically, the index $1 \leq k \leq n$ has the following partition $1 < i - L < i < i + L \leq j - L < j < j + L < n .$ Using this partition, we group the terms in ${(E Z)}_{i j}$ (46a) ${(E Z)}_{i j} = \sum_{k = 1}^{i - L - 1} E_{i k} Z_{k j} + \sum_{k = i - L}^{i} E_{i k} Z_{k j} + \sum_{k = i + 1}^{i + L} E_{i k} Z_{k j} + \sum_{k = i + L + 1}^{j - L - 1} E_{i k} Z_{k j}$ (46a) (46b) $+ \sum_{k = j + L + 1}^{n} E_{i k} Z_{k j} + \sum_{k = j}^{j + L} E_{i k} Z_{k j} + \sum_{k = j - L}^{j - 1} E_{i k} Z_{k j}$ (46b)

In the following, we derive the upper bounds for every summation on the right-hand side of the equation in (Equation46(46a) ${(E Z)}_{i j} = \sum_{k = 1}^{i - L - 1} E_{i k} Z_{k j} + \sum_{k = i - L}^{i} E_{i k} Z_{k j} + \sum_{k = i + 1}^{i + L} E_{i k} Z_{k j} + \sum_{k = i + L + 1}^{j - L - 1} E_{i k} Z_{k j}$ (46a) ). The four summations in the top row represent four different cases. The summations in the second row are aligned with the first row in such a way that each summation has the same upper bound as that of the summation on top.

Case I, $1 \leq k < i - L .$ It is obvious that $| j - k | / s > (j - i) / s$

From (Equation16(16) $e^{- x^{2}} < c (α, x) < e^{- | x |} .$ (16) ) in Lemma 1, we have (47a) $| \sum_{k = 1}^{i - L - 1} E_{i k} Z_{k j} | \leq \sum_{k = 1}^{i - L - 1} c (ρ e, | i - k | / s) c (ρ e, | j - k | / s)$ (47a) (47b) $= \sum_{k = 1}^{i - L - 1} \frac{{(ρ e)}^{| i - k | / s}}{{(| i - k | / s)}^{| i - k | / s}} c (ρ e, | j - k | / s)$ (47b) (47c) $\leq \sum_{k = 1}^{i - L - 1} \frac{{(ρ e)}^{| i - k | / s}}{{(L / s)}^{| i - k | / s}} c (ρ e, | j - i | / s), (because | i - k | \geq L because | j - i | < | j - k |)$ (47c) (47d) $\leq \frac{1}{1 - {(\frac{s ρ e}{L})}^{1 / s}} c (ρ e, | j - i | / s)$ (47d)

It equals one half of the first term on the right-hand side of (Equation45(45a) $| {(E Z)}_{i j} | \leq \frac{2}{1 - {(s ρ e / L)}^{1 / s}} c (ρ e, | j - i | / s)$ (45a) ). Similarly, one can prove the same inequality for $j + L + 1 \leq k \leq n .$

Case II, $i - L \leq k \leq i .$ In this case we have (48a) $| \sum_{k = i - L}^{i} E_{i k} Z_{k j} | \leq β \sum_{k = i - L}^{i} c (ρ e, | j - k | / s)$ (48a) (48b) $\leq β \sum_{k = i - L}^{i} c (ρ e, | j - i | / s)$ (48b) (48c) $= β (L + 1) c (ρ e, | j - i | / s)$ (48c)

It equals one half of the second term on the right-hand side of (Equation45(45a) $| {(E Z)}_{i j} | \leq \frac{2}{1 - {(s ρ e / L)}^{1 / s}} c (ρ e, | j - i | / s)$ (45a) ). Similarly, one can prove the same inequality for $j \leq k \leq j + L .$

Case III, $i + 1 < k \leq i + L .$ Because $L \leq j - i - L \leq j - k,$ then (49a) $| \sum_{k = i + 1}^{i + L} E_{i k} Z_{k j} | \leq β \sum_{k = i + 1}^{i + L} c (ρ e, | j - k | / s)$ (49a) (49b) $\leq β \sum_{k = i + 1}^{i + L} c (ρ e, (j - i - L) / s) (because L \leq j - i - L \leq j - k)$ (49b) (49c) $= β L c (ρ e, (j - i - L) / s),$ (49c)

It equals one half of the third term on the right-hand side of (Equation45(45a) $| {(E Z)}_{i j} | \leq \frac{2}{1 - {(s ρ e / L)}^{1 / s}} c (ρ e, | j - i | / s)$ (45a) ). Similarly, one can prove the same inequality for $j - L \leq k \leq j - 1 .$

Case IV, $i + L < k < j - L .$ If $| j - i | = 2 L + 1,$ this is an empty index set. The coefficient in the corresponding term equals zero. Let’s assume that $| j - i | > 2 L + 1 .$ It is known that ${(1 + α / x)}^{x},$ x > 0, is an increasing function of x. It approaches $e^{α}$ when $x \to + \infty,$ i.e. (50) ${(1 + \frac{α}{x})}^{x} ↑ e .$ (50)

Consider (51a) $| \sum_{k = i + L + 1}^{j - L - 1} E_{i k} Z_{k j} | \leq \sum_{k = i + L + 1}^{j - L - 1} \frac{{(ρ e)}^{(k - i) / s}}{{((k - i) / s)}^{(k - i) / s}} \frac{{(ρ e)}^{(j - k) / s}}{{((j - k) / s)}^{(j - k) / s}}$ (51a) (51b) $= \sum_{k = i + L + 1}^{j - L - 1} \frac{{(ρ e)}^{(k - i) / s} {(1 + \frac{j - k}{k - i})}^{(k - i) / s}}{{(\frac{k - i}{s})}^{(k - i) / s} {(1 + \frac{j - k}{k - i})}^{(k - i) / s}}$ (51b) (51c) $\frac{{(ρ e)}^{(j - k) / s} {(1 + \frac{k - i}{j - k})}^{(j - k) / s}}{{(\frac{j - k}{s})}^{(j - k) / s} {(1 + \frac{k - i}{j - k})}^{(j - k) / s}}$ (51c) (51d) $= \sum_{k = i + L + 1}^{j - L - 1} \frac{{(ρ e)}^{(j - i) / s} {(1 + \frac{j - k}{k - i})}^{(k - i) / s} {(1 + \frac{k - i}{j - k})}^{(j - k) / s}}{{((j - i) / s)}^{(k - i) / s} {((j - i) / s)}^{(j - k) / s}}$ (51d)

Applying (Equation50(50) ${(1 + \frac{α}{x})}^{x} ↑ e .$ (50) ), we have (52a) $| \sum_{k = i + L + 1}^{j - L - 1} E_{i k} Z_{k j} | \leq \sum_{k = i + L + 1}^{j - L - 1} \frac{{(ρ e)}^{(j - i) / s} e^{(j - k) / s} e^{(k - i) / s}}{{((j - i) / s)}^{(j - i) / s}}$ (52a) (52b) $\leq \sum_{k = i + L + 1}^{j - L - 1} \frac{{(ρ e^{2})}^{(j - i) / s}}{{((j - i) / s)}^{(j - i) / s}}$ (52b) (52c) $\leq (j - i - 2 L - 1) c (ρ e^{2}, (j - i) / s)$ (52c)

It equals the 4th term on the right-hand side of (Equation45(45a) $| {(E Z)}_{i j} | \leq \frac{2}{1 - {(s ρ e / L)}^{1 / s}} c (ρ e, | j - i | / s)$ (45a) ). □

The Proof of Theorem 1. In the following, we prove (Equation41(41) $| G_{i j}^{C} (t) | \leq \bar{G} c (ρ e^{2} t, (| j - i | - L) / s), if | j - i | > L_{1}$ (41) ). We first assume that Q = I is identity. Let E(t) denote the exponential matrix $E (t) = e^{A t}$

According to Theorem 2 and the inequality (Equation43(43) $| E_{i, j} | \leq c (ρ e, | i - j | / s), | i - j | > L .$ (43) ), (53) $\begin{matrix} | E {(t)}_{i j} | \leq c (ρ e t, | i - j | / s), & if | i - j | > L_{1} \end{matrix}$ (53) for some $L_{1} > 0 .$ Obviously, the same inequality holds true for the matrix $e^{A^{T} t} .$ From Lemma 2, (54a) $| {(e^{A t} Q e^{A^{T} t})}_{i j} | \leq \frac{2}{1 - {(s ρ e / L)}^{1 / s}} c (ρ e t, | j - i | / s)$ (54a) (54b) $+ 2 β (L + 1) c (ρ e t, | j - i | / s)$ (54b) (54c) $+ 2 β L c (ρ e t, (| j - i | - L) / s)$ (54c) (54d) $+ (| j - i | - 2 L - 2) c (ρ e^{2} t, | j - i | / s), if | j - i | > 2 L$ (54d) where (55a) $L = \max {sρeT, L_{1}},$ (55a) (55b) $β = \max {{(e^{A t})}_{i j} | 1 \leq i, j \leq n, t \in [0, T]}$ (55b)

Because $c (α, x)$ is an increasing function of α and a decreasing function of x, (Equation54(54a) $| {(e^{A t} Q e^{A^{T} t})}_{i j} | \leq \frac{2}{1 - {(s ρ e / L)}^{1 / s}} c (ρ e t, | j - i | / s)$ (54a) ) implies (56a) $| {(e^{A t} Q e^{A^{T} t})}_{i j} | \leq {\bar{G}}_{1} c (ρ e^{2} T, (| j - i | - L) / s), if | j - i | > 2 L$ (56a) for some ${\bar{G}}_{1} > 0 .$ Substituting (Equation56(56a) $| {(e^{A t} Q e^{A^{T} t})}_{i j} | \leq {\bar{G}}_{1} c (ρ e^{2} T, (| j - i | - L) / s), if | j - i | > 2 L$ (56a) ) into the controllability gramian, (57a) $| G_{i j} | \leq \int_{0}^{T} {\bar{G}}_{1} c (ρ e^{2} T, (| j - i | - L) / s) d ξ$ (57a) (57b) $= {\bar{G}}_{1} T c (ρ e^{2} T, (| j - i | - L) / s)$ (57b)

This is equivalent to (Equation41(41) $| G_{i j}^{C} (t) | \leq \bar{G} c (ρ e^{2} t, (| j - i | - L) / s), if | j - i | > L_{1}$ (41) ). If Q is banded with bandwidth s_Q, then Q equals the sum of $2 s_{Q} + 1$ diagonal matrices along the main diagonal and s_Q subdioganols. The subdiagonals in Q do not change the expression in the inequality (Equation56(56a) $| {(e^{A t} Q e^{A^{T} t})}_{i j} | \leq {\bar{G}}_{1} c (ρ e^{2} T, (| j - i | - L) / s), if | j - i | > 2 L$ (56a) ) except that it increases $\bar{G}$ and L. □

Some quantitative characteristics of error covariance for Kalman filters

Abstract

1. Introduction

2. Error covariance and controllability Gramian