Full article: Autoregressive moving average model for matrix time series

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In the paper, the autoregressive moving average model for matrix time series (MARMA) is investigated. The properties of the MARMA model are investigated by using the conditional least square estimation, the conditional maximum likelihood estimation, the projection theorem in Hilbert space and the decomposition technique of time series, which include necessary and sufficient conditions for stationarity and invertibility, model parameter estimation, model testing and model forecasting.

Keywords:

1. Introduction

Matrix time series is a time series whose cross-sectional data are matrices, which can be found in a variety of fields such as economics, business, ecology, psychology, meteorology, biology and fMRI (Samadi, Citation2014). For example, consider two stocks, $A_{1}$ and $A_{2}$ , as potential investment products, whose prices and volumes are selected as two analysis factors. Denote the price and volume of stock $A_{k}$ at time t by $P_{k} (t)$ and $V_{k} (t)$ , k = 1, 2, and then a $2 \times 2$ -dimensional matrix time series can be constructed as follows: ${X_{t} \equiv [\begin{array}{cc} P_{1} (t) & P_{2} (t) \\ V_{1} (t) & V_{2} (t) \end{array}], t = 1, 2, \dots} .$ Matrix time series has attracted a few scholars' attention and research at the beginning of the century. Walden and Serroukh (Citation2002) studied the construction of matrix-valued filters for multi-resolution analysis of matrix time series. Samadi (Citation2014) brought forward and investigated a p-order autoregressive model for matrix time series, which is essentially a VAR(p) model in matrix form. D. Wang et al. (Citation2019) proposed a novel factor model $X_{t} = R F_{t} C^{⊤} + ϵ_{t}, t = 1, 2, \dots,$ where $X_{t}$ and $F_{t}$ are matrix time series. Chen et al. (Citation2021) first proposed one-order autoregressive model for matrix time series in the bilinear form, denoted by MAR(1), (1) $X_{t} = A X_{t - 1} B^{⊤} + ϵ_{t}, t = 1, 2, \dots,$ (1) and investigated its stationarity, causality, method of parameter estimation, and asymptotics of statistic. Wu and Hua (Citation2022) independently proposed the p-order autoregressive model for matrix time series in the bilinear form, denoted by MAR(p), (2) $X_{t} = \sum_{k = 1}^{p} A_{k} X_{t - k} B_{k}^{⊤} + ϵ_{t}, t = 1, 2, \dots,$ (2) and presented parameter estimation, model identification criterion and model checking. For more literature studies on matrix time series, one can refer to H. Wang and West (Citation2009), Zhou et al. (Citation2018), Getmanov et al. (Citation2021) and their references.

It is widely known that the autoregressive moving average model of time series (ARMA) plays a very important role in the theory and the application of one-dimensional time series, and we will show later that a bilinear model has its unique advantages for matrix time series. In the paper, autoregressive moving average models for matrix time series (MARMA) are first proposed and investigated. Necessary and sufficient conditions for stationarity of MARMA are provided, and parameter estimations are also considered by the conditional least squares method and the conditional maximum likelihood estimation method. At last, an example is presented to show the applications of the MARMA model.

2. Preliminaries

Let $(Ω, F, P)$ be a probability space with a σ-filtration ${F_{t}, t \in N}$ in which the second moment of each variable exists, and $N = {1, 2, 3, \dots}$ .

Definition 2.1

For any given positive integers m and n, an $m \times n$ -dimensional matrix time series refers to (3) $X = {{(X_{i j} (t))}_{m \times n}, t \in N},$ (3) where ${X_{i j} (t), t \in N}$ is a one-dimensional time series on a probability space $(Ω, F, P)$ for any $i = 1, 2, \dots, n$ and $j = 1, 2, \dots, m$ .

Definition 2.2

Let $X = {X (t), t \in N}$ be an $m \times n$ -dimensional matrix time series defined by (Equation3(3) $X = {{(X_{i j} (t))}_{m \times n}, t \in N},$ (3) ), and then its mean function follows as (4) $μ_{X} (t) \equiv E [X (t)] = {(E [X_{i j} (t)])}_{m \times n}, t \in N .$ (4) Additionally, its autocovariance function follows as (5) $Γ_{X} (t, s) \equiv Γ_{v e c (X)} (t, s) = {(σ_{i j, k ℓ} (t, s))}_{m n \times m n},$ (5) where $σ_{i j, k ℓ} (t, s) = c o v (X_{i j} (t), X_{k ℓ} (s))$ , $i, k = 1, 2, \dots, n$ and $j, ℓ = 1, 2, \dots, n$ ; $t, s \in N$ , and $v e c (X (t))$ is the vectorization of $X (t)$ by columns, that is, (6) $v e c (X (t)) = {[X_{11} (t), X_{21} (t), \dots, X_{m 1} (t), X_{12} (t), X_{22} (t), \dots, X_{m n} (t)]}^{⊤} .$ (6)

Stationarity and matrix white noise play a very important role on time series analysis. Thus, we will introduce the concept of stationary matrix time series and matrix white noise in the following.

Definition 2.3

Let ${X (t), t \in N}$ be a matrix time series defined by (Equation3(3) $X = {{(X_{i j} (t))}_{m \times n}, t \in N},$ (3) ) and $v e c (X (t))$ be the vectorization of $X (t)$ defined by (Equation6(6) $v e c (X (t)) = {[X_{11} (t), X_{21} (t), \dots, X_{m 1} (t), X_{12} (t), X_{22} (t), \dots, X_{m n} (t)]}^{⊤} .$ (6) ). Then ${X (t), t \in N}$ is a stationary matrix time series if and only if ${v e c (X (t)), t \in N}$ is stationary.

Definition 2.4

For any given positive integers m and n, denote an $m \times n$ -dimensional matrix time series $ϵ = {(ϵ_{i j} (t))_{m \times n}, t \in N}$ , and then ε is called an $m \times n$ -dimensional matrix white noise, if it satisfies the following conditions.

Its mean function $μ_{ϵ} (t) = O_{m \times n}$ for all $t \in N$ , where $O_{m \times n}$ is the $m \times n$ -dimensional zero matrix.
Its autocovariance function $Γ_{ϵ} (t, s)$ defined by Definition 2.3 satisfies that $Γ_{ϵ} (t, s) = {\begin{cases} O_{m n}, & t \neq s, \\ Σ_{m n}, & t = s, \end{cases} \forall t, s \in N,$ where $O_{m n}$ is the $m n \times m n$ -dimensional zero matrix, and (7) $Σ_{m n} = d i a g (σ_{11}^{2}, σ_{21}^{2}, \dots, σ_{m 1}^{2}, σ_{12}^{2}, σ_{22}^{2}, \dots, σ_{(m - 1) n}^{2}, σ_{m n}^{2})$ (7) is the $m n \times m n$ -dimensional diagonal matrix with diagonal entries $σ_{11}^{2}$ , $σ_{21}^{2}$ , …, $σ_{m 1}^{2}$ , $σ_{12}^{2}$ , $σ_{22}^{2}$ , …, $σ_{(m - 1) n}^{2}$ , $σ_{m n}^{2}$ .

For any matrix white noise ${ϵ (t), t \in N}$ , if its vectorization by columns, ${v e c (ϵ (t)), t \in N}$ , is Gaussian, then ${ϵ (t), t \in N}$ is called a matrix Gaussian white noise.

Property 2.1

For any $m \times n$ -dimensional matrix time series ${ϵ (t), t \in N}$ , it is an $m \times n$ -dimensional matrix white noise if and only if ${v e c (ϵ (t)), t \in N}$ is an mn-dimensional vector white noise, where $N = {1, 2, 3, \dots}$ .

The proof of Property 2.1 is not difficult, so we omit it.

When we investigate the autoregressive moving average model for matrix time series, we may use the Kronecker product, matrix reshape and derivative of matrix. Thus, we introduce them in the following.

Definition 2.5

Graham, Citation2018

Assume matrices $A = (a_{i j})_{m \times n}$ and $C = (c_{i j})_{p \times q}$ , and then the $m \times n$ block matrix $(a_{i j} C)_{m \times n}$ is called the Kronecker product of A and C, denoted by $A \otimes C$ , that is, $A \otimes C = [\begin{array}{cccc} a_{11} C & a_{12} C & \dots & a_{1 n} C \\ a_{21} C & a_{22} C & \dots & a_{2 n} C \\ ⋮ & ⋮ & ⋱ & ⋮ \\ a_{m 1} C & a_{m 2} C & \dots & a_{m n} C \end{array}] .$

Definition 2.6

For any $A = (a_{i j})_{m \times n}$ and positive integers p, q satisfying pq = mn, the $(p, q)$ -order reshaped matrix of A, denoted by $R e s (A, p, q)$ , is defined by $R e s (A, p, q) = [\begin{array}{cccc} a_{1} & a_{p + 1} & \dots & a_{(p - 1) q + 1} \\ a_{2} & a_{p + 2} & \dots & a_{(p - 1) q + 2} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ a_{p} & a_{2 p} & \dots & a_{p q} \end{array}],$ where $a_{k} = a_{i j}$ for all $k = 1, 2, \dots, p q$ , $i = k - m [(k - 1) / m]$ and $j = [(k - 1) / m] + 1$ , where $[\cdot]$ is the operator of taking the integer part.

Definition 2.7

Graham, Citation2018

Let $F = (F_{i j})_{m \times n}$ and $X = (X_{i j})_{p \times q}$ be two matrices, where m, n, p and q are natural numbers. The derivative of matrix F with respect to matrix X is defined by $\frac{\partial F}{\partial X} = [\begin{matrix} \frac{\partial F}{\partial X_{11}} & \frac{\partial F}{\partial X_{12}} & \dots & \frac{\partial F}{\partial X_{1 q}} \\ \frac{\partial F}{\partial X_{21}} & \frac{\partial F}{\partial X_{22}} & \dots & \frac{\partial F}{\partial X_{2 q}} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ \frac{\partial F}{\partial X_{p 1}} & \frac{\partial F}{\partial X_{p 2}} & \dots & \frac{\partial F}{\partial X_{p q}} \end{matrix}],$ where the derivative of matrix F with respect to scalar $X_{i j}$ is defined by $\frac{\partial F}{\partial X_{i j}} = [\begin{matrix} \frac{\partial F_{11}}{\partial X_{i j}} & \frac{\partial F_{12}}{\partial X_{i j}} & \dots & \frac{\partial F_{1 n}}{\partial X_{i j}} \\ \frac{\partial F_{21}}{\partial X_{i j}} & \frac{\partial F_{22}}{\partial X_{i j}} & \dots & \frac{\partial F_{2 n}}{\partial X_{i j}} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ \frac{\partial F_{m 1}}{\partial X_{i j}} & \frac{\partial F_{m 2}}{\partial X_{i j}} & \dots & \frac{\partial F_{m n}}{\partial X_{i j}} \end{matrix}], \begin{array}{l} i = 1, 2, \dots, p, \\ j = 1, 2, \dots, q . \end{array}$

For the derivative of matrix with respect to matrix, its product rule and two common formulas follow as Properties 2.2 and 2.3.

Property 2.2

Graham, Citation2018

For any $X = (x_{i j})_{m \times n}$ , $Y = (y_{i j})_{n \times u}$ and $Z = (z_{i j})_{p \times q}$ , it follows that $\frac{\partial (X Y)}{\partial Z} = \frac{\partial X}{\partial Z} (I_{q} \otimes Y) + (I_{p} \otimes X) \frac{\partial Y}{\partial Z},$ where $I_{q}$ is the $q \times q$ -dimensional identity matrix.

Taking $Y = (y_{i j})_{n \times 1}$ and $X = Y^{⊤}$ into Property 2.2, we obtain Corollary 2.1.

Corollary 2.1

For any $Y = (y_{i j})_{n \times 1}$ and $Z = (z_{i j})_{p \times q}$ , it follows that $\frac{\partial (Y^{⊤} Y)}{\partial Z} = 2 \frac{\partial Y^{⊤}}{\partial Z} (I_{q} \otimes Y) .$

Property 2.3

Graham, Citation2018

For any $A = (a_{i j})_{m \times n}$ , $B = (b_{i j})_{n \times u}$ and invertible $X = (X_{i j})_{n \times n}$ , it follows that $\frac{\partial v e c (A X^{- 1} B)}{\partial v e c (X)} = - (X^{- 1} B) \otimes (A X^{- 1})^{⊤}$ and $\frac{\partial \ln (X)}{\partial X} = (X^{- 1})^{⊤} .$

Taking $B = (b_{i j})_{n \times 1}$ and $A = B^{⊤}$ into Property 2.3, we obtain Corollary 2.2.

Corollary 2.2

For any $B = (b_{i j})_{n \times 1}$ and invertible $X = (X_{i j})_{n \times n}$ , it follows that $\frac{\partial B^{⊤} X^{- 1} B}{\partial X} = - R e s ((X^{- 1} B) \otimes (A X^{- 1})^{⊤}, n, n) .$

3. Autoregressive moving average model for matrix time series

The autoregressive moving average model for matrix time series is an extension of the vector autoregressive moving average model (VARMA) to matrix time series. However, we cannot build the autoregressive moving average model for matrix time series like the VARMA model as follows: (8) $X (t) = Φ_{0} + \sum_{i = 1}^{p} Φ_{i} X (t - i) + ϵ (t) - \sum_{j = 1}^{q} Ψ_{j} ϵ (t - j) .$ (8) The reason is that the form of (Equation8(8) $X (t) = Φ_{0} + \sum_{i = 1}^{p} Φ_{i} X (t - i) + ϵ (t) - \sum_{j = 1}^{q} Ψ_{j} ϵ (t - j) .$ (8) ) cannot describe the dependent relation between the different columns of $X (t)$ according to the rule of matrix multiplication. That is, the ℓth column of $X (t)$ will not be affected by the sth column of $X (t - 1), X (t - 2), \dots, X (t - p)$ as $s \neq ℓ$ .

3.1. $M A R M A (p, q)$ model

In this section, an autoregressive moving average model for matrix time series (MARMA) is first brought forward, whose degradation model, autoregressive model for matrix-valued time series (MAR), is just the model (Equation2(2) $X_{t} = \sum_{k = 1}^{p} A_{k} X_{t - k} B_{k}^{⊤} + ϵ_{t}, t = 1, 2, \dots,$ (2) ) proposed by Wu and Hua (Citation2022) and the extension of model (Equation1(1) $X_{t} = A X_{t - 1} B^{⊤} + ϵ_{t}, t = 1, 2, \dots,$ (1) ) proposed by Chen et al. (Citation2021).

Definition 3.1

Let ${X (t), t \in N}$ be an $m \times n$ -dimensional matrix time series. If X is stationary and for each $t \in N$ it follows that (9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) where C is an $m \times n$ -dimensional matrix; $Φ_{k}$ and $Θ_{j}$ are $m \times m$ -dimensional matrices, and $Ψ_{k}$ and $Ξ_{j}$ are $n \times n$ -dimensional matrices for each $k = 1, 2, \dots, p$ and $j = 1, 2, \dots, q$ , where p and q are two nonnegative integers; ${ϵ (t), t \in N}$ is an $m \times n$ -dimensional matrix white noise satisfying that $v e c (ϵ (t))$ is independent with $v e c (X (s))$ for all s<t, and then ${X (t), t \in N}$ is said to follow a $(p, q)$ -order autoregressive moving average model for matrix time series, denoted by MARMA $(p, q)$ .

When q = 0, MARMA $(p, 0)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) degenerates into the form (10) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t),$ (10) which is a p-order autoregressive model for matrix time series, MAR(p).

When p = 0, MARMA $(0, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) degenerates into the form (11) $X (t) = C + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (11) which is called a q-order moving average model for matrix time series, denoted by MMA(q).

If $X = {X (t), t \in N}$ is an $m \times n$ -dimensional matrix time series defined by (Equation3(3) $X = {{(X_{i j} (t))}_{m \times n}, t \in N},$ (3) ) and X is stationary, denote $μ = E [X (t)], \forall t \in N,$ and then it follows from MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) that (12) $μ = C + Φ_{1} μ Ψ_{1} + Φ_{2} μ Ψ_{2} + \dots + Φ_{p} μ Ψ_{p} .$ (12) Denote $Y (t) = X (t) - μ .$ It yields from (Equation12(12) $μ = C + Φ_{1} μ Ψ_{1} + Φ_{2} μ Ψ_{2} + \dots + Φ_{p} μ Ψ_{p} .$ (12) ) and MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) that (13) $Y (t) = \sum_{k = 1}^{p} Φ_{k} Y (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j}$ (13) holds for all $t \in N$ , and then $Y = {Y (t), t \in N}$ is said to follow a $(p, q)$ -order centralized MARMA $(p, q)$ model.

Because every MARMA $(p, q)$ model can be changed into a centralized MARMA $(p, q)$ model and they have the same coefficient parameters. Thus, while estimating coefficient parameters of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) we will mainly study centralized MARMA $(p, q)$ model (Equation13(13) $Y (t) = \sum_{k = 1}^{p} Φ_{k} Y (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j}$ (13) ).

For any MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), and for any $c_{k} \neq 0$ and $d_{j} \neq 0$ , $k = 1, 2, \dots, p$ and $j = 1, 2, \dots, q$ , it follows that $X (t) = C + \sum_{k = 1}^{p} (c_{k} Φ_{k}) X (t - k) (\frac{1}{c_{k}} Ψ_{k}) + ϵ (t) - \sum_{j = 1}^{q} (d_{j} Θ_{j}) ϵ (t - j) (\frac{1}{d_{j}} Ξ_{j}),$ that is, coefficient parameters of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) are not unique! Thus, we present constraint conditions that (14) $Ψ_{k} = {(ψ_{u v})}_{n \times n} satisfies \arg max_{ψ_{i j}} {| ψ_{i j} |, i, j = 1, 2, \dots, n} = 1$ (14) and (15) $Ξ_{j} = {(ξ_{u v})}_{n \times n} satisfies \arg max_{ξ_{i j}} {| ξ_{i j} |, i, j = 1, 2, \dots, n} = 1$ (15) for all $k = 1, 2, \dots, p$ and $j = 1, 2, \dots, q$ .

3.2. Relationship between MARMA model and VARMA model

When the column number of matrix $X (t)$ equals one, i.e., n = 1, MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) degenerates into a $(p, q)$ -order vector autoregressive moving average model, VARMA $(p, q)$ , as follows: (16) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j),$ (16) where ${X (t), t \in N}$ is an m-dimensional vector time series, C is an m-dimensional vector, $Φ_{k}$ and $Θ_{j}$ are $m \times m$ -dimensional matrices for all $k = 1, 2, \dots, p$ and $j = 1, 2, \dots, q$ , and ${ϵ (t), t \in N}$ is a white noise of the m-dimensional vector time series satisfying that $ϵ (t)$ is independent with $X (s)$ for all s<t. Obviously, VARMA model (Equation16(16) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j),$ (16) ) is a special case of MARMA model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ).

On the other hand, for any $m \times n$ -dimensional matrix time series ${X (t), t \in N}$ , its vectorization ${v e c (X (t)), t \in N}$ is an $m n \times 1$ -dimensional time series, and the $(p, q)$ -order vector autoregressive moving average model VARMA $(p, q)$ for ${v e c (X (t)), t \in N}$ follows as (17) $v e c (X (t)) = A_{0} + \sum_{k = 1}^{p} A_{k} v e c (X (t - k)) + ϵ (t) - \sum_{j = 1}^{q} B_{j} ϵ (t - j),$ (17) where $A_{0}$ is an $m n \times 1$ -dimensional vector; $A_{k}$ and $B_{j}$ are $m n \times m n$ -dimensional matrices for $k = 1, 2, \dots, p$ and $j = 1, 2, \dots, q$ ; and ${ϵ (t), t \in N}$ is an $m n \times 1$ -dimensional white noise satisfying that $ϵ (t)$ is independent with $v e c (X (s))$ for all s<t.

A natural question is why the authors still bring forward MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) for ${X (t), t \in N}$ but directly use VARMA $(p, q)$ model (Equation17(17) $v e c (X (t)) = A_{0} + \sum_{k = 1}^{p} A_{k} v e c (X (t - k)) + ϵ (t) - \sum_{j = 1}^{q} B_{j} ϵ (t - j),$ (17) ) for ${v e c (X (t)), t \in N}$ .

In fact, there are two important reasons that the authors propose MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) for ${X (t), t \in N}$ . Firstly, MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) for ${X (t), t \in N}$ can reveal the information structure of ${X (t), t \in N}$ very clearly. Secondly, MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) for ${X (t), t \in N}$ can reduce model parameters more greatly than VARMA $(p, q)$ model (Equation17(17) $v e c (X (t)) = A_{0} + \sum_{k = 1}^{p} A_{k} v e c (X (t - k)) + ϵ (t) - \sum_{j = 1}^{q} B_{j} ϵ (t - j),$ (17) ) for ${v e c (X (t)), t \in N}$ . In fact, the parameter number of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) for ${X (t), t \in N}$ is $2 m n + (p + q) (m^{2} + n^{2})$ . However, the parameter number of VARMA $(p, q)$ model (Equation17(17) $v e c (X (t)) = A_{0} + \sum_{k = 1}^{p} A_{k} v e c (X (t - k)) + ϵ (t) - \sum_{j = 1}^{q} B_{j} ϵ (t - j),$ (17) ) for ${v e c (X (t)), t \in N}$ is $2 m n + (p + q) m^{2} n^{2}$ . Generally, $2 m n + (p + q) (m^{2} + n^{2}) ≪ 2 m n + (p + q) m^{2} n^{2} .$ For example, if p = q = 1 and m = n = 10, then $2 m n + (p + q) (m^{2} + n^{2}) = 600 ≪ 2 m n + (p + q) m^{2} n^{2} = 20200.$ In today's big data era, m and n are often very large, taking m = n = 100 and p = q = 1 as an example, and then $2 m n + (p + q) (m^{2} + n^{2}) = 60000 ≪ 2 m n + (p + q) m^{2} n^{2} = 200020000.$

Remark 3.1

MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) greatly reduces model parameters compared with VARMA $(p, q)$ model (Equation17(17) $v e c (X (t)) = A_{0} + \sum_{k = 1}^{p} A_{k} v e c (X (t - k)) + ϵ (t) - \sum_{j = 1}^{q} B_{j} ϵ (t - j),$ (17) ).

Although it is not a good idea to replace MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) with VARMA $(p, q)$ model (Equation17(17) $v e c (X (t)) = A_{0} + \sum_{k = 1}^{p} A_{k} v e c (X (t - k)) + ϵ (t) - \sum_{j = 1}^{q} B_{j} ϵ (t - j),$ (17) ), in the following we will show there exists a special VARMA $(p, q)$ model equivalent to MARMA $(p, q)$ model, which will play a very important role in theoretical analysis of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ).

Theorem 3.1

MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) for ${X (t), t \in N}$ is equivalent to VARMA $(p, q)$ model (Equation18(18) $\begin{aligned} v e c (X (t)) & = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c (X (t - k)) + v e c (ϵ (t)) \\ - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c (ϵ (t - j)), \end{aligned}$ (18) ) for ${v e c (X (t)), t \in N}$ as follows: (18) $\begin{aligned} v e c (X (t)) & = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c (X (t - k)) + v e c (ϵ (t)) \\ - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c (ϵ (t - j)), \end{aligned}$ (18) where $v e c (X (t))$ and $v e c (ϵ (t))$ represent the vectorization of matrices $X (t)$ and $ϵ (t)$ by columns, and ⊗ is the Kronecker product.

Theorem 3.1 can be proved by the following equivalence relation: for any matrices $Y_{m \times n}$ , $A_{m \times m}$ , $B_{m \times n}$ and $C_{n \times n}$ , it follows that $Y_{m \times n} = A_{m \times m} B_{m \times n} C_{n \times n} ⟺ v e c (Y_{m \times n}) = (C_{n \times n}^{⊤} \otimes A_{m \times m}) v e c (B_{m \times n}) .$ The equivalence relation is not difficult to prove, so we omit the proof and that of Theorem 3.1.

3.3. Stationary and invertible conditions for MARMA model

According to Theorem 3.1, any MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) can be converted into its corresponding VARMA $(p, q)$ model (Equation18(18) $\begin{aligned} v e c (X (t)) & = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c (X (t - k)) + v e c (ϵ (t)) \\ - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c (ϵ (t - j)), \end{aligned}$ (18) ). Furthermore, VARMA $(p, q)$ model (Equation18(18) $\begin{aligned} v e c (X (t)) & = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c (X (t - k)) + v e c (ϵ (t)) \\ - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c (ϵ (t - j)), \end{aligned}$ (18) ) can be rewritten as (19) $P (B) v e c (X (t)) = v e c (C) + Q (B) v e c (ϵ (t)), t \in N,$ (19) where (20) $\begin{aligned} P (B) & = I_{m n} - \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} B^{k}, \end{aligned}$ (20) (21) $\begin{aligned} Q (B) & = I_{m n} - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} B^{j} \end{aligned}$ (21) and B is the delay operator, i.e., $B X (t) = X (t - 1)$ holds for all $t \in N$ .

Theorem 3.2

For MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), the necessary and sufficient conditions for stationarity are that any root λ of (Equation22(22) $| λ^{p} I_{m n} - λ^{p - 1} Ψ_{1}^{⊤} \otimes Φ_{1} - λ^{p - 2} Ψ_{2}^{⊤} \otimes Φ_{2} - \dots - λ Ψ_{p - 1}^{⊤} \otimes Φ_{p - 1} - Ψ_{p}^{⊤} \otimes Φ_{p} | = 0.$ (22) ) is in the unit circle, where (22) $| λ^{p} I_{m n} - λ^{p - 1} Ψ_{1}^{⊤} \otimes Φ_{1} - λ^{p - 2} Ψ_{2}^{⊤} \otimes Φ_{2} - \dots - λ Ψ_{p - 1}^{⊤} \otimes Φ_{p - 1} - Ψ_{p}^{⊤} \otimes Φ_{p} | = 0.$ (22) The necessary and sufficient conditions for invertibility are that any root λ of (Equation23(23) $| λ^{q} I_{m n} - λ^{q - 1} Ξ_{1}^{⊤} \otimes Θ_{1} - λ^{q - 2} Ξ_{2}^{⊤} \otimes Θ_{2} - \dots - λ Ξ_{q - 1}^{⊤} \otimes Θ_{q - 1} - Ξ_{q}^{⊤} \otimes Θ_{q} | = 0.$ (23) ) is in the unit circle, where (23) $| λ^{q} I_{m n} - λ^{q - 1} Ξ_{1}^{⊤} \otimes Θ_{1} - λ^{q - 2} Ξ_{2}^{⊤} \otimes Θ_{2} - \dots - λ Ξ_{q - 1}^{⊤} \otimes Θ_{q - 1} - Ξ_{q}^{⊤} \otimes Θ_{q} | = 0.$ (23)

The proof of Theorem 3.2 is presented in Appendix 1.

Corollary 3.1

For MAR(p) model (Equation10(10) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t),$ (10) ), the necessary and sufficient conditions for stationarity are that any root λ of (Equation22(22) $| λ^{p} I_{m n} - λ^{p - 1} Ψ_{1}^{⊤} \otimes Φ_{1} - λ^{p - 2} Ψ_{2}^{⊤} \otimes Φ_{2} - \dots - λ Ψ_{p - 1}^{⊤} \otimes Φ_{p - 1} - Ψ_{p}^{⊤} \otimes Φ_{p} | = 0.$ (22) ) is in the unit circle.

Remark 3.2

Corollary 3.1 expands Proposition 1 in Chen et al. (Citation2021).

Corollary 3.2

For MMA(q) model (Equation11(11) $X (t) = C + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (11) ), the necessary and sufficient conditions for invertibility are that any root λ of (Equation23(23) $| λ^{q} I_{m n} - λ^{q - 1} Ξ_{1}^{⊤} \otimes Θ_{1} - λ^{q - 2} Ξ_{2}^{⊤} \otimes Θ_{2} - \dots - λ Ξ_{q - 1}^{⊤} \otimes Θ_{q - 1} - Ξ_{q}^{⊤} \otimes Θ_{q} | = 0.$ (23) ) is in the unit circle.

3.4. Parameter estimation for MARMA model

In the section, we will present the conditional least square method and the conditional maximum likelihood estimation method for MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ).

Let $x_{1}, x_{2}, \dots, x_{N}$ be a series of samples of the centralized matrix time series $X = {X (t), t \in N}$ defined by (Equation3(3) $X = {{(X_{i j} (t))}_{m \times n}, t \in N},$ (3) ) with $C = O_{m \times n}$ , where (24) $x_{t} = [\begin{array}{cccc} x_{11} (t) & x_{12} (t) & \dots & x_{1 n} (t) \\ x_{21} (t) & x_{22} (t) & \dots & x_{2 n} (t) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{m 1} (t) & x_{m 2} (t) & \dots & x_{m n} (t) \end{array}], t = 1, 2, \dots, N,$ (24) where the integer N is the sample length.

When the coefficient parameters of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) have been obtained, it follows from (Equation12(12) $μ = C + Φ_{1} μ Ψ_{1} + Φ_{2} μ Ψ_{2} + \dots + Φ_{p} μ Ψ_{p} .$ (12) ) that $C = μ - Φ_{1} μ Ψ_{1} - Φ_{2} μ Ψ_{2} - \dots - Φ_{p} μ Ψ_{p},$ and then the constant matrix C of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) can be estimated as follows: $\hat{C} = \bar{X} - Φ_{1} \bar{X} Ψ_{1} - Φ_{2} \bar{X} Ψ_{2} - \dots - Φ_{p} \bar{X} Ψ_{p},$ where $\bar{X} = \frac{1}{N} \sum_{t = 1}^{N} x_{t} .$ Thus, in the following we always assume the samples come from a centralized MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), i.e., $C = O_{m \times n}$ .

We use VARMA $(p, q)$ model (Equation19(19) $P (B) v e c (X (t)) = v e c (C) + Q (B) v e c (ϵ (t)), t \in N,$ (19) ) with $C = O_{m \times n}$ , equivalent to centralized MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), to estimate the coefficient parameters of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) by the conditional least square method.

It yields from (Equation19(19) $P (B) v e c (X (t)) = v e c (C) + Q (B) v e c (ϵ (t)), t \in N,$ (19) ) with $C = O_{m \times n}$ that (25) $v e c (ϵ (t)) = Q^{- 1} (B) P (B) v e c (X (t)), t \in N,$ (25) where $Q^{- 1} (B)$ is the inverse operator of $Q (B)$ , and $P (B) = I_{m n} - \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} B^{k}, Q (B) = I_{m n} - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} B^{j} .$ For the sake of briefness, denote (26) $G (B) \overset{△}{=} \sum_{k = 0}^{+ \infty} G_{k} B^{k} = Q^{- 1} (B) P (B)$ (26) and $P (B) = \sum_{i = 0}^{+ \infty} P_{i} B^{i}, Q (B) = \sum_{j = 0}^{+ \infty} Q_{j} B^{j},$ where we stipulate that (27) $P_{i} = {\begin{cases} I_{m n}, & i = 0, \\ - Ψ_{i}^{⊤} \otimes Φ_{i}, & 1 \leq i \leq p, \\ O_{m n}, & i \geq p + 1, \end{cases} and Q_{j} = {\begin{cases} I_{m n}, & j = 0, \\ - Ξ_{j}^{⊤} \otimes Θ_{j} B^{j}, & 1 \leq j \leq q, \\ O_{m n}, & j \geq q + 1. \end{cases}$ (27) It follows from (Equation26(26) $G (B) \overset{△}{=} \sum_{k = 0}^{+ \infty} G_{k} B^{k} = Q^{- 1} (B) P (B)$ (26) ) that $Q (B) G (B) = P (B)$ , which means that (28) $\sum_{i = 0}^{k} Q_{i} G_{k - i} = P_{k}, k = 0, 1, 2, \dots .$ (28) It yields from (Equation28(28) $\sum_{i = 0}^{k} Q_{i} G_{k - i} = P_{k}, k = 0, 1, 2, \dots .$ (28) ) and (Equation27(27) $P_{i} = {\begin{cases} I_{m n}, & i = 0, \\ - Ψ_{i}^{⊤} \otimes Φ_{i}, & 1 \leq i \leq p, \\ O_{m n}, & i \geq p + 1, \end{cases} and Q_{j} = {\begin{cases} I_{m n}, & j = 0, \\ - Ξ_{j}^{⊤} \otimes Θ_{j} B^{j}, & 1 \leq j \leq q, \\ O_{m n}, & j \geq q + 1. \end{cases}$ (27) ) that (29) $G_{k} = {\begin{cases} I_{m n}, & k = 0, \\ - Ψ_{k}^{⊤} \otimes Φ_{k} + \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & 1 \leq k \leq p, \\ \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & k \geq p + 1, \end{cases}$ (29) where $k \land q = min {k, q}$ .

In summary, centralized MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), i.e., $C = O_{m \times n}$ , is equivalent to VARMA $(p, q)$ model (Equation30(30) $v e c (ϵ (t)) = \sum_{k = 0}^{+ \infty} G_{k} v e c (X (t - k)), t \in N,$ (30) ). (30) $v e c (ϵ (t)) = \sum_{k = 0}^{+ \infty} G_{k} v e c (X (t - k)), t \in N,$ (30) where $G_{k}$ , $k = 0, 1, 2, \dots$ , are given by (Equation29(29) $G_{k} = {\begin{cases} I_{m n}, & k = 0, \\ - Ψ_{k}^{⊤} \otimes Φ_{k} + \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & 1 \leq k \leq p, \\ \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & k \geq p + 1, \end{cases}$ (29) ).

Theorem 3.3

According to the conditional least square method, the parameters of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) satisfy the following matrix differential equations: ${\begin{aligned} \sum_{t = p + 1}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Φ_{i}} (I_{m} \otimes ({\tilde{x}}_{t} + \sum_{ℓ = 1}^{t - 1} G_{ℓ} {\tilde{x}}_{t - ℓ})) = O_{m}, i = 1, 2, \dots, p, \\ \sum_{t = p + 1}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Ψ_{i}} (I_{n} \otimes ({\tilde{x}}_{t} + \sum_{ℓ = 1}^{t - 1} G_{ℓ} {\tilde{x}}_{t - ℓ})) = O_{n}, i = 1, 2, \dots, p, \\ \sum_{t = p + 1}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Θ_{j}} (I_{m} \otimes ({\tilde{x}}_{t} + \sum_{ℓ = 1}^{t - 1} G_{ℓ} {\tilde{x}}_{t - ℓ})) = O_{m}, j = 1, 2, \dots, q, \\ \sum_{t = p + 1}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Ξ_{j}} (I_{n} \otimes ({\tilde{x}}_{t} + \sum_{ℓ = 1}^{t - 1} G_{ℓ} {\tilde{x}}_{t - ℓ})) = O_{n}, j = 1, 2, \dots, q, \end{aligned}$ where $G_{k}$ is given by (Equation29(29) $G_{k} = {\begin{cases} I_{m n}, & k = 0, \\ - Ψ_{k}^{⊤} \otimes Φ_{k} + \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & 1 \leq k \leq p, \\ \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & k \geq p + 1, \end{cases}$ (29) ).

The proof of Theorem 3.3 is presented in Appendix 2.

Corollary 3.3

According to the conditional least square method, the parameters of MAR(p) model (Equation10(10) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t),$ (10) ) satisfy the following matrix differential equations: ${\begin{matrix} \sum_{t = p + 1}^{N} \frac{\partial ({\tilde{x}}_{t - i}^{⊤} (Ψ_{i} \otimes Φ_{i}^{⊤}))}{\partial Φ_{i}} (I_{m} \otimes ({\tilde{x}}_{t} - \sum_{ℓ = 1}^{p} (Ψ_{ℓ}^{⊤} \otimes Φ_{ℓ}) {\tilde{x}}_{t - ℓ})) = O_{m}, \\ \sum_{t = p + 1}^{N} \frac{\partial ({\tilde{x}}_{t - i}^{⊤} (Ψ_{i} \otimes Φ_{i}^{⊤}))}{\partial Ψ_{i}} (I_{n} \otimes ({\tilde{x}}_{t} - \sum_{ℓ = 1}^{p} (Ψ_{ℓ}^{⊤} \otimes Φ_{ℓ}) {\tilde{x}}_{t - ℓ})) = O_{n}, \end{matrix} i = 1, 2, \dots, p .$

Theorem 3.4

Assume the innovations are Gaussian with the mean $O_{m \times n}$ and covariance matrix $Σ_{m n}$ . According to the conditional maximum likelihood estimation method, the parameters of centralized MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) satisfy the following matrix differential equations: ${\begin{aligned} \sum_{t = 2}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Φ_{i}} (I_{m} \otimes [Σ_{m n}^{- 1} H (t, G_{\cdot})]) = O_{m}, i = 1, 2, \dots, p, \\ \sum_{t = 2}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Ψ_{i}} (I_{n} \otimes [Σ_{m n}^{- 1} H (t, G_{\cdot})]) = O_{n}, i = 1, 2, \dots, p, \\ \sum_{t = 2}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Θ_{j}} (I_{m} \otimes [Σ_{m n}^{- 1} H (t, G_{\cdot})]) = O_{m}, j = 1, 2, \dots, q, \\ \sum_{t = 2}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Ξ_{j}} (I_{n} \otimes [Σ_{m n}^{- 1} H (t, G_{\cdot})]) = O_{n}, j = 1, 2, \dots, q, \\ \frac{1}{N} \sum_{t = 1}^{N} R e s ([Σ_{m n}^{- 1} H (t, G_{\cdot})] \otimes [Σ_{m n}^{- 1} H (t, G_{\cdot})], m n, m n) = Σ_{m n}^{- 1}, \end{aligned}$ where $H (t, G_{\cdot}) = {\tilde{x}}_{t} + \sum_{k = 1}^{t - 1} G_{k} {\tilde{x}}_{t - k}$ and $G_{k}$ is given by (Equation29(29) $G_{k} = {\begin{cases} I_{m n}, & k = 0, \\ - Ψ_{k}^{⊤} \otimes Φ_{k} + \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & 1 \leq k \leq p, \\ \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & k \geq p + 1, \end{cases}$ (29) ).

The proof of Theorem 3.4 is presented in Appendix 3.

Corollary 3.4

Remark 3.3

The matrix differential equations in Theorems 3.3 and 3.4 are very complex. Especially, the coefficients $G_{k}$ in (Equation29(29) $G_{k} = {\begin{cases} I_{m n}, & k = 0, \\ - Ψ_{k}^{⊤} \otimes Φ_{k} + \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & 1 \leq k \leq p, \\ \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & k \geq p + 1, \end{cases}$ (29) ), $k = 1, 2, \dots$ , are defined by a series of recursions, whose implied parameters are to be estimated. Thus, it is difficult to obtain its closed solution, but its approximate solutions can be obtained by the numerical computation method.

3.5. Hypothesis testing for the MARMA model

Let $x_{1}, x_{2}, \dots, x_{N}$ be a series of samples of the centralized matrix time series $X = {X (t), t \in N}$ defined by (Equation3(3) $X = {{(X_{i j} (t))}_{m \times n}, t \in N},$ (3) ) with $C = O_{m \times n}$ and $x_{t} = (x_{i j})_{m \times n}$ for all $t = 1, 2, \dots, N$ . Additionally assume $x_{1}, x_{2}, \dots, x_{N}$ are Gaussian. In the section, we will test whether $x_{1}, x_{2}, \dots, x_{N}$ follow MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ).

The null hypothesis and the alternative hypothesis follow as

$H_{0}$ : $X = {X (t), t \in N}$ follows MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) );
$H_{1}$ : $X = {X (t), t \in N}$ does not follow MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ).

When $H_{0}$ holds, denote (31) $\tilde{ϵ} (t) = \sum_{k = 0}^{t - 1} G_{k} {\tilde{x}}_{t - k}, t = 1, 2, \dots, N,$ (31) where $\tilde{(\cdot)} = v e c (\cdot)$ . It follows from Corollary 5.3 (Karl & Simar, Citation2015) that $T^{2} = N (\bar{\tilde{ϵ}})^{⊤} {(S_{\tilde{ϵ}}^{2})}^{- 1} \bar{\tilde{ϵ}} \sim T^{2} (m n, N - 1),$ where (32) $\bar{\tilde{ϵ}} = \frac{1}{N} \sum_{t = 1}^{N} {\tilde{ϵ}}_{t} and S_{\tilde{ϵ}}^{2} = \frac{1}{N - 1} \sum_{t = 1}^{N} {\tilde{ϵ}}_{t} {\tilde{ϵ}}_{t}^{⊤} .$ (32) It follows from Theorem 5.9 (Karl & Simar, Citation2015) that $\frac{N - m n}{(N - 1) m n} T^{2} (m n, N - 1) \sim F (m n, N - m n),$ that is, $F = \frac{N (N - m n)}{(N - 1) m n} (\bar{\tilde{ϵ}})^{⊤} {(S_{\tilde{ϵ}}^{2})}^{- 1} \bar{\tilde{ϵ}} \sim F (m n, N - m n) .$ Summarize the above deduction and we obtain Theorem 3.5 for the hypothesis testing on MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ).

Theorem 3.5

For any given significance level $α \in (0, 1)$ , if $F < F_{\frac{α}{2}} (m n, N - m n)$ or $F > F_{1 - \frac{α}{2}} (m n, N - m n)$ , then reject ${X (t), t \in N}$ following MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ); otherwise, accept ${X (t), t \in N}$ following MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), where $F = \frac{N (N - m n)}{(N - 1) m n} (\bar{\tilde{ϵ}})^{⊤} {(S_{\tilde{ϵ}}^{2})}^{- 1} \bar{\tilde{ϵ}},$ and $\bar{\tilde{ϵ}}$ , $S_{\tilde{ϵ}}^{2}$ , ${\tilde{ϵ}}_{t}$ are given by (Equation32(32) $\bar{\tilde{ϵ}} = \frac{1}{N} \sum_{t = 1}^{N} {\tilde{ϵ}}_{t} and S_{\tilde{ϵ}}^{2} = \frac{1}{N - 1} \sum_{t = 1}^{N} {\tilde{ϵ}}_{t} {\tilde{ϵ}}_{t}^{⊤} .$ (32) ) and (Equation31(31) $\tilde{ϵ} (t) = \sum_{k = 0}^{t - 1} G_{k} {\tilde{x}}_{t - k}, t = 1, 2, \dots, N,$ (31) ).

3.6. Forecasting for the MARMA model

Let ${X_{t}, t \in N}$ be an $m \times n$ -dimensional matrix time series defined by (Equation3(3) $X = {{(X_{i j} (t))}_{m \times n}, t \in N},$ (3) ) following MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), equivalently, ${v e c (X_{t}), t \in N}$ following VARMA $(p, q)$ model (Equation18(18) $\begin{aligned} v e c (X (t)) & = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c (X (t - k)) + v e c (ϵ (t)) \\ - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c (ϵ (t - j)), \end{aligned}$ (18) ), that is, (33) $\begin{aligned} v e c (X_{t}) & = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c (X_{t - k}) + v e c (ϵ_{t}) \\ - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c (ϵ_{t - j}), \end{aligned}$ (33) where ${ϵ_{t}, t \in N}$ is an $m \times n$ -dimensional matrix white noise.

Denote the forecasting for $X_{t + ℓ}$ under the condition that $X_{1}, X_{2}, \dots, X_{t}$ have been known by ${\hat{X}}_{t} (ℓ)$ , which refers to the ℓth step forecasting. It follows from (Equation33(33) $\begin{aligned} v e c (X_{t}) & = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c (X_{t - k}) + v e c (ϵ_{t}) \\ - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c (ϵ_{t - j}), \end{aligned}$ (33) ) and the projection theorem in Hilbert space that (34) $v e c ({\hat{X}}_{t} (ℓ)) = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c ({\tilde{X}}_{t} (ℓ - k)) - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c ({\tilde{ϵ}}_{t} (ℓ - j)),$ (34) where $v e c ({\tilde{X}}_{t} (k)) = {\begin{cases} v e c (X_{t + k}), & k \leq 0, \\ v e c ({\hat{X}}_{t} (k)), & k \geq 1, \end{cases} and v e c ({\tilde{ϵ}}_{t} (k)) = {\begin{array}{cl} v e c (ϵ_{t + k}), & k \leq 0, \\ v e c (O_{m \times n}), & k \geq 1. \end{array}$ It yields from the equivalence relation of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) and VARMA $(p, q)$ model (Equation18(18) $\begin{aligned} v e c (X (t)) & = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c (X (t - k)) + v e c (ϵ (t)) \\ - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c (ϵ (t - j)), \end{aligned}$ (18) ) that (35) ${\hat{X}}_{t} (ℓ) = C + \sum_{k = 1}^{p} Φ_{k} {\tilde{X}}_{t} (ℓ - k) Ψ_{k} - \sum_{j = 1}^{q} Θ_{j} {\tilde{ϵ}}_{t} (ℓ - j) Ξ_{j},$ (35) where ${\tilde{X}}_{t} (k) = {\begin{cases} X_{t + k}, & k \leq 0, \\ {\hat{X}}_{t} (k), & k \geq 1, \end{cases} and {\tilde{ϵ}}_{t} (k) = {\begin{cases} ϵ_{t + k}, & k \leq 0, \\ O_{m \times n}, & k \geq 1. \end{cases}$ In the following, we will study the interval estimation of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) and assume the innovations are Gaussian. Equivalently, ${v e c (X_{t}), t \in N}$ follows VARMA $(p, q)$ model (Equation19(19) $P (B) v e c (X (t)) = v e c (C) + Q (B) v e c (ϵ (t)), t \in N,$ (19) ), that is, $P (B) v e c (X_{t}) = v e c (C) + Q (B) v e c (ϵ_{t}),$ where $P (B)$ and $Q (B)$ are defined by (Equation20(20) $\begin{aligned} P (B) & = I_{m n} - \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} B^{k}, \end{aligned}$ (20) ) and (Equation21(21) $\begin{aligned} Q (B) & = I_{m n} - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} B^{j} \end{aligned}$ (21) ), and ${v e c (ϵ_{t}), t \in N}$ is a vector white noise.

Denote $Π (B) \overset{△}{=} \sum_{k = 0}^{+ \infty} Π_{k} B^{k} = P^{- 1} (B) Q (B),$ and then (36) $v e c (X_{t}) = P^{- 1} (B) v e c (C) + \sum_{k = 0}^{+ \infty} Π_{k} v e c (ϵ_{t - k}),$ (36) where (37) $Π_{k} = {\begin{cases} I_{m n}, & k = 0, \\ - Ξ_{k}^{⊤} \otimes Θ_{k} + \sum_{i = 1}^{k \land p} (Ψ_{i}^{⊤} \otimes Φ_{i}) Π_{k - i}, & 1 \leq k \leq q, \\ \sum_{i = 1}^{k \land p} (Ψ_{i}^{⊤} \otimes Φ_{i}) Π_{k - i}, & k \geq q + 1, \end{cases}$ (37) with $k \land p = min {k, p}$ .

For any $ℓ > 0$ , it follows from (Equation36(36) $v e c (X_{t}) = P^{- 1} (B) v e c (C) + \sum_{k = 0}^{+ \infty} Π_{k} v e c (ϵ_{t - k}),$ (36) ) and the estimation method of $v e c ({\hat{X}}_{t} (ℓ))$ that (38) $v e c (X_{t + ℓ}) - v e c ({\hat{X}}_{t} (ℓ)) = \sum_{k = 0}^{ℓ - 1} Π_{k} v e c (ϵ_{t + ℓ - k}),$ (38) and then (39) $v e c (X_{t + ℓ}) - v e c ({\hat{X}}_{t} (ℓ)) \sim N (O_{m n \times 1}, \sum_{k = 0}^{ℓ - 1} Π_{k} Σ_{m n} Π_{k}^{⊤}) .$ (39) For any given $α \in (0, 1)$ , it yields from (Equation39(39) $v e c (X_{t + ℓ}) - v e c ({\hat{X}}_{t} (ℓ)) \sim N (O_{m n \times 1}, \sum_{k = 0}^{ℓ - 1} Π_{k} Σ_{m n} Π_{k}^{⊤}) .$ (39) ) that the confidence interval of $v e c (X_{t + ℓ})$ with confidence level $1 - α$ follows as $(v e c ({\hat{X}}_{t} (ℓ)) - U_{1 - \frac{α}{2}} \sqrt{d i a g (\sum_{k = 0}^{ℓ - 1} Π_{k} Σ_{m n} Π_{k}^{⊤})}, v e c ({\hat{X}}_{t} (ℓ)) + U_{1 - \frac{α}{2}} \sqrt{d i a g (\sum_{k = 0}^{ℓ - 1} Π_{k} Σ_{m n} Π_{k}^{⊤})}),$ where $d i a g (\cdot)$ refers to the vector composed by all main diagonal elements, and $\sqrt{\cdot}$ means taking the square roots of every elements. It yields from the equivalence relation of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) and VARMA $(p, q)$ model (Equation19(19) $P (B) v e c (X (t)) = v e c (C) + Q (B) v e c (ϵ (t)), t \in N,$ (19) ) that the confidence interval of $X_{t + ℓ}$ with confidence level $1 - α$ follows as $\begin{aligned} ({\hat{X}}_{t} (ℓ) - U_{1 - \frac{α}{2}} R e s (\sqrt{d i a g (\sum_{k = 0}^{ℓ - 1} Π_{k} Σ_{m n} Π_{k}^{⊤})}, m, n), {\hat{X}}_{t} (ℓ) \\ + U_{1 - \frac{α}{2}} R e s (\sqrt{d i a g (\sum_{k = 0}^{ℓ - 1} Π_{k} Σ_{m n} Π_{k}^{⊤})}, m, n)) . \end{aligned}$ In summary, we can obtain the following results.

Theorem 3.6

Assume ${X_{t}, t \in N}$ follows MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ).

(1) For any $ℓ > 0$ , the ℓ-step point estimation follows as ${\hat{X}}_{t} (ℓ) = C + \sum_{k = 1}^{p} Φ_{k} {\tilde{X}}_{t} (ℓ - k) Ψ_{k} - \sum_{j = 1}^{q} Θ_{j} {\tilde{ϵ}}_{t} (ℓ - j) Ξ_{j},$ where ${\tilde{X}}_{t} (k) = {\begin{cases} X_{t + k}, & k \leq 0, \\ {\hat{X}}_{t} (k), & k \geq 1, \end{cases} and {\tilde{ϵ}}_{t} (k) = {\begin{cases} ϵ_{t + k}, & k \leq 0, \\ O_{m \times n}, & k \geq 1. \end{cases}$ (2) For any $ℓ > 0$ and $α \in (0, 1)$ , the ℓ-step interval estimation with confidence level $1 - α$ follows as $\begin{aligned} ({\hat{X}}_{t} (ℓ) - U_{1 - \frac{α}{2}} R e s (\sqrt{d i a g (\sum_{k = 0}^{ℓ - 1} Π_{k} Σ_{m n} Π_{k}^{⊤})}, m, n), {\hat{X}}_{t} (ℓ) \\ + U_{1 - \frac{α}{2}} R e s (\sqrt{d i a g (\sum_{k = 0}^{ℓ - 1} Π_{k} Σ_{m n} Π_{k}^{⊤})}, m, n)), \end{aligned}$ where $U_{1 - \frac{α}{2}}$ is the $1 - \frac{α}{2}$ level lower quantile of standard normal distribution, $R e s (\cdot)$ the reshape function by Definition 2.6, $d i a g (\cdot)$ the vector composed by all main diagonal elements, $\sqrt{\cdot}$ takes the square roots of every elements, and $Π_{k} = {\begin{cases} I_{m n}, & k = 0, \\ - Ξ_{k}^{⊤} \otimes Θ_{k} + \sum_{i = 1}^{k \land p} (Ψ_{i}^{⊤} \otimes Φ_{i}) Π_{k - i}, & 1 \leq k \leq q, \\ \sum_{i = 1}^{k \land p} (Ψ_{i}^{⊤} \otimes Φ_{i}) Π_{k - i}, & k \geq q + 1. \end{cases}$

3.7. Supplementary notes for the MARMA model

3.7.1. Model identification for the MARMA model

According to Theorem 3.1, MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) is equivalent to VARMA $(p, q)$ model (Equation18(18) $\begin{aligned} v e c (X (t)) & = v e c (C) + \sum_{k = 1}^{p} Ψ_{k}^{⊤} \otimes Φ_{k} v e c (X (t - k)) + v e c (ϵ (t)) \\ - \sum_{j = 1}^{q} Ξ_{j}^{⊤} \otimes Θ_{j} v e c (ϵ (t - j)), \end{aligned}$ (18) ). Thus, we can use the model identification method for the VARMA model to identify the order of MARMA model, such as $\begin{aligned} A I C (p, q) = & \ln (| Σ_{m n} (p, q) |) + \frac{2}{N} (p + q) (m^{2} + n^{2}), \\ B I C (p, q) = & \ln (| Σ_{m n} (p, q) |) + \frac{\ln (N)}{N} (p + q) (m^{2} + n^{2}), \end{aligned}$ or alternatively, $\begin{aligned} A I C (p, q) & = - \ln (L) + (p + q) (m^{2} + n^{2}), \\ B I C (p, q) & = - 2 \ln (L) + \ln (N) (p + q) (m^{2} + n^{2}), \end{aligned}$ where N is the length of observation sequence and $\ln (L)$ is the logarithm likelihood function.

3.7.2. MARIMA model

For any matrix time series ${X (t) = (X_{i j} (t))_{m \times n}, t \in N}$ defined by (Equation3(3) $X = {{(X_{i j} (t))}_{m \times n}, t \in N},$ (3) ), the difference operator Δ for matrix time series follows as (40) $\begin{aligned} Δ X (t) & = X (t) - X (t - 1), \\ Δ^{k} X (t) & = Δ^{k - 1} X (t) - Δ^{k - 1} X (t - 1), k = 2, 3, \dots, \end{aligned}$ (40) and Δ defined by (Equation40(40) $\begin{aligned} Δ X (t) & = X (t) - X (t - 1), \\ Δ^{k} X (t) & = Δ^{k - 1} X (t) - Δ^{k - 1} X (t - 1), k = 2, 3, \dots, \end{aligned}$ (40) ) has the same effect as the difference operator for vector time series. That is, if ${X (t) = (X_{i j} (t))_{m \times n}, t \in N}$ is nonstationary, then we can try to eliminate nonstationarity by Δ defined by (Equation40(40) $\begin{aligned} Δ X (t) & = X (t) - X (t - 1), \\ Δ^{k} X (t) & = Δ^{k - 1} X (t) - Δ^{k - 1} X (t - 1), k = 2, 3, \dots, \end{aligned}$ (40) ). If there exists a positive integer d such that ${Δ^{d} X (t), t \in N}$ is stationary but ${Δ^{d - 1} X (t), t \in N}$ is nonstationary, and ${Δ^{d} X (t), t \in N}$ follows a MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), then ${X (t) = (X_{i j} (t))_{m \times n}, t \in N}$ is called to follow a $(p, d, q)$ -order autoregressive integrated moving average for matrix time series, and denoted by MARIMA $(p, d, q)$ .

4. An application of the MARMA model

In this section, we will try to model the time series of daily closing prices and daily volumes of Haitong Securities Company Limited (Abbreviated as Haitong Securities; Stock code: 600837) and Ping An Insurance (Group) Company of China, Ltd. (Abbreviated as Ping An; Stock code: 601318). The data are downloaded from the China Stock Market & Accounting Research Database (CSMAR), and the time window is from January 2, 2018 to December 31, 2021, which includes 973 records every stock.

For the sake of clarity, we denote the time series by ${[\begin{array}{cc} P_{1} (t) & P_{2} (t) \\ V_{1} (t) & V_{2} (t) \end{array}], t = 1, 2, 3, \dots},$ where $P_{1} (t)$ and $V_{1} (t)$ are the daily closing price and daily volume of Haitong Securities, and $P_{2} (t)$ and $V_{2} (t)$ are the daily closing price and daily volume of Ping An.

4.1. Data preprocessing

We first conduct the Kwiatkowski, Phillips, Schmidt and Shin (KPSS) test, i.e., ‘kpsstest’ function in the software MATLAB R2020b, to test the stationarity of the daily closing prices and daily volumes of Haitong Securities and Ping An, and the results show that the daily closing prices and daily volumes of Haitong Securities and Ping An are nonstationary.

In the following we will consider the logarithmic rates (log rate) of daily closing prices and daily volumes of Haitong Securities and Ping An. Denote (41) ${R (t) = [\begin{array}{cc} R_{11} (t) & R_{12} (t) \\ R_{21} (t) & R_{22} (t) \end{array}], t = 2, 3, 4, \dots},$ (41) where $R_{1 k} (t) = \ln (\frac{P_{k} (t)}{P_{k} (t - 1)}) and R_{2 k} (t) = \ln (\frac{V_{k} (t)}{V_{k} (t - 1)}), k = 1, 2.$ That is, $R_{11} (t)$ is the logarithmic rate of daily closing price of Haitong Securities, $R_{21} (t)$ the logarithmic rate of daily volume of Haitong Securities, $R_{12} (t)$ the logarithmic rate of daily closing price of Ping An and $R_{22} (t)$ the logarithmic rate of daily volume of Ping An.

We conduct the Kwiatkowski, Phillips, Schmidt and Shin (KPSS) test, i.e., ‘kpsstest’ function in the software MATLAB R2020b, to test the stationarity of the logarithmic rates of daily closing prices and daily volumes of Haitong Securities and Ping An, and the results show that the logarithmic rates of daily closing prices and daily volumes of Haitong Securities and Ping An are stationary.

Additionally, we conduct a Ljung-Box Q test, i.e., ‘lbqtest’ function in the software MATLAB R2020b, to test the pure randomness of the logarithmic rates of daily closing prices and daily volumes of Haitong Securities and Ping An, and the results show that the logarithmic rates of daily closing prices or daily volumes of Haitong Securities and Ping An are not purely random.

In conclusion, for the stocks of Haitong Securities and Ping An, their daily closing prices and daily volumes are nonstationary, but their logarithmic rates of daily closing prices and daily volumes are stationary, and their logarithmic rates of daily closing prices or daily volumes are not purely random.

4.2. Modelling of $M A R M A (p, q)$

We use the Bayesian information criterion (BIC) to select the model, and the results show that MARMA(4,0) is the best. Using the conditional least square method and MATLAB R2020b program, we establish MARMA(4,0) model for ${R (t), t = 1, 2, 3, \dots}$ by (Equation41(41) ${R (t) = [\begin{array}{cc} R_{11} (t) & R_{12} (t) \\ R_{21} (t) & R_{22} (t) \end{array}], t = 2, 3, 4, \dots},$ (41) ) as follows: (42) $R (t) = \hat{C} + {\hat{Φ}}_{1} R (t - 1) {\hat{Ψ}}_{1} + {\hat{Φ}}_{2} R (t - 2) {\hat{Ψ}}_{2} + {\hat{Φ}}_{3} R (t - 3) {\hat{Ψ}}_{3} + {\hat{Φ}}_{4} R (t - 4) {\hat{Ψ}}_{4} + ϵ (t),$ (42) where $\begin{aligned} {\hat{Φ}}_{1} & = [\begin{array}{cc} 2.11 \times 10^{- 2} & 1.32 \times 10^{- 3} \\ 3.4371 & - 0.3527 \end{array}], {\hat{Ψ}}_{1} = [\begin{array}{cc} 1 & 0.3708 \\ - 0.4014 & 0.0354 \end{array}], \\ {\hat{Φ}}_{2} & = [\begin{array}{cc} 0.0339 & - 2.21 \times 10^{- 3} \\ 1.1721 & - 0.1271 \end{array}], {\hat{Ψ}}_{2} = [\begin{array}{cc} 0.4132 & 0.4468 \\ 1 & 0.2409 \end{array}], \\ {\hat{Φ}}_{3} & = [\begin{array}{cc} 0.0101 & - 9.82 \times 10^{- 4} \\ 0.3575 & - 0.1739 \end{array}], {\hat{Ψ}}_{3} = [\begin{array}{cc} 1 & - 0.4715 \\ - 0.1625 & 0.2516 \end{array}], \\ {\hat{Φ}}_{4} & = [\begin{array}{cc} - 0.0440 & 8.75 \times 10^{- 4} \\ - 0.4732 & - 0.1012 \end{array}], {\hat{Ψ}}_{4} = [\begin{array}{cc} 0.9537 & 0.6853 \\ 0.4385 & 1 \end{array}], \\ \hat{μ} & = [\begin{array}{cc} - 6.66 \times 10^{- 5} & - 3.75 \times 10^{- 4} \\ 4.86 \times 10^{- 4} & - 1.64 \times 10^{- 3} \end{array}], \end{aligned}$ and $\hat{C} = [\begin{array}{cc} - 3.27 \times 10^{- 5} & - 3.48 \times 10^{- 4} \\ 1.27 \times 10^{- 3} & - 3.40 \times 10^{- 3} \end{array}],$ and then the covariance matrix of residuals ${ϵ (t), t \in N}$ follows as (43) $Σ_{ϵ} = [\begin{array}{cccc} 4.40 \times 10^{- 4} & 2.74 \times 10^{- 3} & 2.17 \times 10^{- 4} & 1.30 \times 10^{- 3} \\ 2.74 \times 10^{- 3} & 0.1386 & 1.31 \times 10^{- 3} & 5.60 \times 10^{- 2} \\ 2.17 \times 10^{- 4} & 1.31 \times 10^{- 3} & 3.15 \times 10^{- 4} & 1.31 \times 10^{- 3} \\ 1.30 \times 10^{- 3} & 5.60 \times 10^{- 2} & 1.31 \times 10^{- 3} & 0.1077 \end{array}] .$ (43)

4.3. Evaluation on $M A R M A (p, q)$

For the sake of saving space, we will not show the model test, model optimization or forecasting of MARMA(4,0) model (Equation42(42) $R (t) = \hat{C} + {\hat{Φ}}_{1} R (t - 1) {\hat{Ψ}}_{1} + {\hat{Φ}}_{2} R (t - 2) {\hat{Ψ}}_{2} + {\hat{Φ}}_{3} R (t - 3) {\hat{Ψ}}_{3} + {\hat{Φ}}_{4} R (t - 4) {\hat{Ψ}}_{4} + ϵ (t),$ (42) ), but present a comparison of the MARMA model and ARMA model in this subsection. We first establish ARMA $(p, q)$ model for $R_{11} (t), R_{21} (t), R_{12} (t)$ and $R_{22} (t),$ respectively, and obtain their models as follows: (44) $\begin{aligned} R_{11} (t) & = 1.53 \times 10^{- 8} + 0.0204 R_{11} (t - 1) + 0.0058 R_{11} (t - 2) \\ + 0.0625 R_{11} (t - 3) - 0.0393 R_{11} (t - 4) + e_{11} (t), \\ R_{21} (t) & = - 2.98 \times 10^{- 4} - 0.3994 R_{21} (t - 1) - 0.2598 R_{21} (t - 2) \\ - 0.1918 R_{21} (t - 3) - 0.1096 R_{21} (t - 4) + e_{21} (t), \\ R_{12} (t) & = 5.17 \times 10^{- 7} - 0.0002 R_{12} (t - 1) - 0.0041 R_{12} (t - 2) \\ + 0.0476 R_{12} (t - 3) - 0.0699 R_{12} (t - 4) + e_{12} (t), \\ R_{22} (t) & = 2.52 \times 10^{- 4} - 0.4842 R_{22} (t - 1) - 0.3860 R_{22} (t - 2) \\ - 0.2660 R_{22} (t - 3) - 0.1562 R_{22} (t - 4) + e_{22} (t), \end{aligned}$ (44) where the covariance matrix of residuals ${e (t) \overset{△}{=} (e_{11} (t), e_{21} (t), e_{12} (t), e_{22} (t))^{⊤}, t \in N}$ follows as (45) $Σ_{e} = [\begin{array}{cccc} 4.49 \times 10^{- 4} & 2.69 \times 10^{- 3} & 2.22 \times 10^{- 4} & 1.24 \times 10^{- 3} \\ 2.69 \times 10^{- 3} & 0.1692 & 1.40 \times 10^{- 3} & 7.22 \times 10^{- 2} \\ 2.22 \times 10^{- 4} & 1.40 \times 10^{- 3} & 3.22 \times 10^{- 4} & 1.32 \times 10^{- 3} \\ 1.24 \times 10^{- 3} & 7.22 \times 10^{- 2} & 1.32 \times 10^{- 3} & 0.1369 \end{array}] .$ (45) It follows from (Equation43(43) $Σ_{ϵ} = [\begin{array}{cccc} 4.40 \times 10^{- 4} & 2.74 \times 10^{- 3} & 2.17 \times 10^{- 4} & 1.30 \times 10^{- 3} \\ 2.74 \times 10^{- 3} & 0.1386 & 1.31 \times 10^{- 3} & 5.60 \times 10^{- 2} \\ 2.17 \times 10^{- 4} & 1.31 \times 10^{- 3} & 3.15 \times 10^{- 4} & 1.31 \times 10^{- 3} \\ 1.30 \times 10^{- 3} & 5.60 \times 10^{- 2} & 1.31 \times 10^{- 3} & 0.1077 \end{array}] .$ (43) ) and (Equation45(45) $Σ_{e} = [\begin{array}{cccc} 4.49 \times 10^{- 4} & 2.69 \times 10^{- 3} & 2.22 \times 10^{- 4} & 1.24 \times 10^{- 3} \\ 2.69 \times 10^{- 3} & 0.1692 & 1.40 \times 10^{- 3} & 7.22 \times 10^{- 2} \\ 2.22 \times 10^{- 4} & 1.40 \times 10^{- 3} & 3.22 \times 10^{- 4} & 1.32 \times 10^{- 3} \\ 1.24 \times 10^{- 3} & 7.22 \times 10^{- 2} & 1.32 \times 10^{- 3} & 0.1369 \end{array}] .$ (45) ) that the residuals of MARMA(4,0) model (Equation42(42) $R (t) = \hat{C} + {\hat{Φ}}_{1} R (t - 1) {\hat{Ψ}}_{1} + {\hat{Φ}}_{2} R (t - 2) {\hat{Ψ}}_{2} + {\hat{Φ}}_{3} R (t - 3) {\hat{Ψ}}_{3} + {\hat{Φ}}_{4} R (t - 4) {\hat{Ψ}}_{4} + ϵ (t),$ (42) ) are almost consistently less than those of ARMA(4,0) model (Equation44(44) $\begin{aligned} R_{11} (t) & = 1.53 \times 10^{- 8} + 0.0204 R_{11} (t - 1) + 0.0058 R_{11} (t - 2) \\ + 0.0625 R_{11} (t - 3) - 0.0393 R_{11} (t - 4) + e_{11} (t), \\ R_{21} (t) & = - 2.98 \times 10^{- 4} - 0.3994 R_{21} (t - 1) - 0.2598 R_{21} (t - 2) \\ - 0.1918 R_{21} (t - 3) - 0.1096 R_{21} (t - 4) + e_{21} (t), \\ R_{12} (t) & = 5.17 \times 10^{- 7} - 0.0002 R_{12} (t - 1) - 0.0041 R_{12} (t - 2) \\ + 0.0476 R_{12} (t - 3) - 0.0699 R_{12} (t - 4) + e_{12} (t), \\ R_{22} (t) & = 2.52 \times 10^{- 4} - 0.4842 R_{22} (t - 1) - 0.3860 R_{22} (t - 2) \\ - 0.2660 R_{22} (t - 3) - 0.1562 R_{22} (t - 4) + e_{22} (t), \end{aligned}$ (44) ).

In practice, we are more concerned about the residual variance, i.e., the variance of every element of residual. Using (Equation43(43) $Σ_{ϵ} = [\begin{array}{cccc} 4.40 \times 10^{- 4} & 2.74 \times 10^{- 3} & 2.17 \times 10^{- 4} & 1.30 \times 10^{- 3} \\ 2.74 \times 10^{- 3} & 0.1386 & 1.31 \times 10^{- 3} & 5.60 \times 10^{- 2} \\ 2.17 \times 10^{- 4} & 1.31 \times 10^{- 3} & 3.15 \times 10^{- 4} & 1.31 \times 10^{- 3} \\ 1.30 \times 10^{- 3} & 5.60 \times 10^{- 2} & 1.31 \times 10^{- 3} & 0.1077 \end{array}] .$ (43) ) and (Equation45(45) $Σ_{e} = [\begin{array}{cccc} 4.49 \times 10^{- 4} & 2.69 \times 10^{- 3} & 2.22 \times 10^{- 4} & 1.24 \times 10^{- 3} \\ 2.69 \times 10^{- 3} & 0.1692 & 1.40 \times 10^{- 3} & 7.22 \times 10^{- 2} \\ 2.22 \times 10^{- 4} & 1.40 \times 10^{- 3} & 3.22 \times 10^{- 4} & 1.32 \times 10^{- 3} \\ 1.24 \times 10^{- 3} & 7.22 \times 10^{- 2} & 1.32 \times 10^{- 3} & 0.1369 \end{array}] .$ (45) ), we compute the relative change of the residual variance of MARMA(4,0) model (Equation42(42) $R (t) = \hat{C} + {\hat{Φ}}_{1} R (t - 1) {\hat{Ψ}}_{1} + {\hat{Φ}}_{2} R (t - 2) {\hat{Ψ}}_{2} + {\hat{Φ}}_{3} R (t - 3) {\hat{Ψ}}_{3} + {\hat{Φ}}_{4} R (t - 4) {\hat{Ψ}}_{4} + ϵ (t),$ (42) ) to the residual variance of ARMA(4,0) model (Equation44(44) $\begin{aligned} R_{11} (t) & = 1.53 \times 10^{- 8} + 0.0204 R_{11} (t - 1) + 0.0058 R_{11} (t - 2) \\ + 0.0625 R_{11} (t - 3) - 0.0393 R_{11} (t - 4) + e_{11} (t), \\ R_{21} (t) & = - 2.98 \times 10^{- 4} - 0.3994 R_{21} (t - 1) - 0.2598 R_{21} (t - 2) \\ - 0.1918 R_{21} (t - 3) - 0.1096 R_{21} (t - 4) + e_{21} (t), \\ R_{12} (t) & = 5.17 \times 10^{- 7} - 0.0002 R_{12} (t - 1) - 0.0041 R_{12} (t - 2) \\ + 0.0476 R_{12} (t - 3) - 0.0699 R_{12} (t - 4) + e_{12} (t), \\ R_{22} (t) & = 2.52 \times 10^{- 4} - 0.4842 R_{22} (t - 1) - 0.3860 R_{22} (t - 2) \\ - 0.2660 R_{22} (t - 3) - 0.1562 R_{22} (t - 4) + e_{22} (t), \end{aligned}$ (44) ) as follows: $[\begin{array}{cc} \frac{v a r (ϵ_{11} (t)) - v a r (e_{11} (t))}{v a r (e_{11} (t))} & \frac{v a r (ϵ_{12} (t)) - v a r (e_{12} (t))}{v a r (e_{12} (t))} \\ \frac{v a r (ϵ_{21} (t)) - v a r (e_{21} (t))}{v a r (e_{21} (t))} & \frac{v a r (ϵ_{22} (t)) - v a r (e_{22} (t))}{v a r (e_{22} (t))} \end{array}] = [\begin{array}{cc} - 1.93 % & - 2.24 % \\ - 18.10 % & - 21.31 % \end{array}] .$ That is, MARMA(4,0) model (Equation42(42) $R (t) = \hat{C} + {\hat{Φ}}_{1} R (t - 1) {\hat{Ψ}}_{1} + {\hat{Φ}}_{2} R (t - 2) {\hat{Ψ}}_{2} + {\hat{Φ}}_{3} R (t - 3) {\hat{Ψ}}_{3} + {\hat{Φ}}_{4} R (t - 4) {\hat{Ψ}}_{4} + ϵ (t),$ (42) ) reduces all residual variance relative to ARMA(4,0) model (Equation44(44) $\begin{aligned} R_{11} (t) & = 1.53 \times 10^{- 8} + 0.0204 R_{11} (t - 1) + 0.0058 R_{11} (t - 2) \\ + 0.0625 R_{11} (t - 3) - 0.0393 R_{11} (t - 4) + e_{11} (t), \\ R_{21} (t) & = - 2.98 \times 10^{- 4} - 0.3994 R_{21} (t - 1) - 0.2598 R_{21} (t - 2) \\ - 0.1918 R_{21} (t - 3) - 0.1096 R_{21} (t - 4) + e_{21} (t), \\ R_{12} (t) & = 5.17 \times 10^{- 7} - 0.0002 R_{12} (t - 1) - 0.0041 R_{12} (t - 2) \\ + 0.0476 R_{12} (t - 3) - 0.0699 R_{12} (t - 4) + e_{12} (t), \\ R_{22} (t) & = 2.52 \times 10^{- 4} - 0.4842 R_{22} (t - 1) - 0.3860 R_{22} (t - 2) \\ - 0.2660 R_{22} (t - 3) - 0.1562 R_{22} (t - 4) + e_{22} (t), \end{aligned}$ (44) ). Especially, the relative change of volume's residual variance exceeds $10 %$ by MARMA(4,0) model (Equation42(42) $R (t) = \hat{C} + {\hat{Φ}}_{1} R (t - 1) {\hat{Ψ}}_{1} + {\hat{Φ}}_{2} R (t - 2) {\hat{Ψ}}_{2} + {\hat{Φ}}_{3} R (t - 3) {\hat{Ψ}}_{3} + {\hat{Φ}}_{4} R (t - 4) {\hat{Ψ}}_{4} + ϵ (t),$ (42) ) relative to ARMA(4,0) model (Equation44(44) $\begin{aligned} R_{11} (t) & = 1.53 \times 10^{- 8} + 0.0204 R_{11} (t - 1) + 0.0058 R_{11} (t - 2) \\ + 0.0625 R_{11} (t - 3) - 0.0393 R_{11} (t - 4) + e_{11} (t), \\ R_{21} (t) & = - 2.98 \times 10^{- 4} - 0.3994 R_{21} (t - 1) - 0.2598 R_{21} (t - 2) \\ - 0.1918 R_{21} (t - 3) - 0.1096 R_{21} (t - 4) + e_{21} (t), \\ R_{12} (t) & = 5.17 \times 10^{- 7} - 0.0002 R_{12} (t - 1) - 0.0041 R_{12} (t - 2) \\ + 0.0476 R_{12} (t - 3) - 0.0699 R_{12} (t - 4) + e_{12} (t), \\ R_{22} (t) & = 2.52 \times 10^{- 4} - 0.4842 R_{22} (t - 1) - 0.3860 R_{22} (t - 2) \\ - 0.2660 R_{22} (t - 3) - 0.1562 R_{22} (t - 4) + e_{22} (t), \end{aligned}$ (44) ), which means the MARMA model could really improve the prediction accuracy.

5. Conclusion

We proposed an autoregressive moving average model for matrix time series (MARMA), which is an extension of the autoregressive model for matrix time series (MAR). Like the MAR model, the MARMA model retains the original matrix structure, and provides a much more parsimonious model, compared with the approach of the vector autoregressive model for vectorizing the matrix into a long vector. Compared with MAR model, MARMA models are capable of modelling the unknown process with the minimum number of parameters.

As for MARMA model, the necessary and sufficient conditions for stationarity and invertibility are established. Parameter estimation methods are investigated for the conditional least square method and the conditional maximum likelihood estimation method. Point forecasting and interval forecasting are presented by using the projection theorem in the Hilbert space and the decomposition technique of time series. Additionally, model identification, model testing and possible extensions are discussed.

There are many directions to extend the scope of the MARMA model. Random environment such as the Markov environment might be imposed on the MARMA model to depict the impact of environmental change. Additionally, sparsity or group sparsity might be imposed on coefficient matrices to reach a further dimension reduction. Furthermore, the idea of MARMA can be applied for yield modelling, volatility modelling, weather forecast modelling and animal migration modelling.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This paper is partially supported by the basic scientific research business expenses of Universities in Xinjiang, China [Grant Number XQZX20230057] and the National Natural Science Foundation of China [Grant Number 11671142].

References

Chen, R., Xiao, H., & Yang, D. (2021). Autoregressive models for matrix-valued time series. Journal of Econometrics, 222(1), 539–560. https://doi.org/10.1016/j.jeconom.2020.07.015
Web of Science ®Google Scholar
Getmanov, V. G., Chinkin, V. E., Sidorov, R. V., Gvishiani, A. D., Dobrovolsky, M. N., Soloviev, A. A., Dmitrieva, A. N., Kovylyaeva, A. A., & Yashin, I. I. (2021). Methods for recognition of local anisotropy in muon fluxes in the URAGAN hodoscope matrix data time series. Physics of Atomic Nuclei, 84(6), 1080–1086. https://doi.org/10.1134/S106377882113010X
Web of Science ®Google Scholar
Graham, A. (2018). Kronecker products and matrix calculus with applications. Dover Publications Inc.
Google Scholar
Karl, W., & Simar, L. (2015). Applied multivariate statistical analysis (4th ed.). Springer-Verlag.
Google Scholar
Samadi, S. (2014). Matrix time series analysis [PhD Dissertation]. The University of Georgia.
Google Scholar
Walden, A., & Serroukh, A. (2002). Wavelet analysis of matrix-valued time series. Proceedings: Mathematical, Physical and Engineering Sciences, 458(2017), 157–179.
Web of Science ®Google Scholar
Wang, D., Liu, X., & Chen, R. (2019). Factor models for matrix-valued high-dimensional time series. Journal of Econometrics, 208(1), 231–248. https://doi.org/10.1016/j.jeconom.2018.09.013
Web of Science ®Google Scholar
Wang, H., & West, M. (2009). Bayesian analysis of matrix normal graphical models. Biometrika, 96(4), 821–834. https://doi.org/10.1093/biomet/asp049
PubMed Web of Science ®Google Scholar
Wu, S. J., & Hua, N. (2022). Autoregressive model of time series with matrix cross-section data (in Chinese). Acta Mathematica Sinica, 65(6), 1093–1104.
Google Scholar
Zhou, L. H., Du, G. W., Tao, D. P., Chen, H. M., Cheng, J., & Gong, L. B. (2018). Clustering multivariate time series data via multi-nonnegative matrix factorization in multi-relational networks. IEEE Access, 2018(6), 74747–74761. https://doi.org/10.1109/Access.6287639.
Google Scholar

Appendices

Appendix 1. Proof of Theorem 3.2

In order to obtain stationary conditions and invertible conditions for MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), we first give a lemma as follows.

Lemma A.1

For any square matrices $A_{1}, A_{2}, \dots, A_{k}$ , the operator $G (B) \overset{△}{=} I - A_{1} B - \dots - A_{k - 1} B^{k - 1} - A_{k} B^{k}$ is invertible if and only if any root λ of (EquationA1(A1) $| λ^{k} I - λ^{k - 1} A_{1} - \dots - λ A_{k - 1} - A_{k} | = 0.$ (A1) ) satisfies $| λ | < 1$ , where k is a natural number and B is the delay operator. (A1) $| λ^{k} I - λ^{k - 1} A_{1} - \dots - λ A_{k - 1} - A_{k} | = 0.$ (A1)

Proof.

For the polynomial with k degree and matrix coefficients $G (z) = I - A_{1} z - \dots - A_{k - 1} z^{k - 1} - A_{k} z^{k},$ it can be factorized into k linear polynomials with the matrix coefficient in the complex field as follows: $G (z) = (I - C_{1} z) (I - C_{2} z) \dots (I - C_{k} z),$ where $C_{1}, C_{2}, \dots, C_{k}$ are determined by (A2) $\sum_{1 \leq j_{1} < j_{2} < \dots < j_{u} \leq k} C_{j_{1}} C_{j_{2}} \dots C_{j_{u}} = (- 1)^{u - 1} A_{u}, u = 1, 2, \dots, k .$ (A2) Thus, (A3) $G (B) = (I - C_{1} B) (I - C_{2} B) \dots (I - C_{k} B) .$ (A3) For any $i = 1, 2, \dots, k$ , it is easy to prove that $I - C_{i} B$ is invertible if and only if $ρ (C_{i}) < 1$ , that is, all roots of $| λ I - C_{i} | = 0$ are in the unit circle. It follows from (EquationA3(A3) $G (B) = (I - C_{1} B) (I - C_{2} B) \dots (I - C_{k} B) .$ (A3) ) that $G (B)$ is invertible if and only if all $I - C_{i} B$ , $i = 1, 2, \dots, k$ , are invertible. Thus, $G (B)$ is invertible if and only if all roots of $| λ I - C_{i} | = 0$ are in the unit circle for all $i = 1, 2, \dots, k$ . According to determinant properties, $G (B)$ is invertible if and only if all roots of (A4) $| (λ I - C_{1}) (λ I - C_{2}) \dots (λ I - C_{k}) | = 0$ (A4) are in the unit circle. It yields from (EquationA2(A2) $\sum_{1 \leq j_{1} < j_{2} < \dots < j_{u} \leq k} C_{j_{1}} C_{j_{2}} \dots C_{j_{u}} = (- 1)^{u - 1} A_{u}, u = 1, 2, \dots, k .$ (A2) ) that $(λ I - C_{1}) (λ I - C_{2}) \dots (λ I - C_{k}) = λ^{k} I - λ^{k - 1} A_{1} - \dots - λ A_{k - 1} - A_{k} .$ Thus, $G (B)$ is invertible if and only if all roots of $| λ^{k} I - λ^{k - 1} A_{1} - \dots - λ A_{k - 1} - A_{k} | = 0$ are in the unit circle.

Proof

Proof of Theorem 3.2

For VARMA $(p, q)$ model (Equation19(19) $P (B) v e c (X (t)) = v e c (C) + Q (B) v e c (ϵ (t)), t \in N,$ (19) ), $P (B) v e c (X (t)) = v e c (C) + Q (B) v e c (ϵ (t)), t \in N .$ It follows from the concept of stationarity that the necessary and sufficient conditions of stationarity are that the operator $P (B)$ is invertible. According to Lemma A.1, the operator $P (B)$ is invertible if and only if any root λ of (Equation22(22) $| λ^{p} I_{m n} - λ^{p - 1} Ψ_{1}^{⊤} \otimes Φ_{1} - λ^{p - 2} Ψ_{2}^{⊤} \otimes Φ_{2} - \dots - λ Ψ_{p - 1}^{⊤} \otimes Φ_{p - 1} - Ψ_{p}^{⊤} \otimes Φ_{p} | = 0.$ (22) ) satisfies $| λ | < 1$ . Thus, VARMA $(p, q)$ model (Equation19(19) $P (B) v e c (X (t)) = v e c (C) + Q (B) v e c (ϵ (t)), t \in N,$ (19) ) is stationary if and only if any root λ of (Equation22(22) $| λ^{p} I_{m n} - λ^{p - 1} Ψ_{1}^{⊤} \otimes Φ_{1} - λ^{p - 2} Ψ_{2}^{⊤} \otimes Φ_{2} - \dots - λ Ψ_{p - 1}^{⊤} \otimes Φ_{p - 1} - Ψ_{p}^{⊤} \otimes Φ_{p} | = 0.$ (22) ) satisfies $| λ | < 1$ . Note that VARMA $(p, q)$ model (Equation19(19) $P (B) v e c (X (t)) = v e c (C) + Q (B) v e c (ϵ (t)), t \in N,$ (19) ) is equivalent to MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ), so MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) is stationary if and only if any root λ of (Equation22(22) $| λ^{p} I_{m n} - λ^{p - 1} Ψ_{1}^{⊤} \otimes Φ_{1} - λ^{p - 2} Ψ_{2}^{⊤} \otimes Φ_{2} - \dots - λ Ψ_{p - 1}^{⊤} \otimes Φ_{p - 1} - Ψ_{p}^{⊤} \otimes Φ_{p} | = 0.$ (22) ) satisfies $| λ | < 1$ .

The necessary and sufficient conditions for invertibility can be obtained by the similar method to obtain the necessary and sufficient conditions for stationarity, so we omit it.

Appendix 2. Proof of Theorem 3.3

Noting that ${v e c (ϵ (t)), t \in N}$ is an $m n \times 1$ -dimensional white noise, and the objective function of VARMA $(p, q)$ model (Equation30(30) $v e c (ϵ (t)) = \sum_{k = 0}^{+ \infty} G_{k} v e c (X (t - k)), t \in N,$ (30) ) using the conditional least square method follows as (A5) $\begin{aligned} J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1} \dots, Ξ_{q}) \\ = \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k})), \end{aligned}$ (A5) where we take $x_{t} = O_{m \times n}$ for all $t \leq 0$ .

Lemma A.2

$J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q})$ defined by (EquationA5(A5) $\begin{aligned} J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1} \dots, Ξ_{q}) \\ = \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k})), \end{aligned}$ (A5) ) has the minimum value about $Φ_{k}$ , $Ψ_{k}$ , $Θ_{j}$ and $Ξ_{j}$ for all $k = 1, 2, \dots, p$ and $j = 1, 2, \dots, q$ .

Proof.

It yields from analysing (Equation29(29) $G_{k} = {\begin{cases} I_{m n}, & k = 0, \\ - Ψ_{k}^{⊤} \otimes Φ_{k} + \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & 1 \leq k \leq p, \\ \sum_{i = 1}^{k \land q} (Ξ_{i}^{⊤} \otimes Θ_{i}) G_{k - i}, & k \geq p + 1, \end{cases}$ (29) ) that $J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q})$ by (EquationA5(A5) $\begin{aligned} J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1} \dots, Ξ_{q}) \\ = \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k})), \end{aligned}$ (A5) ) is a multivariate polynomial of $Φ_{k}$ , $Ψ_{k}$ , $Θ_{j}$ and $Ξ_{j}$ for all $k = 1, 2, \dots, p$ and $j = 1, 2, \dots, q$ . And it is obvious that $J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q})$ by (EquationA5(A5) $\begin{aligned} J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1} \dots, Ξ_{q}) \\ = \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k})), \end{aligned}$ (A5) ) is greater than or equal to zero, which means that $J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q})$ by (EquationA5(A5) $\begin{aligned} J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1} \dots, Ξ_{q}) \\ = \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k})), \end{aligned}$ (A5) ) has lower bound. Thus, $J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q})$ by (EquationA5(A5) $\begin{aligned} J (Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1} \dots, Ξ_{q}) \\ = \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k})), \end{aligned}$ (A5) ) has the minimum value about $Φ_{k}$ , $Ψ_{k}$ , $Θ_{j}$ and $Ξ_{j}$ for all $k = 1, 2, \dots, p$ and $j = 1, 2, \dots, q$ .

Proof

Proof of Theorem 3.3.

It follows from Lemma A.2 that, according to the conditional least square method, the parameters of MARMA $(p, q)$ model (Equation9(9) $X (t) = C + \sum_{k = 1}^{p} Φ_{k} X (t - k) Ψ_{k} + ϵ (t) - \sum_{j = 1}^{q} Θ_{j} ϵ (t - j) Ξ_{j},$ (9) ) satisfy the following matrix differential equations: ${\begin{aligned} \frac{\partial}{\partial Φ_{i}} \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{ℓ = 1}^{t - 1} G_{ℓ} v e c (x_{t - ℓ})) = O_{m}, i = 1, 2, \dots, p, \\ \frac{\partial}{\partial Ψ_{i}} \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{ℓ = 1}^{t - 1} G_{ℓ} v e c (x_{t - ℓ})) = O_{n}, i = 1, 2, \dots, p, \\ \frac{\partial}{\partial Θ_{j}} \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{ℓ = 1}^{t - 1} G_{ℓ} v e c (x_{t - ℓ})) = O_{m}, j = 1, 2, \dots, q, \\ \frac{\partial}{\partial Ξ_{j}} \sum_{t = p + 1}^{N} {(v e c (x_{t}) + \sum_{k = 1}^{t - 1} G_{k} v e c (x_{t - k}))}^{⊤} (v e c (x_{t}) + \sum_{ℓ = 1}^{t - 1} G_{ℓ} v e c (x_{t - ℓ})) = O_{n}, j = 1, 2, \dots, q . \end{aligned}$ Using the derivative of scalar by matrix, it yields from Corollary 2.1 that ${\begin{aligned} \sum_{t = p + 1}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Φ_{i}} (I_{m} \otimes ({\tilde{x}}_{t} + \sum_{ℓ = 1}^{t - 1} G_{ℓ} {\tilde{x}}_{t - ℓ})) = O_{m}, i = 1, 2, \dots, p, \\ \sum_{t = p + 1}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Ψ_{i}} (I_{n} \otimes ({\tilde{x}}_{t} + \sum_{ℓ = 1}^{t - 1} G_{ℓ} {\tilde{x}}_{t - ℓ})) = O_{n}, i = 1, 2, \dots, p, \\ \sum_{t = p + 1}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Θ_{j}} (I_{m} \otimes ({\tilde{x}}_{t} + \sum_{ℓ = 1}^{t - 1} G_{ℓ} {\tilde{x}}_{t - ℓ})) = O_{m}, j = 1, 2, \dots, q, \\ \sum_{t = p + 1}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Ξ_{j}} (I_{n} \otimes ({\tilde{x}}_{t} + \sum_{ℓ = 1}^{t - 1} G_{ℓ} {\tilde{x}}_{t - ℓ})) = O_{n}, j = 1, 2, \dots, q . \end{aligned} ■$

Appendix 3. Proof of Theorem 3.4

It yields from (Equation30(30) $v e c (ϵ (t)) = \sum_{k = 0}^{+ \infty} G_{k} v e c (X (t - k)), t \in N,$ (30) ) that (A6) $v e c (X (t)) = - \sum_{k = 1}^{+ \infty} G_{k} v e c (X (t - k)) + v e c (ϵ (t)), t \in N .$ (A6) For the sake of briefness, we denote ${\tilde{X}}_{t} = v e c (X (t)), t \in N and {\tilde{x}}_{k} = v e c (x_{k}), k = 1, 2, \dots, N .$ It yields from (EquationA6(A6) $v e c (X (t)) = - \sum_{k = 1}^{+ \infty} G_{k} v e c (X (t - k)) + v e c (ϵ (t)), t \in N .$ (A6) ) that (A7) ${\tilde{X}}_{t} | {{\tilde{X}}_{t - 1}, {\tilde{X}}_{t - 2}, \dots} \sim N (- \sum_{k = 1}^{+ \infty} G_{k} {\tilde{X}}_{t - k}, Σ_{m n}), t \in N,$ (A7) where $Σ_{m n}$ is defined by (Equation7(7) $Σ_{m n} = d i a g (σ_{11}^{2}, σ_{21}^{2}, \dots, σ_{m 1}^{2}, σ_{12}^{2}, σ_{22}^{2}, \dots, σ_{(m - 1) n}^{2}, σ_{m n}^{2})$ (7) ).

Let $X (t) = O_{m \times n}$ for all $t \leq 0$ . It follows from (EquationA7(A7) ${\tilde{X}}_{t} | {{\tilde{X}}_{t - 1}, {\tilde{X}}_{t - 2}, \dots} \sim N (- \sum_{k = 1}^{+ \infty} G_{k} {\tilde{X}}_{t - k}, Σ_{m n}), t \in N,$ (A7) ) that (A8) ${\tilde{X}}_{1} \sim N (O_{m n \times 1}, Σ_{m n})$ (A8) and (A9) ${\tilde{X}}_{t} | {{\tilde{X}}_{t - 1}, {\tilde{X}}_{t - 2}, \dots, {\tilde{X}}_{1}} \sim N (- \sum_{k = 1}^{t - 1} G_{k} {\tilde{X}}_{t - k}, Σ_{m n}), t \in N .$ (A9) Thus, the maximum likelihood function of $x_{1}, x_{2}, \dots, x_{V}$ follows as $\begin{aligned} L (x_{1}, x_{2}, \dots, x_{N}; Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q}) \\ = L ({\tilde{x}}_{1}, {\tilde{x}}_{2}, \dots, {\tilde{x}}_{N}; Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q}) \\ = f ({\tilde{x}}_{1}) f ({\tilde{x}}_{2} | {{\tilde{x}}_{1}}) f ({\tilde{x}}_{3} | {{\tilde{x}}_{2}, {\tilde{x}}_{1}}) \dots f ({\tilde{x}}_{N} | {{\tilde{x}}_{N - 1}, {\tilde{x}}_{N - 2}, \dots, {\tilde{x}}_{1}}) \\ = (2 π)^{- \frac{N m n}{2}} | Σ_{m n} |^{- \frac{N}{2}} \exp {- \frac{1}{2} \sum_{t = 1}^{N} {({\tilde{x}}_{t} + \sum_{k = 1}^{t - 1} G_{k} {\tilde{x}}_{t - k})}^{⊤} Σ_{m n}^{- 1} ({\tilde{x}}_{t} + \sum_{k = 1}^{t - 1} G_{k} {\tilde{x}}_{t - k})}, \end{aligned}$ where $f (\cdot)$ means the probability density function, and we stipulate $\sum_{k = 1}^{0} (\cdot)$ equals zero vector or zero matrix as needed. Therefore, the logarithm maximum likelihood function of $x_{1}, x_{2}, \dots, x_{N}$ follows as (A10) $\begin{aligned} ℓ (x_{1}, x_{2}, \dots, x_{N}; Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q}) \\ = \ln (L (x_{1}, x_{2}, \dots, x_{N}; Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q})) \\ = - \frac{N m n}{2} \ln (2 π) - \frac{N}{2} \ln (| Σ_{m n} |) - \frac{1}{2} \sum_{t = 1}^{N} {({\tilde{x}}_{t} + \sum_{k = 1}^{t - 1} G_{k} {\tilde{x}}_{t - k})}^{⊤} Σ_{m n}^{- 1} ({\tilde{x}}_{t} + \sum_{k = 1}^{t - 1} G_{k} {\tilde{x}}_{t - k}) . \end{aligned}$ (A10) Using the derivative of scalar by matrix, it yields from (EquationA10(A10) $\begin{aligned} ℓ (x_{1}, x_{2}, \dots, x_{N}; Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q}) \\ = \ln (L (x_{1}, x_{2}, \dots, x_{N}; Φ_{1}, \dots, Φ_{p}, Ψ_{1}, \dots, Ψ_{p}, Θ_{1}, \dots, Θ_{q}, Ξ_{1}, \dots, Ξ_{q})) \\ = - \frac{N m n}{2} \ln (2 π) - \frac{N}{2} \ln (| Σ_{m n} |) - \frac{1}{2} \sum_{t = 1}^{N} {({\tilde{x}}_{t} + \sum_{k = 1}^{t - 1} G_{k} {\tilde{x}}_{t - k})}^{⊤} Σ_{m n}^{- 1} ({\tilde{x}}_{t} + \sum_{k = 1}^{t - 1} G_{k} {\tilde{x}}_{t - k}) . \end{aligned}$ (A10) ) that (A11) ${\begin{aligned} \sum_{t = 1}^{N} \frac{\partial}{\partial Φ_{i}} (H (t, G_{\cdot})^{⊤} Σ_{m n}^{- 1} H (t, G_{\cdot})) = O_{m}, i = 1, 2, \dots, p, \\ \sum_{t = 1}^{N} \frac{\partial}{\partial Ψ_{i}} (H (t, G_{\cdot})^{⊤} Σ_{m n}^{- 1} H (t, G_{\cdot})) = O_{n}, i = 1, 2, \dots, p, \\ \sum_{t = 1}^{N} \frac{\partial}{\partial Θ_{j}} (H (t, G_{\cdot})^{⊤} Σ_{m n}^{- 1} H (t, G_{\cdot})) = O_{m}, j = 1, 2, \dots, q, \\ \sum_{t = 1}^{N} \frac{\partial}{\partial Ξ_{j}} (H (t, G_{\cdot})^{⊤} Σ_{m n}^{- 1} H (t, G_{\cdot})) = O_{n}, j = 1, 2, \dots, q, \\ N \frac{\partial \ln (| Σ_{m n} |)}{\partial Σ_{m n}} + \sum_{t = 1}^{N} \frac{\partial}{\partial Σ_{m n}} (H (t, G_{\cdot})^{⊤} Σ_{m n}^{- 1} H (t, G_{\cdot})) = O_{m n}, \end{aligned}$ (A11) where $H (t, G_{\cdot}) = {\tilde{x}}_{t} + \sum_{k = 1}^{t - 1} G_{k} {\tilde{x}}_{t - k}$ . It yields from Corollary 2.2 and Property 2.3 that ${\begin{aligned} \sum_{t = 2}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Φ_{i}} (I_{m} \otimes [Σ_{m n}^{- 1} H (t, G_{\cdot})]) = O_{m}, i = 1, 2, \dots, p, \\ \sum_{t = 2}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Ψ_{i}} (I_{n} \otimes [Σ_{m n}^{- 1} H (t, G_{\cdot})]) = O_{n}, i = 1, 2, \dots, p, \\ \sum_{t = 2}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Θ_{j}} (I_{m} \otimes [Σ_{m n}^{- 1} H (t, G_{\cdot})]) = O_{m}, j = 1, 2, \dots, q, \\ \sum_{t = 2}^{N} \sum_{k = 1}^{t - 1} \frac{\partial {(G_{k} {\tilde{x}}_{t - k})}^{⊤}}{\partial Ξ_{j}} (I_{n} \otimes [Σ_{m n}^{- 1} H (t, G_{\cdot})]) = O_{n}, j = 1, 2, \dots, q, \\ \frac{1}{N} \sum_{t = 1}^{N} R e s ([Σ_{m n}^{- 1} H (t, G_{\cdot})] \otimes [{(Σ_{m n}^{- 1})}^{⊤} H (t, G_{\cdot})], m n, m n) = {(Σ_{m n}^{- 1})}^{⊤} . \end{aligned}$

Autoregressive moving average model for matrix time series

Abstract

1. Introduction

2. Preliminaries

Graham, Citation2018

Graham, Citation2018

Graham, Citation2018

Graham, Citation2018

3. Autoregressive moving average model for matrix time series

3.1. $M A R M A (p, q)$ model

3.2. Relationship between MARMA model and VARMA model

3.3. Stationary and invertible conditions for MARMA model

3.4. Parameter estimation for MARMA model

3.5. Hypothesis testing for the MARMA model

3.6. Forecasting for the MARMA model

3.7. Supplementary notes for the MARMA model

3.7.1. Model identification for the MARMA model

3.7.2. MARIMA model

4. An application of the MARMA model

4.1. Data preprocessing

4.2. Modelling of $M A R M A (p, q)$

4.3. Evaluation on $M A R M A (p, q)$

5. Conclusion

Disclosure statement

References

Appendices

Appendix 1. Proof of Theorem 3.2

Proof of Theorem 3.2

Appendix 2. Proof of Theorem 3.3

Proof of Theorem 3.3.

Appendix 3. Proof of Theorem 3.4

Information for

Open access

Opportunities

Help and information

Autoregressive moving average model for matrix time series

Abstract

1. Introduction

2. Preliminaries

Graham, Citation2018

Graham, Citation2018

Graham, Citation2018

Graham, Citation2018

3. Autoregressive moving average model for matrix time series

3.1. MARMA (p,q) model

3.2. Relationship between MARMA model and VARMA model

3.3. Stationary and invertible conditions for MARMA model

3.4. Parameter estimation for MARMA model

3.5. Hypothesis testing for the MARMA model

3.6. Forecasting for the MARMA model

3.7. Supplementary notes for the MARMA model

3.7.1. Model identification for the MARMA model

3.7.2. MARIMA model

4. An application of the MARMA model

4.1. Data preprocessing

4.2. Modelling of MARMA (p,q)

4.3. Evaluation on MARMA (p,q)

5. Conclusion

Disclosure statement

Additional information

Funding

References

Appendices

Appendix 1. Proof of Theorem 3.2

Proof of Theorem 3.2

Appendix 2. Proof of Theorem 3.3

Proof of Theorem 3.3.

Appendix 3. Proof of Theorem 3.4

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

3.1. $M A R M A (p, q)$ model

4.2. Modelling of $M A R M A (p, q)$

4.3. Evaluation on $M A R M A (p, q)$