1,656
Views
1
CrossRef citations to date
0
Altmetric
GENERAL & APPLIED ECONOMICS

Robust tests for ARCH in the presence of a misspecified conditional mean: A comparison of nonparametric approaches

& ORCID Icon | (Reviewing editor)
Article: 1862445 | Received 17 Jun 2020, Accepted 04 Dec 2020, Published online: 06 Jan 2021

Abstract

This study compares the size and power of autoregressive conditional heteroskedasticity (ARCH) tests that are robust to the presence of a misspecified conditional mean. The approaches employed are based on two nonparametric regressions for the conditional mean: an ARCH test with a Nadaraya-Watson kernel regression and an ARCH test using a polynomial approximation regression. The two approaches do not require the specification of a conditional mean and can adapt to various nonlinear models, which are unknown a priori. The results reveal that the ARCH tests are robust to the misspecfied conditional mean models. The simulation results show that the ARCH tests based on the polynomial approximation regression approach have better properties of the size and power than those using the Nadaraya-Watson kernel regression approach for various nonlinear models.

Jel classification:

PUBLIC INTEREST STATEMENT

The presence of heteroskedasticity significantly impacts estimations and inferences in a time series analysis. Autoregressive conditional heteroskedasticity (ARCH) model is a simple and popular volatility model and continues to be widely used in the literature. Many studies evidence the overrejection of no ARCH effects when the nonlinearity of the conditional mean regression model is neglected. To appropriately test for ARCH, it is necessary to avoid the misspecified model of the conditional mean. This study compares the size and power of ARCH tests that do not depend on the conditional mean model. The tests are applicable to various nonlinear conditional mean models and are robust to the misspecified conditional mean model. The Monte Carlo simulation results evidence that ARCH tests that are based on the polynomial approximation regression approach have better size and power properties than those using the Nadaraya-Watson kernel regression approach when DGPs are various nonlinear models.

1. Introduction

The presence of heteroskedasticity significantly impacts estimations and inferences in a time series analysis. Becker and Hurn (Citation2009) and Pavlidis et al. (Citation2010), for example, demonstrate that the presence of heteroskedasticity frequently leads to over-rejections of the null hypothesis when testing the null for the linearity of a conditional mean model against the alternative hypothesis of nonlinear time series models. Pavlidis et al. (Citation2013) show that causality tests on the conditional mean demonstrate spurious causality relationships in the presence of multivariate heteroskedasticity. These facts indicate that tests for heteroskedasticity in data-generating processes (DGP) play an important role in time-series analyses.

The most representative model for heteroskedasticity is Engle’s (Citation1982) autoregressive conditional heteroskedasticity (ARCH) model. ARCH is a simple and popular volatility model and continues to be widely used in the literature. When testing for heteroskedasticity, a regression model for the assumed conditional mean is first estimated. Next, ARCH is examined to use statistics such as the Lagrange multiplier (LM). If the conditional mean regression model is correctly specified, the ARCH test performs well. However, a misspecified conditional mean severely impedes the ARCH tests. Lumsdaine and Ng (Citation1999) examine the properties of ARCH tests under a misspecified conditional mean. They show that the misspecification of the conditional mean over-rejects the null hypothesis for homoskedasticity. Similarly, Balke and Kapetanios (Citation2007) clarify the influence of the neglected nonlinearity of the conditional mean on ARCH tests. Their analysis evidences the over-rejection of no ARCH effects when the nonlinearity of the conditional mean regression model is neglected. Erroneous ARCH tests based on misspecified conditional mean models and the failure to obtain sufficient reliability for the derived results increasingly impede model constructions and statistical evaluation. To appropriately test for ARCH, it is necessary to avoid the misspecified model of the conditional mean. Thus, it is important to investigate which ARCH tests are robust to the misspecification of the conditional mean.

This study clarifies which ARCH tests that do not depend on the conditional mean model have better size and power. To this end, we first introduce ARCH tests using nonparametric regression approaches to avoid the misspecification of the conditional mean and next investigate the size and power of the introduced tests in various linear and nonlinear models with/without ARCH effects. The tests are applicable to various nonlinear conditional mean models and are robust to the misspecified conditional mean model. We employ two nonparametric approaches to avoid the misspecification of the conditional mean model. First is a regression using the Nadaraya-Watson kernel estimator, which is a representative nonparametric method. Nadaraya (Citation1964) and Watson (Citation1964) propose the method using a kernel density function in a regression analysis that does not depend on the model. McMillan (Citation2001) and Exterkate et al. (Citation2016) show that the Nadaraya-Watson estimator is useful under various nonlinear models. Second is the regression analysis using a polynomial approximation of a general unknown nonlinear model. Stone (Citation1977) and Katkovnik (Citation1979) propose the local polynomial estimator on the basis of a polynomial approximation. Balke and Kapetanios (Citation2007) develop a method to approximate unknown models using a neural network. Péguin-Feissolle et al. (Citation2013) introduce a causality test that is based on a Taylor approximation of a general nonlinear model and is applicable to various nonlinear models. These approaches are relevant from the viewpoint of a polynominal approximation.

In this study, we examine rejection frequencies under the null and alternative hypotheses for the introduced ARCH tests using Monte Carlo simulations. The simulation analyzes the influence of the lag length, the bandwidth selection for the Nadaraya-Watson estimator, and the approximation order for the polynominal approximation method on the results. The conditional mean models investigated in this study are linear autoregressive, threshold autoregressive, smooth transition autoregressive, Markov switching, and bi-linear models. These are popular nonlinear models used for empirical analysis and tend to cause spurious ARCH effects because as shown by Lumsdaine and Ng (Citation1999), Van Dijk et al. (Citation1999), and Balke and Kapetanios (Citation2007), it is difficult to distinguish between nonlinear models with homoskedastic variance and linear models with an ARCH effect. The Monte Carlo simulation results evidence that ARCH tests that are based on the polynomial approximation regression approach have better size and power properties than those using the Nadaraya-Watson kernel regression approach when DGPs are various nonlinear models. Empirical applications to economic variables support the simulation results.

The remainder of this paper is organized as follows: Section 2 presents the influence of a misspecified conditional mean on the ARCH tests and proposes ARCH tests using nonparametric regression approaches for the conditional mean. Section 3 presents the size and power of the tests under nonlinear models. Section 4 illustrates empirical applications. Section 5 concludes the paper.

2. ARCH tests using nonparametric regression approaches for conditional mean

We consider the following DGP with lag order m:

(1) yt=f(yt1,,ytm;β)+ut,t=1,,T(1)

where f(;) is an unknown function and β is a parameter vector. ut is a disturbance term with mean zero and variance denoted by

(2) ut=σtt;σt2=γ0+i=1pγiuti2,(2)

where  t are independently and identically distributed (iid) random variables with mean zero and variance equal to one. Although the conditional variance could have model misspecification similar to the conditional mean, standard heteroskedastic tests have the ability to find linear ARCH effects even if the true conditional variance is generalized ARCH (GARCH) with or without nonlinear parts.Footnote1 On the other hand, spurious ARCH effects tend to be observed when the conditional mean has model misspecifications. The misspecification of the conditional mean has clear impacts on the inference of variance, as shown by Lumsdaine and Ng (Citation1999) and Balke and Kapetanios (Citation2007). Thus, we focus on investigating the influence that the model misspecification of the conditional mean has on ARCH effects.

The null hypothesis of homoskedasticity to test for the ARCH effect is denoted by

(3) H0:γ1==γp=0,(3)

and the alternative hypothesis is

(4) H1:atleastoneγi0,i=1,,p.(4)

Even if we assume a GARCH model to be heteroskedastic, the testing procedure is the same as that in by Lee (Citation1991) and Gel and Chen (Citation2012). Therefore, we focus only on the ARCH test. Engle’s (Citation1982) standard ARCH test uses the auxiliary regression of squared residuals:

(5) uˆt2=γ0+γ1uˆt12++γpuˆtp2+ηt,(5)

where ηt is an error term. The LM test statistics is given by

(6) LM=TdˆWˆ(WˆWˆ)1Wˆdˆdˆdˆ,(6)

where dˆ=(dˆ1,,dˆT), dˆt=(uˆt2/σˆu1), σˆ2=(1/T)t=1Tuˆt2, Wˆ=(wˆ1,,wˆT), and wˆt=(1,uˆt12,,uˆtp2). The LM test statistic (6) is equivalent to TR2, where R2 is the coefficient for the determination of (5)Footnote2. Under the null hypothesis of no ARCH effects, the asymptotic distribution of (6) is χ2(p).

When true DGP are denoted by (1), suppose that we estimate the following misspecified model:

(7) yt=g(yt1,,ytm˜;α)+ut,(7)

where g(;) is a misspecified function, m˜ is the lag length, and α is a parameter vector for the misspecified model. Accordingly, the residual is denoted by

(8) uˆt=ut+f(yt1,,ytm;β)gˆ(yt1,,ytm˜;α)=ut+et,(8)

where et=f(yt1,,ytm;β)gˆ(yt1,,ytm˜;α). The squared residual for uˆt is

(9) uˆt2=ut2+2utet+et2.(9)

EquationEquation (9) means that the ARCH test correctly performs when etp0, whereas the ARCH test is subject to a model misspecification and leads to unreliable results when etp0 does not hold. For example, when true DGP (1) are a threshold autoregressive (TAR) model and misspecified estimation model (7) is a linear AR model, et includes nonlinearity. As highlighted by Lumsdaine and Ng (Citation1999) and Balke and Kapetanios (Citation2007), such a misspecification results in a spurious ARCH effect. Therefore, a regression approach that does not depend on a specific model is necessary to avoid model misspecification and spurious ARCH effects.

The first approach that is robust to model misspecification is a nonparametric regression that is based on the Nadaraya-Watson kernel estimator. We consider the following conditional mean regression regression model:

(10) yt=m(yt1,,yts)+ut,t=1,,T,(10)

where m() is the unknown regression function without any parametric form. The regression function for yt on Yt=(yt1,,yts) is

(11) z(yt1,,yts)=E(yt|Yt=y).(11)

The most representative method to estimate the function is the Nadaraya-Watson estimator. The estimator is denoted by

(12) zˆ(yt1,,yts)=t=1TK(Yt1y1h1,,Ytsyshs)ytt=1TK(Yt1y1h1,,Ytsyshs)(12)

where hs denotes the bandwidth to determine the smoothness of the kernel function. The kernel funcion K(y) satisfies the following:

(13) K(y)dy=1,yK(y)dy=0,y2K(y)dy>0.(13)

This study uses the Gaussian kernel denoted byFootnote3:

(14) K()=12πexp(y22).(14)

We use two bandwidth selections for hs that are derived by minimizing the integrated mean squared error (IMSE). First is Silverman’s (Citation1986) plug-in method. The bandwidth obtained using the plug-in method is based on the following equation:

(15) hs=c0T1/1+s,(15)

where c0 is a constant that depends on the kernel function. The optimal bandwidth selection for the Gaussian kernel is denoted by

(16) hsopt1.06σsT1/(s+4),(16)

where σs is the standard deviation for yts. The modified hsopt that is robust to outliers is written as

(17) h˜sopt=1.06min(σˆs,Qˆs/1.34)T1/(s+4),(17)

where Qˆs is the estimate for the interquartile range of yts.Footnote4

Second is the cross-validation procedure developed by Rudemo (Citation1982). When using the Gaussian kernel, we consider the following mean squared error called the cross-validation criterion:

(18) CV(h)=1Ti=1T(yizˆ(Yi))2,(18)

where h=(h1,,hs) and zˆ(Yi) is a leave-one-out estimator that excludes ith observation. The optimal bandwidth h for the cross-validation procedure is determined by minimizing CV(h). Stone (Citation1984) shows that bandwidth h for the cross-validation can asymptotically select the optimal bandwidth from an IMSE viewpoint and has probability convergence to the bandwidth for the plug-in method. While bandwidth hs for the plug-in method depends on the assumed kernel density function, the cross-validation is not required to assume the kernel density function and can obtain a consistent estimator for the bandwidth that minimizes IMSE. It is possible that the residuals obtained using Nadaraya-Watson estimator (12) with bandwidth selection (17) or (18) have similar properties. Accordingly, the above-mentioned nonparametric regression approach is robust to the model misspecification of the conditional mean and thus, the ARCH test is correctly performed.Footnote5

The next approach adopted to avoid misspecification is a polynomial approximation of a general unknown nonlinear model. When we apply a kth-order Taylor approximation to true model (1), the regression model is denoted by

(19) yt=β0+j=1lβjytj+j1=1lj2=j1lβj1j2ytj1ytj2++j1=1lj2=j1ljk=jk1lβj1jkytj1ytjk+t,(19)

where l is the lag length and  t is an error term that includes the remainder term of the Taylor series approximation. We assume lk as a simple notation. If the true model is a linear AR model, all βj1j2 and βj1jk are zero. In contrast, if the true model is nonlinear, one βj1j2 or βj1jk is not zero at least. We investigate this using a standard Wald test. For example, (19) with l=2 and k=2 can be written as

(20) yt=β0+j=12βjytj+j1=12j2=j12βj1j2ytj1ytj2+t.(20)

The difference between the true model and the polynomial approximation regression model reduces because the polynomial regression can approximate various nonlinear models including the TAR and Markov switching models. When testing for ARCH effects under an unknown (true) model, using residuals obtained from polynomial approximation regression (19) can be advantageous since it is possible that they show size and power properties similar to those of the true model. Therefore, the ARCH test using the residuals from the polynomial approximation regression does not appear to be influenced by model misspecification.

3. Size and power properties of ARCH tests using nonparametric regression models

This section examines the size and power of the ARCH tests using nonparametric regression models for the conditional mean presented in Section 2. We conduct Monte Carlo simulations to compare the rejection frequencies of the test statistics under various conditional mean models with and without ARCH effects. The simulations are based on 10,000 replications; a significance level of 5%; and sample sizes with T=100, 250, and 500.Footnote6 To avoid the effect of initial conditions, data with T+100 are generated. We discard the initial 100 samples and use the data with sample size T. We compare ARCH tests (6) using the following regression models for the conditional mean: the AR model denoted AR(q), polynomial approximation model (19) with second- and third-order Taylor approximation denoted as T2(q) and T3(q), and nonparametric regression model (12) with plug-in method (17) and cross-validation method (18) denoted as NPpl(q) and NPcv(q). We set lag length q to q=1 or q=2.Footnote7 The AR model is used as a benchmark for comparison.

First, we consider the following AR processes to examine the influence of lag length on the tests’ performance.

(21) yt=β0+β1yt1+β2yt2+ut,(21)
(22) ut=σtt,(22)
(23) σt2=γ0+γ1ut12,(23)

where uti.i.d.N(0,1). β0 is set to β0=0. presents the rejection frequencies for the ARCH tests obtained from each regression model for the conditional mean. We use the following DGP:

Table 1. Rejection frequencies under AR models

DGP1-1: yt=0.2yt1+ut,

DGP1-2: yt=0.7yt1+ut,

DGP1-3: yt=0.7yt10.2yt2+ut,

DGP1-4: yt=0.7yt10.5yt2+ut.

These DGP have homoskedastic errors with γ0=1 and γ1=0 for (23). The rejection frequencies presented in indicate the empirical size of the ARCH tests on the basis of each regression model.

For DGP1-1 and DGP1-2, which have lag order one, most of the tests have a small under-rejection but reasonable size performance, except for NPpl(2) and NPcv(2). NPpl(2) and NPcv(2) report over-rejections for DGP1-1 and DGP1-2. The rejection frequencies of NPpl(2) for DGP1-1 with T=500 and of NPcv(2) for DGP1-2 with T=500 are 0.143 and 0.101. An additional lag for the nonparametric regression of the conditional mean using the Nadaraya-Watson estimator leads to size distortions in the ARCH tests. In contrast, AR(2), T2(2), and T3(2) do not report overrejections for DGP1-1 and DGP1-2. The results show that the additional lag for AR and polynomial approximation regression does not impact the size of the ARCH tests. However, a lower lag length clearly influences the empirical size of all the tests. We see that the ARCH tests based on AR(1), T2(1), T3(1), NPpl(1), and NPcv(1) over-reject the null hypothesis of homoskedastic variance under DGP1-3 or DGP1-4, which have a lag order of two. For example, the rejection frequencies of AR(1), T2(1), T3(1), NPpl(1), and NPcv(1) for DGP1-4 with T=250 are 0.127, 0.116, 0.097, 0.115, and 0.113, respectively. The size distortions in DGP1-4 are greater than those in DGP1-3. The influence of the lower lag length on the empirical size depends on the persistence parameter of DGP. Compared with the size distortions for the model with a lower lag length, those for the model with an additional lag length are smaller. The lower lag length tends to cause the size distortions because the conditional variance includes the influence of extra lags and leads to spurious ARCH effects. In contrast, when the conditional mean regression includes additional lags, they are not significant and the conditional variance does not include the influence of additional lag length. Accordingly, we present size and power properties for the models with two lags below.

We examine the empirical size of the ARCH tests under the following conditional mean generated by the TAR models.

DGP2-1: yt=(0.7yt10.2yt2)I(yt10)+(0.1yt10.2yt2)I(yt1<0)+ut,

DGP2-2: yt=(0.7yt10.2yt2)I(yt10)+(0.5yt10.2yt2)I(yt1<0)+ut,

DGP2-3: yt=(0.7yt1+0.2yt2)I(yt10)+(0.7yt10.7yt2)I(yt1<0)+ut,

DGP2-4: yt=(0.7yt10.2yt2)I(Δyt10)+(0.1yt10.2yt2)I(Δyt1<0)+ut,

DGP2-5: yt=(0.7yt10.2yt2)I(Δyt10)+(0.5yt10.2yt2)I(Δyt1<0)+ut,

DGP2-6: yt=(0.7yt1+0.2yt2)I(Δyt10)+(0.7yt10.7yt2)I(Δyt1<0)+ut,

where I() is an indicator function that takes the value of 1 if I() is ture and 0 if I() is not true. ut denotes a homoskedastic error similar to that from DGP1-1 to 1-4. While DGP2-1, 2-2, and 2-3 are standard TAR models whose indicator functions depend on yt1, DGP2-4, 2-5, and 2-6 are momentum threshold autoregressive (MTAR) models wherein the threshold is the difference Δyt1. These TAR models allow for asymmetric adjustments. In addtion, MTAR can capture the spiky properties of the process.

illustrate the sample path for DGP2-1 with homoskedastic errors and the ARCH effect γ0=0.3 for (23). clearly shows the volatile behavior generated by the ARCH effect. However, illustrates that the sample path for DGP2-3 demonstrates a similar volatile movement even if the error is homoskedastic. As shown in , it is generally difficult to distinguish between the nonlinear conditional mean model with the homoskedastic error and the linear AR model with ARCH effect. Such a similarlity between the TAR model with homoskedastic errors and the linear AR model with ARCH effects may produce spurious size and power properties.

Figure 1. Sample path for DGP1-1

Figure 1. Sample path for DGP1-1

Figure 2. Sample path for DGP1-1 with γ1=0.3

Figure 2. Sample path for DGP1-1 with γ1=0.3

Figure 3. Sample path for DGP2-3

Figure 3. Sample path for DGP2-3

tabulates the simulation results. AR(2) reports over-rejections for the null hypothesis of no ARCH effects. For DGP2-2 and DGP2-5, which have strong asymmetry, the size distortions of AR(2) are significantly large. These results indicate that the use of the AR model for the conditional mean leads to spurious ARCH effects when the true DGP are based on the TAR or MTAR model. In additon, the over-rejections increase with large sample size. Unlike the performance of AR(2), the polynomial approximation regression models T2(2) and T3(2) and nonparametric regression models NPpl(2) and NPcv(2) perform better. For example, the rejection frequencies of AR(2), T2(2), T3(2), NPpl(2), and NPcv(2) for DGP2-2 with T=250 are 0.373, 0.040, 0.033, 0.042, and 0.051, respectively. T3(2) has a more reasonable size compared with those for T2(2), NPcl(2), and NPcv(2). T2(2), NPcl(2), and NPcv(2) report size distortions in certain cases. The rejection frequencies of T2(2), NPcl(2), and NPcv(2) for DGP2-3 with T=500 are 0.096, 0.139, and 0.104, respectively. Thus, the polynomial approximation regression model T3(2) is a more appropriate approach to test for ARCH than other approaches under the TAR or MTAR model.

Table 2. Rejection frequencies under TAR models

presents the rejection frequencies for each test under smooth transition autoregressive (STAR) models generated by the followings:

Table 3. Rejection frequencies under STAR models

DGP3-1: yt=0.7yt10.2yt2+(0.5yt10.2yt2)(1exp(0.1yt12))+ut,

DGP3-2: yt=0.7yt10.2yt2+(yt10.2yt2)(1exp(0.1yt12))+ut,

DGP3-3: yt=0.7yt10.2yt2+(yt10.2yt2)(1exp(yt12))+ut,

DGP3-4: yt=0.7yt10.2yt2+(0.5yt10.2yt2)(1+exp(0.1yt1))1+ut,

DGP3-5: yt=0.7yt10.2yt2+(yt10.2yt2)(1+exp(0.1yt1))1+ut,

DGP3-6: yt=0.7yt10.2yt2+(yt10.2yt2)(1+exp(yt1))1+ut,

where ut denotes homoskedastic errors similar to those in . STAR models have the time-varying properties of the conditional mean. DGP3-1, 3-2, and 3-3 impose symmetry constraints on the time-varying properties, whereas DGP3-4, 3-5, and 3-6, which are logistic STAR models, allow asymmetry. DGP3-2 and 3-5 produce a smoother and more marginal change than DGP3-3 and 3-6. We observe that AR(2), T2(2), and NPpl(2) partially reject the null hypothesis of no ARCH effects. The rejection frequencies of AR(2) is higher than those of the other regression models for DGP3-2 and 3-6. T2(2) shows size distortions for DGP3-2. NPpl(2) reports a slight over-rejection with T=500. In contrast, the shape of the transition function does not have a clear impact on the empirical size of T3(2) and NPcv(2). T3(2) and NPcv(2) can capture the properties of STAR models and allows the ARCH test to perform well.

In addition, we present the results of each test for the other nonlinear processes:

DGP4-1: yt=(0.7yt10.2yt2)st+(0.3yt10.2yt2)(1st)+ut, p00=p11=0.7,

DGP4-2: yt=(0.7yt10.2yt2)st+(0.3yt10.2yt2)(1st)+ut, p00=p11=0.98,

DGP4-3: yt=(0.7yt1+0.2yt2)st+(0.3yt10.2yt2)(1st)+ut, p00=p11=0.98,

DGP4-4: yt=0.1yt1ut1+0.1yt2ut2+ut,

DGP4-5: yt=0.3yt1ut1+0.1yt2ut2+ut,

DGP4-6: yt=0.1yt1ut10.1yt2ut2+ut,

where uti.i.d.N(0,1) and st is a random variable that takes the value of 0 or 1. DGP4-1, 4-2, and 4-3 are Markov switching processes and st determines the behavior. Whether st takes the value of 0 or 1 depends on the transition probabilities p11 and p00. p11=P(st+1=1|st=1) denotes the change probability from state st=1 to state st+1=1. Similarly, the transition probabilities are denoted by p00=P(st+1=0|st=0), p10=1p00=P(st+1=1|st=0), and p01=1p11=P(st+1=0|st=1), respectively. They are set to p11=p00=0.7 for DGP4-1 and p11=p00=0.98 for DGP4-2 and 4-3. While DGP4-1 has frequent switches in the AR parameters, DGP4-2 and 4-3 show persistent switches. DGP4-4, 4-5, and 4-6 are bilinear models that are used to model rare, volatile, or outburst processes.

AR(2) that neglects nonlinearity causes the spurious ARCH effect, which is similar to the results in . The results for the nonparametric regression models using the Nadaraya-Watson estimator depend on the bandwidth selection. NPpl(2) under-rejects the null hypothesis for DGP4-1, DGP4-2, and DGP4-5 and over-rejects that for DGP4-3, 4-4, and 4-6. NPcv(2) performs well for DGP4-2, DGP4-4, and DGP4-6 and over-rejects the null hypothesis for DGP4-1, 4-3, and 4-5. T2(2) has relatively reasonable emirical sizes for T=100 and 200, but reports size distortions for DGP4-1, 4-2, 4-3, 4-5, and 4-6 with T=500. Here as well, we find that T3(2) generally performs better.

The simulation results from evidence that the model misspecification of the conditional mean causes size distortions for the null hypothesis of no ARCH effects. The ARCH tests using the AR regression model are sensitive to the presence of the nonlinear conditional mean and show high over-rejections. This can be attributed by neglected nonlinearity and difficulties in distinguishing between the nonlinearity of the conditional mean and the ARCH effects. While the nonparametric regression models using the Nadaraya-Watson estimator partially perform well, the rejection frequencies strongly depend on DGP and the bandwidth selection. By contrast, the size properties of T3(2) outperform those of other models and are close to the nominal size at 5%. Therefore, T3(2) can approximate the (unknown) linear and nonlinear conditional mean models well and produce reliable ARCH tests.

report the nominal power and size-corrected power properties for the ARCH tests. We use DGP1-3, DGP2-1, 2-4, 3-1, 3-4, 4-1, and 4-4 for power comparison. Each DGP has an ARCH effect denoted by

(24) ut=σtt,(24)
(25) σt2=γ0+γ1ut12,(25)

Table 4. Rejection frequencies under MS and bilinear models

Table 5. Nominal power properties for ARCH tests

Table 6. Size-corrected power properties for ARCH tests

where γ0 and γ1 are set to γ0=1 and γ1=(0.1,0.3). The powers of AR(2) are clearly higher than those of other models in . We have a relatively reasonable evaluation of the power for DGP1-3 because the size properties of AR(2) and other tests are close to the nominal level 0.05 (). However, we cannot correctly evaluate the high nominal powers of AR(2) for other DGP. The higher powers of AR(2) are influenced by size distortions presented in tables from 2 to 4. The power properties of the nonparametric models are more appropriately interpreted because T2(2) and T3(3) do not overreject the null hypothesis for DGP in and the size distortions of NPpl(2) and NPcv(2) are smaller than those of AR(2). In comparison, we observe that the polynomial approximation models T2(2) and T3(2) perform better than NPpl(2) and NPcv(2). Note that the powers of NPpl(2) are quite small when the ARCH effect is γ1=0.1. For γ1=0.3, the nonparametric regression models report sufficient power to identify the ARCH effects.

We compare the power properties among the models without the influences of size distortions. demonstrates the size-corrected power. The powers of AR(2) in are lower than those in because the size distortions are corrected. AR(2) still performs well even if the size is corrected. The ability to detect ARCH effects in the nonlinear models for T2(2) is high, similar to that of AR(2). While the powers of T3(2) is slightly smaller than those of T2(2) because T3(2) has additional regression parameters for the conditional mean, it has sufficient power to find the ARCH effect. The rejection frequencies of NPpl(2) and NPcv(2) for γ1=0.1 are inferior to those of other models in . While they relatively perform well for γ1=0.3 with T=100, other models have better power properties, particularly for T=250 and 500.

The comparison of the ARCH tests using each regression model for the conditional mean indicates that the presence of the nonlinear conditional mean has influences of size and power properties on the ARCH tests. The AR regression models have higher over-rejection of the null hypothesis of no ARCH effects for the nonlinear conditional mean models. The ARCH tests based on AR models for the nonlinear conditional mean are not effective from the viewpoints of size and power. This is because size-corrected tests are needed and the true model is generally unknown a priori. The nonparametric regression models using the Nadaraya-Watson estimator tend to have slight size distortions and low power. The polynomial approximation model T2(2) shows slight over-rejection depending on the nonlinear conditional mean and sample size, although it has better power properties for the ARCH effect with the nonlinear conditional mean. T3(2) has a reasonable size and power properties and yields reliable results for the ARCH tests irrespective of the conditional mean models.

4. Empirical applications

In this section, we apply the tests to economic variables. We employ six economic data including those of Real Gross Domestic Product (GDP) for U.S., M2 for U.S., Federal Funds rate (FF), Producer Price Index (PPI) for U.S., Crude Oil Prices (COP) for West Texas Intermediate, and Global Price Index of all commodities (GPI). We use first difference of logarithm except FF. The data are obtained from Federal Reserve Bank of St. Louis. Each data consist of 294 (1Q 1947-2Q 2020), 478 (November 1980–August 2020), 794 (July 1954–August 2020), 1291 (January 1913–July 2020), 416 (January 1986–August 2020), and 343 (January 1992–July 2020) observations, respectively. The lag lengths of ARCH tests are selected by the Akaike information criterion (AIC).

Empirical results are presented in , which tabulates nominal p values of each test. The results support simulation evidence demonstrated in Section 3. For M2 and FF, the p values of all the tests are small. It is possible that the results are influenced by size distortions because Monte Carlo simulation in Section 3 demonstrates that when time series prosesses have nonlinear mean properties, AR(2) has overrejections of the null hypothesis, and tends to lead spurious ARCH effects. T2(2) NPpl(2), and NPcv(2) also have overrejections depending on DGP. However, as shown in Section 3, T3(2) has reasonable size properties regardless of DGP. The p values of T3(2) are sufficient small. This indicates that M2 and FF have ARCH effects. While AR(2), T2(2), T3(2), and NPcv(2) have small p values for GDP and PPI similar to those for M2 and FF, NPpl(2) has large p values. The results may be due to poor power performance in the presence of small ARCH effects as shown in .

Table 7. Empirical results

For COP and GPI, p values of AR(2) are small, whereas those of T3(2) and NPpl(2) are large. It is possible that this is because AR(2) has overrejection when DGP have nonlinear mean properties. T3(2) has superior size properties for DGP with nonlinear mean properties. The results indicate that COP and GPI have nonlinear mean properties without ARCH effects. The empirical applications support our simulation results and show that T3(2) can appropriately find ARCH effects, whereas there are cases in which other tests lead to unreliable results.

5. Summary and conclusion

This study compares the size and power of the ARCH tests that are robust to misspecified conditional mean models. ARCH tests are important for statistical modeling because the presence of ARCH affects the statistical inference of the conditional mean regression model and the analysis of volatility. However, it is difficult to determine the correct specified conditional mean model and possible to employ a misspecified conditonal mean model. This may lead to unreliable results. Therefore, it is neccesary to compare robust ARCH tests to various unknown conditional mean model and clarify their size and power properties. The approaches employed in this study are based on two nonparametric regressions: an ARCH test using the Nadaraya-Watson kernel regression and an ARCH test with the polynomial approximation. The two approches can adapt to various nonlinear models. Since a true model is generally unknown a priori, they are robust to misspecfied models. The Monte Carlo simulations evidence that the ARCH tests based on the polynomial regression approach have a better size and power properties than those using the Nadaraya-Watson kernel regression approach for various nonlinear conditional mean models. In particular, the test using the regression approach based on the third-order Taylor approximation has a reasonable and acceptable size and sufficient power for any time series models. The simulation and empirical results further show that the ARCH test using the polynomial approximation approach is useful when testing if DGPs have an ARCH effect and for ARCH without model specifications when the conditional mean model is unknown a priori. Of course, if researchers know DGP a priori or the purpose of researchers is the specification of the conditional mean, nonparametric regressions employed in this study are not useful and parametric approaches are appropriate. However, when it is difficult to specify the parametric model a priori and the purpose of researchers is the test for heteroskedasticity, our analysis is more useful. Robust univariate and multivariate ARCH tests that do not depend on the model specification of the conditional variance in addition to the conditional mean are left for further study.

Additional information

Funding

This research was supported by KAKENHI [Grant number: 16K03604].

Notes on contributors

Daiki Maki

Daiki Maki is a professor of Econometrics at the Faculty of Commerce, Doshisha University. He is an expert at time series analysis. His research interests cover nonlinear time series analysis, nonstationary time series analysis, and financial econometrics.

Yasushi Ota

Yasushi Ota is an associate professor of Finance at the Faculty of Management, Okayama University of Science. His field of expertise is financial theory. His research interests cover option pricing and mathematical finance.

Notes

1. Bühlmann and McNeil (Citation2002) and Meister and Kreiẞ (Citation2016) introduce nonparametric GARCH models. In addition, studies of nonlinear GARCH models include those of González-Rivera (Citation1998), Pan et al. (Citation2008), and Bauwens et al. (Citation2010).

2. Catani and Ahlgren (Citation2017) propose an LM test for ARCH using high-dimentional vector autoregressive models. In addition, Gel and Chen (Citation2012) introduce bootstrap ARCH tests.

3. Other kernel functions include uniform, Epanechnikov, biweight, and triweight kernel functions. In general, while the type of kernel functions does not have a large impact on the estimation results, the selection of bandwidth significantly influences the estimation results.

4. Sheather and Jones (Citation1991) propose another bandwidth selection that is based on the plug-in method. There are other bandwidth selection methods including those of Hart and Vieu (Citation1990), Ziegler (Citation2006), Li et al. (Citation2013), Cheng et al. (Citation2019), Jones et al. (Citation1996), and Köhler et al. (Citation2014) provide survey of bandwidth selection. The performance of the Nadaraya-Watson kernel regression strongly depends on the bandwidth selection. Since we only employ two bandwidth selections, the results in this study are somewhat limited. The compasison of robust ARCH tests using the Nadaraya-Watson kernel regression using various bandwidth is left for further study.

5. Shimizu (Citation2014) introduces the estimation of nonparametric AR(1)-ARCH(1) using wild bootstrap. Shin and Hwang (Citation2015) apply stationary bootstrap to estimate nonparametric AR(1)-ARCH(1).

6. If you are interested in simulation results of other levels of significance, these results are available upon request. The results are similar to those of 5% level of siginificance.

7. Zambom and Kim (Citation2017) propose lag selection in the nonparametric conditional heteroskedastic models. Compared to conventional methods, this method more appropriately selects lag length for various nonlinear models. We fix lag length in this study to investigate the statistical performance of the nonparametric regression models.

References

  • Balke, A. P., & Kapetanios, G. (2007). Testing for ARCH in the presence of nonlinearity of unknown form in the conditional mean. Journal of Econometrics, 137(2), 472–18. https://doi.org/10.1016/j.jeconom.2005.08.007
  • Bauwens, L., Preminger, A., & Rombouts, J. V. (2010). Theory and inference for a Markov switching GARCH models. Econometrics Journal, 13(2), 218–244. https://doi.org/10.1111/j.1368-423X.2009.00307.x
  • Becker, R., & Hurn, S. (2009). Testing for nonlinearity in mean in the presence of heteroskedasticity. Economic Analysis and Policy, 39(2), 313–326. https://doi.org/10.1016/S0313-5926(09)50023-9
  • Bühlmann, P., & McNeil, A. J. (2002). An algorithm for nonparametric GARCH modelling. Computational Statistics & Data Analysis, 40(4), 665–683. https://doi.org/10.1016/S0167-9473(02)00080-4
  • Catani, P. S., & Ahlgren, N. J. C. (2017). Combined Lagrange multiplier test for ARCH in vector autoregressive models. Econometrics and Statistics, 1, 62–84. https://doi.org/10.1016/j.ecosta.2016.10.006
  • Cheng, T., Gao, J., & Zhang, X. (2019). Nonparametric localized bandwidth selection for Kernel density estimation. Econometric Reviews, 38(7), 733–762. https://doi.org/10.1080/07474938.2017.1397835
  • Engle, R. F. (1982). Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica, 50(4), 987–1007. https://doi.org/10.2307/1912773
  • Exterkate, P., Groenen, P. J. K., Heij, C., & van Dijk, D. (2016). Nonlinear forecasting with many predictors using kernel ridge regression. International Journal of Forecasting, 32(3), 736–753. https://doi.org/10.1016/j.ijforecast.2015.11.017
  • Gel, Y. R., & Chen, B. (2012). Robust Lagrange multiplier test for detecting ARCH/GARCH effect using permutation and bootstrap. The Canadian Journal of Statistics, 40(3), 405–426. https://doi.org/10.1002/cjs.11149
  • González-Rivera, G. (1998). Smooth-transition GARCH models. Studies in Nonlinear Dynamics & Econometrics, 3(2), 61–78. https://doi.org/10.2202/1558-3708.1041
  • Hart, J. D., & Vieu, P. (1990). Data-driven bandwidth choice for density estimation based on dependent data. The Annals of Statistics, 18(2), 873–890. https://doi.org/10.1214/aos/1176347630
  • Jones, M. C., Marron, J. S., & Sheather, S. J. (1996). A brief survey of bandwidth selection for density estimation. Journal of the American Statistical Association, 91(433), 401–407. https://doi.org/10.1080/01621459.1996.10476701
  • Katkovnik, V. Y. (1979). Linear and nonlinear methods of nonparametric regression analysis. Soviet Automatic Control, 5, 35–46.
  • Köhler, M., Schindler, A., & Sperlich, S. (2014). A review and comparison of bandwidth selection methods for kernel regression. International Statistical Review, 82(2), 243–274. https://doi.org/10.1111/insr.12039
  • Lee, J. H. H. (1991). A Lagrange multiplier test for GARCH models. Economics Letters, 37(3), 265–271. https://doi.org/10.1016/0165-1765(91)90221-6
  • Li, Q., Lin, J., & Racine, J. S. (2013). Optimal bandwidth selection for nonparametric conditional distribution and quantile functions. Journal of Business and Economic Statistics, 31(1), 57–65. https://doi.org/10.1080/07350015.2012.738955
  • Lumsdaine, R. L., & Ng, S. (1999). Testing for ARCH in the presence of a possibly misspecified conditional mean. Journal of Econometrics, 93(2), 257–279. https://doi.org/10.1016/S0304-4076(99)00011-1
  • McMillan, D. G. (2001). Nonlinear predictability of stock market returns: Evidence from nonparametric and threshold models. International Review of Economics and Finance, 10(4), 353–368. https://doi.org/10.1016/S1059-0560(01)00093-4
  • Meister, A., & Kreiẞ, J. P. (2016). Statistical inference for nonparametric GARCH models. Stochastic Processes and Their Applications, 126(10), 3009–3040. https://doi.org/10.1016/j.spa.2016.03.010
  • Nadaraya, E. A. (1964). On estimating regression. Theory of Probability and Its Application, 9(1), 141–142. https://doi.org/10.1137/1109020
  • Pan, J., Wan, H., & Tong, H. (2008). Estimation and tests for power-transformed and threshold GARCH models. Journal of Econometrics, 142(1), 352–378. https://doi.org/10.1016/j.jeconom.2007.06.004
  • Pavlidis, E., Paya, I., & Peel, D. (2010). Specifying smooth transition regression models in the presence of conditional heteroskedasticity of unknown form. Studies in Nonlinear Dynamics and Econometrics, 14(3), 1–40. https://doi.org/10.2202/1558-3708.1702
  • Pavlidis, E., Paya, I., & Peel, D. (2013). Nonlinear causality tests and multivariate conditional heteroskedasticity: A simulation study. Studies in Nonlinear Dynamics and Econometrics, 17(3), 297–312. https://doi.org/10.1515/snde-2012-0067
  • Péguin-Feissolle, A., Strikholm, B., & Teräsvirta, T. (2013). Testing the Granger noncausality hypothesis in stationary nonlinear models of unknown functional form. Communications in Statistics-Simulation and Computation, 42(5), 1063–1087. https://doi.org/10.1080/03610918.2012.661500
  • Rudemo, M. (1982). Empirical choice of histograms and kernel density estimators. Scandinavian Journal of Statistics, 9(2), 65–78. https://www.jstor.org/stable/4615859
  • Sheather, S. J., & Jones, M. C. (1991). A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society, Series B, 53(3), 683–690. https://doi.org/10.1111/j.2517-6161.1991.tb01857.x
  • Shimizu, K. (2014). Bootstrapping the nonparametric ARCH regression model. Statistics & Probability Letters, 87, 61–69. https://doi.org/10.1016/j.spl.2014.01.002
  • Shin, D. W., & Hwang, E. (2015). Stationary bootstrapping for the nonparametric AR-ARCH model. Communications for Statistical Applications and Methods, 22(5), 463–473. https://doi.org/10.5351/CSAM.2015.22.5.463
  • Silverman, B. W. (1986). Density estimation for statistics and data analysis. Champman & Hall/CRC Monographs on Statistics & Applied Probability, Springer.
  • Stone, C. J. (1977). Consistent nonparametric regression. Annals of Statistics, 5(4), 595–645. https://doi.org/10.1214/aos/1176343886
  • Stone, C. J. (1984). An asymptotically optimal window selection rule for kernel density estimates. Annals of Statistics, 12(4), 1285–1297. https://doi.org/10.1214/aos/1176346792
  • Van Dijk, D., Franses, P. H., & Lucas, A. (1999). Testing for ARCH in the presence of additive outliers. Journal of Applied Econometrics, 14(5), 539–562. https://doi.org/10.1002/(SICI)1099-1255(199909/10)14:5<539::AID-JAE526>3.0.CO;2-W
  • Watson, G. S. (1964). Smooth regression analysis. Sankhya, Series A, 26, 359–372. https://www.jstor.org/stable/25049340
  • Zambom, A. Z., & Kim, S. (2017). Lag selection and model selection specification testing in nonparametric autoregressive conditional heteroscedastic models. Journal of Statistical Planning and Inference, 186, 13–27. https://doi.org/10.1016/j.jspi.2017.02.004
  • Ziegler, K. (2006). On local bootstrap bandwidth choice in kernel density estimation. Statistics and Decisions-International Journal Stochastic Methods and Models, 24, 291–302. https://doi.org/10.1524/stnd.2006.24.2.291