372
Views
0
CrossRef citations to date
0
Altmetric
Financial Economics

Forecasting Ethereum’s volatility: an expansive approach using HAR models and structural breaks

Article: 2300925 | Received 10 Jul 2023, Accepted 27 Dec 2023, Published online: 29 Jan 2024

Abstract

Cryptocurrencies have become a popular investment option and the Ethereum has become a mainstream cryptocurrency because of the additional functionality that can be accomplished with the backing of the powerful Ethereum network compared to Bitcoin. The high volatility of Ethereum offers both profits and risks, making it crucial to improve the forecasting ability for its price volatility. The results of this study could be useful for investors and policymakers who are interested in understanding and managing the risks associated with investing in Ethereum. Several studies have explored similar topics using heterogeneous autoregressive (HAR) models for cryptocurrencies, but this paper offers a more expansive approach. This paper employs five-minute high-frequency data to construct 4 HAR models to predict the volatility of Ethereum, taking into account the impact of structural breaks, Bitcoin, SP500 and VIX. The model that considers all factors outperforms other models for out-of-sample predictions for the 1-week forecasting. Due to the nature of the Ethereum price, the HAR-RV model has achieved a perfect fit in 1-day and 1-month forecasting. Therefore, other models have a very small improvement in fitness and prediction accuracy. This paper contributes to the understanding of Ethereum’s volatility and its impact on the cryptocurrency market.

1. Introduction

With the maturity of blockchain technology, cryptocurrencies have become an attractive investment option for numerous investors. Ethereum is in the spotlight as a mainstream currency. Ethereum, unlike its competitor Bitcoin, can not only perform the same functions as Bitcoin which are tradability and storage values of cryptocurrency, but also carry out extra tasks including developing and running applications, signing smart contracts, and carrying out different kinds of transactions that bitcoin lacks relying on Ethereum network. From November 28, 2019 to January 5, 2023, the price of Ethereum experienced a trough of $122.17 and climbed to a peak of $4,644.43. Such high volatility offers both profits and risks. There is growing evidence that Ethereum offers substantial diversification to investors when included in portfolios (Bhosale & Mavale, Citation2018). However, the high volatility of Ethereum, partly due to the immaturity of the cryptocurrency market and the lack of regulation, presents both profits and risks for investors. To effectively manage these risks, it is crucial to improve the forecasting ability for the price volatility of Ethereum.

The study (Ftiti et al., Citation2021) found that the heterogeneous autoregressive (HAR) model is appropriate for predicting future volatility. Furthermore, a seminal study in this area is the work of Shen et al. (Citation2020) showing that structural breaks should also be considered when predicting the volatility of cryptocurrency. Besides, a qualitative study by Imran et al. described how the major cryptocurrencies affected each other by using high-frequency data. Mensi et al. (Citation2019) show evidence of co-movements in time frequency space with leading relationships of Ethereum with Bitcoin. Moreover, Assaf et al. (Citation2022) quantify information flows between Bitcoin and Ethereum. This indicates that analyzing Ethereum’s volatility is crucial for the cryptocurrency market. The paper (Akyildirim et al., Citation2020) claims the existence of time-varying positive interrelationships between the conditional correlations of cryptocurrencies and financial market stress (VIX). The inclusion of VIX in the analysis allows for a better understanding of how financial market stress affects the volatility of Ethereum. Ghorbel et al. (Citation2022) shed light on the relationship between the stock market and cryptocurrencies. To build upon their research, we extend our analysis by incorporating the SP500 index, thereby considering the potential impact of traditional stock market dynamics on the cryptocurrency market.

Forecasting the volatility of Ethereum is an extremely complicated topic that necessitates consideration of a multitude of variables and the underlying economic logic. Previous studies have explored similar topics using heterogeneous autoregressive (HAR) models for cryptocurrencies (Ftiti et al., Citation2021) and have considered structural breaks (Mensi et al., Citation2019). Opposed to just one or two factors examined in previous studies, such as Omura et al. (Citation2023) and Shen et al. (Citation2020), this paper considers multiple factors.

In particular, Shen et al. (Citation2020) have studied Bitcoin by using HAR model and considering structural breaks and jumps. Based on the similarity of the research hypothesis, we replicated the logical framework of their study, for instance, the structure of the paper and some of the methods. But we took into account other factors. Therefore, more complex comparison methods are needed, making our study more comprehensive than theirs.

This study employs 5-minute high-frequency data to construct 4 Heterogeneous Autoregressive models (HAR) to predict Ethereum’s volatility, incorporating the influence of structural breaks, Bitcoin, and the Volatility Index (VIX). Our results indicate that the model incorporating structural breaks outperforms other models in both in-sample analysis and out-of-sample forecasts, in the 1-week horizon. Given the peculiar price dynamics of Ethereum, the HAR-RV model exhibits an excellent fit in both 1-week and 1-month forecasts, rendering any further enhancement of the all-inclusive model’s fitness and prediction accuracy trivial. However, considering the influence of structural breaks, the VIX, S&P 500 volatility, and Bitcoin price volatility marginally improves the 1-month forecast accuracy, as indicated by the out-of-sample results.

The subsequent sections of this paper are organized as follows. Section 2 presents a detailed overview of the database employed in this study. Section 3 discusses the detection and interpretation of structural breaks. We introduce the HAR-RV model as the benchmark model alongside three novel models, in Section 4. The analysis of in-sample and out-of-sample performance is evaluated in Section 5, and the robustness of the models is examined in Section 6. Finally, Section 7 concludes with a concise summary of the primary findings and recommendations for future research.

2. Data description

Considering the accuracy of volatility prediction, this paper chose 5-min as the sampling frequency. The transaction data is the price of Ethereum in USD from Bloomberg. The full sample period is from November 28, 2019 to January 5, 2023. After computing the Realized Volatility and subtracting the vacant value, 1119 trading days from December 12, 2019 to January 5, 2023 are obtained. Ethereum trades 24 hours a day, seven days a week, and therefore we have a continuous time series throughout our sample period.

To calculate the realized volatility from the 5‐minute data, Andersen and Bollerslev (Citation1997) defined the daily realized volatility as the sum of the squared intraday returns  rt,i at a given sampling frequency 1/N: RVt=i=1N rt,i2, where  rt,i=100*(lnPt,ilnPt,i1) and Pt,i  is the ith intraday closing price at day t.

illustrates that the price of ETH exhibited a steep ascent beginning in January 2021 and reached its peak on July 10, 2021. Throughout this period, the volatility remained high and exhibited some anomalous fluctuations. Notably, between mid-April and mid-May 2021, the price experienced a rapid increase followed by a sharp decline, indicating uncommon and dramatic fluctuations. From May 2022, the volatility stabilized gradually. Nevertheless, certain points of uncharacteristic volatility persisted.

Figure 1. The price of ETH.

Figure 1. The price of ETH.

depicts the Autocorrelation Function (ACF) of daily realized volatility. The ACF is a common technique used to determine the parameters of common time series models, such as p in an autoregressive model (AR) and expresses the average relationship between data points in a time series and their preceding data points. Based on , we conjecture that standard time series models are inadequate for predicting ETH prices.

Figure 2. Autocorrelation function (ACF) of RV.

Figure 2. Autocorrelation function (ACF) of RV.

reveals the existence of volatility aggregation and asymmetry, as well as volatility structural breaks, which refer to critical events that led to significant changes in economic conditions, such as policy transitions and natural disasters. In the following section, we will introduce the ICSS test, which aims to identify structural breaks. Large economic events, such as a new wave of regulations implemented by the Chinese government, can trigger substantial price volatility. Additional events are summarized in the subsequent section. In other words, when modelling volatility, it is crucial to consider the impact of structural breaks to improve forecasting accuracy.

Figure 3. Daily realized volatility.

Figure 3. Daily realized volatility.

displays the mean, extreme values, and standard deviation of ETH price volatility, S&P500 volatility, the VIX index, and Bitcoin volatility. As shown, the volatility of ETH price ranges from 2.709 to 1330.177, with a wide range and intense volatility. These findings reinforce the argument that the volatility of the ETH market fluctuates more dramatically than the SP500 volatility and the VIX index. Additionally, the variance of Bitcoin price volatility is larger than that of Ethereum price volatility. In summary, the mean and variance of Bitcoin, which is also a cryptocurrency, are relatively similar to those of Ethereum, while the other two index funds exhibit relatively more consistent performance.

Table 1. Descriptive statistics of variables.

3. Structural breaks

Liu and Maheu (Citation2008) and Choi et al. (Citation2010) provide strong evidence of the presence of structural breaks in high-frequency realized volatility. Furthermore, Shen et al. (Citation2020) present a significant analysis and discussion on the structural breaks in the volatility of cryptocurrencies. While the focus of their study is on Bitcoin, their findings are still illustrative of the importance of considering structural breakpoints in similar problems. These structural breakpoints indicate the response of the volatility to unexpected information. This specific information may continue to affect Ethereum in the future. Therefore, considering structural breaks may improve the in-sample and, more importantly, the out-of-sample performance of HAR-RV models.

3.1. ICSS algorithm

To identify the breakpoints, we apply the iterative cumulative sum of squares (ICSS) approach which is developed by Inclan and Tiao (Citation1994). This algorithm calculates the standard deviations between the change points to determine the number of structural breaks. Firstly, the cumulative sum of squares Ck for all potential breakpoints observations 1 through k were found by using the model error terms. Ck=t=1ket2,k=1,2,T.

The cumulative sum of squares should be normalized and centralized, using the partial series CSS, Ck, and the full series CSS. Dk=CkCTkT.

Furthermore, the potential breakpoint should be found, k*, which is the location in the series of the maximum absolute value of the centralized cumulative sum of squares Dk. Finally, the IT with the following formula must be checked, if IT exceeds the critical value of the limiting distribution, then k∗ represents a statistically significant breakpoint. In this paper, 1.628 (Gong & Lin, Citation2021) was selected as the approximate threshold at the 99th percentile of the asymptotic distribution of IT. IT=T2*Dk.

plots the daily realized volatility of ETH with breakpoints and ± 3 standard deviation bands. reports the number and start time of structural breaks in volatility that are identified by the ICSS algorithm. The price volatility differs significantly at different stages.

Figure 4. The daily realized volatility of ETH with breakpoints and ± 3 standard deviation bands.

Figure 4. The daily realized volatility of ETH with breakpoints and ± 3 standard deviation bands.

Table 2. Structural breaks of the price volatility of ETH.

3.2. Identification

The consistency of the ICSS algorithm is demonstrated through its ability to identify specific events that correspond to structural breaks. For example, on April 16, 2020, the scaling down of Facebook-backed Libra caused panic in the cryptocurrency market, leading to the occurrence of a structural breakpoint. Surprisingly, volatility remained stable within an acceptable range until January 4th, 2021, when the interest from investors seeking fast gains and protection against inflation caused a rapid surge in the prices of various digital currencies, including Ethereum, leading to another structural breakpoint. Subsequently, the price of Ethereum experienced a rapid rise, culminating in a structural breakpoint on February 24th, since traditional financial institutions began accepting it as a routine payment vehicle and asset. However, on May 19th, Ethereum experienced a sudden drop, with a volatility break point attributed to institutional investors shifting their focus from cryptocurrencies to gold, amidst growing regulatory concerns. Notably, regulatory concerns from authorities in China and the United States contributed to the weakness observed. On 28th May and 23rd June, two new breakpoints emerged respectively, revealing the significant fluctuation in the price of Ethereum. On September 7th, Saldova established cryptocurrencies as legal tender, leading to heightened volatility and a significant break point in the cryptocurrency market. On September 20th of the same year, signs of stress in China’s credit markets led to bearish sentiment in global markets, triggering a massive sell-off in cryptocurrencies with substantial price fluctuations and a breakpoint. Tightening monetary policy in response to inflationary pressures has dissuaded investors from speculative assets, resulting in ETH's volatility once again occurring a breaking point on May 9th, 2022. On June 16th, due to the precarious and often perilously unstable junction between emerging technologies and traditional currency, the instability of ETH's price led to a corresponding breakpoint. The period we chose happened to coincide with the COVID-19 pandemic. From a macroeconomic perspective, we cannot overlook the sudden uncertainty brought about by this unprecedented event. This helps explain the occurrence of numerous structural breaks in the short term. Additionally, the rapid development of the cryptocurrency market and its own unique characteristics further contribute to the emergence of a significant number of structural break points.

4. Method

This section presents four HAR-type models designed to forecast the volatility of ETH, consisting of original HAR-type models (HAR-RV) as benchmark models, and three improved HAR-type models (i.e., HAR-RV-VBS, HAR-RV-SB, HAR-RV-SB-VBS) that incorporate structural breaks, the VIX index, the S&P500 index, and the price volatility of Bitcoin.

The first model introduced is the classical HAR-RV model, which employs a linear form to determine whether the RV provides sufficient information. Subsequently, the HAR-RV-SB model is developed by incorporating the effects of structural breaks. Additionally, the HAR-RV-VBS model is presented, which incorporates the VIX and S&P500 indices, representing investor sentiment and market conditions, respectively, along with the price volatility of Bitcoin, to enhance forecasting accuracy. Finally, an improved HAR-RV-SB-VBS model is established by considering all the above factors simultaneously.

The subsequent sections provide detailed explanations of each of these models.

4.1. HAR-RV

We use the simple AR-type model established by Corsi (Citation2008). In the data description section, we mentioned how to calculate daily realized volatility. Therefore, the weekly RV and the monthly RV of the trading day t, denoted as RVt,w and RVt,m, respectively, are defined as follow. RVtw=RVtd+RVt1d++RVt6d7; RVtm=RVtd+RVt1d++RVt28d29.

In addition, the linear form of the HAR model is expressed as RVt+1d=βdRVtd+βwRVtw+βmRVtm.

4.2. HAR-RV-VBS

In this section, we aim to investigate the impact of the S&P500 index, the VIX index, and the volatility of Bitcoin price on the volatility forecasting of ETH price through the implementation of the HAR-RV-VBS model.

Sovbetov (Citation2018) found that S&P500 index has a positive long-run effect on Ethereum, and Yousaf and Ali also found that S&P 500 impacted the volatility of the Returns of Ethereum. Therefore, we hypothesize that the price volatility of ETH is influenced by the S&P500 index, and thus we introduce the historical volatility of S&P500 into the model.

In addition, it is well-established that the prices of cryptocurrencies are interrelated, with Bitcoin and Ethereum exhibiting co-movements (Mensi et al., Citation2019). Furthermore, Beneki et al. (Citation2019) found that Bitcoin volatility has a delayed response on Ethereum returns, while Meynkhard (Citation2020) suggested that Bitcoin plays a significant role in guiding the cryptocurrency market. Therefore, we argue that incorporating Bitcoin price volatility into the model will enhance our ability to predict the price volatility of ETH.

Moreover, the VIX index is widely regarded as one of the most crucial market uncertainty indices (Adrian et al., Citation2019). Leirvik (Citation2022) found that the VIX index has a negative impact on the returns of Ethereum. Thus, we theorize that including the VIX index into the model could improve the volatility forecasting model for ETH.

In conclusion, building upon the HAR-RV model, we further introduce the S&P500 index, VIX index, and Bitcoin price volatility into our model. As a result, we present the HAR-RV-VBS model, which can be expressed as follows. RVt+1d=βdRVtd+βwRVtw+βmRVtm+βvVIXt+βbicRt2+βspRt2.

4.3. HAR-RV-SB

Structural breaks are a ubiquitous feature in the high-frequency volatility of financial markets. Ignoring structural breaks tends to overestimate the long memory of volatility and consequently results in unreliable forecasting outcomes. Therefore, when modeling volatility, it is imperative to account for the effects of structural breaks. To this end, we identify 11 structural break points through a test conducted in Section 3 and incorporate them into the HAR-RV model in the form of dummy variables.

Building upon the HAR-RV model, we establish a linear form of the HAR-RV-SB model by incorporating the effects of structural breaks, as follows: RVt+1d=βdRVtd+βwRVtw+βmRVtm+i=1nβiDi.

The variable n represents the number of structural break points, while Di (i = 1, 2, …, n) denotes the dummy variable for the ith structural break point in the price volatility of ETH, which is obtained through the ICSS algorithm. We assign the value of 1 to Di after the ith structural break point to reflect the notion that the corresponding event at that break point affects price volatility from that day forward. Otherwise, the Di value should be assigned as 0. Comparing the coefficients of the dummy variables allows for a comparison of the influence of each event. As outlined in Section 3, the volatility sequence contains 12 break points, thus n = 12.

4.4. HAR-RV-SB-VBS

In order to comprehensively examine the factors that influence the volatility forecasting of the Ethereum market, we develop a HAR-RV-SB-VBS model class that incorporates structural breaks, the VIX index, the volatility of the S&P500, and the volatility of Bitcoin prices into the corresponding original HAR-type models. By doing so, we aim to better capture the complexities and nuances of the Ethereum market and enhance our ability to forecast its volatility. As a result, we present the HAR-RV-SB-VBS model, which can be expressed as follows. RVt+1d=βdRVtd+βwRVtw+βmRVtm+βvVIXt+βbicRt2+βspRt2+i=1nβiDi.

5. Result analysis

5.1. In-sample analysis

In this section, we employ the ordinary least squares (OLS) method to estimate the parameters of four heterogeneous autoregressive models of realized volatility (HAR-RV) type, following Corsi (Citation2008). Furthermore, we compare the predictive performance of four HAR models, namely, HAR-RV, HAR-RV-SB, HAR-RV-VBS, and HAR-RV-SB-VBS, with the aim of testing the impact of structural breaks, the VIX index, and the volatility of both SP500 and Bitcoin prices. summarizes the parameter estimates for the four models when forecasting the volatility of ETH at three different horizons, i.e., daily, weekly, and monthly.

Table 3. Parameter estimation results of models.

The estimation results from the HAR-RV model indicate that only weekly volatility is statistically significant in predicting one-day volatility. However, for the 1-week forecast, the realized volatility (RV) at all three time periods is significantly positive. The adjusted R-squared value of 0.2549 suggests that the HAR-RV model is not adequately informative for 1-day volatility prediction. Moreover, the findings reveal that the monthly volatility of the ETH market contains a substantial amount of information for predicting long-term RV. This result implies the existence of long memory in the RV of ETH. In other words, the ETH market is heterogeneous.

The findings from the estimation results of the HAR-RV-VBS model suggest that the coefficients associated with the daily, weekly, and monthly lags are statistically significant associated with the 1-day and 1-week volatilities of ETH price. The parameter estimation results further reveal that the VIX index exerts a positive and significant influence on the short-term and mid-term forecasting results. Additionally, our analysis shows that the S&P500 volatility contains relevant information for short-term forecasting. Furthermore, the volatility of Bitcoin price appears to have a statistically significant effect on ETH volatility at all three forecasting horizons, indicating that it is important to take into account when predicting ETH volatility. Therefore, we conclude that the inclusion of the VIX index, Bitcoin price volatility, and S&P500 volatility in the forecasting model can improve the accuracy of the ETH volatility forecasts.

The estimation results of the HAR-RV-SB model indicate that most of the coefficients of daily, weekly, and monthly volatility are statistically significant, which is consistent with the results of the HAR-RV model. Additionally, we conduct an analysis of the coefficients associated with the structural breaks (Di). This analysis reveals that only half of the coefficients of Di are statistically significant, indicating that financial events have limited impacts on ETH volatility. Through a thorough examination of coefficients, we discover that the structural breaks which are statistically significant provide crucial information. It is noteworthy that the majority of the events that affect long-term volatility are also present in short-term forecasting. Therefore, we can infer that the detection of structural breaks may not be entirely precise, as Ethereum’s volatility experiences drastic fluctuations. Consequently, it is still essential to consider the impact of structural breaks on volatility.

The estimation results of the HAR-RV-SB-VBS model demonstrate that structural breaks, the VIX index, and the volatility of BTC and SP500 exhibit analogous effects as in our analyses using the HAR-RV-SB and HAR-RV-VBS models. This finding strengthens the argument that these factors should not be disregarded when predicting the volatility of ETH. The impact of some structural breaks, however, does not remain statistically significant according to the HAR-RV-SB-VBS model, contrary to the HAR-RV-SB model. We can speculate that the Bitcoin volatility, the VIX index and SP500 volatility may have the same information with the structural breaks. Additionally, illustrates the in-sample adjusted R-squared values of the four models for varying time horizons, further highlighting the necessity of considering these factors.

Figure 5. The in-sample adjusted R-squared values.

Figure 5. The in-sample adjusted R-squared values.

5.2. Out-of-sample analysis

This paper evaluates the forecasting power of four models for the volatility of ETH through the use of different loss functions, referencing Witt and Witt (Citation1992). It is important to note that when conducting out-of-sample forecasting, we do not consider structural breakpoints that were not present at the time of the forecast. Specifically, we employ the mean square error (MSE), root mean squared error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE) and quasi‐likelihood (QLIKE) loss function to evaluate the performance of the models. A smaller value for these loss functions indicates better forecasting performance. Furthermore, we use R-squared (R2) as a metric to reflect the goodness of fit of the model. The numerical value of R2 indicates the extent to which an independent variable in a regression model explains the variation of a dependent variable. A value closer to 1 implies better predictive power of the model. The definitions of R-squared, MSE, RMSE, MAE, MAPE and QLIKE are provided below, R2=1i=1n(ŷiyi)2i=1n(y¯yi)2, MSE=1ni=1n(ŷiyi)2; RMSE=1ni=1n(ŷiyi)2; MAE=1ni=1n|ŷiyi|; MAPE=1ni=1n|ŷiyiyi|; QLIKE=1ni=1n(yiŷilogyiŷi1); where ŷi is the predicted value and yi is the real value.

To ensure the accuracy of our findings, we conduct out-of-sample predictions and calculate loss functions for both 15-day and 30-day horizons. and summarize the results obtained. As shown in , the ability to forecast daily volatility remains inadequate, regardless of the model utilized. This outcome is in line with the findings of . Except for the HAR-RV-SB model, which exhibits a higher out-of-sample forecast R-squared value than the benchmark model HAR-RV, all the loss function values are larger than those of the benchmark model, indicating greater forecast bias. Nonetheless, our analysis surprisingly reveals that the HAR-RV-SB-VBS model has a lower loss function value for 1-week forecasting, both in-sample and out-of-sample testing. In other words, for 1-week forecasts, the HAR-RV-SB-VBS model provides the most accurate predictions of ETH prices.

Table 4. Loss function values of different models (15 days).

Table 5. Loss function values of different models (30 days).

provides a visual comparison of the predicted and actual values of the four models, which yields further insights into the prediction outcomes. The comparison reveals that the HAR-RV model has a high level of forecasting accuracy for both the 1-day and 1-month forecasts. Conversely, the information added to the model did not significantly improve the accuracy of daily volatility forecasts. Furthermore, the figure shows that forecasting mid-term volatility for Ethereum necessitates considering the effects of structural breakpoints, the VIX index, S&P 500 volatility, and Bitcoin price volatility.

Figure 6. Predict results of 4 models.

Figure 6. Predict results of 4 models.

Although the loss function approach enables us to determine which model performs better from different perspectives, it has a limitation in that it does not indicate whether the results can be considered trustworthy.

5.3. DM test

The DM test, introduced by Diebold and Mariano (Citation1995), is a statistical method widely used to compare the forecasting performance of two models (Sadorsky, Citation2006). This technique is based on the loss functions which can quantify the difference between the forecasted and actual values. The null hypothesis of the DM test is that the two models have the same predictive accuracy. The DM statistic is calculated as follows: DM=dt¯Var̂(dt)TN(0,1). where dt¯ denotes the mean of dt, and dt is the difference of the loss function. In this part, we use MSE, RMSE, MAE, MAPE and QLIKE as the loss function. Var̂(dt) represents the robust estimate of the asymptotic variance for dt, with heteroscedasticity and autocorrelation. If the DM test statistic exceeds the critical value, the predictive performance of the models is deemed significantly different. When DM-value is greater than 0, the new model is better than the previous one. Therefore, the DM test is a valuable tool for assessing the relative forecasting power of different models. Nevertheless, in this study, we have actually employed the modified test proposed by Harvey et al. (Citation1997), which enhances robustness.

To evaluate the performance of various Ethereum forecasting models, we established the HAR-RV model as a benchmark and conducted a DM-test to investigate whether the forecasting accuracy of the HAR-RV-VBS, HAR-RV-SB, and HAR-RV-SB-VBS models was significantly improved. The results of the DM-test are presented in .

Table 6. DM test results for out-of-sample forecasting.

reveals that the various HAR-type models exhibit distinct forecasting accuracy. Regardless of the loss function selected (MSE, MAE, MAPE, RMSE and QLIKE), all three models perform worse than the HAR-RV model in terms of 1-day and 1-month ETH volatility forecasting. This outcome aligns with in-sample forecasting. In addition, different loss functions do not get consistent conclusions. Based on the DM-test results of QLIKE, we can find that In 1-week forecasting, the HAR-RV-SB-VBS model outperforms the benchmark model.

In summary, we posit that considering all factors within the original HAR-type model (i.e., HAR-RV-SB-VBS) yields the most precise out-of-sample forecasting of all models we have established in 1-week forecasting. This finding indicates that the combination of the market information effectively enhances the accuracy of 1-week ETH volatility forecasting. Additionally, the HAR-RV model is the optimal model for ETH 1-month and 1-day forecasting. Incorporating other factors into the prediction may disturb the efficacy of the HAR-RV model.

5.4. Model Confidence Set (MCS) Procedure

To evaluate the performance of various Ethereum forecasting models, we employ the Model Confidence Set (MCS) Procedure. The MCS test, introduced by Hansen et al. (Citation2011), is based on the loss function. It offers an alternative approach to assessing predictive performance compared to simple comparisons of loss functions. It addresses the issue of ‘outliers leading to a significant increase in the loss function’, resulting in more robust and reliable results. A large amount of research, such as Huang et al. (Citation2021) and Liu and Lee (Citation2021), have demonstrated the robustness of this method.

In simpler terms, the MCS test is a significance test where a set of volatility forecasting models, denoted as M0. The test aims to remove models with poor predictive ability from the set M0. In each test, the null hypothesis assumes that two volatility forecasting models within M0 have the same predictive ability for future volatility, which can be expressed as follows: H0,M:E(di,uv)=0.

The variable di,uv represents the difference in loss functions (MSE) between two volatility forecasting models, u and v. The models are continuously tested using the equivalence test δM and the elimination rule eM. The testing process ends when no models are eliminated from the set. The MCS test involves multiple complex statistical measures, including the maximum deviation to the average loss (maxT), the traditional quadratic-form test (chi), the chi test with sample size correction (F), the deviation from the common average (ComAve), the range-based statistic (Range), and the semi-quadratic statistic (SemiQ). Due to the small differences in the results of these statistical measures, this study only employs the commonly used range-based statistic as the test statistic, which can be defined as: Range=maxu,vM|di,uv¯|var(di,uv)

If the range-based statistic (Range) exceeds a given critical value, the null hypothesis is rejected. The true distribution of the Range statistic is complex as it depends on the ‘aversion parameter’. In empirical applications, the value of the range-based statistic (Range) and the corresponding test can be obtained through bootstrapping. Typically, when the p-value is greater than the commonly set threshold of 0.1, it indicates that the criterion model exhibits good predictive ability. A higher p-value suggests a greater likelihood of strong predictive ability.

presents the results of the MCS test, conducting consistent findings with the DM test. The data provides evidence that in mid-term forecasting, the volatility of Ethereum is significantly influenced by both traditional financial market information (S&P500 and VIX), Bitcoin and structural breaks.

Table 7. MCS test results for out-of-sample forecasting.

6. Robustness test

This section aims to assess the robustness of both the in-sample and the out-of-sample forecasting by altering the sample range. While the forecasting results are consistent with those presented in Sections 5 and 6, this section will solely focus on the robustness of the results obtained.

6.1. In-sample

To determine the reliability of forecasting outcomes across different samples, a division of the total 722 samples into two sub-samples was conducted. Subsample 1 contains samples 1 to 360, while Subsample 2 comprises samples 360 to 722. Following this, an in-sample regression analysis was performed on both subsamples, and the adjusted R-squares were calculated as an indicator of the model’s predictive accuracy. The results of this analysis are presented in .

Table 8. Adjusted R-squares of sample 1 and sample 2.

displays the results of the estimation of 1-day volatility by using four models on two subsamples, namely subsample 1 and subsample 2. The adjusted R-squares in subsample 1 and subsample 2 exhibit considerable disparity and lack robustness, implying that these models are sensitive to the selection of data in estimating 1-day volatility. Conversely, the adjusted R-squares in the medium-term and long-term volatility estimation demonstrate consistent and stable results for both subsamples, indicating that each model performs reliably in these horizons. Consequently, we infer that the four models are robust for the medium-term and long-term volatility estimation of Ethereum.

6.2. Out of sample

To assess the accuracy of out-of-sample forecasts within the Ethereum market, the rolling window was altered from 1046 days to 846 days, covering the period from December 12, 2019 to April 8, 2022. Consequently, a predicted sample of observations spanning the time period from April 8, 2022 to March 30, 2018 was obtained. The outcomes of this analysis are presented in , which showcases the results of four loss functions.

Analysis of reveals that, on the whole, the loss function values exhibit a marginal decrease following the change in window size; however, the overall trend remains consistent with the previous outcomes.

Table 9. Loss function values for out-of-sample forecasting.

7. Conclusion

This study aims to provide a novel perspective on Ethereum (ETH) price forecasting, building upon existing literature (Ftiti et al., Citation2021) by utilizing a HAR-type model that considers the impact of various factors on ETH prices. The improved HAR-type model in this study incorporates structural breaks to enhance the robustness and validity of predictions. Furthermore, the study expands upon the HAR-RV model by investigating the impact of the VIX and S&P500 indices, which represent investor sentiment and market direction, respectively, contributing to the existing literature. Besides, the volatility of Bitcoin which is closely connected with ETH volatility is also considered. We establish three novel models, namely HAR-RV-SB, HAR-RV-VBS, and HAR-RV-SB-VBS, which are utilized in both in-sample and out-of-sample analysis of the RV of Ethereum prices. Based on our analysis, we propose several conclusions.

Our results demonstrate that the HAR-RV model achieves a perfect fit in 1-week and 1-month forecasting due to the characteristics of Ethereum itself. The model that considers all factors has only a small improvement in fitness and prediction accuracy. However, in out-of-sample forecasting, considering the effects of structural breakpoints, VIX, S&P 500 volatility, and Bitcoin price volatility improves the accuracy of 1-week forecasts, albeit marginally.

Despite these findings, three main limitations of this study should be acknowledged. Firstly, we did not consider simple models such as HAR-RV-VIX and HAR-RV-SP, which do not sufficiently analyze the impact of individual factors but rather include all factors, resulting in increased complexity. Secondly, we need to consider the duration of the impact of different structural breakpoints. Some breakpoints occur merely due to unforeseen financial events and do not have a lasting influence. Finally, we did not decompose the volatility component further to capture the volatility of Ether more accurately, which could lead to reduced accuracy in predictions. Future research may consider incorporating the leverage effect into a model for predicting ETH price volatility and try to use some Machine Learning models like short- and long-term memory networks.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

  • Adrian, T., Crump, R. K., & Vogt, E. (2019). Nonlinearity and flight‐to‐safety in the risk‐return trade‐off for stocks and bonds. The Journal of Finance, 74(4), 1931–1973. https://doi.org/10.1111/jofi.12776
  • Akyildirim, E., Corbet, S., Lucey, B., Sensoy, A., & Yarovaya, L. (2020). The relationship between implied volatility and cryptocurrency returns. Finance Research Letters, 33, 101212. https://doi.org/10.1016/j.frl.2019.06.010
  • Andersen, T. G., & Bollerslev, T. (1997). Heterogeneous information arrivals and return volatility dynamics: Uncovering the long‐run in high frequency returns. The Journal of Finance, 52(3), 975–1005. https://doi.org/10.2307/2329513
  • Assaf, A., Bilgin, M. H., & Demir, E. (2022). Using transfer entropy to measure information flows between cryptocurrencies. Physica A: Statistical Mechanics and Its Applications, 586, 126484. https://doi.org/10.1016/j.physa.2021.126484
  • Beneki, C., Koulis, A., Kyriazis, N. A., & Papadamou, S. (2019). Investigating volatility transmission and hedging properties between Bitcoin and Ethereum. Research in International Business and Finance, 48, 219–227. https://doi.org/10.1016/j.ribaf.2019.01.001
  • Bhosale, J., & Mavale, S. (2018). Volatility of select crypto-currencies: A comparison of Bitcoin, Ethereum and Litecoin. Annual Research Journal of SCMS, Pune, 6(1), 132–141.
  • Choi, K., Yu, W. C., & Zivot, E. (2010). Long memory versus structural breaks in modeling and forecasting realized volatility. Journal of International Money and Finance, 29(5), 857–875. https://doi.org/10.1016/j.jimonfin.2009.12.001
  • Corsi, F. (2008). A simple approximate long-memory model of realized volatility. Journal of Financial Econometrics, 7(2), 174–196. https://doi.org/10.1093/jjfinec/nbp001
  • Diebold, F. X., & Mariano, R. S. (1995). Comparing predictive accuracy. Journal of Business & Economic Statistics, 13(3), 253–263.
  • Ftiti, Z., Louhichi, W., & Ben Ameur, H. (2021). Cryptocurrency volatility forecasting: What can we learn from the first wave of the COVID-19 outbreak? Annals of Operations Research, 1–26. https://doi.org/10.1007/s10479-021-04116-x
  • Ghorbel, A., Frikha, W., & Manzli, Y. S. (2022). Testing for asymmetric non-linear short-and long-run relationships between crypto-currencies and stock markets. Eurasian Economic Review, 12(3), 387–425. https://doi.org/10.1007/s40822-022-00206-8
  • Gong, X., & Lin, B. (2021). Effects of structural changes on the prediction of downside volatility in futures markets. Journal of Futures Markets, 41(7), 1124–1153. https://doi.org/10.1002/fut.22207
  • Hansen, P. R., Lunde, A., & Nason, J. M. (2011). The model confidence set. Econometrica, 79(2), 453–497.
  • Harvey, D. I., Leybourne, S. J., & Newbold, P. (1997). Testing the equality of prediction mean squared errors. International Journal of Forecasting, 13(2), 281–291. https://doi.org/10.1016/S0169-2070(96)00719-4
  • Huang, Y., Dai, X., Wang, Q., & Zhou, D. (2021). A hybrid model for carbon price forecasting using GARCH and long short-term memory network. Applied Energy, 285, 116485. https://doi.org/10.1016/j.apenergy.2021.116485
  • Inclan, C., & Tiao, G. C. (1994). Use of cumulative sums of squares for retrospective detection of changes of variance. Journal of the American Statistical Association, 89(427), 913–923. https://doi.org/10.2307/2290916
  • Leirvik, T. (2022). Cryptocurrency returns and the volatility of liquidity. Finance Research Letters, 44, 102031. https://doi.org/10.1016/j.frl.2021.102031
  • Liu, C., & Maheu, J. M. (2008). Are there structural breaks in realized volatility? Journal of Financial Econometrics, 6(3), 326–360. https://doi.org/10.1093/jjfinec/nbn006
  • Liu, M., & Lee, C. C. (2021). Capturing the dynamics of the China crude oil futures: Markov switching, co-movement, and volatility forecasting. Energy Economics, 103, 105622. https://doi.org/10.1016/j.eneco.2021.105622
  • Mensi, W., Al-Yahyaee, K. H., & Kang, S. H. (2019). Structural breaks and double long memory of cryptocurrency prices: A comparative analysis from Bitcoin and Ethereum. Finance Research Letters, 29, 222–230. https://doi.org/10.1016/j.frl.2018.07.011
  • Mensi, W., Rehman, M. U., Al-Yahyaee, K. H., Al-Jarrah, I. M. W., & Kang, S. H. (2019). Time frequency analysis of the commonalities between Bitcoin and major cryptocurrencies: Portfolio risk management implications. The North American Journal of Economics and Finance, 48, 283–294. https://doi.org/10.1016/j.najef.2019.02.013
  • Meynkhard, A. (2020). Effect of bitcoin volatility on altcoins pricing. In Software Engineering Perspectives in Intelligent Systems: Proceedings of 4th Computational Methods in Systems and Software 2020 (Vol. 14, pp. 652–664). Springer International Publishing.
  • Omura, A., Cheung, A., & Su, J. J. (2023). Does natural gas volatility affect Bitcoin volatility? Evidence from the HAR-RV model. Applied Economics, 56(4), 414–425. https://doi.org/10.1080/00036846.2023.2168608
  • Sadorsky, P. (2006). Modelling and forecasting petroleum futures volatility. Energy Economics, 28(4), 467–488. https://doi.org/10.1016/j.eneco.2006.04.005
  • Shen, D., Urquhart, A., & Wang, P. (2020). Forecasting the volatility of Bitcoin: The importance of jumps and structural breaks. European Financial Management, 26(5), 1294–1323. https://doi.org/10.1111/eufm.12254
  • Sovbetov, Y. (2018). Factors influencing cryptocurrency prices: Evidence from bitcoin, ethereum, dash, litcoin, and monero. Journal of Economics and Financial Analysis, 2(2), 1–27.
  • Witt, S. F., & Witt, C. A. (1992). Modeling and forecasting demand in tourism. Academic Press Ltd.
  • Yousaf, I., & Ali, S. (2020). Discovering interlinkages between major cryptocurrencies using high-frequency data: new evidence from COVID-19 pandemic. Financial Innovation, 6(1), 45. https://doi.org/10.1186/s40854-020-00213-1