key: cord-0828972-wry0uopq authors: Ceylan, Zeynep title: The impact of COVID‐19 on the electricity demand: a case study for Turkey date: 2021-03-25 journal: Int J Energy Res DOI: 10.1002/er.6631 sha: 75c3fa36e4bbff5b10f1f0a9258a691142f7d0c7 doc_id: 828972 cord_uid: wry0uopq Due to the extraordinary impact of the Coronavirus Disease 2019 (COVID‐19) and the resulting lockdown measures, the demand for energy in business and industry has dropped significantly. This change in demand makes it difficult to manage energy generation, especially electricity production and delivery. Thus, reliable models are needed to continue safe, secure, and reliable power. An accurate forecast of electricity demand is essential for making a reliable decision in strategic planning and investments in the future. This study presents the extensive effects of COVID‐19 on the electricity sector and aims to predict electricity demand accurately during the lockdown period in Turkey. For this purpose, well‐known machine learning algorithms such as Gaussian process regression (GPR), sequential minimal optimization regression (SMOReg), correlated Nyström views (XNV), linear regression (LR), reduced error pruning tree (REPTree), and M5P model tree (M5P) were used. The SMOReg algorithm performed best with the lowest mean absolute percentage error (3.6851%), mean absolute error (21.9590), root mean square error (29.7358), and root relative squared error (36.5556%) values in the test dataset. This study can help policy‐makers develop appropriate policies to control the harms of not only the current pandemic crisis but also an unforeseeable crisis. Coronavirus Disease 2019 (COVID-19) outbreak has dealt a significant blow to the ordinary course of life and caused a massive economic slowdown. 1, 2 Global economic recession and widespread negative growth figures are observed all over the world. According to the experts, the cost of COVID-19 to the world economy is estimated to be about $ 1 trillion in 2020. 3 To understand the potential negative economic impact of COVID-19, Carlsson-Szlezak et al. (2020) examined three potential economic transmission channels. 4 The first is the indirect hit to the wealth effect. Significant financial difficulties have arisen in people's lives due to the damage of the COVID-19 to the economy. With the decline in household wealth, saving rates increased, and consumption expenditures fell. The second is the direct effect associated with reduced consumption habits of goods and services. The third is the supply-side effect, unlike the channel demand, shocks mentioned above. The virus has negatively affected supply chains, labor demand, and employment by disrupting production, causing longer layoffs and increased unemployment. 5 The slowdown in production progress has led to business closures and operations to cease, as well as deeper financial problems, driving many firms to bankruptcy. 6, 7 The way organizations and people use energy has also changed significantly due to the socio-economic effects of the epidemic. 8 The reduced social and economic activity caused by the COVID-19 pandemic has affected all aspects of life, including the electricity sector. With the transition of educational institutions to distance learning mode, many public events and non-critical services started to be performed remotely. Many commercial and professional services started to be carried out from homes all over the world. As expected, this trend resulted in lower electricity consumption in the commercial and public sectors and an increase in residential electricity use. In addition to commercial and public sectors, the transportation sector has come under difficult conditions. For example, the cessation of the aviation industry and the disruption of related services at all airports resulted in a drastic drop in electricity demand. The reduction in traffic has also affected electricity demand, as many types of public transport run on electricity. COVID-19 outbreak has also had a significant impact on energy production systems and the use of energy resources worldwide. Conventional power generation systems are generally affected only in terms of generation capacity. However, another essential type of energy source that has been significantly affected by the epidemic is the renewable energy generation sector, which has become widespread in recent years. Solar or wind energy systems, which require equipment, materials, and labor, are among the most affected renewable energy sectors. The economic effects of the epidemic slow down the expansion of existing basic clean energy technologies while limiting or preventing new investments. With the epidemic, except for a few types (eg, electric vehicles, lighting, and so on), renewable energy investments have been stopped. Also, serious decreases in fuel prices due to transportation restrictions and the closure of countries reduce the demand for biomass or biomass-derived biofuels such as pellets, 9 biogas, 10, 11 biochar, 12 vegetable oil-biodiesel, 13 which are other clean, sustainable and renewable resources. The impact of the economic contraction in the sector necessitates the support of the governments for investments in renewable energy resources. For this reason, it is a critical issue to examine energy production during the pandemic process and to reshape energy production policies after the pandemic. In the literature, many studies are conducted that examine the impact of the pandemic in various energy sectors. However, a few of these studies focus on the effects on the electrical power industry. Table 1 shows some studies on the impact of the COVID-19 epidemic and associated lockdowns on electricity consumption patterns in different countries. For example, Carvalho et al. (2020) applied the joinpoint analysis to analyze the effect of mobility T A B L E 1 The change in electricity consumption of different countries due to COVID-19 Province/Country Impact of COVID-19 on electricity consumption In the first quarter of 2020, the electricity consumption decreased by 0.9% compared to 2019, and residential, industrial, and commercial sectors decreased by 0.3%, 0.4%, and 2.2%. 15 Ontario, Canada The electricity demand declined by 14%, totaling 1267 GW for the month of April-2020. 16 European countries From 6 to April 13, 2020, Spain (25%) experienced the highest reduction in electricity demand, followed by Italy (17.7%), Belgium (15.6%), the United Kingdom (14.2), and Netherlands (11.6%). The overall electricity demand has declined by less than 10%. The average share of net electricity production using Renewable Energy Resources increased above 55% in the first half of 2020, compared to 47% for the same period in 2019. The demand for electricity decreases by 0.65% when the population of infected people increases by 1%. restrictions caused by the COVID-19 pandemic on the Brazilian electricity demand patterns. 14 Rayash and Dincer (2020) conducted a similar analysis for the province of Ontario, Canada. The results of the study showed a 14% decrease in monthly electricity demand, with the highest daily 25% decrease. 15 Bahmanyar et al. (2020) compared the impact of different COVID-19 restrictions on electricity use in different European countries. 16 Gillingham et al. (2020) used the polynomial regression and the two-step augmented regression model to provide insight into the effects of the pandemic on energy consumption in the United States. 17 Kanitkar (2020) examined how lockdown measures affect electricity consumption in India. They used the linear input-output analysis to forecast the economic losses from the virus. 18 Snow et al. (2020) focused on household electricity use in Australia during the COVID-19 pandemic. The study results revealed a significant increase in household electricity consumption during the COVID-19 lockdown period, such as cooking and the use of digital devices. 19 Similar to the study by Kanitkar (2020), Santiago et al. (2021) conducted a detailed analysis to determine how the containment measures affect the daily electricity consumption in Spain. In the study, the changes in the CO 2 emission and electricity prices caused by the electricity demand change were also evaluated. 20 Unlike the studies in the literature, Edomah and Ndulue (2020) separately examined the impact of COVID-19 electricity consumption on residential, commercial, and industrial sectors in Lagos, Nigeria. 21 In another study, Ghiani et al. (2020) discussed the effects of the COVID-19 on the entire power system in Italy. They also conducted a statistical analysis to compare the different day-ahead electricity market price distributions pre-and post-the lockdown measures. The study results showed that the decrease in electricity consumption had a rapid effect on the dayahead electricity market. 22 Halbrügge et al. (2021) applied data visualization and descriptive statistics to reveal how German electricity systems were affected during the COVID-19 pandemic. The analysis results of the study revealed that COVID-19 caused not only noticeable changes in electricity consumption in Germany but also in pricing, import, and export. 23 developed regressive and neural network models to determine the change in electricity and petroleum consumption in China, the first country to suffer from COVID-19. The study results indicated that the severity of the epidemic significantly affects the demand for electricity and petroleum, both directly and indirectly. 24 The studies mentioned above generally discuss the impact of the COVID-19 epidemic on electricity consumption. However, studies to predict sudden and dramatic changes in electricity demand during the pandemic are quite limited. that a significant decrease in electricity consumption is strongly associated with the number of COVID-19 cases. 25 Lu et al. (2021) developed a hybrid multiobjective optimizer-based model to predict the electricity consumption of the US during the epidemic. According to the study results, the highest prediction accuracy was obtained by including the daily number of infections in the developed model. 26 Chen et al. (2020) developed a prediction model that considers mobility data as a measure of economic activities to estimate the electrical load of regions in the United States and Europe. 27 Alhajeri et al. (2020) used regression analysis and the genetic algorithm to predict the electricity peak load and power generation of Kuwait during the pandemic. They also made a quantitative evaluation of the economic and environmental impacts of partial and full lockdowns. 28 29 The previous studies emphasize the importance and necessity of energy forecasting to help electricity companies manage their resources effectively and ensure a safe electricity supply to the end-user during the pandemic. Although previous studies have examined various effects of epidemic-like emergencies, research on the impact of the epidemic on the electricity sector is still needed due to the different severity of the epidemic in different countries. Presenting a case study on Turkey, this paper highlights the importance and vulnerability of the energy sector due to the COVID-19 pandemic with a specific focus on electricity demand. This study aims to develop a robust and useful method that can accurately predict the electricity consumption gap during the COVID-19 lockdown period when there is an obvious and significant change in global trends. Considering the capability of the existing models, this uncontrollable and unpredictable contingency increases complexity and disrupts the predictive performance of classical models. Therefore, machine learning (ML) methods are a better option for capturing the change in electricity consumption with their nonlinear modeling capabilities. As far as the author aware, this is the first comprehensive study to use and compare different ML approaches to predict electricity consumption that deviates from its usual trend due to the COVID-19. Nowadays, as the effects of the virus continue, such studies can guide policy-makers to manage the effects of the economic recession during and after the pandemic. Thus, they can better anticipate the supply and demand reactions of the markets and ensure that these processes are overcome with the least damage. The rest of the paper is organized as follows: Section 2 describes the case study, the data, and the methodology. Section 3 contains results and discussions, and Section 4 presents the conclusions and suggestions for future studies. The methodology section of the study, which consists of five subsections, is shown in detail in Figure 1 . Section 2.1 presents the case study for Turkey. In this subsection, the daily electricity consumption data of Turkey (starting from March 16, 2020 to June 1, 2020) is examined to better understand the changing dynamics of electricity consumption habits due to COVID-19. The daily electricity consumption data of Turkey was obtained from the Turkish Electricity Transmission Corporation (TEIAS) website. 25 Section 2.2 presents some essential aspects of the data set used in the study. Details about the data preprocessing stage and the predictive variables used are given in this subsection. Section 2.3 presents the description and basic theoretical aspects of different ML techniques used within the scope of the study. Section 2.4 F I G U R E 1 Flowchart of research methodology provides the performance evaluation criteria and statistical significance tests used in the study. Finally, Section 2.5 details the method used to rank the performance of the models. The COVID-19 disease was declared as a pandemic by the World Health Organization (WHO) on 11 March 2020. 30 On the same day, the Turkish Ministry of Health reported the first case in Turkey. 31 Immediately after the detection of the first COVID-19 case in Turkey, it was decided to be shut down the schools to curb the pandemic. Then, the government of Turkey declared a state of emergency, bars, nightclubs, theaters, cinemas, gyms, and cafes were closed. These measures were in full force during part of March 2020 and the entire month of April and May. Travel restrictions, curfews, unpaid leave practices, and working hours regulations have led to changes in sectoral demand, thus affecting both production and production demand. Many industrial centers have reduced production levels due to the lack of demand and interruptions in their supply chain. As a result, the public was subject to lockdown, which resulted in a sharp drop in economic output. The reduced social and economic activity caused by the COVID-19 pandemic had a dramatic impact on the country's electricity demand. As seen in Figure 2 , the electricity demand of Turkey fell sharply in May compared to March and April. Total electricity consumption in April and May 2020 is 10.2% and 10.3% lower than the lowest April and May values recorded between 2016 and 2019, respectively. Compared to the same month of 2019, demand decreased by 15.4% and 16.7% in April and May. In Figure 3 , the weeks during lockdown measures in Turkey are compared with the corresponding weeks in 2019. Week 1 is 16-22 March 2020, which corresponds to 18-24 March 2019, and so forth (see Appendix). COVID-19 outbreak in Turkey has exceeded 100 cases as of 18 March 2020. In the same week, it was entered into a new era by the end of the working days in Turkey. Therefore, electricity consumption started to decrease as of 23rd March, and the demand decreased by 5.7% compared with the same week of the previous year. In the second week of April, when the full-curfew was imposed, electricity consumption decreased by 16.9%. Electricity demand fell to its lowest level in the last week of May, and electricity demand was 26.3% lower than the reference week of 2019. As seen in Figure 4 , the electricity demand fell to Sunday levels rather than a typical high-demand weekday pattern. A 48-hour curfew was imposed on the weekends in major cities where about 65 million people live. The electricity consumption hit the lowest level on Sunday at around 457 272 MW-hours, marking a nearly 34.7% decrease compared to the corresponding day last year. Figure 5 shows daily electricity consumption from the beginning of March until the start of new normalization on 1 June 2020, and the corresponding days in 2019. This trend clearly shows that national electricity demand declined since mid-March when the first COVID-19 case was confirmed. Week In this study, Waikato Environment for Knowledge Analysis-version 3.9.4 (WEKA 3.9.4) software was used for time series forecasting. WEKA is developed by the University of Waikato, New Zealand, equipped with various ML algorithms written in Java programming language. 32 WEKA includes a time series forecasting package that enables the selection of the basic learning model and parameters, generation of lagged variables, creation of variables derived from a date-time stamp, the addition of "overlay" data, evaluation options, and output control. Lagged variables are effective tools that capture the relationship between past and current values of a series. 33 Since the data used in this study is daily, the minimum and maximum lag lengths were set as 1 and 7, respectively. A value of "1" means a lagged variable that holds target values at time − 1, while a value of 7 is a lagged variable that holds target values at time − 7. All-time periods between minimum and maximum are turned into lagged variables. A fine-tuning lag selection was also performed to achieve high predictive performance by entering a random interval within the minimum and maximum variables. Additional data that may be related to the prediction can improve the performance of learning algorithms. WEKA software uses the intervention attributes (also called overlay data) as inputs to the model. Thus, within the scope of this study, time series forecasting has been performed with the overlay data. Daily electricity consumption data in 2019 has been taken into consideration in the time series forecasting. The average temperature data for Turkey has also been taken into consideration from the Turkish State Meteorological Service Official website. 34 Moreover, consumer behavior differs significantly from the regular pattern on certain days, such as national and religious holidays and special days. Thus, national holidays and curfews during the lockdown period were included in the analysis (1-yes; 0-no). WEKA provides many parameters and options to derive time-dependent attributes in time series analysis. Thus, in this study, new attributes such as "day of the week" and "weekends" that will affect the time series forecasting performance were derived. Consumption data were collected daily in units of GWh. To check the predictive ability of the learning algorithms, the electricity consumption data was divided into two sets, namely training and testing sets. In this regard, all learning algorithms have been trained on the first 70% (54 samples) of the instances and tested on the last 30% (23 samples) of the instances. Thus, daily electricity consumption data between 16 March 2020 and 8 May 2020 were used as the training set. On the other hand, test data includes the data from 9 May 2020 to 31 May 2020. In this study, various function-and tree-based ML algorithms in the WEKA software are used as a base learner for time series forecasting. Function-based algorithms make predictions based on mathematical models. On the other hand, tree-based learning models are used for making predictions via a tree structure. Gaussian Process Regression (GPR), Sequential Minimal Optimization Regression (SMOReg), Correlated Nyström Views (XNV), and Linear Regression (LR) from function-based models and M5P model tree (M5P) and Reduced Error Pruning Tree (REPTree) from tree-based models were used. The mentioned ML algorithms are briefly explained in the subsections. Table 3 shows the default values of the hyperparameters used in regression models in this study. Gaussian process regression (GPR) is a powerful nonparametric kernel-based probabilistic model. It was introduced by Rasmussen and Williams. 35 GPR is well adopted and widely used in various applications in engineering problems. 36 GPR implements Gaussian processes (GP) for regression without hyperparameter-tuning. GP is a set of random variables such that any finite number of them have a joint Gaussian distribution. The GP is specified by a mean μ(x) and a covariance (kernel) function k (x, x 0 ). Given a function space f(x) = φ(x) T ω, and defining [f(x (1) ), f(x (2) ), …, f(x (n) )] as the variable set, which follows a Gaussian distribution, the GPR model can be given as: where E[.] denotes expectation. The function values f(x) are not achievable in most applications. In practice, only the noisy observations are available, which can be expressed by 35 ; where ε is the additive white noise. It is assumed to be independent and identically distributed normal random noises with mean 0 (zero) and variance σ 2 , εÑ 0, σ 2 n À Á . Any finite number of the observed values can form an individual GP as stated by 35 : where δ ij is the Kronecker delta, which is set as 1 when i = j or 0 otherwise. The objective of the GPR model is to infer the function value f à and its variance cov(f * ) based on the new test point x * . In this sense, X * denotes the input matrix of the test dataset and N * is the size of test dataset. According to the GPR assumption, the observed values and the function values at new test points follow the joint Gaussian process distribution: where I denotes the n × n unit matrix, K(X, X) is the covariance matrix of the training dataset, K(X * , X * ) is the covariance matrix of test dataset, and K(X, X * ) is the covariance matrix obtained from the training and test The minimum proportion of the variance 0.001 The amount of data used for pruning 3 dataset, such that K(X * , X) = K(X, X * ) T . Thus, we can obtain the predictive distribution: The main advantage of this method is that it has good generalization ability and is suitable for dealing with high dimensional spaces and small samples, and complex nonlinear problems. 37 Support vector machine (SVM) is a supervised ML algorithm developed by Cortes and Vapnik in 1995. 38 It has been widely used in recent years as a successful machine learning approach for solving real-life engineering problems. 39, 40 It is called support vector classification (SVC) and support vector regression (SVR) for classification and regression analysis, respectively. 41 The aim of this algorithm is to find the optimal separating hyperplane that provides the maximum margin between the hyperplane and the support vectors. The schematic diagram of the SVR algorithm is given in Figure 6 . 42 The kernel function is used to construct a mapping from input vectors to a high dimensional feature space. These functions can be of various types, such as linear, polynomial, sigmoid, and radial basis function. Suppose there are N training elements {(x i , y i )}, for i = 1, …, N, y i ∊ R, x i ∊ R D , the SVM function can be represented, as shown in Equation (10) where w is weight, b is the deviation vector. The SVR algorithm formula can be expressed as a constrained optimization problem as following equation. 43 subject to: where ε is the insensitive loss function, ξ i , ξ 0 i are the nonnegative slack variables, C is the penalty factor (also known complexity parameter), and y i is i th target value. The sequential minimum optimization (SMO) technique was introduced by Platt. 44 SMO significantly improved the computation efficiency and accuracy of the SVM model. 45 Smola and Scholkopf 43 improved the original SMO algorithm to solve regression problems. Then, Shevade et al. 46 made further modifications to the SMO regression (SMOreg) model to achieve better performance. The improved SMOreg algorithm enhances the computation speed of SVM regression, even with small sample size data sets. Besides, it can reduce the risk of overfitting problems, which helps find the generalized pattern of the real-life problems. Correlated Nyström views (XNV) was first proposed by McWilliams et al. 47 Function XNV is a semi-supervised kernel-based modeling algorithm. It is assumed that the input set x i contains n feature vectors and that only a small fraction of them is labeled x i , y i f g n i = 1 . As shown in the pseudo-code algorithm below, the working procedure of this algorithm follows two main steps 48 : Step 1: Generating random views. Two view data sets are generated using random features that are not computationally expensive. In this stage, the Nyström method is used for constructing random features. Step 2: Multi-view regression. Canonical correlation analysis (CCA) is applied to the two-view data set to bias the optimization procedure toward useful features. Finally, linear regression penalized with a canonical norm is applied. According to McWilliams et al., 47 the function XNV outperforms a state-of-the-art algorithm for semisupervised learning. It is a fast algorithm that can be used for regression and classification tasks. 49 The main advantage is that it significantly improves predictive performance and reduces performance variability across various real-world datasets in a shorter processing time. Linear regression (LR) is a well-known and simple regression algorithm. LR presents a mathematical model of the relationship between a dependent variable and one or more independent variables. LR model can be expressed as in Equation (15) 50 : where ω is the weight, a is the attribute, and y is predicted the class. The algorithm iteratively attributes weights to the model and calculates the value of the class. The algorithm then tries to minimize the sum of the squares of the differences between the actual class value and the predicted class value using Equation (16) . 2.3.5 | M5P model tree M5P model tree (M5P) is one of the decision tree deduction tools in machine learning. 51 The original M5 algorithm was first introduced by Quinlan 52 and then rearranged into M5P (M5 0 ) by Wang and Witten. 53 It is a widely used decision tree algorithm by many researchers for prediction purposes. [54] [55] [56] There are three main steps in the M5P algorithm: (1) constructing the tree, (2) pruning the tree, and (3) smoothing the tree. 57 The first step involves identifying different data classes and building a tree. M5P model trees split the input progressively. In the M5P algorithm, a split process starts based on the SD of the target value, which measures error at the node. The SD reduction (SDR) is expressed as follows 58 : where T denotes the case that reaches the node, jTj is the number of cases that reach the node, n is the number of branches, T i is the case belonging to the i th branch, jT i j is the number of cases related to the i th branch, σ is the SD. The splitting process is performed recursively by selecting the assignment with the maximum expected error reduction. The exact process is applied to the subsets. 54 The splitting process in M5P ends when a slight change is observed in the output value or some examples are preserved. After the tree is constructed, a linear regression technique is then created for each subset of samples at the leaves. 45 In the second step, the pruning process begins after the tree is constructed to overcome the overfitting problem by eliminating the undesired subtree. The third step involves the smoothing process to compensate for the sharp discontinuities between adjacent linear models at the pruned tree leaves. Reduced error pruning tree (REPTree) is a fast decision tree learner with low computational cost. 59 It is a popular method for classification/regression problems because of the simple configuration. 60 It builds a classification tree using the information gain or makes a regression tree from the variance. 61 At first, the model uses regression tree logic to create multiple trees in various iterations. The best tree is chosen from multiple trees. Then, the model uses the reduced error pruning technique with backfitting as a pruning procedure in order to improve generalization. Finally, the algorithm sorts the values of the numerical attributes using the embedded method and then handles the missing values using a C4.5 algorithm. 62 In this study, four performance metrics, namely mean absolute percent error (MAPE%), mean absolute error (MAE), root mean square error (RMSE), and root-relative square error (RRSE%), were used to evaluate and compare models. MAPE is the percentage ratio of the mean absolute prediction errors to the absolute actual data. MAE is applied to reveal how close estimations are to the final results. RMSE gives information about the shortterm performance of the prediction models. RRSE is used to evaluate the error rate. The model with the lowest MAPE%, MAE, RMSE, and RRSE% values is considered the best candidate model. The detailed equations are defined in Table 4 . In addition to the performance criteria mentioned above, two non-parametric statistical tests, the Wilcoxon signed-rank test 63 and the Friedman ranking test 64 were used to see any differences among the forecasting models. The following hypotheses were formulated: H0. There is no significant difference in the results of the ML models used in this study. H1. There is a significant difference in the results of the ML models used in this study. The Wilcoxon signed-rank test is based on the "one by one" rule, while the Friedman test (also referred to as analysis of variance by ranks) is used for multiple comparisons. 40, 65 Both statistical techniques were performed on the testing dataset. Besides, the two-tailed test was conducted with a significance level of α = 0.05. Statistical analysis was carried out with SPSS Version 20.0 statistic software package. Evaluating the algorithms according to a single criterion sometimes misleads the decision to choose the best algorithm from the available candidate algorithms. Therefore, the regression models used in this study were ranked using the Technique for Order Preference by Similarity to Ideal Solutions (TOPSIS), one of the popular Multi-Criteria Decision Making (MCDM) approaches. In other words, the problem was handled as an MCDM problem that includes six models and four criteria. TOPSIS was introduced by Hwang and Yoon in 1981. 66 The underlying principle of the TOPSIS method is to determine the best alternative with the minimum distance from the positive ideal solution and the maximum distance from the negative ideal solution. : y t and x t are the predicted and measured values at time point t, respectively. Also, x is the mean of measured values, and n is the number of time points. Assuming an MCDM problem with m alternatives (regression models) and n criteria (model performance measures) the TOPSIS procedure is as follows. 67 Develop a normalized decision matrix (r ij ) using Equation (18): where X is the m × n decision matrix of performance ratings of each alternative at each criterion. Construct a weighted normalized decision matrix (v ij ) by assigning weights to each criterion as follows: v ij = w j × r ij i = 1,2,…, m and j = 1,2,::, n ð19Þ where w i is the weight of j-th criteria and P n j = 1 w j = 1: Compute the positive ideal (A + ) and negative ideal solution (A − ) using Equations (20) and (21), respectively. where h and l indicate the sets of indices j corresponding to the benefit criteria and negative criteria. Compute the separation measures S + i from positive ideal and S − i from negative ideal solution by using Euclidean distance as follows: Determine the relative closeness of each alternative by calculating the performance score (p i ): Rank the alternatives according to the scores p i . In this section, the results of the time series analysis are given. Section 3.1 shows the performance comparison of models, statistical test results, and residual analysis. Finally, a detailed discussion is provided in Section 3.2. As discussed previously, the objective of the study is to examine the predictability of the electricity demand of Turkey via six different ML algorithms during the COVID-19 lockdown period. To evaluate the performance of the models, 70% of the dataset was used for training and the remaining 30% for testing. The reliability of each model was measured by four accuracy criteria, namely, MAPE (%), MAE, RMSE, and RRSE (%). To select the best performing model according to the evaluation criteria mentioned above, the TOPSIS method was used. This provides a more systematic ranking system that makes the algorithm selection process more applicable and robust. Equal weight was assigned to each performance metric such that the sum of all weights is equal to 1. Table 6 shows the ranking of the algorithms according to their relative closeness values. Rank 1 is the higher preferred algorithm, while rank 6 is less preferred. Accordingly, the SMOReg algorithm is the best model for predicting electricity consumption change due to the COVID- 19 The baseline model is built by using the SMOreg algorithm is shown; Figure 7A ,B shows the violin plot and the histogram of residual values to compare the performance of models in the test dataset. In Figure 7A , the center of each violin represents a box plot, and the black dot inside the box represents the mean of the error. Vertical length indicates the error range of each model. The curve around the violin plot shows the error distribution in each model. Accordingly, the model with smaller metrics values in Table 5 has an error with a smaller variation. As seen in Figure 7B , the residuals are concentrated around zero since the SMOReg model has minimum errors and no outliers. For both figures, it can be said that the SMOReg model outperforms the XNV and LR models slightly and is significantly higher than the GPR, M5P, and REPTree models. As shown in Table 7 , the considerable differences among the models were also analyzed using the Friedman test and Wilcoxon signed-rank tests at the level of 95% confidence. Using the Friedman test, it was observed that the P-value is less than the significant level of .05. Thus, it can be stated that there is a statistically significant difference in the performance among all regression models (χ 2 = 40.216, df = 5, p = 0.000). On the other hand, the Wilcoxon signed-rank test was also performed to determine whether there was a significant one-to-one difference between SMOreg and other models. The Pvalues of the model pairs are less than .05, and the zvalues are below −1.960 (Critical values for 5% twotailed: ±1.96). Accordingly, the null-hypothesis between the SMOreg model and any of the compared ML algorithms can be rejected. The results of both statistical tests confirm the existence of a difference between the models used in this study. To better understand the predictive success of the SMOReg model, the actual values were plotted against the predicted values in Figure 8A ,B. Using the data recorded from TEIAS, the electricity demand forecast is formulated with a 95% confidence level. It can be said that the SMOReg technique successfully captures actual electricity consumption data, especially when the data suddenly decreased. When the curve of the electricity load data reaches a peak or trough, the SMOreg model can adopt these changes well. The nation and/or provincial lockdown implemented to slow down the spread of infection has affected the operational activities of businesses and industries, which resulted in a significant decrease in electricity demand. This unprecedented situation makes it difficult to forecast electricity demand. This unexpected change disrupts the predictive performance of existing models, making the electrical system difficult to manage. In this uncertain and unclear period, policy-makers need reliable models to monitor the impact of COVID-19 in real-time and develop policies that can respond quickly to this change. Thus, accurate forecasting of electricity demand is essential for the management, operation, and control of the power system. Therefore, forecasting the future trend of the power demand with reliable and robust models is crucial. As in other countries, restrictive measures were taken to slow the spread of COVID-19 in Turkey since mid-March 2020. Taking the daily electricity demand in Turkey as a case study, this study discusses the impact of the COVID-19 outbreak on electricity consumption. It provides a comprehensive comparison of ML techniques for predicting electricity demand during the lockdown period. Based on the analysis results, it can be seen clearly that the SMOReg model yielded better results than the benchmark models to predict the daily electricity consumption during the pandemic. This may be due to the advantage of SMOReg to cope with higher dimensional feature spaces compared to other regression algorithms. 38, 68 Another strength of the SMOReg model is that it can be successfully applied to different data sets. Given that the impacts of COVID-19 vary from country to country and governments take different degrees of restrictions at different periods, it should be noted that electricity consumption patterns are likely to vary. Therefore, this model can be used in other countries and regions to monitor the extent of the disruption caused by the pandemic in real-time. This only requires data on electrical load, temperature, and COVID-19 restrictions, and such information is available to the public in almost all developed countries in the world. However, there are some limitations in this study. First, it can be said that electricity demand patterns will gradually return to their normal trajectories as countries begin to ease lockdown and shutdowns. Although the reduction in electricity demand will not be persistent, the trajectory of the epidemic is still unclear. With the second wave of the epidemic, if the lockdowns are renewed, the measures may be reactivated, which directly affects the consumption behavior of electricity demand. Thus, the study results will need to be updated periodically to ensure high prediction accuracy. Second, this study addresses only the short-term impacts of COVID-19 on the electricity sector during the lockdown period. Therefore, this study does not consider the potential for long-term and recurrent lockdowns with cumulative and increasingly adverse economic impacts. Finally, while satisfactory short-term forecast results are provided with national electricity load change data in the study, individual effects of the epidemic in various sectors such as the transport, industrial sector, residential, and commercial and public services are not evaluated. Analyzing changes in electricity consumption patterns on a sector basis can be helpful to evaluate the impact of COVID-19 measures on particular sectors. The lockdown measures implemented by many countries to control the spread of COVID-19 have direct and indirect effects on human life and the world economy. The electricity sector is an essential part of economic growth in every country. Thus, it is essential to determine a suitable model that can estimate fluctuations in electricity consumption. In this context, this study provides a comprehensive comparison of ML techniques to estimate the short-term impact of COVID-19 on the economy using daily electricity consumption data of Turkey. The findings are summarized as follows: • There has been a significant reduction in electricity demand in Turkey in May 2020. In this regard, demand decreased approximately 17% compared to 2019 levels. • The most significant decrease in electricity usage was on the weekends, with a decline of 34.7% compared to the same period of 2019. • The SMOReg model outperformed the GPR, XNV, LR, M5P, and REPTree models in predicting electricity consumption during the COVID-19 lockdown period. Consequently, this study discusses the severity and impact of COVID-19 on the electrical power industry as a case study. The results of this study showed that the SMOReg method could be used as a stable and robust method for short-term estimation of electricity demand in any crisis (ie, pandemics, economic collapses, etc.). Therefore, this study can be useful for governments and policy-makers to develop appropriate policies. While still experiencing the pandemic, it is essential to comprehensively examine the crisis and analyze its comprehensive and multidimensional effects. Many more issues remain to be investigated regarding the impact of the COVID-19 pandemic and the resulting lockdown measures on the energy sector. For example, it can be useful to examine the effects of measures during the COVID-19 lockdown period on fossil fuel demand and renewable energy. From an environmental perspective, the estimation of CO 2 emission levels associated with energy demand during the epidemic could be another critical subject for future research. No funding to declare. The data that support the findings of this study are available from the corresponding author upon reasonable request. ORCID Zeynep Ceylan https://orcid.org/0000-0002-3006-9768 +0.0517 * (normalized) * (normalized) Lag_electricity_consumption_2020 − 1 +0.0469 * (normalized) Lag_electricity_consumption_2020 − +0.0995 * (normalized) Lag_electricity_consumption_2020 − 3 +0.0714 * (normalized) Lag_electricity_consumption_2020 − 4 +0.1818 * (normalized) Lag_electricity_consumption_2020 − 5 +0.0805 * (normalized) date − remapped * Lag_electricity_consumption_2020 − 1 −0.1165 * (normalized) date − remapped * Lag_electricity_consumption_2020 − * (normalized) date − remapped * Lag_electricity_consumption_2020 − 3 −0.0994 * (normalized) date − remapped * Lag_electricity_consumption_2020 − 4 +0.1065 * (normalized) date − remapped * Lag_electricity_consumption_2020 − 5 The impact of COVID-19 on the electricity sector in Spain: an econometric approach based on prices Estimation of COVID-19 prevalence in Italy COVID-19: impact analysis and recommendations for power sector operation What Coronavirus Could Mean for the Global Economy Examining the economic impact of COVID-19 in India through daily electricity consumption and nighttime light intensity Bankruptcy prevention: new effort to reflect on legal and social changes Systematic review of variables applied in bankruptcy prediction models of Visegrad group countries Classification of the social distance during the COVID-19 pandemic from electricity consumption using artificial intelligence Experimental determination of bed temperatures during wood pellet combustion Advances in nutrient management make it possible to accelerate biogas production and thus improve the economy of food waste processing. Energy Sources, Part A Recover Util Environ Eff Advances in the agrochemical utilization of fermentation residues reduce the cost of purpose-grown phytomass for biogas production. Energy Sources, Part A Recover Util Environ Eff Analysis of czech subsidies for solid biofuels Economic analysis of the pressure shockwave disintegration process Effects of the COVID-19 pandemic on the Brazilian electricity consumption patterns Analysis of the electricity demand trends amidst the COVID-19 coronavirus pandemic The impact of different COVID-19 containment measures on electricity consumption in Europe The short-run and longrun effects of Covid-19 on energy and the environment The COVID-19 lockdown in India: impacts on the economy and the power sector Drivers behind residential electricity demand fluctuations due to COVID-19 restrictions Electricity demand during pandemic times: the case of the COVID-19 in Spain Energy transition in a lockdown: an analysis of the impact of COVID-19 on changes in electricity demand in Lagos Nigeria Impact on electricity consumption and market pricing of energy and ancillary services during pandemic of COVID-19 in Italy How did the German and other European electricity systems react to the COVID-19 pandemic? When pandemics impact economies and climate change: exploring the impacts of COVID-19 on oil and electricity demand in China Prediction-based analysis on power consumption gap under long-term emergency: a case in China under COVID-19 A hybrid multi-objective optimizer-based model for daily electricity demand prediction considering COVID-19 Using mobility for electrical load forecasting during the COVID-19 pandemic Energy demand in the State of Kuwait during the Covid-19 pandemic: technical, economic, and environmental perspectives Impact of the COVID-19 pandemic on electricity demand and load forecasting. Sustainability WHO j World Health Organization _Dil=2. Cited October Weka 3-Data mining with open source machine learning software in Java Time series analysis and forecasting with Weka-Pentaho data mining-Pentaho Wiki Turkish State Meteorological Service Official Web Sites Gaussian Processes for Machine Learning Estimation of municipal waste generation of Turkey using socio-economic indicators by Bayesian optimization tuned Gaussian process regression A robust combination approach for short-term wind speed forecasting and analysis-combination of the ARIMA (autoregressive integrated moving average), ELM (extreme learning machine), SVM (support vector machine) and LSSVM (least square SVM) forecasts using a GPR (Gaussian Process Regression) model Support-vector networks A new design of evolutionary hybrid optimization of SVR model in predicting the blast-induced ground vibration A support vector regression model hybridized with chaotic krill herd algorithm and empirical mode decomposition for regression task Prediction of medical waste generation using SVR, GM (1,1) and ARIMA models: a case study for megacity Istanbul Support vector regression model based on empirical mode decomposition and auto regression for electric load forecasting A Tutorial on support vector regression Fast training of support vector machines using sequential minimal optimization Time series prediction of CO 2 , TVOC and HCHO based on machine learning at different sampling points Improvements to the SMO algorithm for SVM regression Correlated random features for fast semi-supervised learning Randomized Nyström features for fast regression: an error analysis Comprehensive preference learning and feature validity for designing energy-efficient residential buildings using machine learning paradigms Influential factors analysis on LEED building markets in U.S. East coast cities by using support vector regression M5P-tree and nonlinear regression approaches with statistical evaluations to predict the compressive strength of cement-based mortar modified with fly ash Learning with continuous classes Induction of Model Trees for Predicting Continuous Classes Development of expert systems for the prediction of scour depth under live-bed conditions at river confluences: application of different types of ANNs and the M5P model tree Evaluating the ability of artificial neural network and PCA-M5P models in predicting leachate COD load in landfills Prediction of the compressive strength of normal and high-performance concretes using M5P model tree algorithm Bedload transport rate prediction: application of novel hybrid data mining techniques Neural networks and m5 model trees in modelling water level-discharge relationship Data mining: practical machine learning tools and techniques with Java implementations Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles A comparative evaluation of software effort estimation using REPTree and K* in handling with missing values Reptree classifier for identifying link spam in web search engines Individual comparisons of grouped data by ranking methods The use of ranks to avoid the assumption of normality implicit in the analysis of variance Periodogram estimation based on LSSVR-CCPSO compensation for forecasting ship motion Methods for multiple attribute decision making. Multiple Attribute Decision Making A new energy efficiency evaluation solution for ethylene cracking furnace based on integrated TOPSIS-DEA model How to cite this article: Ceylan Z. The impact of COVID-19 on the electricity demand: a case study for Turkey