key: cord-1048353-jf5u20ea authors: Dhamodharavadhani, S; Rathipriya, R title: COVID-19 mortality rate prediction for India using statistical neural networks and gaussian process regression model date: 2021-03-03 journal: Afr Health Sci DOI: 10.4314/ahs.v21i1.26 sha: fab0b286707df3c722a27994f3220c72b8b7d49a doc_id: 1048353 cord_uid: jf5u20ea The primary purpose of this research is to identify the best COVID-19 mortality model for India using regression models and is to estimate the future COVID-19 mortality rate for India. Specifically, Statistical Neural Networks (Radial Basis Function Neural Network (RBFNN), Generalized Regression Neural Network (GRNN)), and Gaussian Process Regression (GPR) are applied to develop the COVID-19 Mortality Rate Prediction (MRP) model for India. For that purpose, there are two types of dataset used in this study: One is COVID-19 Death cases, a Time Series Data and the other is COVID-19 Confirmed Case and Death Cases where Death case is dependent variable and the Confirmed case is an independent variable. Hyperparameter optimization or tuning is used in these regression models, which is the process of identifying a set of optimal hyperparameters for any learning process with minimal error. Here, sigma (σ) is a hyperparameter whose value is used to constrain the learning process of the above models with minimum Root Mean Squared Error (RMSE). The performance of the models is evaluated using the RMSE and 'R2 values, which shows that the GRP model performs better than the GRNN and RBFNN. At the end of December 2019 in Wuhan, China, it was first reported that a human infection was caused by a novel coronavirus (nCov) or Wuhan virus or 2019-nCov 1 . One of the biggest challenges of this epidemic is a human-to-human transition of nCov. The coronavirus (COVID- 19) infected cases increase at an exponential rate worldwide. On 30 January 2020, the World Health Organization (WHO) issued a worldwide health emergency warning notice 2 , describing that 2019-nCoV is of critical global concern. The morbidity and mortality rates for the COVID -19 are uncertain at the early stage 3 especially for young ones and aged people. WHO has estimated the reproduction factor (R0) of nCov is 2.7. To monitor the massive and rapid spread of the nCov, public health sectors took reliable preventative measures. They imposed curfew or lockdown infested cities in China, the United States, India, and other coun-tries also. This is to limit the social distance between people and to avoid the spread of this novel virus via humans to humans. For the last decade, machine learning techniques have gain momentum and play a vital role in many domains of research fields. Notably, it has a tremendous impact on data analytics and data science. It enables a better understanding of the data and its methods, allows for future assumptions based on past data / empirical data, and automatically classifies the type of data (known as classification). Machine Learning (ML) techniques also can be used to develop standard mortality models. Generally, ML tools used for prediction of the mortality rate of epidemic diseases in advance which helps the public healthcare authorities to develop and design an effective and efficient plan to reduce deaths. Authors Deprez et al. 4 , used machine learning algorithms to fit and assess the mortality model by detecting the weaknesses of different mortality models. Artificial Neural Networks (ANNs) 5 used to identify and forecast latent mortality variables with higher predictive accuracy. In 6 , the authors used neural networks to expand the Lee-Carter model to several predictions of populations. Gaussian process models have typically been used extensively in engineering-based optimization applications (Razavi et al. 7 ) . A combination of GPR and adaptive neuro-fuzzy inference system (ANFIS) 8 used in groundwater level forecasting. In 9 , an extensive comparative study was carried out between several surrogate models, comprising GPR, using simulation-optimization methodology with uncertainty parameters. In the end, they had concluded that the GPR models and their ensemble were efficient methods concerning prediction accuracy. GRNN model was built in 10 as a new computational method for the field of incidence prediction of infectious diseases. Han, et al. 11 developed a GRNN network with a one-dimensional input and output layer to predict blood, and sexually transmitted infections are occurring. In 12 , -authors implemented a comparison analysis on Back Propagation Neural Network (BPNN), Generalized Regression Neural Network (GRNN), and Radial Basis Function Neural Network (RBFNN) network for prediction of the evaporation ‖ . The results showed that the GPR is a successful technique compared with artificial neural network approaches. In 13 , -a large-scale comparison analysis was presented for major machine learning models such as multilayer perceptron, Bayesian neural networks, radial base functions, generalized neural regression networks (also kernel regression), K-nearest neighbor regression, CART regression trees, support vector regression, and Gaussian time series prediction processes ‖ . The authors observed that the performance of these models was solely dependent on the data set, having different impacts. The RBF and GRNN 14 were applied to data of patients with heart disease for the medication outcome. The results showed that RBF performed well for prescribing medicine for the patient. In 15 , the authors claimed that the Gaussian process approach performed better than the standard generalized linear model (GLM) for the Phenomenological forecasting of dengue disease incidence. Gholam Ali Montazer et. al 16 reviewed various learning methods for defining network parameters such as widths, centers, and synaptic weights of the RBF neural network. In 17 , extensive neural regression networks were proposed as an automated technique for forecasting time series. This technique is intended to achieve an efficient and fast tool for automatically predicting a vast amount of time series. From these works, one could clearly understand the applications of GPR, GRNN, and RDFNN in various research domains. In this work, the GPR model with optimized hyperparameter (Co-variance, mean), GRNN with optimized hyperparameter (spread), and RBFNN with optimized hyperparameter (spread ) are applied to develop COV-ID -19 mortality models for two types of dataset. Moreover, to evaluate the performance of these models, RMSE, a quantifiable measure will be used, and the MRP model with low RMSE will be selected as the best model for predicting the COVID-19 mortality rate for India. The purpose of this study is to predict mortality rates against multiple COVID-19 confirmed cases using machine learning techniques that capture patterns that cannot be identified by a standard statistical mortality model. The rest of the paper is organized as follows. Section 2 details the methods and materials for Covid-19 mortality rate forecasting for India. The outcomes and analysis of this study are given in Section 3. This work is summarized in Section 4 with potential future research. The goal of hyperparameter optimization in regression models is to find the parameters of a given regression technique, which returns the best output on a validation set while training and testing the model 15 . This is as shown in equation (1) (1) Where f(O) is an objective score to minimize RMSE calculated on the validation dataset; H * is the set of hyperparameters that gives the lowest RMSE score, and o is any value in the problem domain O. Even though hyperparameter optimization is costly in terms of computational time, it yields good prediction accuracy than traditional regression models. Gaussiaprocessss is a machine learning technique used to make uncertain predictions. It is defined as a finite set of random variables distributed jointly by the Gaussians 15 . These random variables represent the value for a function f(x) at input x in regression problems. It is represented as {f(x) : x X} defined by the mean function μ(x) and the covariance function k(x, x′) so it can be represented as Usually, a zero-mean Gaussian process before equation (2) is the prior distribution over functions f (•) Same (1) f(⋅)∼GP(μ(⋅),k(⋅,⋅)) (2) detailed explanation found in 16. This model is named, in literature, a substitute for the objective function. The substitute can be configured more easily than the intent feature. A GP method determines the next set of hyperparameters to test by choosing the best hyperparameters acting on this surrogate function on the actual objective function. A covariance function as defined in equation (3) is used to represents the covariance between pairs of random variables in GPR. (3) Here, hyperparameters are The structure of a GRNN with two layers is comparatively simple and fixed. The first is the sequence, and the second is a summation. When the input is passed through each unit in the pattern layer, the input-response relationship will be "memorized" and stored within the unit. As a result, the number of units in the pattern layer equals the number of individual values in the training set. In each pattern unit, a Gaussian PDF is applied to the network input, so that it is defined as equation (4) (4) where Theta is the output of the Pattern Unit, A is the origin, t is the vector of training stored in the unit, and Sigma is a positive variable known as the "distance" or "smooth parameter" or -smoothing factor ‖ . If Theta is determined, the calculation is transferred to the summation layer P = SUM(P*Theta)/SUM(Theta) where P is the conditional prediction of P and Q is the solution in the sample of training. RBFNN 30 is an artificial neural network that uses radial functions as the activation functions shown in equation (5). RBFNN is a three-layer neural network of feed-forwards. The first layer is linear, transmitting only the input signal, while the next layer is nonlinear, using Gaussian functions. The third layer integrates a linear representation of the Gaussian outputs. Only the tap weights between the hidden layer and the output layer shift during preparation. The Nonlinear autoregressive neural network is a type of ANN that is suitable for estimating future input variable values. The NAR Network helps to forecast future values of the time. It supported the use of a re-feeding mechanism through its historical precedent, in which a predicted value would serve as feedback for new predictions at more advanced points in time. In equation (6) represents as predict series y(t) given d past values of y(t). RMSE is the square root of the square differences measured between predicted and actual COVID-19 Death cases 8. It represents as in equation (7 G. Correlation coefficient (R2) It measures a linear relationship between the predicted and actual COVID-19 death cases 8. It represents as in equation (8) (8) Where is the actual COVID-19 death case value, is the predicted COVID-19 death case value; ̅ is the mean of real COVID-19 death case value ̅ is the predicted mean COVID-19 death cases value, and is the total number of data points. In this paper, three models (such as GPR, GRNN, and RBFNN) are constructed with the appropriate model parameter values and used in these two datasets to validate the predicted results concerning given the available datasets. Figure 1 illustrates the proposed methodology. In general, residues or errors are an inevitable part of any predictive or regression models. Similarly, there are errors in the GPR, GRNN, and RBFNN models. To provide a predictive model with high accuracy, this study explores a hybrid approach, including regression methods and the non-linear auto-regression (NAR) neural network (NAR-NN) time series forecasting model. Therefore, trends in residues or errors are detected and predicted by the NAR-NN model. Combining the predicted residual values of each model with the predicted value of each model will provide greater predictive accuracy. The following steps are used to develop the hybrid model: Step 1: Download data from the website and pre-process the dataset. Create Dataset 1(D1) and Data-set2(D2). Step 2: Initialize Model Parameters and define hyperparameters for GPR, GRNN, and RBFNN. Step 3: Input datasets D1 and D2 into the GPR model, GRNN model, and RBFNN model respectively, and predict COVID-19 death cases (Prednew) for 'n' period ahead or for given set of confirmed cases. Step 4: The residuals produced by these models are extracted and converted into time-series data. Step 5: Input these residuals into the NAR-NN time series forecasting model and predict the residual values (Ferr) for three models separately. It is shown graphically in figure 3. Step 6: Ferr is added with PredNew to generate an optimized prediction value. Step 7: Return optimized predicted values as output Table 1 shows the parameter setup for all three models. Figure 2 illustrates the operating principles of the hyperparameter tuning used with the GPR model, the GRNN model, and the RBFNN model to achieve the minimum RMSE value for these models, respectively. Similarly, figure 3 describes the working principle of the NAR-NN model for error forecasting for these models. This section contains results of Gaussian Process Regression, and two different statistical neural networks (SNNs): GRNN and RBFNN models for dataset1 and dataset 2 are presented and discussed. The performance of these models was compared. The benchmark performance metrics such as Root Mean Squared Error (RMSE), an error measure, and Correlation coefficient accuracy measure (R) are used to estimate the COVID-19 death cases prediction accuracy. By using Hyperparameter tuning, the predictive efficiency of these three models can be improved. The goal of hyperparameter tuning is to optimize the value of hyperparameters of each model to minimize the RMSE value of these models. Based on RMSE value, the best model is selected for COVID-19 death cases prediction and mortality rate prediction. Table 2 shows the values of the performance metrics such as RMSE and R2 for three models. While comparing these models, it is seen that the GPR model has low RMSE value and high R2 value for both datasets. It signifies that the GPR model performs better than two statistical neural networks for the COVID-19 dataset. The spread value (σ) of GRNN and RBFNN is 4 and 1.76, respectively. Table 3 and Table 4 Figure 4 shows the number of predicted death cases versus days from May 1, 2020, to May 20, 2020. From this point of view, the two SNNs have nearly equal efficiency, but their RMSE values are higher than that of the GPR models. Figure 5 shows the number of death cases predicted against the number of COVID-19 confirmed cases. From this perspective, there is no significant difference between the standard and hybrid models for the dataset (D1), as their RMSE values are negligible. However, in this case, the GPR models perform better than the SNNs. Table 8 shows the calculated Mortality Rate Prediction (MRP) for COVID-19 predicted death cases using the dataset (D1). MRP is defined as equation (9) (9) Figure 6 illustrates the predicted curve for COVID-19 death cases versus the number of confirmed cases for India. Here, the X-axis indicates the number of confirmed cases, and Y-axis shows the number of death cases predicted. The GPR model shows a gradual decrease in the number of death cases, while the SNN models show an increasing pattern. Figure 7 displays the predicted curve for COVID-19 death cases versus the number of days since the first COVID-19 case for India. Here, the X-axis indicates the number of days, and Y-axis shows the number of death cases predicted. For the dataset (D2) also, the GPR model shows a gradual decrease in the number of death cases, while the SNN models show an increasing pattern. The performance of these models is compared based on RMSE value, as shown in figure 8 . From the results, it can be found that GPR performs better than statistical neural networks for two types of COVID-19 dataset. This article proposed a methodology that hybridized the regression model (GPR) and the SSN(GRNN and RBFNN) models with the NAR-NN time series forecasting model to achieve higher predictive accuracy in the prediction of COVID-19 death cases. The NAR-NN time series forecasting model was used to predict errors that should be included in the expected value. The Gaussian Process Regression (GPR) model for the two datasets has yielded a relatively good result in terms of optimized predicted values for death cases in the COVID-19 epidemiological data. The proposed method is capable of providing a predictive tool for assessing its current state of infection, severity, and help government and health care workers for better decision making to reduce the mortality rate in India. WHO | Novel Coronavirus -China. WHO. Archived from the original on 23 Emergency Committee regarding the outbreak of novel coronavirus (2019-nCoV). World Health Organization (WHO) How China's Coronavirus Is Spreading and How to Stop It, Foreign Policy Novel COVID-19 Mortality Rate Prediction (MRP) Model for India Using Regression Model With Optimized Hyperparameter SEIR model FOR COVID-19 Epidemic using DE-LAY differential equation COVID-19 Mortality Rate Prediction for India Using Statistical Neural Network Models Machine learning techniques for mortality modeling A neural-network analyzer for mortality forecast A Neural Network Extension of the Lee-Carter Model to Multiple Populations Multistep ahead groundwater level time-series forecasting using Gaussian Process Regression and ANFIS, Advanced Computing and Systems for Security Arbitrary walk with minimum length based route identification scheme in graph structure for opportunistic wireless sensor network Gaussian Processes for Regression: A Quick Introduction Application of Gaussian process regression for bearing degradation assessment, Information Science and Service Science and Data Mining (ISSDM) Opportunistic Forward Routing Using Bee Colony Optimization Dynamical systems identification using Gaussian process models with incorporated local models Recursive Gaussian process: on-line regression and learning Distributed prognostic health management with gaussian process regression Dynamical systems identification using Gaussian process models with incorporated local models A Comparative Study of Activation Functions and Training Algorithm of NAR Neural Network for Crop Prediction Multi-crop Selection Model Using Binary Particle Swarm Optimization Recursive Gaussian process: on-line regression and learning A pilot study on climate data analysis tools and software Gaussian Processes for Machine Learning Recursive Gaussian process regression Gaussian process dynamical models for the human motion Machine learning for estimation of building energy consumption and performance: a review Review of Applications of Generalized Regression Neural Networks in Identification and Control of Dynamic Systems Region-Wise Rainfall Prediction Using MapReduce-Based Exponential Smoothing Techniques Usage of Probabilistic and General Regression Neural Network for Early Detection and Prevention of Oral Cancer. The Scientific World Journal Prediction on the incidence of blood and sexually transmitted diseases with models of ARIMA and GRNN Investigating the Predictive Performance of Gaussian Process Regression in Evaluating Reservoir Porosity and Permeability. Energies An Empirical Comparison of Ma-chine Learning Models for Time Series Forecasting Generalized Regression Neural Network and Radial Basis Function for Heart Disease Diagnosis Variable Selection Method for Regression Models Using Computational Intelligence Techniques Phenomenological forecasting of disease incidence using heteroskedastic Gaussian processes: A dengue case study Enhanced Logistic Regression (ELR) Model for Big Data Automatic Time Series Forecasting with GRNN: A Comparison with Other Models The authors have no conflict of interest.