key: cord-0834595-dy05qibe authors: Hassan, Farman; Albahli, Saleh; Javed, Ali; Irtaza, Aun title: A Robust Framework for Epidemic Analysis, Prediction and Detection of COVID-19 date: 2022-05-06 journal: Front Public Health DOI: 10.3389/fpubh.2022.805086 sha: 711631bd126929c8243d80daf4a2798de27016a0 doc_id: 834595 cord_uid: dy05qibe Covid-19 has become a pandemic that affects lots of individuals daily, worldwide, and, particularly, the widespread disruption in numerous countries, namely, the US, Italy, India, Saudi Arabia. The timely detection of this infectious disease is mandatory to prevent the quick spread globally and locally. Moreover, the timely detection of COVID-19 in the coming time is significant to well cope with the disease control by Governments. The common symptoms of COVID are fever as well as dry cough, which is similar to the normal flu. The disease is devastating and spreads quickly, which affects individuals of all ages, particularly, aged people and those with feeble immune systems. There is a standard method employed to detect the COVID, namely, the real-time polymerase chain reaction (RT-PCR) test. But this method has shortcomings, i.e., it takes a long time and generates maximum false-positive cases. Consequently, we necessitate to propose a robust framework for the detection as well as for the estimation of COVID cases globally. To achieve the above goals, we proposed a novel technique to analyze, predict, and detect the COVID-19 infection. We made dependable estimates on significant pandemic parameters and made predictions of infection as well as potential washout time frames for numerous countries globally. We used a publicly available dataset composed by Johns Hopkins Center for estimation, analysis, and predictions of COVID cases during the time period of 21 April 2020 to 27 June 2020. We employed a simple circulation for fast as well as simple estimates of the COVID model and estimated the parameters of the Gaussian curve, utilizing a parameter, namely, the least-square parameter curve fitting for numerous countries in distinct areas. Forecasts of COVID depend upon the potential results of Gaussian time evolution with a central limit theorem of data the Covid prediction to be justified. For gaussian distribution, the parameters, namely, extreme time and thickness are regulated using a statistical Y(2) fit for the aim of doubling times after 21 April 2020. Moreover, for the detection of COVID-19, we also proposed a novel technique, employing the two features, namely, Histogram of Oriented Gradients and Scale Invariant Feature Transform. We also designed a CNN-based architecture named COVIDDetectorNet for classification purposes. We fed the extracted features into the proposed COVIDDetectorNet to detect COVID-19, viral pneumonia, and other lung infections. Our method obtained an accuracy of 96.51, 92.62, and 86.53% for two, three, and four classes, respectively. Experimental outcomes illustrate that our method is reliable to be employed for the forecast and detection of COVID-19 disease. In March 2020, the World Health Organization (WHO) confirmed a widespread of a novel Corona Virus called COVID-19, a pandemic. COVID-19 is caused by a virus named severe acute respiratory syndrome coronavirus 2 (SARS Cov2). Initially, the pandemic started in Wuhan, China; however, it spread quickly to a large part of the globe (1). COVID-19 spreads via breathing drops of the diseased individual, which are generated when the infected person sneezes or coughs. The droplets of an infected person can also contaminate large surfaces that increase the spread more quickly. The infected person may suffer respiratory illness either severe or mild; however, the severe may need the support of ventilation (2) . People of old age and those having chronological illnesses are prone to COVID-19 infection. Therefore, many countries shut their international borders and imposed strict presentation measures to avoid a quick spread of the COVID-19 (3) . Researchers and scientists have developed different vaccines to combat the pandemic by sequencing ribonucleic acid (RNA) from COVID-19. The organizations of vaccines employed both conventional and leading-edge technology with six different platforms of vaccine, such as deoxyribonucleic acid (DNA), messenger RNA (mRNA), viral vector-based, subunit or protein, inactivated virus, and a live attenuated virus. However, the developed vaccines can significantly reduce the quick spread and enhance immunity by producing antibodies. The vaccines have shown 95% effectiveness; however, some issues were encountered while managing the vaccines, i.e., the hesitancy of vaccine, complacency, and logistical challenges of the supply chain. Most importantly, the vaccines are not to cure rather a prevention measure against COVID-19 (4). Although, vaccines are produced, however, detection is crucial as it assists in easily tracking the persons who were in touch with the infected person. The quick spread of the pandemic is significantly avoidable by tracing these people. In the initial stage, the infection manifests as an infection of the lungs; hence, the researchers utilized the lung's x-rays and computed tomography (CT) images to detect the lungs infection (5) . Numerous models have been designed to predict the infectious disease that quickly spread similar to the COVID- 19 . Recently, a model named susceptible-infected-removed (SIR) (6, 7) has been employed for estimating the spread and fatality rate of COVID. Distinct variations of these systems are either very simple so that they cannot accurately generate the predictions, or either very complex for understanding. The early forecasting of certain attributes for COVID-19, namely, the highest quantity of positive cases, the fatality rate per day, forecasting peak number, the exact time of new severe sick people per day (SSPs), is believed to be significant for each country, especially those that are expected to witness exponential growth. More specifically, the quick and dependable forecasting of COVID is significant for the policymakers to enhance the monitoring of pandemic drift and to take precautionary measures for avoiding the shortage of life-saving resources in medical centers as well as in emergency services. In this work, we present the development as well as utilization of the Gaussian model as a beneficial, simple, and effective description of fatalities due to COVID-19 with time and the recent works in the USA (8) and Germany (9) . Distinct from the prior study, we chose to use a knowledgeable regular death rates algorithm (10) as evaluated input data. Moreover, we also presented the Gaussian doubling times principle as an amount of an increased rate (11) as an alternative to the growing infections. The Gaussian distribution function assessment has a significant role to resolve various problems in plasma kinetic theory named drift-Maxwellian (12) or counter streaming bi-Maxwellian (13) velocity distribution function. The above-mentioned terms are called plasma physics. Accurate and timely detection of COVID-19 is important for controlling the quick spread of this disease among people. It has become more crucial to detect the COVID-19-infected people after the vaccination to quarantine the people and to prevent the spread. The PT-PCR is believed to be a standard detection method for COVID; however, PT-PCR generates a lot of false positives due to various reasons, namely, stages of the disease, technique of collecting specimens, disadvantages of methodology that sustainably delay the control and detection process. The sensitivity and specificity of the initial standard testing method have been dejected in these works (14) (15) (16) (17) . Hence, we required a unique automatic diagnostic method, which can assist to stop the quick spread of COVID-19 (18) . Medical experts, clinicians, technologists, and researchers are putting their efforts to early detect the patients with COVID-19. In 2019, more than 755 research articles were published as reported by PubMed (19) , while, in the first 3 months of 2020, more than 1,245 articles were published. Deep learning (DL) and artificial intelligence methods are utilized by scientists for the detection of COVID-19 using CT and chest x-rays images (CXI). DL techniques (20) (21) (22) (23) (24) (25) (26) have shown extraordinary results in research applications and are commonly employed due to the enhanced performance comparative to the conventional techniques. Compared to machine learning and conventional techniques, features are not selected manually. On the other hand, the DL model can be trained by changing the configurations and parameters to learn the prominent features from the dataset. The research community has examined the DL techniques to explore the medical imaging field before the COVID-19 pandemic. DL attained maximum attention to detect COVID-19 using CXI. Researchers reported detailed methods (27, 28) to detect COVID-19 through computer vision and artificial intelligence. For many papers, transfer learning-based techniques are the go-to methods. In transfer learning, the pre-trained models on the ImageNet dataset are employed for performing the transfer learning. Even though methods are the same, however, distinct architectures are employed in works (29) . Distinct variants are employed even if the architectures are the same. Crossvalidation is also considered in transfer learning. Additionally, techniques with novel CNN models are also employed that use the significance of transfer learning when the available data are small for training. In (30) , a CNN-based architecture named COVID-Net was designed for the detection of COVIDinfected patients through CXI. Authors also introduced a dataset COVIDx that has three classes, i.e., normal, COVID, and viral pneumonia (VP). COVID-Net is based on the two phases of projections, such as depth-wise representation, expansion, and extension. Initially, the CNN was trained on ImageNet as well as on the COVIDx dataset. In (31) , a model that comprises three portions, such as a backbone, a classification head, and an anomaly detection head, was developed for the detection of COVID-infected people. The backbone part was used on ImageNet for extracting the high-level feature from CXI, and the extracted features were passed into other two parts of the network such as classification and anomaly heads to generate a score. A cumulative score of "one" was also used for every prediction. In (32), a capsule network-based model named COVID-CAPS was designed for the detection of COVID-19 through CT scans and CXI. It was reported that the benefit of employing a capsule network is it performs good, while the training data are small. The COVID-CAPS was trained using the dataset (33) . In (34) , a CNN-based model, namely, DeTraC was developed that comprises three stages, such as feature extraction, decomposition, and the third stage, a class composition. The backbone architecture was employed to obtain features from images, followed by using SGD optimizer and, finally, a class to categorize images into normal or COVID-19 infected. In (35) , COVIDLite was developed that employed a depth-wise separable CNN to classify the CXI for the detection of COVID-19. Similarly, this (36) also employed depth-wise separable convolutional layers in the XceptionNet architecture (37) and named it a Fast COVID-19 detector. To improve the color fidelity, white balancing was used, while, to expand the visibility and optimize the white balance, preprocessing was executed. In (38) , the CNN model was designed that comprises a block of convolutional layers, having 16 filters, a batch normalization layer, an activation function ReLU, two fully connected layers, followed by a SoftMax layer. In (39) , a set of customized CNN models was employed for the prediction of an infection graph. Additionally, viral and bacterial pneumonia were also detected using the CNN-based model. In (40) , a tailored CNN was employed that takes the fused set of features by employing two models, namely, Xception and ResNet50V2. A fused set of features was fed into the convolutional and classification layer for the classification purposes. Similarly, in (41) , deep features were obtained by employing the MobileNet, and the deep features are fed into the global pooling and fully connected layer. The performance of the model was evaluated by transfer learning, training from the scratch, and fine-tuning the network. The CoroNet (41) was used to classify the xray images into four distinct classes, such as normal, viral and bacterial pneumonia, and COVID-19. Xception was used as a base model; however, the last two layers, such as dropout and two fully connected layers, were added. In (42) , DarkCovidNet was designed for COVID-19 detection, which is based on the Darknet-19 (43) . DarkCOVIDNet used a smaller number of layers than Darknet-19. Two layers, such as average pooling and SoftMax, were added for classification. In (44) , a four-stage technique, namely, exemplar-based pyramid feature producing, relief, iterative principal component analysis, and classification, was developed to detect patients with COVID-19. The feature extraction was emphasized by the initial three stages, while, in the last stage, a deep neural network and artificial neural network were used for classification purposes. In (45) , CovXNet with depth-wise convolutional layers was developed for binary as well as a multi-class classification problem. The model was trained from the scratch as well as used numerous modifications, such as fine-tuning, transfer learning. In this work, we addressed the challenges that are associated with predicting and detecting COVID earlier by proposing a novel framework to reliably analyze, predict, and detect COVID-19. Moreover, the proposed framework is capable of effectively detecting VP, as well as extra lung infections. Major contributions of the proposed study are given as follows: • We used Gaussian doubling times for best analysis in addition to the prediction of COVID-19 globally. • We developed an innovative COVID detector, which employs two features, namely, scale invariant feature transform (SIFT) and histogram of oriented gradients (HOG). • We developed a novel CNN-based architecture called COVIDDetectorNet to effectively detect the patients with COVID-19 and patients suffering from VP, and other infections of the lungs. • The proposed COVID detection technique has capability to detect normal, COVID, VP, as well as other lung infections. • To detect COVID and other lungs abnormalities, we have performed rigorous experiments on the publicly available dataset, namely, the COVID Radiography dataset. The rest of this manuscript is structured as follows: Section 2 Materials and Methods has a detailed explanation of our proposed working mechanism to detect the COVID-19 and estimate an infection rate. Section 3 Proposed Method gives an explanation of experimental outcomes, while, finally, Section 4 Results and Analysis has concluded the work. This section provides an in-depth summary of data and techniques for the COVID-19 infections, forecasting in Asian countries and globally. Moreover, the detailed discussion of the proposed CNN-based architecture, i.e., COVIDDetectorNet is presented to detect COVID-infected people, VP, and other lung infections. For forecasting the infection rate, we collected the data through a real-time inquiry from Johns Hopkins University as well as additional suppliers, namely, WHO, to examine and make a forecast about the pandemic for worst-hit countries. Currently, the COVID data are gathered from numerous sources, such as media reports, online news, as well as official reports of governments, etc. It is significant to consider the data of all sources as it will be helpful to examine the diverse data to have a clear as well as a comprehensive image of an epidemic and its implications. The statistics and literature (46) demonstrate that there are three stages of a pandemic, namely, the total of infected people grows exponentially, the peak of an epidemic, and the quick decline in the infectious rate (9) . Therefore, we employed a Gaussian curve to illustrate the progress of a pandemic. K p represents the amount of COVID-affected individuals' each day p, which is illustrated through a Gaussian curve as follows: In the above Equation (1), P 0 represents the highest amount of infectious cases each day D, while shows a standard deviation of a curvature. Change in the level of infection is computed through separating K(p) w.r.t p. Hence, change in relative rate R(p) is given the following Equation (2). The number of cases per day can be computed by Equation (3) Similarly, relative change can be computed by the Equation (4). The doubling time in terms of D and is computed by combining the equations (2) and (4) as follows; when at p = 0, We required calculation of doubling time so as to obtain two values, namely, D and . It is computed, applying the Equation (4). In the Equation (7), the E (Y) denotes the amount of COVID cases on Day Y, while P represents a rolling window. In this work, we used a rolling window of 7. Doubling Time for Worldwide Cases Figure 1 depicts doubling time for worldwide infection cases from 21 April 2020. We selected this date because the doubling rate was stabilized globally from the above-mentioned date, as every country started releasing the data publicly. Moreover, we analyzed the data till 27 June 2020 and the analyzed data assumingly it has an error of 20%. We used the data (9) . In order to obtain the value of D, we analyzed the doubling rate at p = 0, From Equations (5) and (8) We computed the value of D for the Gaussian curve of worldwide cases by computing a Y 2 fit using Equation (10). The n p l in Equation (10) represents the analyzed doubling rate, C D, p l shows the estimated doubling rate, and δ p l represents an error for the analyzed rate with almost 20% and by employing the Equation (9); we got the following expression: From the analysis till 27 June 2020, M = 67; hence, D is a singlefree parameter, and the freedom degree is computed by M − 1 = 66, while the lowest value of Y 2 min is equal to 63.28 by using D Frontiers in Public Health | www.frontiersin.org equals to 109.5 days from p = 0 on 21 April 2020 as illustrated in Figure 2A . Ratio Y 2 min /(M − 1) is equal to 0.96 that signifies that model is performing well on the data because the ratio is