key: cord-0715294-4vape5ps authors: Abirami, R. Sudha; Kumar, G. Suresh title: Comparative Study Based on Analysis of Coronavirus Disease (COVID-19) Detection and Prediction Using Machine Learning Models date: 2021-11-20 journal: SN Comput Sci DOI: 10.1007/s42979-021-00965-2 sha: cbb3d1c749dcb08f34372caceb6e2ee03bb4ec01 doc_id: 715294 cord_uid: 4vape5ps As the number of COVID-19 cases increases day by day, the situation and livelihood of people throughout the world deteriorates. The goal of this study is to use machine learning models to identify disease and forecast whether or not a person is infected with the virus or another common illness. More articles about COVID-19 will be released starting in 2020, but we still do not have a reliable prediction mechanism to diagnose the disease with 100% accuracy. This comparison is done to see which model is the most effective in detecting and predicting disease. Despite the fact that we have immunizations, we require a best-prediction strategy to assist all humans in surviving. Researchers claimed that the supervised learning method predicts more accurately than the unsupervised learning method in the majority of studies. Supervised learning is the process of mapping inputs to derived outputs using a set of variables and created functions. This will also help us to optimize performance criteria using experience. It is further divided into two categories: classification and regression. According to recent studies, classification models are more accurate than other models. Coronaviruses (CoV) are a large family of viruses that cause illness ranging from common cold to more severe diseases. These are positive-stranded RNA (+ ssRNA) viruses with a crown-like appearance. They are mainly susceptible to mutation and recombination and are, therefore, highly diverse. They reside in bats and wild birds, and can spread to other animals and hence to humans. The virus which causes COVID-19 is thought to have originated in bats [1] . Outbreaks of COVID-19 have been reported in various types of workplaces and job categories. All workers should be protected from acquiring COVID-19 because of their work. The prevention of COVID-19 in work settings should be combined with measures for protecting physical and mental health, safety and wellbeing of workers from other occupational hazards in the operation, closures and reopening of workplaces [2] . COVID-19 is likely spread in the following ways; When the virus travels in respiratory droplets when an infected person coughs, sneezes, talks, sings or breathes near you (within six feet). This is thought to be the main way it is spread. Another way is through close contact (touching, shaking hands) with an infected person. And moreover, by touching surfaces that the virus has landed on, then touching our eyes, mouth, or nose before washing the hands [3] . Due to the huge growth of machine learning techniques, predicting and diagnosing diseases made it easy in current days. Machine learning can learn through machines and makes a proper prediction by self-learning. Several techniques of machine learning were utilized to identify the affected individuals with personalized characteristics. Machine learning is an exclusive technology that is used to allow a computer to train itself and make correct decisions and can be employed for predicting coupled information from previous records, geography, and demographics, climate as well as outbreak severity [4] . Machine learning (ML) is one of the most advanced concepts of artificial intelligence (AI), and provides a strategic approach to developing automated, complex and objective algorithmic techniques for multimodal and dimensional biomedical or mathematical data analysis [5] . This study presents the machine learning model as a method for predicting the transmission of COVID-19 pandemic in an easily understandable way using statistical visualization graphs e.g., normal distribution. It determines the predictive value of the technique with quality and density of collected data of WHO. The resulting predictions will reduce the population's anxiety and prepares their mentality for accepting and dealing with the next phases of the pandemic. This prediction study proposed a model for COVID-19 prediction and it has achieved a classification accuracy of 91.62%. Even more important fact is it yields a sensitivity of around 95% for COVID-positive cases [6] . From all the researches made so far stated that, the machine learning models can be used to predict and detect this disease (Fig. 1 ). It will help the people to reduce their panic about the symptoms caused by normal cold or fever. This pandemic will end definitely once all of us must be vaccinated and as well as ensuring the guidelines stated by WHO while in workplace, traveling, in social gathering etc., Even though we used to research on this virus alleviation and mutiny, we need the prediction about causes of spread, symptoms, death rate, recovery rate, and remedies. The remaining paper contains the following subsections; "Related Work" for work done by other researchers after reception of this disease, "Machine Learning Methods" to describe the various machine learning models, "Case Study Due to COVID-19 pandemic, the need of disease analysis has been raised to detect whether a person is infected with virus or not. In the year 2021, Corona 2.0 obscured many people lives. To avoid this situation in future, health care providers must have some prediction techniques to save human life [1, 2]. Supervised machine learning is used to get better prediction accuracy. These are some papers given the results regarding Corona disease prediction and detection. In recent months, different techniques using image processing are done by various researchers. In this paper, a major review on image acquisition, segmentation, diagnosis, avoidance, and management are presented. An analytical comparison of the various proposed algorithm by researchers for corona virus has been carried out. ML methods train the input data and analyze the output data statistically. The application of ML includes the detection of infected persons and the temperature of the person [7] . The machine learning models was trained to predict drug structures that could potentially treat COVID-19. The model is employed with AI neural network with multiple hidden layers and neural systems such as repetitive neural networks (RNN), convolutional neural networks (CNN), deep belief networks (DBN), and completely associated feed-forward systems that supported to investigate drug molecule design and re-positioning of drug molecules [8, 9] . While the RT-PCR is the silver bullet test for confirming the COVID-19 infection, it is limited by the lack of reagents, time-consuming, and the need for specialized labs. This study develops six predictive models for COVID-19 diagnosis using 6 different classifiers (i.e., BayesNet, Logistic, IBk, CR, PART, and J48) based on 14 clinical features. This study retrospects 114 cases from the Taizhou hospital of Zhejiang Province in China. The results showed that the CR meta-classifier is the most accurate classifier for predicting the positive and negative COVID-19 cases with an accuracy of 84.21% [10] . Generally, a huge amount of data is accumulating regarding the COVID-19 pandemic, which makes hot research topics for machine learning researchers. This study considered machine learning approaches to predict the spread of the COVID-19 in many countries. The experimental results of the proposed model showed that the overall R2 is 0.99 from the perspective of confirmed cases. A machine learning model has been developed to predict the estimation of the spread of the COVID-19 infection in many countries and the expected period after which the virus can be stopped [11, 12] . The increasing interest in developing artificial intelligence (AI) applications has addressed several medical problems. Some of the AI applications in healthcare are AIenabled clinical support system, machine learning for drug discovery [13] , diagnosing tissue samples, and AI-assisted image analysis for radiology etc. However, such applications remain insufficient given the high potential threat posed by this virus to global public health. This systematic review addresses automated AI applications based on data mining and machine learning (ML) algorithms for detecting and diagnosing COVID-19. We aimed to obtain an overview of various deep learning techniques included 3D and 2D analysis of the chest CT images a Hybrid-Covid network, an IRRCNN model with NABLA-N network and Covid-Net and CheXNet network architecturebased detection. The accuracy of the CheXNet architecturebased detection had the highest accuracy. [14] . Sensing the threatening impacts of COVID-19, researchers of computer science have started using various techniques and approaches of machine learning and deep learning to detect the presence of the disease using X-rays and CT images. Artificial intelligence-centered tools can be designed and developed quickly for adapting the existing AI models and for leveraging the ability to modify and associating them with the preliminary clinical understanding to address the new group of COVID-19 and the novel challenges associated with it [15] . Many research done so far have emphasized that we need a more effective technique to detect and forecast infectious people based on all of these facts. The following section contains a diagram (Fig. 2) that depicts various machine learning models to enhance the research in prediction and detection of crucial COVID-19 in early days from the basic symptoms. Machine learning is typically a system of automated data processing algorithms that help to make decisions more natural and enhance performance based on the results. The "Learning" here means that the algorithm can garner new information and insights without being explicitly programmed. There are several models of machine learning present. From those, some primary paradigms are described as follows [16]: Supervised machine learning: It is defined as, use of labeled datasets to train algorithms that can classify data or predict outcomes accurately. As input data are fed into the model, the weights are adjusted until the model is properly fitted, which happens during the cross-validation phase. Organizations can use supervised learning to tackle a range of real-world problems at scale, such as spam classification in a distinct folder from our email. Unsupervised machine learning: These algorithms can be used to find patterns as the inputs are unlabeled. Unlabeled datasets are analyzed and clustered using machine learning methods. Without the need for human intervention, these algorithms uncover hidden patterns or data groupings. It is the best solution for exploratory data analysis, cross-selling techniques, consumer segmentation, and image identification because of its capacity to detect similarities and differences in information. Reinforcement machine learning: Similar to supervised ML, but in this case, instead of a labeled output, there are rewards and the algorithm's goal is to maximize rewards. Reinforcement learning is a machine learning training strategy that rewards desirable behaviors while penalizing undesirable ones. A reinforcement learning agent can perceive and comprehend its surroundings, act, and learn through trial and error in general. Semi-supervised machine learning: Input data are a mixture of labeled and unlabelled, so the model must learn structures to organize as well as make predictions. Semisupervised learning is a type of machine learning that tries to solve issues involving both labeled and unlabeled data. Semi-supervised learning makes use of mathematical concepts such as clustering and classification method features. From these types, supervised learning is intensively helpful in finding solution for real-world computational problems. The algorithms predict the outcome results from unlabeled data. Different types of supervised learning are regression, classification, naïve Bayesian Model, decision trees, random forest model, neural networks, support vector machines. The diagram (Fig. 2) below showed the major types of machine learning models. The purpose of this study is to recognize which machine learning method works well in predicting and detecting diseases with highest accuracy. The entire world faces a big crisis due to COVID-19. During this pandemic, more researches have been made to identify the infectious persons in a proper way at right time. From those researches, this comparison table illustrates that the supervised learning plays a vital role in prediction using symptoms, clinical features and chest X-rays. Some researchers used classification model with different algorithms for example, shallow singlelayer perceptron neural network, Gaussian process regression, Bayes Net, logistic, J48, multinomial Naive Bayes classifier, support vector machine etc., [17] [18] [19] [20] [21] [22] [23] [24] Some researchers used regression, recurrent neural network and convolutional neural network. From all the experiments and studies, it is noted that supervised learning techniques are alone used to obtain more accurate results. To develop an effective algorithm for detection of corona virus presence in human beings, the above table (Table 1) would be helpful. The results were compared and it is observed that classification model is best among all the others. It yields higher accuracy, more specificity when compared with regression and convolutional neural networks. Furthermore, this study would be useful to develop a new algorithm for detecting this deadly disease in efficient manner. The research may be fruitful by concentrating on new classification model. Thus, this comparative case study ensures that the machine learning models are efficient in a way to predict and detect the infectious persons. In that, supervised learning algorithms are more powerful when compared to other algorithms. This study focused to show that this prediction will be helpful to end this epidemic in future. Our conclusion is machine learning applications in the field of disease diagnostics showed promising results with higher accuracy, and specificity. In particular, supervised machine learning takes a leading role in disease diagnosis. This machine learning technique yield accurate results when compared to other techniques. The researchers have used so many models such as, linear regression, naïve Bayes classifier, support vector machine, decision tree and ResNet50 etc., to train and test the data set for predicting and detecting Coronavirus disease in an effective manner. This study will be expanded into developing a new algorithm based on the models available in terms of supervised machine learning for earlier diagnosis of diseases especially COVID-19. Data availability Introduction about COVID-19 used in this study was collected from official website of WHO (World Health Organization). Code availability Not applicable. Conflict of interest There are no conflicts of interest. Intelligent system for COVID-19 prognosis: a stateof-the-art survey Machine learning for detection and diagnosis of disease The COVID-19 pandemic: prediction study based on machine learning models Novel corona virus (COVID-19) diagnosis using computer vision and artificial intelligence techniques: a review Automatic COVID-19 detection from X-ray images using ensemble learning with convolutional neural network Available online at http:// iaeme. com/ Home/ issue/ IJARET? Volume= 11& Issue= 11 Predicting the COVID-19 infection with fourteen clinical features using machine learning classification algorithms Role of biological data mining and machine learning techniques in detecting and diagnosing the novel coronavirus (COVID-19): a systematic review COVID-19: nature, outbreak and role of machine learning A computer-based disease prediction and medicine recommendation system using machine learning approach Machine learning and deep learning approaches to analyze and detect COVID-19: a review Prediction of COVID-19 cases using machine learning for effective public health management Predicting mortality risk in patients with COVID-19 using machine learning to help medical decisionmaking Machine learning algorithms: a review COVID-19 epidemic analysis and prediction using machine learning algorithms. Emerg Technol Battling Covid-19 Utilization of machine-learning models to accurately predict the risk for critical COVID-19 Machine learning for coronavirus covid-19 detection from chest x-rays Automatic detection of coronavirus disease X-ray and CT images: a machine learning-based approach Coronavirus disease (COVID-19) cases analysis using machine-learning applications Alleviation of antimicrobial therapy in ICU during COVID-19 second wave-a review paper A machine learning forecasting model for COVID-19 pandemic in India Analysis and prediction of COVID-19 pandemic in Bangladesh by using ANFIS and LSTM network Machine learning based approaches for detecting COVID-19 using clinical text data Supervised machine learning models for prediction of COVID-19 infection using epidemiology dataset COVID-19 infection detection from chest X-ray images using hybrid social group optimization and support vector classifier Detection of COVID-19 using CXR and CT images using Transfer Learning and Haralick features Machine learning-based model to predict the disease severity and outcome in COVID-19 patients. Hindawi Sci Program