key: cord-0710139-pz5nycyf authors: Göreke, Volkan; Sarı, Vekil; Kockanat, Serdar title: A novel classifier architecture based on deep neural network for COVID-19 detection using laboratory findings date: 2021-03-19 journal: Appl Soft Comput DOI: 10.1016/j.asoc.2021.107329 sha: a4c3d74a6bf813fbc0dff33bfbc9af506de4b5b0 doc_id: 710139 cord_uid: pz5nycyf Unfortunately, Coronavirus disease 2019 (COVID-19) is spreading rapidly all over the world. Along with causing many deaths, it has substantially affected the social life, economics, and infrastructure worldwide in a negative manner. Therefore, it is very important to be able to diagnose the COVID-19 quickly and correctly. In this study, a new feature group based on laboratory findings was obtained considering ethnical and genetic differences for interpretation of blood data. Then, using this feature group, a new hybrid classifier architecture based on deep learning was designed and COVID-19 detection was made. Classification performance indicators were obtained as accuracy of 94.95%, f1-score of 94.98%, precision of 94.98%, recall of 94.98% and AUC of 100%. Achieved results were compared with those of the deep learning classifiers suggested in literature. According to these results, proposed method shows superior performance and can provide more convenience and precision to experts for diagnosis of COVID-19 disease. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a novel zoonotic coronavirus that causes acute respiratory disease in humans. The disease known as COVID-19, has affected the whole world not only by being very contagious and causing many deaths but also affecting the social life, economics, and infrastructure [1, 2] . Therefore, it is clinically very important to be able to predict the progress of the disease. Recent studies have shown that the interpretation of blood findings have a substantial clinical application value in the progress of contagious diseases [3, 4] . At the same time, Guan et al. have detected anomalies in the blood findings by examining the clinical attributes of 1000 COVID-19 patients [5] . The studies have also determined that the blood values were able to have ethnical and genetic differences. Different types of blood findings may be more determinant in the detection of the disease in societies in different locations of the world [6] . Therefore, considering the genetic differences related to this disease, for which there is insufficient scientific data yet, is important in increasing the success in detection. For the doctors, the lack of experience about a very recent disease of which properties are not completely known, has led to difficulties in diagnosis. Due to all these limitations, an artificial intelligence (AI) system is needed to make right decisions in the diagnosis. AI has been actively used in healthcare systems to provide clinical decision support [7] . Especially in real world classification problems which medical datasets are used, data sets are called unbalanced data sets, where the number of samples between classes are different from each other. Studies to be carried out on these unbalanced training data sets negatively affect classifier performance [8] . In studies based on deep learning, eliminating this problem on data sets as a pre-process, is a more professional approach that helps to obtain more accurate and high-performance results. The most widely used class imbalance measure in the literature is calculated as the ratio of the sample numbers of the largest majority class and the smallest minority class and is called the imbalance ratio. The higher this ratio is the greater the imbalance scope of the dataset and causes over fitting problem in the classification process and decreases performance [9, 10] . A widely used method to eliminate the imbalance between J o u r n a l P r e -p r o o f data classes encountered by Deep Learning classifier models is SMOTE (Synthetic Minority Oversampling Technique) method. In this method, the number of samples of the class with a small number of samples is increased with synthetic data [11] . In addition to this, the training data increased by this method provides important objective gains for deep learning. These gains are expressed as increased stability and reduced over fitting [12] . In this study, firstly, a new feature group was obtained from blood findings. COVID-19 prediction was realized with three deep learning classifiers with different architectures using this feature group. The results were compared with the results in literature and classic laboratory findings. Contributions of the proposed model can be listed as follows: (1) With the proposed two-layer deep learning-based classifier architecture, it has achieved a superior performance than classical deep learning approaches. In particular, the preweighting vector, which gives the maximum classification accuracy, has been optimized using artificial bee colony (ABC) algorithm and the performance has been improved. (2) Ethnic and genetic differences affect the diagnosis of the disease based on blood data. Using the proposed feature acquisition method, it is ensured to obtain the features related to the value with higher discrimination in each different region. So, the performance of the classifier is also increased. (3) As a result, the proposed model can be used to help doctors in early diagnosis of the COVID-19 disease. (4) A novel classifier architecture based on deep neural network using for COVID-19 detection gives better results for classification. The paper is organized as follows. In Section 2, deep learning concept was explained. In Section 3, the studies about COVID-19 prediction using laboratory findings with artificial intelligence methods, were reviewed. In Section 4, firstly, an algorithm was developed to obtain a new feature set based on these findings and COVID-19 prediction was realized using three deep learning classifiers with different architectures. Accuracy, precision, recall, f1-score, and AUC parameters were compared with the results obtained by the classical findings. Afterwards, a hybrid classifier architecture based on deep learning J o u r n a l P r e -p r o o f Journal Pre-proof was designed and classification performance was measured with the same measurement parameters mentioned above. In Section 5, the conclusions of the study were interpreted, and future work is presented. Deep learning has gained popularity because of the performance improvements of hardware units such as graphics cards and the decrease in the unit prices. The increase in the number of training data and studies about machine learning and information processing have also contributed to deep learning. Different deep learning architectures such as artificial neural network (ANN), convolutional neural network (CNN) and recurrent neural network (RNN) have been widely used in many fields such as image classification, natural language processing and speech recognition [13, 14] . ANN is an information processing approach inspired by the human biological nervous system. It consists of neurons, activation functions, input, output, and hidden layers. Every layer in ANN contains a set of neurons that form a hierarchy. The output of the previous layer is the input to the next layer. Each of the layers learn more complicated relationships from the input data. CNN is a deep learning algorithm developed for visual data processing such as images and videos. CNN has various layer types fulfilling various duties. These are called convolution layer, activation function layer, pooling layer, fully connected layer, and dropout layer [13] . In RNN structures, the result depends not only on the current inputs but also on the other inputs. These networks generate their output with the combination of current and previous data [15] . Swarm intelligence is a research area that models the behavior of self-organizing populations such as flock of birds, ant colonies or bee colonies mathematically. In recent years, the development of swarm intelligence algorithms and the effective results of them in solving engineering problems in many research areas has attracted the attention of many researchers and practitioners. Especially Artificial Bee Colony (ABC) algorithm came to the fore with its successful, easy and effective use. Karaboga introduced the ABC algorithm that is modeled food search behavior of honey bees. Using literature benchmark functions, the performance of the ABC algorithm has been compared with those of the well-J o u r n a l P r e -p r o o f Journal Pre-proof known meta-heuristic algorithms such as differential evolution (DE) and particle swarm optimization (PSO) [16, 17] . Also, ABC algorithm has been efficiently employed in different research areas such as signal and image processing [18] [19] [20] . Especially in recent times, it has been successfully applied in hybridized approaches based deep learning and machine learning techniques [21, 22] . Initially, computer aided diagnosis systems (CADs) that are implemented via deep learning classifiers with laboratory findings as inputs, were reviewed. Literature review was realized in Science Direct, IEEE, PubMed, Google Scholar, and arxiv.org with keywords such as COVID-19 blood artificial intelligence/deep learning, COVID-19 laboratory data deep learning, and the results were summarized below. Alakus and Turkoglu used a laboratory data set that contains 600 samples to compare deep learning approaches for prediction of COVID-19. The experimental results were obtained as 92.30% accuracy, 93% f1-score, 92.35% precision, 93.68% recall and 90% AUC [29] . The aim of this study is to obtain a new feature group based on laboratory findings using the developed algorithm and to detect COVID-19 with a new hybrid classifier based on deep learning that uses the new feature group as input parameters. Monocyte, and Platelet in the prediction of the disease using SPSS. Additionally, it has also been encountered that the medical experts have rated some of these laboratory findings together such as Neutrophils-to-lymphocytes ratio [30] . According to the researches, blood findings may give different results in different ethnic and genetic groups [6] . Therefore, the parameters that depend on the blood findings can have differences in discrimination due to the ethnical and genetic factors when used in deep learning systems. For example, while Monocyteto-lymphocytes ratio [30] is a more important parameter in the diagnosis of COVID-19 using artificial intelligence blood findings sampled from a European country, Neutrophils-to-lymphocytes ratio is more significant in Asian genetics. By applying the algorithm given below, first the data set was divided into patient and healthy groups in terms of each laboratory finding (parameters). Distribution test was applied for each parameter. p values were calculated using Student's test for the data with normal distribution, and Mann-Whitney U test for the others [31] . As shown in Table 1 , the finding with the highest p value was serum glucose. Different parameters may be more determinant in populations belonging to different locations. Thanks to this method, genetic and ethnic differences are also taken into account. In this study, first the rating method of Sun et al. [30] that was applied for some of the findings, was applied to all findings using Serum Glucose which has the highest p value among classes. To the best of our knowledge, there is no study dealing with this aspect of the subject for computer-aided diagnostic systems. The algorithm to obtain a new feature group is given below. The block diagram of the proposed architecture is given in Figure 1 . In architecture, the pre-processing layer functions as the balancing layer using the SMOTE technique. The data set based on classical findings includes 520 samples diagnosed with COVID-19 and 80 healthy samples. Here, the ratio of the majority sample number to the minority sample is 6.5. By the SMOTE technique, this ratio was reduced to 1 [9, 10] . The SMOTE method is based on the algorithm which is given below [32] . In the literature, multi-layer architectures have been used in various areas in the artificial intelligencebased classification problems. In a study, the system architecture was designed, as each ANFIS output is an input to the next ANFIS layer [33] . Uzunhisarcıklı and Göreke proposed a two-layer fuzzy inference system for mass detection in ultrasound images [34] . Liu et al. used a CNN neural network to identify diabetic retinopathy from eye fundus images. In their work, they used a pre-weighting coefficient for the convolution layer outputs of the neural network [35] . In this study, pre-weighting vector was applied for the input layer of the deep neural network and a twolayer classifier in hybrid structure was proposed. In Figure 1 , X1… Xn represent the new feature group in the pre-weighting layer which is called as Layer 1. In Layer 1, the weights of the input parameter were determined by calculating a pre-weighting coefficients. Calculation of these coefficients ( 1 ... n k k ) is an optimization problem and the ABC algorithm was efficiently used to optimize them. The pseudo-code structure of the basic ABC algorithm is given in Algorithm 3. Apply greedy selection process between i k and i v for the employed bees. Calculate the fitness of the solution i k using 1/ (1 ( )), For In the software, SVM code was run inside the ABC algorithm code. The developed software calculates the optimum pre-weighting vector that will provide the best accuracy in the SVM classifier. In the pre-weighting vector optimization, ABC algorithm was compared with particle swarm optimization (PSO) and genetic algorithm (GA). For a fairly comparison, population size and maximum iteration number were selected 20 and 500. Lower and upper bound were determined as 1 and 3. Also, competitor algorithms were run 20 times with different initial values. In Table 2 The layer called as Layer 2 represents the three deep learning classifiers with different basic architectures. In this study, these three deep learning classifiers were designed separately and their performance was measured. The RNN type that provides the best performance was selected. In this study, three deep learning classifiers with basic architectures were used. The hyper-parameter values of each architectures are given in Table 3 . Normally, in the artificial intelligence studies with relatively small datasets, cross-validation method is mostly preferred. But especially in medical clinical application, this cross-validation approach gives fewer clear results [29] . In this study, to have a clearer result, the tests were realized using train-test split approach and cross-validation approach was not preferred. Also, split ratio was determined as 20. The performances of deep learning classifiers with new features were measured with accuracy, recall, precision, and f1-score metrics. The mathematical representations of these metrics are given with Here TP (true positive) is the number of patients diagnosed as patient, TN (true negative) is the number of healthy individuals diagnosed as healthy, FP (false positive) is the number of healthy individuals diagnosed as patient and, FN (false negative) is the number of patients diagnosed as healthy [36] . Additionally, receiving operating characteristic (ROC) curve gives the change between the recall and FP / (TN+FP) ratio [37] , and it is used in the calculation of AUC (area under the curve). AUC is interpreted as the possibility of a higher test measure of a random patient than a random healthy J o u r n a l P r e -p r o o f individual [38] . The reliability and the validity of the model is proven by measuring the accuracy of the model [39] . f1-score is used for determining that the classifier has small FP and FN values [26] . Tests have been realized on an i7 2.6 GHz computer using Microsoft Visual Studio Community 2017 IDE and Python Anaconda 5.2.0 environment. In the first stage of the test process, the performance of the new feature group against classical laboratory findings was measured with classifiers in standard architecture. In the second step, the new feature group was classified using the classifiers in the proposed architecture and its performance was compared with the studies in the literature. By the implemented algorithm, the new feature set was applied as the input to three deep learning classifiers with basic architectures. The performance of each classifier according to the label information in the data set was measured by accuracy, recall, precision and f1-score metrics and the ROC graphs were plotted. The results are given in Table 4 . ROC graphs obtained using novel features are shown in The performance results of deep learning classifiers using laboratory findings for the detection of COVID-19 are given in Table 5 [29] . When the results of Table 4 and Table 5 are compared, it is seen that results in Table 4 outperform the results in Table 5 . The averages of each performance measure of three different classifiers using both classical and the novel features are given in Table 6 . The ROC graphic obtained with the proposed method for ANN, CNN and RNN architecture are shown in Figure 5 -7. The performance comparison of the proposed method against the studies in the literature is given in Table 7 . J o u r n a l P r e -p r o o f and Turkoglu for the classifiers with the same architecture. When Table 4 and Table 5 are compared, the new features that depend on the laboratory findings have better results than the classical findings for the prediction of COVID-19. Considering the results given in Table 7 , it can be stated that the proposed method was obtained more successful than in literature works. This study depends on the samples taken from Hospital Israelita Albert Einstein. In the future work, using data sets obtained from the groups with different genetics (different countries etc.), the effect of the biological and genetic differences in the prediction of COVID-19 will be searched. J o u r n a l P r e -p r o o f These differences can determine more clearly that the different findings are more effective on different groups for the detection of COVID-19. Therefore, the parameter detection process of CAD systems that implement deep learning can be shortened for different regions, and it can also have a positive contribution on the CAD performance. It helps the doctors to have more accurate answers in the decision process. Factors such as the environment of the swab sample from the mouth and the qualification of the sample affects the accuracy of the RT-PCR test used worldwide for the diagnosis of the disease [40, 41] . In this regard, using computer aided diagnosis tools such as deep neural networks implemented in this study can help doctors in the early diagnosis of the disease. With the early diagnosis of the disease, more successful results can be achieved in the treatment and the mortality rate can be decreased. COVID-19: Psychological and Psychosocial Impact, Fear, and Passion Finding an Accurate Early Forecasting Model from Small Dataset: A Case of 2019-nCoV Novel Coronavirus Outbreak Combination of the Preoperative Systemic Immune-Inflammation Index and Monocyte-Lymphocyte Ratio as a Novel Prognostic Factor in Patients with Upper-Tract Urothelial Carcinoma The significance of neutrophil-lymphocyte ratio, platelet-lymphocyte ratio and lymphocyte-monocyte ratio in predicting peripheral arterial disease, peripheral neuropathy, osteomyelitis and amputation in diabetic foot infection Clinical characteristics of coronavirus disease 2019 in China Distributions of Ethnic Differences in Blood Counts are Suggestive of Genetic Variability The potential for artificial intelligence in healthcare Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance Adjusting the imbalance ratio by the dimensionality of imbalanced data Imbalanced data learning by minority class augmentation using capsule adversarial networks Effective data generation for imbalanced learning using conditional generative adversarial networks Data augmentation for deep-learning-based electroencephalography Comparison of Type-2 Fuzzy Inference Method and Deep Neural Networks for Mass Detection from Breast Ultrasonography Images Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) network On the performance of artificial bee colony (ABC) algorithm A comparative study of artificial bee colony algorithm Acceleration harmonics estimation and elimination with MABC-RLS algorithm: Simulation and experimental analyses on shaking table Adaptive FIR Filtering Using ABC Algorithm: a Noise Reduction Application on Mitral Valve Doppler Signal A novel 2D-ABC adaptive filter algorithm: A comparative study Towards improving the convolutional neural networks for deep learning using the distributed artificial bee colony method Combined artificial bee colony algorithm and machine learning techniques for prediction of online consumer repurchase intention Clinical Predictive Models for COVID-19: Systematic Study Artificial intelligence-enabled rapid diagnosis of patients with COVID-19 Use of Machine Learning and Artificial Intelligence to predict SARS-CoV-2 infection from Full Blood Counts in a population Towards an artificial intelligence framework for data-driven prediction of coronavirus clinical severity COVID-19 diagnosis prediction in emergency care patients: a machine learning approach Detection of COVID-19 Infection from Routine Blood Exams with Machine Learning: A Feasibility Study Comparison of deep learning approaches to predict COVID-19 infection Abnormalities of peripheral blood system in patients with COVID-19 in Whitney test of distributional effects in a multivalued treatment SMOTE: Synthetic Minority Oversampling Technique Multi-layer architecture for adaptive fuzzy inference system with a large number of input features A novel classifier model for mass classification using BI-RADS category in ultrasound images based on Type-2 fuzzy inference system Referable diabetic retinopathy identification from eye fundus images with weighted path for convolutional neural network A deep learning framework for automatic diagnosis of unipolar depression Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation Evidence Based Emergency Medicine; Part 5 Receiver Operating Curve and Area under the Curve Evaluating information: validity, reliability, accuracy, triangulation Virological assessment of hospitalized patients with COVID-2019 Sensitivity of Chest CT for COVID-19: Comparison to RT-PCR