key: cord-1019660-ts9lzr1b authors: Shanbehzadeh, Mostafa; Nopour, Raoof; Kazemi-Arpanahi, Hadi title: Developing an artificial neural network for detecting COVID-19 disease date: 2022-01-31 journal: J Educ Health Promot DOI: 10.4103/jehp.jehp_387_21 sha: d8c22cd89177a4280ef33fdbd9445d32c6d27aac doc_id: 1019660 cord_uid: ts9lzr1b BACKGROUND: From December 2019, atypical pneumonia termed COVID-19 has been increasing exponentially across the world. It poses a great threat and challenge to world health and the economy. Medical specialists face uncertainty in making decisions based on their judgment for COVID-19. Thus, this study aimed to establish an intelligent model based on artificial neural networks (ANNs) for diagnosing COVID-19. MATERIALS AND METHODS: Using a single-center registry, we studied the records of 250 confirmed COVID-19 and 150 negative cases from February 9, 2020, to October 20, 2020. The correlation coefficient technique was used to determine the most significant variables of the ANN model. The variables at P < 0.05 were used for model construction. We applied the back-propagation technique for training a neural network on the dataset. After comparing different neural network configurations, the best configuration of ANN was acquired, then its strength has been evaluated. RESULTS: After the feature selection process, a total of 18 variables were determined as the most relevant predictors for developing the ANN models. The results indicated that two nested loops' architecture of 9-10-15-2 (10 and 15 neurons used in layer 1 and layer 2, respectively) with the area under the curve of 0.982, the sensitivity of 96.4%, specificity of 90.6%, and accuracy of 94% was introduced as the best configuration model for COVID-19 diagnosis. CONCLUSION: The proposed ANN-based clinical decision support system could be considered as a suitable computational technique for the frontline practitioner in early detection, effective intervention, and possibly a reduction of mortality in patients with COVID-19. E merging and new pathogens are major threats to global public health. This is principally true for virus-induced diseases that are extremely contagious due to widespread person-to-person transmission and have asymptomatic infectivity periods. [1] [2] [3] Since December 2019, a new strand of coronavirus named severe acute respiratory syndrome coronavirus-2 (COVID-19) was detected in Wuhan District, China, and the outbreak continues to spreading aggressively worldwide. It is thought that the SARS-CoV-2 outbreak has animal origins that slipped from animal species into the human population. The complex and highly contagious nature of COVID-19 had led the World Health Organization (WHO) to pronounce this disease a global health crisis. [4, 5] The WHO and other health officials have recommended some safeguard measures including implementing physical distancing, wearing personal protective equipment, and sanitizing the hands to avoid and reduce the spread of the disease. [6, 7] Despite severe preventive measures and lockdown policies, COVID-19 has now become a pandemic on a global scale, which made a tremendous impact on the health and safety of people all over the world, affecting their lives and causing an escalating number of deaths. In addition, many indirectly devastating outcomes are derived from this pandemic leading to psychological distress and socio-economic crises in many societies. [8] [9] [10] [11] Rapid transmission and high rate of mortality particularly in susceptible populations such as the elderly, and people with underlying medical problems, make it necessary to seek early detection and isolation of positive cases as rapidly and accurately as possible for containing the transmission of the virus, especially for individuals with no sign of symptoms in an early stage. [12] [13] [14] [15] [16] [17] These vulnerabilities emphasize the need for early and accurate diagnosis methods for COVID-19 and prompt confinement of the infected people in the absence of a specific vaccine or treatment. [18, 19] In this situation, many governments and public health authorities across the world have been searching for new and innovative technologies as alternative solutions to screening, monitoring, and tracing infected persons. Artificial intelligence (AI) may be a unique preparation to take up this challenge. [20, 21] AI is a broad field that refers to the capability of a machine to learn from experience, adjust to new inputs, and simulating human intelligence tasks. [22] Machine learning is a subset of AI and that it can be fueled with a huge dataset for automatically extracting high-quality models. [23] Artificial neural network (ANN) is biologically system with an adaptive, self-learning, and computational construction simulating the functions of human neurons. [24, 25] This technique can be trained to recognize and categorize complex patterns of diseases through an iterative learning process. Once proper training is achieved, the ANNs try to forecast with greater accuracy than traditional statistical techniques. Due to its capabilities to identify multifarious nonlinear relations between predictor variables and corresponding outcome variables, it has been effectively applied in clinical decision support system (CDSS) to provide solutions for different numerous problems. [26] [27] [28] [29] [30] Some studies [31] [32] [33] [34] [35] [36] showed that the ANN-based prediction models using clinical or laboratory data can be significantly helpful in a timely, effective, and economical diagnosis of the disease. It can discriminate the COVID-19 from other similar conditions with better accuracy compared to traditional approaches. The ANNs provide timely screening, identify the disease at early asymptomatic phases, and promptly confinement the infected cases. Some other studies are focusing on the deep/convolutional neural network technique to detect any distinguish features from chest X-ray images of COVID-19 patients for identification of disease. [37] [38] [39] [40] [41] [42] [43] It can also provide an automated medical diagnostic system to support health-care specialists for enhanced decision-making with the aim of detection and management of COVID-19 disease. Other applications of ANNs in the management of the COVID-19 epidemic include prognosis, prediction, and risk assessment of individuals for disease outcomes (individual level), [33, 34, [44] [45] [46] [47] [48] [49] [50] the ANNs also can be used in the prediction of disease outbreak trends at the macrolevel (community), [51] [52] [53] and finally, it also employed to predict the hospital resources utilization (bed occupancy, length of stay, etc.). [54, 55] The present study aimed to establish an intelligent system based on back-propagation ANN for earlier diagnosis of COVID-19 by training on a retrospectively collected dataset (clinical and laboratory data) specifically for frontline practitioners. This retrospective study was conducted in 2020, consisted of four sequential steps as follows. This retrospective and the single-center study was conducted in Ayatollah Taleghani Hospital, which is the focal center for COVID-19 special care and treatment in South West of Khuzestan, Iran. The experimentation is ethic compliant and has been approved a certificate of ethics (code: IR.ABADANUMS. REC.1400.008) by the Ethics Committee board of the Abadan University of Medical Sciences. A total of 4369 supposed COVID-19 cases were referred to this center, February 9, 2020, to October 20, 2020. Of those, 2814 cases were identified as suspicious. By applying the predefined exclusion criteria, 435 cases remained. After a quantitative analysis of medical records, 35 incomplete records that had a lot of missing data (more than 70% missing) were excluded from the analysis; and 400 records have remained. After the test, 250 and 150 cases were confirmed as positive and negative reverse transcription-polymerase chain reaction respectively. A flowchart to represent the patient selection methodology is given in Figure 1 . It is effective in reducing the input number for processing, or finding the most meaningful inputs to reduce the dimensions of the dataset for increasing the data mining performance and its calculation capabilities. In our study, we had the 38 independent variables including different criteria for COVID-19 diagnosis such as demographic (age, sex, and body mass index), clinical findings (respiratory rate, body temperature, fever, cough, weakness, dyspnea, disability, chest pain, throat pain, rhinorrhea, headache, rhinophyma, tremor, digestive sign, loss of sensation, lung lesion existence, lung lesion appearance, and statue and pulmonary infection), epidemiological factors (job risk, travel to high-risk regions, contact history, contact type, contact number, exposure time, geographic living, and contact with susceptible people), and medical and personal history (history of drinking alcohol, in taking Vitamin D, smoking, in taking blocker, history of acute respiratory distress syndrome, and history of pregnancy) and oxygen saturation in the blood. Considering the qualitative variables that existed in the research database, the phi coefficient correlation has been used to investigating the meaningful relationship between the inputs (diagnostic criteria) and outputs (negative or positive COVID-19) variables, statistically. P < 0.05 was considered for a statistically meaningful level. IBM SPSS Statistics V25.0 (Armonk, NY: IBM Corp., USA) was used for this purpose. Selecting the ANN characteristics and efficient model is the key prerequisite to improving model performance. The ANN model used in this research was a standard feed-forward, back-propagation neural network (BPNN) with three input, intermediate (hidden), and output layers. The BPNN is a deep learning method in ANN with more than one hidden layer (multi-layered preceptors [MLP] ). [56, 57] BPNN is the best technique for training in MLP of ANN. This method is often done by optimizing the learning algorithm and the weight of neurons by calculating the decreasing gradient of the cost function. It is a kind of multilayer feed-forward neural network which uses supervised learning technique for diseases prediction. [58] [59] [60] All data were entered into the MLP as a new and the most common design tool for layered feed-forward neural networks [ Figure 2 ]. An MLP architecture includes three layers (an input layer, a hidden layer, and an output layer). Each node in MLP or generally in every ANN uses a stimulating method for communicating to other nodes that this process can be simulated with nonlinear stimulating function in the ANNs. MLP uses a supervised learning technique called repetition for training. [61] [62] [63] This training algorithm stabilizes the weights of the neurons according to the error that existed between the features of the real and target class to make a suitable relationship between the input and output classes by the nonlinear connection between neurons. [64] Furthermore, the Levenberg-Marquardt was used in this research because of its popularity in error reduction and increasing the efficiency in the calculation process. [57] The MLP activation function (tansig function) was implemented in the MATLAB 2013a that was used in this study as an ANN activation method and physiologic connection between ANN's neurons like human's NN. [60] Developing artificial neural network architecture In this study, to determine the best configuration of the ANN, we used the different types of the ANNs configuration by different hidden layers with the number of the neurons that existed in them for data processing and performance evaluation based on different evaluation criteria such as sensitivity, specificity, and accuracy. In this step, the datasets were split into both training and testing. About 70% of cases were for training and 30% for the testing process. Finally, the final architecture of ANN for COVID-19 diagnosis was acquired based on measuring and comparing the sensitivity, specificity, and accuracy of different ANN configuration types. Our conditional threshold in ANN's COVID-19 diagnosis was the 0.5 value; the uninfected people was considered <0.5 (0.5 ≤ x) and positive outputs were classified more than 0.5 (0.5 > x). The result of determining the most important diagnostic criteria based on the phi coefficient at P < 0.05 is demonstrated in Table 1 . Based on the information provided in Table 1 , the cough (φ = 0.621) (P = 0.00405), fever (φ = 0.545) (P = 0.00512), lung lesion existence (φ = 0.6) (P = 0.00258), and body temperature (φ = 0.554) (P = 0.005405) had obtained the most amount of correlation coefficient at P < 0.05; therefore, in this research, they were considered as the most important diagnostic for diagnosing the COVID-19. In general, the 18 diagnostic criteria acquired the determined correlation coefficient at P < 0.05. After comparing different configurations of ANNs by evaluating the three mentioned comparison criteria [ Table 2 ], the most common architecture of ANN was obtained [ Figure 3 ]. Indeed, the architecture of 18-28-20-2 (28 and 20 neurons in hidden layer 1 and hidden layer 2, respectively) had been gotten as the best configuration for designing the ANN for diagnosing the COVID-19 disease. The receiver operating characteristic (ROC) plot of the ANN is depicted in Figure 5 , and the result of calculating the area under the curve (AUC) demonstrated that these ANNs had an efficient classification strength (AUC = 0.982) in diagnosing the COVID-19 and non-COVID-19 cases with being closed the curve to the true positive rate (perfect classifier than the random type); on the other hand, the AUC plot indicated that the ANN diagnosing model had a high diagnostic power with the high TP and TN rate and low FN and FP rate. This curve also was the best in terms of efficiency among all the ANN configurations. In Figure 7 , the Clinical Decision Support System User Interface for COVID-19 diagnosis was designed by MATLAB v 2013a software (The MathWorks, Inc., Natick, Massachusetts, USA), in which, the users such as a physician could enter the data about their patients, then the system suggests the best recommendation about having COVID-19 disease or not. The high risk of infection, vague characteristics, the uncertainty of nature, long incubation period, vigorous progression, and difficulties for conduct laboratory tests make COVID-19 a critical public health issue that raised intense attention internationally. [65] In this situation, a timely and accurate diagnosis can provide a better plan for health policymakers and clinicians to mitigate disease outbreaks and improve patient survival probability. [56] To this end, developing intelligent models for COVID-19 diagnosis is very crucial in determining their likely new cases at an early stage. [23, 66] The purpose of this study was to develop an intelligent model for detecting the presence or absence of COVID-19 based on ANN techniques. So far, several types of research have been focused on applying and evaluating the ANN techniques in COVID-19 early prognosis, risk assessment, and trend [11] and Moftakhar et al., [71] compared the prediction performance of statistical (regression) and computational (ANN) models in COVID-19 diagnosis, and finally, the ANN exhibits better performance than the regression model. The results of the current study illustrated that the designed ANN model can appropriately identify the COVID-19 cases using parameters that are readily available in clinical practice. To that end, the data were balanced and then used as contributor predictors for the ANNs. Later, the models were developed and their performance was evaluated. The key findings of our study, first, identify the most important clinical predictors using logistic regression, and then a promising performance level with an AUC of 0.982. In the first step, we identified 18 significant predictors [ Table 1 ] which were independently associated with COVID-19. However, the sensitivity, specificity, and accuracy were 96.4%, 90.6%, and 94%, respectively. The ANN model has robust error tolerance; thus, it can be extensively used in the fields of prediction and analysis. [72] Furthermore, leveraging the potential of an ANN-based CDSS would assist health-care providers to make better decisions concerning COVID-19 (diagnosis, classification, etc.). Despite standard statistical approaches (e.g., logistic regression) that need further modeling processes, ANNs do not necessitate distributional assumptions. [73] In addition, contrasting to traditional statistical-based prediction methods, this study offers a new technique for modeling complex nonlinear relationships in spatial epidemiology. Such a prediction model can be employed even for analyzing noisy, imbalanced, and inadequate datasets. Several limitations need to be addressed. First, the dataset was obtained from a single center that limits the external validity of the results; thus, future multi-central datasets and external validation possibly will improve the developed model. Second, only the data of 400 patients were included to devise the model. It is considered a small population and the probability of an overfitting problem. To overcome these limitations and improve the results, we recommend prospective, multicenter teamwork, with a great dataset. In this research, by introducing a scientific and noninvasive evidence-based method, we will be able to propose the best ANN configuration for COVID-19 detection based on the most effective diagnostic criteria. The proposed configuration appears to have a higher performance than the conventional evaluation approaches, and also can be used by physicians to improve their diagnostic performance. We rely on that, in future, an ANN-based CDSS risk assessment will be existing for use in the health-care facilities, which will be straightforward for clinicians to use. We anticipate that this technique may apply to wider fields of medicine, facilitating the complex and nonlinear information processing about patients, and leading to the establishment of personalized risk profiles. We have created and tested an ANN model for COVID-19 diagnosis based merely on patient history and exposure parameters commonly available in inpatient medical records. Our results reveal that ANN can offer high specificity and good sensitivity for the diagnosis of COVID-19. The results also disclosed that ANN could discriminate COVID-19 from other viral pneumonia and flu-like diseases with high accuracy. While our neural network could be potentially used as a clinical tool for COVID-19 diagnosis, further development with more clinical variables included and Identification of COVID-19 can be quicker through artificial intelligence framework using a mobile phone-based survey when cities and towns are under quarantine A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: A study of a family cluster Intermediate versus standard-dose prophylactic anticoagulation and statin therapy versus placebo in critically-ill patients with COVID-19: Rationale and design of the INSPIRATION/INSPIRATION-S studies Di Napoli R. Features, evaluation and treatment coronavirus (COVID-19) World Health Organization declares global emergency: A review of the 2019 novel coronavirus (COVID-19) Physical activity guideline for social distancing during COVID-19 Knowledge of COVID-19 and its implications in dental treatment, and practices of personal protective equipment among dentists: A survey-based assessment Will COVID-19 generate global preparedness? Prediction of epidemic trends in COVID-19 with logistic model and machine learning technics From SARS to COVID-19: A previously unknown SARS-CoV-2 virus of pandemic potential infecting humans -Call for a One Health approach Modeling and prediction of COVID-19 in Mexico applying mathematical and computational models A COVID-19 risk assessment decision support system for general practitioners: Design and development study COVID-19 detection with multi-task deep learning approaches Utility of artificial intelligence amidst the COVID 19 pandemic: A review Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and coronavirus disease-2019 (COVID-19): The epidemic and the challenges COVID-19 and diabetes: Knowledge in progress Cancer patients and research during COVID-19 pandemic: A systematic review of current evidence The optimal diagnostic methods for COVID-19 The diagnostic methods in the COVID-19 pandemic, today and in the future Artificial intelligence approach to predict the COVID-19 patient's recovery Development and evaluation of an AI system for COVID-19 diagnosis Considerations for development and use of AI in response to COVID-19 Artificial Intelligence for infectious disease Big Data Analytics A machine learning approach for handling big data produced by high resolution mass spectrometry after data independent acquisition of small molecules -Proof of concept study using an artificial neural network for sample classification Machine learning and artificial neural network prediction of interfacial thermal resistance between graphene and hexagonal boron nitride Simple scoring system and artificial neural network for knee osteoarthritis risk prediction: A cross-sectional study Artificial neural network modeling of novel coronavirus (COVID-19) incidence rates across the continental United States Within the lack of chest COVID-19 X-ray dataset: A novel detection model based on GAN and deep transfer learning Computer vision detection of foreign objects in walnuts using deep learning Prioritizing and analyzing the role of climate and urban parameters in the confirmed cases of COVID-19 based on artificial intelligence applications Predicting the dynamics of the COVID-19 pandemic in the united states using graph theory-based neural networks A novel classifier architecture based on deep neural network for COVID-19 detection using laboratory findings Artificial neural networks for prediction of covid-19 in Saudi Arabia Using artificial neural network with prey predator algorithm for prediction of the COVID-19: The case of Brazil and Mexico Artificial neural network modeling of novel coronavirus (COVID-19) incidence rates across the continental United States Neural network based country wise risk prediction of COVID-19 A deep convolutional neural network for COVID-19 detection using chest X-rays Automatic COVID-19 detection from X-ray images using ensemble learning with convolutional neural network COVID-19) detection from chest radiology images using convolutional neural networks Fast automated detection of COVID-19 from medical images using convolutional neural networks Convolutional neural network model based on radiological images to support COVID-19 diagnosis: Evaluating database biases Classification of Medical Images of Patients With COVID-19 Using Transfer Learning Technology of Convolutional Neural Network Journal of Physics: Conference Series Chest X-ray image phase features for improved diagnosis of COVID-19 using convolutional neural network COVID 19 Prediction from X Ray Images Using Fully Connected Convolutional Neural Network COVID-19 mortality rate prediction for India using statistical neural network models Genetic Optimization of Ensemble Neural Network Architectures for Prediction of COVID-19 Confirmed and Death Cases. Germany: Studies in Computational Intelligence A comparison: Prediction of death and infected COVID-19 cases in Indonesia using time series smoothing and LSTM neural network Forecasting of COVID-19 cases based on prediction using artificial neural network curve fitting technique Machine learning models for the prediction the necessity of resorting to ICU of COVID-19 patients Machine Learning to Predict ICU Admission, ICU Mortality and Survivors' Length of Stay among COVID-19 Patients: Toward Optimal Allocation of ICU Resources COVID-19 Outbreak Prediction Using Quantum Neural Networks. Singapore: Advances in Intelligent Systems and Computing Detection and Spread Prediction of COVID-19 from Chest X-ray Images using Convolutional Neural Network-Gaussian Mixture Model Generated time-series prediction data of COVID-19′s daily infections in Brazil by using recurrent neural networks Development and validation of a machine learning model for predicting illness trajectory and hospital resource utilization of COVID-19 hospitalized patients -a nationwide study Machine Learning to Predict ICU Admission, ICU Mortality and Survivors' Length of Stay among COVID-19 Patients: Toward Optimal Allocation of ICU Resources Evaluation of feed-forward backpropagation and radial basis function neural networks in simultaneous kinetic spectrophotometric determination of nitroaniline isomers Backpropagation learning algorithm based on Levenberg Marquardt Algorithm Identifying microbe-disease association based on a novel back-propagation neural network model Forecasting of bioaerosol concentration by a Back Propagation neural network model Tansig activation function (of MLP network) for cardiac abnormality detection Comparison of soil and water assessment tool (SWAT) and multilayer perceptron (MLP) artificial neural network for predicting sediment yield in the Nagwa agricultural watershed in Jharkhand Interpolating monthly precipitation by self-organizing map (SOM) and multilayer perceptron (MLP) Multilayer perceptrons for the classification of brain computer interface data Backpropagation artificial neural network and central composite design modeling of operational parameter impact for sunset yellow and azur (II) adsorption onto MWCNT and MWCNT-Pd-NPs: Isotherm and kinetic study Forecasting the prevalence of COVID-19 outbreak in Egypt using nonlinear autoregressive artificial neural networks Statistical Methods in Spatial Epidemiology Prognostic modeling of COVID-19 using artificial intelligence in the United Kingdom: Model development and validation Convolutional capsnet: A novel artificial neural network approach to detect COVID-19 disease from X-ray images using capsule networks Can AI help in screening viral and COVID-19 pneumonia? Diagnosis and detection of infected tissue of COVID-19 patients based on lung x-ray image using convolutional neural network approaches Exponentially increasing trend of infected patients with COVID-19 in Iran: A comparison of neural network and ARIMA forecasting models The application of artificial neural networks and logistic regression in the evaluation of risk for dry eye after vitrectomy Applications of machine learning and artificial intelligence for Covid-19 (SARS-CoV-2) pandemic: A review This article is the result of a research project approved by the Research Committee at Abadan Faculty of Medical Sciences (Iran) (ethic code number: IR.ABADANUMS. REC.1400.008). All subjects signed an informed consent form before participating in the study. We thank the research deputy of the Abadan faculty of medical sciences for financially supporting this project. Nil. There are no conflicts of interest.