key: cord-1032896-yz3w8eu6
authors: Sun, Junding; Li, Xiang; Tang, Chaosheng; Wang, Shui-Hua; Zhang, Yu-Dong
title: MFBCNNC: Momentum factor biogeography convolutional neural network for COVID-19 detection via chest X-ray images [Image: see text]
date: 2021-09-15
journal: Knowl Based Syst
DOI: 10.1016/j.knosys.2021.107494
sha: b35c3fb63ae758d8e3dd97ea10cb8143f67acbc5
doc_id: 1032896
cord_uid: yz3w8eu6

AIM: By October 6, 2020, Coronavirus disease 2019 (COVID-19) was diagnosed worldwide, reaching 3,355,7427 people and 1,037,862 deaths. Detection of COVID-19 and pneumonia by the chest X-ray images is of great significance to control the development of the epidemic situation. The current COVID-19 and pneumonia detection system may suffer from two shortcomings: the selection of hyperparameters in the models is not appropriate, and the generalization ability of the model is poor. METHOD: To solve the above problems, our team proposed an improved intelligent global optimization algorithm, which is based on the biogeography-based optimization to automatically optimize the hyperparameters value of the models according to different detection objectives. In the optimization progress, after selecting the immigration of suitable index vector and the emigration of suitable index vector, we proposed adding a comparison operation to compare the value of them. According to the different numerical relationships between them, the corresponding operations are performed to improve the migration operation of biogeography-based optimization. The improved algorithm (momentum factor biogeography-based optimization) can better perform the automatic optimization operation. In addition, our team also proposed two frameworks: biogeography convolutional neural network and momentum factor biogeography convolutional neural network. And two methods for detection COVID-19 based on the proposed frameworks. RESULTS: Our method used three convolutional neural networks (LeNet-5, VGG-16, and ResNet-18) as the basic classification models for chest X-ray images detection of COVID-19, Normal, and Pneumonia. The accuracy of LeNet-5, VGG-16, and ResNet-18 is improved by 1.56%, 1.48%, and 0.73% after using biogeography-based optimization to optimize the hyperparameters of the models. The accuracy of LeNet-5, VGG-16, and ResNet-18 is improved by 2.87%, 6.31%, and 1.46% after using the momentum factor biogeography-based optimization to optimize the hyperparameters of the models. CONCLUSION: Under the same experimental conditions, the performance of the momentum factor biogeography-based optimization is superior to the biogeography-based optimization in optimizing the hyperparameters of the convolutional neural networks. Experimental results show that the momentum factor biogeography-based optimization can improve the detection performance of the state-of-the-art approaches in terms of overall accuracy. In future research, we will continue to use and improve other global optimization algorithms to enhance the application ability of deep learning in medical pathological image detection.

with an accuracy of 78%, a sensitivity of 80%, a specificity of 53%, and AUC of 71%. In the above 84 literature, the accuracy of CNN models is lower than that of using the more complex network models.

Although the detection accuracy of ResNet-50 and UNet++ is relatively high, they are only verified on 86 the binary classification problem. The binary classification problem is relatively simple. It has practical 87 significance only for the detection of COVID-19 and Normal by the chest X-ray images. It has less 88 reference value for detection COVID-19, Normal, and Pneumonia by the chest X-ray images. that the accuracy of VGG-16 can be increased by 7.2% by using Bayesian inference. Feng et al. [13] 103 used VB-Net to segment the chest images and applied a random forest method based on the size of the 104 infected area to the segmented image to classify the COVID-19 images. Zheng et al. [14] proposed a 105 model using a 3D deep neural network to detecting CT images of COVID-19. Although many 106 optimization methods are used in the models mentioned above, naive Bayes needs to satisfy the 107 assumption that the distribution is independent. When the random forest is faced with the problems that 108 more decision trees are needed, the time complexity and the space complexity are relatively large. If the 109 samples have large noise, the random forest algorithm is prone to overfitting. Besides, the performance 110 of the above models can only be better when dealing with the same datasets. When the category of the 111 data changes, the accuracy of the models will be affected. Therefore, the above models have low 112 robustness and poor generalization ability.

To solve the above problems and improve the accuracy of the CNNs on the dataset of the chest X-114 ray images detection for COVID-19, Normal, and Pneumonia. The momentum factor biogeography-115 based optimization is proposed to optimize the hyperparameters of three convolutional neural networks 116 

The samples of our dataset are shown in Figure 2 . The first row is 5 chest X-ray images of COVID-143 19. The second row is 5 chest X-ray images of Normal, and the last row is 5 chest X-ray images of It can be seen from Figure 2 

To ease the understanding of this paper, Table 13 shows all variables used in our study. Table 14   159 gives the abbreviation and their full names. LeNet-5 is a simple convolutional neural network, and its structure is shown in Figure 3 . LeNet-5 173 is mainly used for handwriting recognition [15] . All the convolution kernel size in LeNet-5 is 5×5; all 174 the convolution kernel stride size is 1. All the size of the pooling kernel is 2×2, and all the stride size of 175 the pooling kernel is 2. Here, the size of the convolution kernels and the stride size of the convolution 176 kernels in the two convolutional layers in Figure 3 are the optimization objectives of using BBO and 177 MF-BBO in the following experiments.

As shown in Figure 3 , each convolutional layer in LeNet-5 is followed by a pooling layer. The Figure 6 . Here, the blue curve represents the migration paths of species 231 between habitats. The remaining every single small icon represents a species habitat.

To describe the algorithm more accurately, this paper introduces the following term: habitat, which 233 is used to describe the sites of species survival, reproduction, and mutation. 

When the species number of a habitat is 0, the immigration rate of the habitat is the highest, the 256 emigration rate is 0, and the HSI value is the lowest. When the species number of a habitat reaches the 257 maximum, the immigration rate of the habitat is 0, the emigration rate is the maximum, and the HSI value 258 is the maximum [25] . Therefore, the following formulas can be obtained.

Here, represents the number of species.

represents the maximum number of species. 

By taking the derivative of ∆t in (4), the following formula can be obtained.

Here, ⃛ represents the probability of species after the derivative. For simplicity, ⃛ can be 295 expressed as the multiplication of matrix A and P shown as below.

298 Figure 9 shows the flow chart of BBO, and Table 1 shows the pseudocode of BBO. As can be seen 299 from Figure 9 and Table 1 336

Feedforward neural network uses backpropagation to optimize the parameters in the network.

Assuming that input is (x, y), the loss function obtained after calculation of the feedforward neural 339 network is ℒ( ,̂). To optimize the parameters in the feedforward neural network, the following 340 formulas need to be calculated.

ℒ( ,̂)

343 According to formulas (13) and (14), we still need to calculate 

Here, ( ) represents the identity matrix of the -layer neurons. From formulas (15), (16), and 349 (17), the following formulas can be obtained.

352 Therefore, the execution sequence of BP in feedforward neural network is as follows: first (10) 369

When BP is performed in the pooling layer [31] , the sizes of all matrices in will be restored to 371 the size before pooling. This process is usually called upsampling. Therefore, when the of the 372 pooling layer is known, the following formula should be followed when deriving the −1 of the 373 previous layer. The migration momentum factor is a variable introduced in the MF-BBO proposed in this paper.

The main function of the migration momentum factor is to standardize the value of the migration SIV in 398 the migration operation. The following formula is followed for the SIV migration. optimization effects on the convolutional neural networks of this paper. Table 3 and Figure 10 show Iter19 Iter37 Iter55 Iter73 Iter91  Iter109 Iter127 Iter145 Iter163 Iter181 Iter199 Iter217  Iter235 Iter253 Iter271  Iter289 Iter307 Iter325 Iter343 Iter361 Iter379 Iter397 Iter415 Iter433 Iter451 Iter469 Iter487   Number   BBO  0<F<1  F=1  1<F<2  F=2 

This section describes how to apply the improved migration operation to MF-BBO. As shown in Figure   479 9 and Table 1 

It can be seen from Figure 11 and Table 2 

The experimental results are summation and averaged to evaluate the optimization performances of 617 the algorithms on the hyperparameters of CNNs through accuracy. To describe the experimental results 618 more accurately, we perform ten times 10-fold cross-validation, and we introduce the confusion matrix 619 as shown in Table 4 . Here, TP represents both the true value and the predicted value are positive. FN 620 represents the true value is positive, but the predicted value is negative. FP represents the true value is 621 negative, but the predicted value is positive. TN represents both the true value and the predicted value 622 are negative. Table 5 shows the confusion matrix of 10-fold cross-validation for chest X-ray images with three 651 convolutional neural networks using default values of hyperparameters. Here, COV represents the chest 652 X-ray images of COVID-19. Nor represents the chest X-ray images of Normal. Pne represents the chest 653 X-ray images of Pneumonia. Table 6 shows the four confusion matrix metrics of three convolutional 654 neural networks. COV  4332  191  107  4191  328  111  4292  241  97   Nor  411  4167  52  285  4282  63  156  4343  131   Pne  89  544  3997  108  573  3949  117  417  4096 675 Table 9 shows the confusion matrix of three convolutional neural networks optimized by MF-BBO 686 to perform 10-fold cross-validation on the chest X-ray images. Here, COV represents the chest X-ray 687 images of COVID-19. Nor represents the chest X-ray image of Normal. Pne represents the chest X-ray 688 images of Pneumonia. 

In this section, we list the average overall accuracy (OA) of the nine methods run 10 times. The 706 specific data is shown in Table 11 . Figure 17 shows the average accuracy of the nine methods.

707 708 In this section, we select two kinds of state-of-the-art methods to compare our methods. The matrix.

( )

The output value of layer .

( −1)

The output value of layer − 1. Index of cross validation dataset.

Maximum rate of emigration.

Migration momentum factor.

The activation function of layer .

Number of folds for cross validation.

Index of fold used as test set.

Index of the species number.

Maximum number of species. ℎ

The upsampling function of convolutional neural network.

Maximum rate of immigration.

The row of the matrix.

The column of the matrix.

Total number of runs (each run carries out a -fold cross validation).

Run index (each run carries out a -fold cross validation).

The padding size of convolution operation.

The padding size of pooling operation.

Maximum rate of mutation.

The stride size of convolution kernel.

The stride size of pooling kernel.

( )

The number of neurons in the layer .

The output of convolution operation.

The output of pooling operation.

Probability of habitat has the number of species.

Maximum probability of species. ⃛

The probability of species after the derivative.

( )

The identity matrix of the -layer neurons.

Rate of immigration made by random function.

Rate of emigration made by random function.

The SIV of immigration.

The SIV of emigration. ⃛

The SIV of emigration calculated by migration momentum factor.

A time in the BBO process. ∆ A very short time difference.

Index of ordinal numbers.

Value of HSI.

The size of convolution kernel.

The size of pooling kernel.

The input of the neural network.

The true label of neural network input. ̂

The prediction label of neural network.

( )

The output of layer without activation function. ℒ Loss function of feedforward neural network. δ

The confusion matrix.

( )

The partial derivative of the loss function in the layer. The chest X-ray image of Pneumonia

UK (RP202G0230)

Hope Foundation for Cancer Research

British Heart Foundation Accelerator Award, UK (AA/18/3/34220)

UK (RP202G0289)

Fundamental Research Funds for the Central Universities

Provincial Key Laboratory for 790

Review of Artificial Intelligence Techniques in Imaging Data Acquisition, Segmentation and Diagnosis for COVID-19

Coronavirus disease (COVID-19) Weekly Epidemiological Update and Weekly Operational Update

3D-printable headlight face shield adapter. Personal protective equipment in the COVID-19 era

Systematic application of COVID-19 nucleic acid tests in general surgery departments in China: An update 801 of current status with nationwide survey data

Evaluation of performance of two SARS-CoV-2 Rapid IgM-IgG combined antibody tests on capillary whole 803 blood samples from the fingertip

Deep learning based detection and analysis of COVID-19 on chest X-ray images

HSMA_WOA: A hybrid novel Slime mould algorithm with whale optimization algorithm for tackling 807 the image segmentation problem of chest X-ray images

Automatic Detection of Coronavirus Disease (COVID-19) Using X-ray Images and Deep Convolutional 809

AI-assisted CT imaging analysis for COVID-19 screening: Building and deploying a medical AI system in four 811 weeks

An Investigation of CNN Models for Differentiating Malignant from Benign Lesions Using Small 813 Pathologically Proven Datasets

A deep learning algorithm using CT images to screen for Corona Virus Disease (COVID-19). medRxiv

Deep learning Enables Accurate Diagnosis of Novel Coronavirus (COVID-19) with CT images. medRxiv

Estimating Uncertainty and Interpretability in Deep Learning for Coronavirus (COVID-19) Detection

Deep Learning-based Detection for COVID-19 from Chest CT using Weak Label

Using a VGG-16 Network for Individual Tree Species Detection with an Object-Based Approach

Automated Brain Image Classification Based on VGG-16 and Transfer Learning

Deep Residual Learning for Image Recognition

A Hybrid Deep ResNet and Inception Model for Hyperspectral Image Classification

Hybrid energy system design using greedy particle swarm and biogeography-based optimisation. IET 832 Renewable Power Generation

An Efficient Implementation of Divergence State Estimation with Biogeography-Based Optimization 834 (DSEBBO) Framework in FPGA-Based Multiprocessor System

BMDA: applying biogeography-based optimization algorithm and Mexican hat wavelet to improve 843 dragonfly algorithm. Soft Computing

A self learned invasive weed-mixed biogeography based optimization algorithm for RFID network 845 planning. Wireless Networks

NBBO: A new variant of biogeography-based optimization with a novel framework and a two-phase 847 migration operator

KATZ centrality with biogeography-based optimization for influence maximization problem

Combinatorial Optimization

AN IMPROVED BIOGEOGRAPHY-BASED OPTIMIZATION FOR ECONOMIC/ENVIRONMENTAL 851 DISPATCH

Game theory-based multi-task scheduling in cloud manufacturing using an extended biogeography-based 853 optimization algorithm. Concurrent Engineering-Research and Applications

Impact of correlation of plug load data, occupancy rates and local weather conditions on electricity 856 consumption in a building using four back-propagation neural network models. Sustainable Cities and Society

Intelligent design in continuous galvanizing process for advanced ultra-high-strength dual-859 phase steels using back-propagation artificial neural networks and MOAMP-Squirrels search algorithm. International 860 Journal of Advanced Manufacturing Technology

A Novel Biogeography-Based Optimization Algorithm with Momentum Migration and Taxonomic Mutation

Advances in Swarm Intelligence

Cross-validation of the entomophagy attitude questionnaire (EAQ): A study in China on eaters and non-864 eaters

Transfer Learning with Deep Convolutional Neural Network (CNN) for Pneumonia Detection Using Chest Junding Sun: Software, Validation, Investigation, Resources, Supervision, Project administration, Funding acquisition Xiang Li: Conceptualization, Methodology, Software, Data Curation, Writing -Original Draft, Chaosheng Tang: Software, Formal analysis, Investigation, Resources, Visualization, Supervision, Shui-Hua Wang: Methodology, Validation, Formal analysis, Resources, Writing -Review & Editing, Funding acquisition Yu-Dong Zhang: Conceptualization, Methodology, Formal analysis, Investigation, Resources, Writing -Original Draft, Writing -Review & Editing, Supervision

We the undersigned declare that this manuscript is original, has not been published before and is not currently being considered for publication elsewhere.We confirm that the manuscript has been read and approved by all named authors and that there are no other persons who satisfied the criteria for authorship but are not listed. We further confirm that the order of authors listed in the manuscript has been approved by all of us.We understand that the Corresponding Author is the sole contact for the Editorial process.