key: cord-0800606-2yrcgvxs
authors: Zhang, Xin; Lu, Siyuan; Wang, Shui-Hua; Yu, Xiang; Wang, Su-Jing; Yao, Lun; Pan, Yi; Zhang, Yu-Dong
title: Diagnosis of COVID-19 Pneumonia via a Novel Deep Learning Architecture
date: 2022-03-31
journal: J Comput Sci Technol
DOI: 10.1007/s11390-020-0679-8
sha: 06b943bb9407065c510a4cb5c74044c16b28b286
doc_id: 800606
cord_uid: 2yrcgvxs

COVID-19 is a contagious infection that has severe effects on the global economy and our daily life. Accurate diagnosis of COVID-19 is of importance for consultants, patients, and radiologists. In this study, we use the deep learning network AlexNet as the backbone, and enhance it with the following two aspects: 1) adding batch normalization to help accelerate the training, reducing the internal covariance shift; 2) replacing the fully connected layer in AlexNet with three classifiers: SNN, ELM, and RVFL. Therefore, we have three novel models from the deep COVID network (DC-Net) framework, which are named DC-Net-S, DC-Net-E, and DC-Net-R, respectively. After comparison, we find the proposed DC-Net-R achieves an average accuracy of 90.91% on a private dataset (available upon email request) comprising of 296 images while the specificity reaches 96.13%, and has the best performance among all three proposed classifiers. In addition, we show that our DC-Net-R also performs much better than other existing algorithms in the literature. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11390-020-0679-8.

Since December 2019, the coronavirus disease 2019 (COVID-19) has become a worldwide public health se-curity challenge. World Health Organization (WHO) has confirmed its pathogen and named it 2019-new coronavirus (2019-nCOV), and the International Committee on Taxonomy of Viruses (ICTV) has designated this novel coronavirus as SARS-CoV-2. 2019-nCoV has strong adaptability; it can be more effectively transmitted from person to person and may have increased toxicity compared with influenza [1] . 2019-nCoV can be detected in human respiratory tract epithelial cells within roughly 96 hours after infection. The COVID-19 is highly contagious, spreading worldwide at an alarming rate, and the number of confirmed cases is increasing. Some patients' disease progresses rapidly, leading to severe and critical illness, and even death. The identification of COVID-19 relies on epidemiology, clinical symptoms, imaging performance, and laboratory tests.

The diagnosis of COVID-19 is proved to be effective using viral nucleic acid (NA) detection, which has robust specificity with meagre sensitivity. However, the diagnosis is greatly affected by the samplers and the sampling points. There are many false-negative cases during the viral NA test that are tested positive with computerized tomography (CT). These include cases of multiple false-negatives that are diagnosed positive through repeated sampling [2] . These findings show that CT preliminary screening is important in some cases. In addition, viral NA lacks available tests and supplies, and the feedback of the test results requires a certain amount of time. These challenges may delay the treatment and isolation of patients, increasing the infection risk of people around them.

CT is a quick and straightforward technique for screening infected patients.

Some experts recommend using time-saving chest calculations to diagnose suspicious cases with CT instead of real-time polymerase chain reaction (RT-PCR), a viral NA detection method 1 ○ . The CT function of early COVID-19 detection has not been specially researched, which is crucial for the early detection of suspicious cases, even in asymptomatic patients. When the NA test result shows false negatives, the CT test is especially important. It is one of the most critical approaches for the early diagnosis of pneumonia caused by the novel coronavirus.

CT has a big impact on judging the nature, progression and prognosis of the lesion, evaluating the severity of the disease, and guiding clinical classification. In the face of sudden outbreaks, making full use of CT's advantages deserves discussing, thinking and investment.

In the meantime, signal processing, artificial intelligence, and deep learning (DL) technologies have successfully been applied in biomedical image analysis, computing, and modelling. Lu [3] proposed radialbasis-function neural networks (RBFNNs) for classi-fying pathological brains. Based on extreme learning machine (ELM), Yang [4] presented a kernel-based version (K-ELM) for creating a novel pathological brain detection system. Their method is robust and effective. Lu [5] proposed a novel extreme learning machine trained by the bat algorithm (ELM-BA) approach. Li and Liu [6] introduced the real-coded biogeographybased optimization (RCBBO) to detect diseased brains. Jiang [7] used a six-layer CNN (6L-CNN) to recognize sign language fingerspelling. Szegedy et al. [8] presented the GoogleNet, which achieved incredible performance on ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) 2014. Yu and Wang [9] suggested the use of ResNet18 for mammogram abnormality detection.

In all, there are successful reports showing that computers can help medical image analysis. However, one common challenging task in the above systems or models is to improve the performance without introducing overcomplicated structures. To effectively select good features in high dimensional domains, a method named minimal-redundancy-maximalrelevance criterion (mRMR) is introduced in [10] . The authors [10] claimed that superior features can be selected at low cost, justified by extensive experiments on different datasets with different classifiers.

Nevertheless, a deep neural network (DNN) may suffer from overfitting, and its accuracy can still be improved using different architectures and algorithms. In this study, we use classical AlexNet as the backbone, and we propose two improvements. 1) We add batch normalization to reduce internal covariance shift and accelerate the training. 2) We replace the fully connected layer in AlexNet with three classifiers: SNN, ELM, and RVFL. Therefore, we propose three deep neural network architectures: DC-Net-SNN, DC-Net-ELM, and DC-Net-RVFL based on three different randomized neural networks, for the task of detecting COVID-19. We select the best model among DC-Net-SNN, DC-Net-ELM, and DC-Net-RVFL as the most suitable model.

The structure of our paper is as follows. Section 2 and Section 3 present the background and the dataset, respectively. Section 4 provides the rationale for our methods. Section 5 configures the experimental settings. Section 6 presents the discussions and results. Finally, Section 7 concludes the paper. 1 ○ https://enapp.chinadaily.com.cn/a/202002/06/AP5e3be074a3103a24b1106147.html, June 2020.

The chest CT (CCT) images of patients with COVID-19 show patchy ground-glass (GG) opacity in the subpleural area of both lungs, as well as pulmonary consolidation. The typical discoveries of COVID-19 on CCT are GG opacities of bilateral pulmonary parenchyma, combined with pulmonary consolidation, and sometimes round lesions distributed around the lungs [11] .

In the computer science community, Adrian Rosebrock 2 ○ presented a guide for using deep learning (Keras and TensorFlow) to detect COVID-19. This guide can help readers learn sample diseased and healthy X-ray images, train CNNs to detect COVID-19 automatically, and evaluate their results. Maghdid et al. [12] shared a low-cost technique of using smartphone embedded sensors to diagnose COVID-19. This approach is particularly helpful since many people are currently holding smartphones every day. Wang and Wong [13] published a DL framework (COVID-Net) adapted for detecting COVID-19 patients based on chest radiography scans. They also used an explainable method to acquire more meaningful understandings into vital elements linked with COVID cases. Staff in the University of Delft developed a software package called CAD4COVID 3 ○ , an AI software that triages suspected patients with COVID-19 on chest X-rays scans and indicates those affected lung tissues. More notable work can be found in [14] . Although the CT performance of detecting COVID-19 has certain characteristics, similar characteristics can also be found for pulmonary bacterial infections, fungal infections, pulmonary haemorrhage, pulmonary edema, and other viral pneumonia diseases. Therefore, it is difficult to distinguish them at diagnosis.

Judging from the current situation, the sensitivity of CT diagnosis is higher than that of NA detection. However, due to the inherent characteristics of image diagnosis, different lesions can show similar image manifestations. These characteristics will result in low specificity and inevitably overdiagnosis.

A patient can have numerous lung lesions due to different causes, causing fluctuations in rate of disease progression, which will require multiple imaging in a short time, significantly increasing the workload of the diagnostician. Therefore, the integration of AI into the diagnosis and treatment process of lung infections or other infectious diseases is worthy of further study.

Specific questions in this field include: how to provide doctors with diagnosis and treatment opinions quickly and accurately, how to alleviate the shortage of clinical radiologists, and how to increase efficiency in disease prevention and control.

From the viewpoint of computer science, most current AI methods are not comparable to radiology experts. The data, models, and codes of recently published papers in the area of AI for COVID-19 are not readily available. Therefore, we expect that our study can contribute to the community greatly. The codes, the data, and the model will be open to the public upon the acceptance of this paper 4 ○ .

Using a systematic random sampling method, 66 patients are randomly selected. The new coronavirus pneumonia is in the observation group: 44 males and 22 females, aged from 23 years to 91 years, with an average age of (49.48 ± 14.71) years. The control group is selected from individuals participating in routine health checkup: 38 males and 28 females, aged from 25 years to 72 years. The checkup group's average age is (38.44 ± 10.58) years. Criteria for confirmed COVID-19 include: 1) a positive NA test and 2) a complete CT image (CTI) data.

During image acquisition, CT scanning configurations are set as follows: Philips Ingenuity 64-row spiral CT machine, low kilovoltage (KV): 120, milliampereseconds (mAs): 240, layer thickness 3 mm, layer spacing 3 mm, screw pitch 1.5: lung window (W: 1500, L: −500), Mediastinum window (W: 350, L: 60), thin layer reconstruction according to the lesion display, the layer thickness and the layer distance are 1 mm lung window image. The patients are placed in a supine position, breathing deeply after holding in, and conventionally scanned from the lung tip to the costal diaphragm angle. Each image is of size 512 × 512 pixels.

All images are transmitted to the medical image PACS for observation, and two radiologists with rich chest diagnostic experience collectively read the radiographs and record the distribution, size and morphology of the CT manifestations of the lesions. Up to 1-4 2 ○ https://www.pyimagesearch.com/2020/03/16/detecting-covid-19-in-x-ray-images-with-keras-tensorflow-and-deep-learning, June 2020. slices are chosen. When there are disagreements between the two analyses, a superior doctor is consulted to reach a consensus. The slice-level selection method is: for COVID-19 pneumonia patients, the selected slice showing the largest size and the number of lesions is selected. For normal subjects, any level of the image can be selected. Fig.1 shows two example CT images of the COVID-19 dataset we used in this study. The dataset is available upon email request. In total, we collect 296 lung window images from CCT. For evaluation, we use the hold-out method. Up to 70% are used for training, and the remainder 30% are for testing. The summary of the dataset is presented in Table 1 . 4 DC-Net

AlexNet is a well-known neural network proposed by Krizhevsky et al. [15] for ILSVRC-2012. It has achieved significantly higher accuracy than its competitors in image classification, leading to the refocus of research interest in using DNNs as universal approximators. AlexNet is built upon the concept of multi-layered convolutional neural networks, introduced by LeNet, and fueled by the rapid increase of computational power with graphics processing units (GPUs). AlexNet has been applied in multiple fields, including biomedical analysis, object detection, etc.

The reason why we use AlexNet as the backbone is that overfitting can be avoided using AlexNet concerning our binary classification task. Also, the compu-tational cost of AlexNet is efficient compared with networks with higher complexity. There are many pieces of research choosing AlexNet, which out-performed their state-of-the-art, e.g., Szymak and Gasiorowski [16] used pre-trained AlexNet for underwater object recognition. Guo et al. [17] employed the AlexNet model for inversion of PM2.5 atmospheric refractivity profile. Zhao et al. [18] utilized the AlexNet model to detect surface defects of wind turbine blades. Xiao et al. [19] proposed an improved AlexNet model that achieved a higher accuracy than the ZFNet model on a 23 categories classification task. The success of all these methods showed that AlexNet is excellent in feature extraction.

The integration of convolution into neural networks is well suited to the image classification, as a strong assumption of local spatial coherence can be made on images. As convolutional filters replace the dense connections within the multi-layered perceptron, the number of connections and trainable parameters can also be significantly reduced. Apart from the CNN structure, characteristics of the AlexNet also include local response normalization (LRN), rectified linear unit (ReLU), and dropout regularization [15] .

Before the development of AlexNet, the standard activation function (AF) is either sigmoid

or hyperbolic tangent

These functions suffer from saturation, where outputs are limited by the asymptotic bounds. Saturation restricts gradient flow during backpropagation and limits the overall capacity of the neural network [20] . The ReLU activation function used in AlexNet

has no upper bound and is therefore non-saturating. Local response normalization (LPN) [15] is a technique used in AlexNet to normalize the unbounded ReLU activations and promote lateral inhibition, which enhances local contrasts and aids generalization. One core challenge in the training of neural networks is overfitting, which occurs when the irrelevant fluctuations in the training data are also captured by the neural network, resulting in lower generalization. Dropout reduces overfitting by freezing neurons in the neural network based on a set probability, equivalent to the generation of new neural networks during the training process. Dropout can effectively limit the co-adaptation of the neurons, and therefore reduce overfitting.

The AlexNet structure, shown in Fig.2(a) , is highly versatile and often used in medical imaging, especially medical image classification. Gertych et al. [21] successfully applied ImageNet pre-trained AlexNet to distinguish lung cancer growth patterns in histological slides. To aid the diagnosis of rheumatoid arthritis, Fukae et al. [22] employed the same neural network in the classification of virtual images generated from clinical information.

We propose the use of a variant of the single GPU AlexNet structure (Net-0) as the backbone of our approach, due to the ease of implementation. As exhibited in Fig.2(b) , we retain the general structure of AlexNet: five convolutional layers and multiple fully connected layers (FCLs). In the original AlexNet, there are three FCLs in total: FCL1, FCL2 and FCL3. We replace the last 1 000-neuron layer (FCL3) of the original AlexNet with an FCL with 512 neurons (FCL3*) and add an extra FCL with two neurons (FCL4*) to the top FCL3*, due to two reasons: 1) the universal approximation theorem [23] ; 2) the fact that two neurons correspond to the two categories of our classification task. This AlexNet variant is called "Net-0".

In order to further improve the backbone AlexNet proposed in Subsection 4.1, we incorporate batch normalization into the neural network structure. Batch normalization (BN) reduces the internal covariance shift within neural networks and promotes independence between layers. This effect is achieved by scaling each mini-batch of previous layer outputs with the mean and variance of that particular mini-batch. The process of this scaling adds noises to the mini-batches and, therefore, also provides regularization. AlexNet with BN can be trained with higher learning rates as BN provides bounds to the activation values. As seen in Fig.2(c) , batch normalization is added to each convolutional layer. This modified structure is named Net-1.

We first train Net-1 on the training images to finetune the weights for better feature extraction in this classification task. The trained network is named Net-2.

AlexNet has been successfully applied in many fields. However, AlexNet requires a large dataset for training to obtain good performance. As it is challenging for us to build a large dataset, we propose to use AlexNet as a pre-trained model for feature extraction only. Then, we employ three different randomized neural network classifiers attached to the trained AlexNet, including 1) extreme learning machine (ELM) [24] ; 2) Schmidt neural network (SNN) [25] ; 3) random vector functional-link net (RVFL) [26] . These deep COVID networks are abbreviated as DC-Nets. We compare these three models and select the best. The three classifiers are selected because they can provide a good solution without iterations, and thus save computation time.

The first proposed model is DC-Net-S, in which the Schmidt neural network (SNN) [25] is used to replace the last three layers in Net-2. Let the i-th input sample x i be x i = (x i1 , · · · , x in ) T ∈ R n , i = 1, · · · , N .

Let y i = (y i1 , y i2 , · · · , y im ) T ∈ R m mean the i-th output information, and the structure of SNN is shown in Fig. 3(a) . For the SNN, we haveN hidden nodes. The model can be expressed as:

in which g (·) is the sigmoid function. α j and β j are randomly initialized and kept the same during training. γ = (γ 1 , γ 2 , γ 3 , · · · , γ m ) T denotes the output biases. λ j can be calculated by pseudo-inverse.

T is the output of the model for the i-th sample.

We propose the second model: DC-Net-E, by substituting the last three layers in Net-2 with the ELM layers. ELMs are feedforward neural networks for regression, clustering, and classification and feature learning with a single layer [27] . Compared with the backpropagation-based networks, ELM excels in generalization performance as the parameters are more likely to achieve a better global best solution and less computation time as it does not depend on the gradient descent. The parameters of the hidden nodes do not need to be tuned. The structure of ELM is shown in Fig.3(b) .

Given the training set as defined in Subsection 4.3, the mapping function of ELM can be expressed aŝ

... whereN means the number of hidden neurons. α i = (α i1 α i2 , · · · , α in ) T stands for the input weight, and β i denotes the bias.

T is the output of the model for the i-th sample. This model is trained to achieve that N j=1 λ j g (α j x i + β j ) = y i , i = 1, · · · , N.

We rewrite the equation as

It has been proved that any single hidden layer network can asymptotically approximate the training samples based on the universal approximation theorem [28] . However, it is a big challenge to find the optimal α j , β j and λ j . ELM is one method that can provide a solution for the above model. The pseudocode is explained in Algorithm 1.

Step 1: randomly initialize values of input weight α j and bias β j

Step 2: calculate output matrix M using (1)

Step 3: compute output weight λ by pseudo inverse in (2) Output: the trained ELM structure Pseudo inverse:

in which M † represents the Moore-Penrose of M .

The third proposed model is named DC-Net-R. For this model, the last three layers in Net-2 are replaced with RVFL layers [26] , whose structure is shown in Fig.3(c) .

Different from traditional Single-Layer Feedforward Neural-network (SLFN) that successively maps its inputs to the outputs until the known mapping achieves the required accuracy, RVFL first maps the input to the enhancement nodes as expressed in (3), and then the feature vector is formed by concatenating the two spaces [29] . Finally, the output nodes and the concatenated feature space are linked by another mapping function. α j , β j and λ stand for weights, bias and output weight respectively. In Fig.3(c) , the input weights are in blue and the output layer weights are in yellow.

For the enhancement layer, we have an AF. Then, input weights and bias values are both randomly assigned values, and the output weights could be attained via pseudo-inverse.

where x = (x 1 , · · · , x N ) T . The loss function of RVFL is expressed as

where λ means the output weight values. V is the enhanced pattern vector, and n is the pattern index. W is the concatenation of x and V . W = concat(x, V ).

Algorithm 2 shows the pseudocode of the proposed three DC-Net models, and Fig.4 provides the flowchart of the proposed three DC-Net models.

Input: the training set and test set

Step 1: AlexNet pre-trained on ImageNet is loaded;

Step 2: Remove last FCL, add two new FCLs, and we get "Net-0";

Step 3: Based on Net-0, we add BN Layers, and get "Net-1";

Step 4: Fine-tuning Net-1 with COVID training set, output "Net-2";

Step 5: Generate training features via NET-2 from training images;

Step 6: For k = 1 : 10

Step 6.1: Set the seed randomly;

Step 6.2: Train SNN, ELM, and RVFL using training features and training labels;

Step 6.3: Combine Net-2 and trained SNN, ELM, and RVFL;

Step 6.4: Create three DC-Nets: DC-Net-S, DC-Net-E, and DC-Net-R;

Step 6.5: Input test images to three proposed models;

Step 6.6: Generate the performance at the k-th run by the predicted labels and the actual test labels;

Step 7: Output the average performances of three proposed DC-Net models;

Step 8: Select the optimal DC-Net model in terms of classification performances.

Output: the optimal DC-Net model and its classification statistics The first step is to input the training set and the test set and generate features via Net-1 and Net-2. Note that original images in the dataset are RGB images with the size of 512 × 512 pixels. Therefore, they are resized to 227 × 227 pixels to meet the input requirement of AlexNet. Then, extracted features are fed into the three classifiers for training until stopping criteria are met. Then we select the classifier with the best performance. We finally get the trained DC-Net and predicted labels. The performance is evaluated by comparing the predicted labels with the ground truth labels.

The training parameters are shown in Table 2 . We set the MiniBatchSize as 20, MaxEpochs as 20 and Ini-tialLearnRate as 10 −4 . We select stochastic gradient descent with momentum (SGDM) [30] to be the training algorithm. All above hyperparameters are selected by the trial-and-error method. To assess the performance of the three proposed DC-Nets, we split the dataset: 70% of the samples are included in the training set, and the other 30% are the testing set. The reason why we use hold-out validation is that our model can be better evaluated using holdout validation than using cross-validation because the number of images is relatively small. Numerous studies are also evaluated based on the hold-out testing set due to the small size of the dataset [31] .

The AlexNet variant in Subsection 4.2 is first trained with the training set and parameters shown in Table 2 . Using Net-2, we can extract the input to the final FCL layer, i.e., activations of FCL3, for each sample. These activations represent the extracted features from the baseline model and are used as input for the proposed algorithms. Each algorithm is trained with the training features and then evaluated on the test set. This process of the hold-out validation is repeated 10 times, where each run initializes with a new set of random weights. The average of 10 runs is the final result. This process can effectively limit the effect of individual stochastic weight initialization on performance.

The reason why we set the number of runs to 10 is that it is a default setting in many other machine learning studies.

In each evaluation, the predictions of all the three DC-Nets are compared with the ground truth. We denote the number of patients accurately classified as TP and misclassified as FN, and the number of healthy controls accurately classified as TN and misclassified as FP. The metrics used to evaluate performance include: 1) sensitivity: the percentage of patients accurately classified

2) specificity: the percentage of healthy controls accurately classified

3) precision: the percentage of predicted patients that are actual patients P recision = T P T P + F P , 4) accuracy: the percentage of correct classification Accuracy = T P + T N T P + F P + T N + F N , 5) F 1 score: the measure of classification ability F 1 = 2 × P recision × Sensitivity P recision + Sensitivity .

6 Results and Discussion

In order to justify the integration of batch normalization into AlexNet, we train and evaluate both Net-0 and Net-1. The iteration-wise plot of training accuracy and losses of Net-0 and Net-1 are shown in Figs.5(a) and 5(b) respectively. At the same learning rate, we can observe that the loss of Net-1 is significantly lower than that of Net-0 in the first few iterations.

Net-1 also shows a lower variation in the decline of loss than Net-0, indicating faster convergence and higher stability. This effect is consistent with BN's characteristic described in Subsection 4.2 and is further validated by the test results shown in Table 3 . 

We can see from Table 4 the mean accuracy of DC-Net-R is 90.91%. Comparative results of DC-Net-E and DC-Net-S show their mean accuracies are 90.34% and 90.23% respectively. DC-Net-R shows the best precision and specificity amongst these algorithms in testing. A possible explanation for the result is the existence of a direct connection between input and output in DC-Net-R, as shown in Fig. 3(c) . A possible explanation for DC-Net-E to show a higher testing accuracy than DC-Net-S is the higher number of trainable parameters. DC-Net-E has a bias for each corresponding weight. In contrast, DC-Net-S only has a single bias value for all weights.

To validate the efficacy of our DC-Net models, we also compare our methods (i.e., previous models) with seven other methods. With no specification, we use the aforementioned parameters in Table 2 when training all of the models. To provide a fair comparison, we adapt all referred methods correspondingly to the classification task here. CTIs are resized to meet the input requirements of different networks. The comparison results are given in Table 5 .

Our proposed DC-Nets show the best performance in every aspect compared with other methods, especially when compared with the hand-crafted features based methods [3] [4] [5] . There are two main reasons why our methods perform best. One is that activation layers extract more representative high-level features in Net-2. The other is the structural superiority of RVFL that allows resilient updates of parameters.

For each method in Table 5 , images are adjusted accordingly to meet the input requirements. For Net-2-RF (Random Forest) and Net-2-SVM (Support Vector Machine), both features are extracted from Net-2 through FCL3 while there are five decision trees in the forest. SVM gives identical results during each running Note: The best performance is in bold. so that there is no variation of results. From the viewpoint of radiologists, spiral CCT is still a reliable and rapid technique for diagnosing and screening COVID-19. Nevertheless, due to a large number of patients and the need for multiple reviews in a short time, the huge number of CTIs significantly increases the workload of the radiologists. Furthermore, different radiologists have various skill levels and may be affected by subjective factors and external pressure. Misdiagnosis and missed diagnosis may delay diagnosis and patient isolation, which facilitates the spread of disease, and eventually alters the overall control of the COVID-19 epidemic. Therefore, there is an urgent need to develop a precise computer-aided method to assist clinicians in identifying patients with COVID-19 infection from CTIs. DL is a critical innovation in the arena of artificial intelligence (AI). It has achieved excellent results in the field of radiology. Existing researches have efficaciously applied DL approaches to detect pneumonia in pediatric chest X-rays (PCXRs). They further distinguished viral pneumonia from bacterial pneumonia using two-dimensional PCXRs [32] . On low-dose CCT, Ardila et al. [33] accomplished an end-to-end model to detect lung cancer. The AUC of their method achieves almost 95%. Chae et al. [34] employed CNNs to classify trivial objects ( 2 cm) as lung nodules within CT scan images. However, there are few reports on COVID-19 in deep learning. We use DL to detect CTIs of COVID-19 pneumonia in this research.

Typical CT of COVID-19 patients shows sub-pleural GG-like patterns, which can 1) affect both lungs and 2) be multiple and peripheral and diffusely distributed. According to the imaging mode, there are many characteristics that can identify viral pathogens. These features are related to their specific pathogenesis [35] , which prove that the deep learning method can be used to extract the image features for COVID-19 diagnosis. The GG in CTI may be one of the most key characteristics for identifying lesions. For that reason, we apply the deep learning model in the GG sample in the CTIs. We develop a DL-based lung CT diagnostic framework DC-Net to diagnose COVID-19 cases. The system can automatically extract GG samples of COVID-19 new pneumonia from image pictures and other radiological characteristics.

We invent and evaluate the DL framework used for detection from the CTIs of the chests of COVID-19 patients. The study gathers 66 COVID-19 confirmed patients from the Huai'an Infectious Disease Hospital in China and 66 healthy patients. As a result of the CT scan of the human chest, we collect 148 CTIs of confirmed COVID-19 cases and CITs of normal people for comparison and modelling. This is a retrospective, multi-cohort, diagnostic study. Our results show that the model has high sensitivity (0.856 8), high specificity (0.961 3), high precision (0.957 0), and high accuracy (0.909 1). Due to the powerful function of detail extraction, the performance of RVFL is better than that of ELM and SNN. The high performance of those DL models shows that the CTIs of both COVID-19 cases and normal subjects can be satisfactorily distinguished.

The results show that the use of deep learning methods to extract imaging graphics features is of vast value for the diagnosis of COVID-19.

The time comparison of manual diagnosis and our DC-Net models is presented in Table 6 . It can be observed that a senior radiologist requires 76.590 9 seconds to make a diagnosis on average, which is appropriately half of the time for the junior radiologist of 142.590 9 seconds. However, our three DC-Nets can deliver the diagnosis result within about half of a millisecond, which is over 10 000 times faster than a senior radiologist. Therefore, the proposed models can be used in realworld situations without long-time endurance.

In this study, we proposed three DC-Net models (DC-Net-S, DC-Net-E, and DC-Net-R) for the classification of new coronavirus pneumonia in CTIs. The DC-Net-R structure uses batch normalization incorporated AlexNet variant with the RVFL algorithm. The comparative study among DC-Net-S, DC-Net-E, and DC-Net-R shows the last DC-Net-R as a viable and potential choice for neural networks in this classification task.

Our research proved that the deep learning method can automatically identify lesions from CTIs and detect COVID-19 patients effectively for doctors. Artificial intelligence can be used as a preliminary screening tool to reduce the pressure on front-line radiologists and reduced misdiagnosis rate of COVID-19 patients. AI can also accelerate the diagnosis of radiation and has great potential to recuperate early diagnosis, treatment, and isolation, thereby helping to contain epidemics.

This study has one limitation. The training dataset is relatively small and is collected from a single hospital, thereby it cannot form a representative distribution for the general population. In the future we shall collect more CT examinations from other hospitals to evaluate the detection efficiency of our model. Lun Yao is a vice president of The Fourth People's Hospital of Huai'an, Huai'an. He is a chief physician graduated from Nanjing Medical College, Nanjing. He has engaged in clinical gastroenterology and infectious diseases for more than 30 years.

Yi Pan received his B.Eng. and M.Eng. degrees in computer engineering from Tsinghua University, Beijing, in 1982 and 1984, respectively, and his Ph.D. degree in computer science from the University of Pittsburgh, Pittsburgh, in 1991. His profile has been featured as a distinguished alumnus in both Tsinghua Alumni Newsletter and University of Pittsburgh CS Alumni Newsletter. Dr. Pan's current research interests include bioinformatics, parallel and cloud computing, and big data. Dr. Pan has published more than 450 papers including over 250 journal papers and 100 IEEE/ACM Transactions/Journal papers. In addition, he has edited/authored 43 books. His work has been cited more than 14 000 times based on Google Scholar and his current h-index is 74. 

A novel coronavirus outbreak of global health concern

Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus-infected pneumonia in Wuhan, China

A pathological brain detection system based on radial basis function neural network

A pathological brain detection system based on kernel based ELM. Multimedia Tools and Applications

A pathological brain detection system based on extreme learning machine optimized by bat algorithm

Pathological brain detection via wavelet packet Tsallis entropy and real-coded biogeography-based optimization. Fundamenta Informaticae

Chinese sign language fingerspelling recognition via six-layer convolutional neural network with leaky rectified linear units for therapy and rehabilitation

Going deeper with convolutions

Abnormality diagnosis in mammograms by transfer learning based on ResNet18

Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy

CT imaging features of 2019 novel coronavirus (2019-nCoV)

A novel AI-enabled framework to diagnose coronavirus COVID 19 using smartphone embedded sensors: Design study

A tailored deep convolutional neural network design for detection of COVID-19 cases from chest radiography images

Machine learning analysis of chest CT scan images as a complementary digital test of coronavirus (COVID-19) patients

ImageNet classification with deep convolutional neural networks

Using pretrained AlexNet deep learning neural network for recognition of underwater objects

Inversion of PM2.5 atmospheric refractivity profile based on AlexNet model from the perspective of electromagnetic wave propagation. Environmental Science and Pollution Research

Detecting surface defects of wind tubine blades using an Alexnet deep learning algorithm

Scene classification with improved AlexNet model

Measuring saturation in neural networks

Convolutional neural networks can accurately distinguish four histologic growth patterns of lung adenocarcinoma in digital slides

Convolutional neural network for classification of two-dimensional array images generated from clinical information may support diagnosis of rheumatoid arthritis

A universal approximation theorem for mixture-of-experts models

A quality diagnosis method of GMAW based on improved empirical mode decomposition and extreme learning machine

Feedforward neural networks with random weights

Learning and generalization characteristics of the random vector functionallink net

Voting extreme learning machine based distributed denial of service attack detection in cloud computing

Universal approximation theorem for uninorm-based fuzzy systems modeling. Fuzzy Sets and Systems

Distributed music classification using random vector functional-link nets

The minimization of empirical risk through stochastic gradient descent with momentum algorithms

Large scale distributed deep networks

Visualization and interpretation of convolutional neural network predictions in detecting pneumonia in pediatric chest radiographs

End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography

Deep learning for the classification of small ( 2 cm) pulmonary nodules on CT imaging: A preliminary study

Radiographic and CT features of viral pneumonia

Xin Zhang obtained his Bachelor's degree in medical imaging from Nanjing Medical University, Nanjing, in 2005, and his Master's degree in medical imaging from Qingdao University

Acknowledgement We thank Qinghua Zhou from University of Leicester who helped in paper-writing and English checking.