key: cord-0514838-265vmlkr
authors: Arco, Juan E.; Ortiz, Andr'es; Ram'irez, Javier; Gorriz, Juan M
title: Tiled sparse coding in eigenspaces for the COVID-19 diagnosis in chest X-ray images
date: 2021-06-28
journal: nan
DOI: nan
sha: 0cb982d06c7b7ab02bb84203b893c54760a51dd3
doc_id: 514838
cord_uid: 265vmlkr

The ongoing crisis of the COVID-19 (Coronavirus disease 2019) pandemic has changed the world. According to the World Health Organization (WHO), 4 million people have died due to this disease, whereas there have been more than 180 million confirmed cases of COVID-19. The collapse of the health system in many countries has demonstrated the need of developing tools to automatize the diagnosis of the disease from medical imaging. Previous studies have used deep learning for this purpose. However, the performance of this alternative highly depends on the size of the dataset employed for training the algorithm. In this work, we propose a classification framework based on sparse coding in order to identify the pneumonia patterns associated with different pathologies. Specifically, each chest X-ray (CXR) image is partitioned into different tiles. The most relevant features extracted from PCA are then used to build the dictionary within the sparse coding procedure. Once images are transformed and reconstructed from the elements of the dictionary, classification is performed from the reconstruction errors of individual patches associated with each image. Performance is evaluated in a real scenario where simultaneously differentiation between four different pathologies: control vs bacterial pneumonia vs viral pneumonia vs COVID-19. The accuracy when identifying the presence of pneumonia is 93.85%, whereas 88.11% is obtained in the 4-class classification context. The excellent results and the pioneering use of sparse coding in this scenario evidence the applicability of this approach as an aid for clinicians in a real-world environment.

The ongoing crisis of the COVID-19 (Coronavirus disease 2019) pandemic is still having a terrible effect in health systems of countries worldwide. The World Health Organization (WHO) has confirmed that 4 million people have died due to this disease, whereas the number of infected people rises to 180 million (World Health Organization, 2021b) . Despite the distribution of vaccines is highly reducing the num-* Corresponding author: jearco@ugr.es ber of contagions, it is extremely crucial the early detection of the disease since many people are contagious in the pre-symptomatic period (World Health Organization, 2021a). Reverse-transcription polymerase chain reaction (RT-PCR) is considered the gold standard for the detection of COVID-19 (Ai et al., 2020; Azar et al., 2020; He et al., 2020) . However, the large time need until results are obtained and its relatively low sensitivity can delay the diagnosis and the subsequent election of the medical treatment. The use of medical imaging can be an alternative solution for the diagnosis of COVID-19, since patients affected by severe COVID-19 usually develop pneumonia. Non-invasive methods such as chest Computed Tomography (CCT) and chest X-ray (CXR) can play a crucial role in the identification of ground glass opacities (GGO) typically associated with COVID-19. Although CCT leads to images with a higher resolution, the low cost of CXR allows that most hospitals have an X-ray machine, resulting in an excellent tool for the diagnosis of this pathology.

Despite medical imaging is extremely useful, it is challenging to discriminate COVID-19 by using only the information that these images offer by several reasons. Findings and abnormalities associated with COVID-19 can be extremely similar than the ones present in other types of pneumonia. The decision of the radiologist can be highly influenced by his/her expertise and there is an overlapping between pneumonia symptoms and lung structures or abnormalities (Chandra et al., 2020; Maduskar et al., 2015) . This leads to a manual and slow diagnostic process, with a high inter and intra-observer variability, that can risk the patients' health in situations where the health system is collapsed. For this reason, the use of automatic methods can play a crucial role by alleviating clinicians when the workload is high and serving as a B-reader when the diagnosis is not straightforward. Previous studies have relied on algorithms based on artificial intelligence for developing systems that allow the automatic detection of pneumonia (Alizadehsani et al., 2021a; Hemdan et al., 2020; Li et al., 2020b) . Alizadehsani et al. (2021a) proposed a semi-supervised method based on Sobel edge detection (Kanopoulos et al., 1988) and generative adversarial networks (Goodfellow et al., 2014) to detect the presence of COVID-19. Ying et al. (2020) presented DeepPneumonia, an automatic tool for the identification of COVID-19 patients based on the identification of GGOs. Alizadehsani et al. (2021a) 1000 CT scans GAN model Normal vs COVID Acc = 99.95 Arco et al. (2020) 6374 CXR images Bayesian Deep Learning Normal vs Bacterial vs Viral vs COVID pneumonia Acc = 98.06 Arco et al. (2021) 513 CT scans Probabilistic Machine Learning Normal vs COVID Acc = 97.86 Li et al. (2020a) 137 CT scans 3D-Resnet-10 Severe vs Critical COVID AUC = 90.9 Li et al. (2020b) 4356 Other studies compared different architectures by employing transfer learning on the ImageNet dataset (Ajin and Mredhula, 2017) . Ezzat et al. (2021) also adopted a transfer learning method based on a pre-trained DenseNET121. In order to maximize performance, they optimized hyperparameters by using a gravitational search algorithm (GSA), leading to an accuracy of 98.28%. Elkorany and Elsharkawy (2021) proposed a deep learning model called COVIDetection-Net for detecting and classifying several types of pneumonia. Arco et al. (2020) introduced a Bayesian perspective in deep learning models in order to provide not only a classification output but a measure based on uncertainty. Specifically, an ensemble classifier was used and the contribution of each individual classifier to the global system was derived from their uncertainty. This would allow to quantify the reliability of the prediction, which is especially interesting in medical contexts. Arco et al. (2021) provided a probabilistic alternative based on eigenlungs derived form Kernel Principal Component Analysis (PCA) and ensemble classification. This work focused on CT images instead of CXR ones, and yielded a classification accuracy of 97.86%. Table 1 provides an overview of recent work focused on the automatic detection of pneumonia.

The performance of deep learning approaches highly depends on the size of the dataset available. When the number of samples is high, methods based on deep learning usually learn the main features that characterize the different classes to distinguish from. When data is limited, deep networks could not learn the relationship between samples and labels. Thus, the use of methods derived from deep learning alternatives would be restricted to scenarios where thousands of images are available. Despite the rising of public datasets, the requirements needed for the correct use of deep learning approaches can not always be met. In this work, we provide an alternative solution based on sparse coding in order to maximize the identification of the clinical symptoms associated with COVID-19 from CXR images. Each one is divided into a number of squared tiles (also known as patches), and features are extracted by using PCA. The most relevant components are then used to build a dictionary, which is the basis of sparse coding. Images are coded and reconstructed according to the elements of the dictionary, obtaining a reconstruction error for each individual patch. Finally, the reconstruction errors of all patches contained in the images are then entered into a classifier, which decides the presence (or not) of pneumonia. Performance of this classification framework is evaluated in a range of scenarios of incremental difficulty: from a control vs pneumonia patients to a multiclass context where differentiating between four pathologies: controls vs bacterial pneumonia vs viral pneumonia vs COVID-19 pneumonia. The main contributions of our work can be summarized as follows:

• The pioneering use of sparse coding allows a novel and rapid approach for the diagnosis of pneumonia.

• The application of PCA optimizes the informativeness of the elements included in the dictionary.

• The division into patches allows the spatial identification of the pathology and its cause (bacteria, virus, COVID-19).

• Our approach offers an alternative to deep learning for scenarios where the number of samples is limited. We have used the dataset available in Kaggle (2020b) for controls and patients who suffered from a bacterial or a non-COVID19 pneumonia. According to the information described in Kermany et al. (2018a) , the CXR images were selected from retrospective cohorts of pediatric patients of one to five years old from Guangzhou Women and Children's Medical Center, Guangzhou. All CXR images were acquired as part of patient's routines clinical care. Institutional Review Board (IRB)/Ethics Committee approvals were obtained. The work was conducted in a manner compliant with the United States Health Insurance Portability and Accountability Act (HIPAA) and was adherent to the tenets of the Declaration of Helsinki. Kermany et al. (2018a) collected and labeled a total of 5856 CXR images from children, including 4273 characterized as depicting pneumonia and 1583 normal. From those patients diagnosed with pneumonia, 2786 were labeled as bacterial pneumonia, whereas 1487 were labeled as viral pneumonia. The dataset containing COVID-19 patients is available in Kaggle (2020a) and includes 576 CXR images from adults. Figure 1 shows the CXR image from a control (CTL), and a patient suffering from a bacterial (BAC), a viral (VIR) and a COVID19 (CVD19) pneumonia.

When working with medical images, it is extremely important to apply a preprocessing in order to improve the subsequent classification performance.This is especially important in CXR images, where low X-ray radiation and movement during image acquisition result in noisy and low-resolution images. This preprocessing also adapts images to the needs of the neural network. In order to mitigate computational and memory limitations, we downsampled the input images to obtain a final map of size 224x224. We also performed an intensity normalization procedure for each individual image based on standardization. Each image was transformed such the resulting distribution has a mean (µ) of 0 and a standard devia-tion (σ) of 1, as follows:

where I is the original image and I' is the resulting one.

The idea behind this technique is derived from the way the primary visual cortex in human brain works. The number of neurons in this brain region is much higher than the number of receptor cells in the retina, which suggests that a sparse code is used to efficiently represent natural scenes (Beyeler et al., 2019; Hunt et al., 2013; Vinje and Gallant, 2000) . Sparse coding relies on the assumption that data can be represented in terms of a linear combination of basis elements (Wright et al., 2009) . Consider a number of samples of class i,

A new sample y ∈ R m can be approximated by the linear span of the initial number of samples as follows:

where α i,j ∈ R, j = 1, 2, . . . , n i is the coefficient vector.

Sparse coding allows the representation of a signal in terms of a linear combination of a few atoms of a dictionary matrix. The main advantage of this technique is that a complex signal can be represented in a very concise manner. A crucial aspect in this process is the way the dictionary is built. The simplest solution is to use each individual image as a dictionary entry. Thus, the size of the dictionary would be exactly the same than the number of images available. This can be problematic when the size is too high for two main reasons. First, a dictionary with a high number of atoms would lead to a slow process when transforming and reconstructing the images because of the matrices multiplication this process relies on. Second, employing features in the original space can be suboptimal when trying to maximize the differences in the representation of images of different classes. In this work, we propose a patches-based method relying on Principal Component Analysis (PCA) for the creation of the dictionary. Briefly, original images are divided into patches of a fixed size, storing then each individual patch by columns in a matrix. PCA Jolliffe (1986) ; Khedher et al. (2015) ; is then applied to this matrix. Given a set of N samples x k ,

x kn ] ∈ R n , the aim of PCA is to find the projection directions that maximize the variance of a subspace . Thus, vector x is projected from the input space, R n , to a high-dimensional space, R f . In the new feature space, R f , the eigenvalue problem can be described as follows:

, and there exist coefficients α i such that:

Defining an N xN matrix K by

the PCA problem becomes:

where α denotes a column vector with entries α 1 . . . α N Schölkopf et al. (1998) . Finally, vectors in the high-dimensional feature space are projected into a lower dimensional spanned by the eigenvectors w Φ . Given a sample x whose projection is Φ(x) in R f , the projection of Φ(x) onto the eigenvectors w Φ is the nonlinear principal components corresponding to Φ, as follows:

We divided the images into patches instead of using complete images because of the patterns associated with pneumonia are usually located in small regions. The sparse coding stage can be summarized as follows:

• Division of the images into patches. We did this operation because patterns associated with pneumonia are usually located in small regions, which means that it could be better to analyze individual to perform the diagnosis.

• Creation of a preliminary dictionary where each column corresponds to individual patches. The size of the resulting matrix will be M xN , where M is the number of voxels contained in each patch and N is the product of the number of patches contained in each image and the total number of images.

• Application of PCA to this matrix to obtain an optimum dictionary by maximizing the variance of the projected components. The eigenvectors will be the elements of the dictionary, so that their size will be derived from the number of principal components preserved.

• Computation of the decomposition coefficient vectorα by solving the L1-norm minimization problem by sparse coding: α = arg min α α 1 subject to Xα − y 2 ≤ where X is the dictionary matrix and y is the test sample to be transformed.

• Estimation of the sparse reconstruction error for each patch y by employing the sparse coefficients associated α: e = Xα − y

These operations can be seen as a feature extraction prior to the classification. Once the reconstruction errors are computed for all images, a new dataset is generated with as many features as patches and as many samples as images. Figure 2 depicts a schematic representation of the dictionary generation. The aim at this point is that the classifier finds the relationship between features and samples. We employed an SVM algorithm for binary classification (CTL vs PNEU) and a Random Forest (RF) classifier for multiclass classification.

The resulting reconstruction errors were then entered as input of the classification process, which was based on an SVM classifier with a linear kernel. This approach employs the hyperplane with the maximum separation between classes to distinguish between them. This separation is known as margin, and the nearest data points are usually termed support vectors. From a mathematical perspective, it is possible to specify a linear SVM classification rule f by a pair of (x, x), as follows:

where w is the weight vector, x i is the feature vector and b is the error term. Thus, a point x is classified as positive if

The maximum distance between the two classes is obtained by solving the optimisation problem described in Boser et al. (1996) : where C is usually known as penalty for misclassification, or cost parameter. The solution to the optimisation problem can be written as:

after applying the Lagrangian multipliers. Substituting the value of w in Equation 8, it is possible to rewrite the decision function in its dual form as:

where α i and b represent the coefficients to be learned from the examples and K(x, x i ) is the kernel function that characterizes the similarity between samples x and x i . Since classes were unbalanced (e.g. the number of pneumonia patients was higher than controls), we incorporated the weights of the classes into the cost function in order to the majority class does not contribute more than the minority one.

For the multiclass classification (controls vs different types of pneumonia), the reconstruction errors of the different patches of the images were entered into a Random Forest classifier. RF is an ensemble method that combines a number of decision trees in order to improve the performance of individual classifiers. The trees are built from k random vectors, Θ k , which are independent of the past random vectors Θ 1 , Θ 2 , Θ 3 , . . . , Θ k−1 but with the same distribution. The process developed by Breiman (2001) employs bagging for generating each random vector Θ as the N observations randomly drawn from the training set. Once a large number of trees is generated {h(x, Θ k ), k = 1, . . . }, each one of them casts a vote for the most popular class at input X. The final decision of the classifier is determined by a majority vote of the trees.

One important feature of RF classifiers is related to its convergence and generalization error (Breiman, 2001; Yang et al., 2008) . Given a set of classifiers h 1 (x), h 2 (x), . . . , h k (x), and a training set randomly drawn from the distribution of a random vector (X,Y), the margin function is defined as:

where X is the input metric, av k is the average number of votes at X , Y for the corresponding class and I(·) is the indicator function. The margin is a measure about the extent to which the average number of votes at X, Y for the right class exceeds the average vote for any other class. Thus, the larger the margin, the more confidence in the classification. The generalization error es given by:

where P X,Y indicates that the probability is over the X, Y space. According to Theorem 1.2 in Breiman (2001) , as the number of trees increases, for almost surely all sequences Θ 1 , . . . P E * converge to: Equation 14 shows that for a large number of trees, it follows the Strong Law of Large Numbers. This explains that random forests do not overfit as more trees as added to the ensemble since they produce a limiting value of the generalization error. A crucial aspect of RF models is related to the two randomized procedures applied for building the trees (Yang et al., 2008) . For a number of cases in the training set, N, and a number of variables in the classifier, M, the number of input variables used for determining the decision at a node of the tree will be given by m. This number should be much less than M (m << M ) . Briefly, from the training set, N samples are randomly selected with replacement as the new training set. For each node of the tree, m of the M variables on which to base the decision at that node are randomly selected. The best split based on these m variables is computed, and each tree is fully grown and not pruned. Figure 3 shows a schematic representation of the entire classification process. Once the dictionary is built, images are transformed and reconstructed, leading to an error value when compared the transformed with the original images. The reconstruction errors for each individual patch are then entered into the classifier, which assigns the final label prediction.

For all experiments, a 5-fold stratified cross-validation scheme was used to estimate the generalization ability of our method (Kohavi, 1995) . We evaluated the performance of the classification frameworks in terms of the following parameters from the confusion matrix:

where T P is the number of pneumonia patients correctly classified (true positives), T N is the number of control patients correctly classified (true negatives), F P is the number of control subjects classified as pneumonia (false positives) and F N is the number of pneumonia patients classified as controls (false negatives). We also employed the area under the curve ROC (AUC) as an additional measure of the classification performance (Hajian-Tilaki, 2013; Mandrekar, 2010) . In the multiclass scenario, the information derived from parameters such as Sens or Spec can not be easily interpreted. In this context, we employed a method based on a multi-class Onevs-One scheme to compare every unique pairwise combination of classes (Allwein et al., 2001) . The multiclass-AUC was computed by averaging the results obtained for each individual comparison. Moreover, a multiclass version of the balanced accuracy was computed, as follows:

where M is the number of classes, n m is the number of samples belonging to class m and r m is the number of samples belonging to class m that are accurately predicted.

In this work we propose a classification framework to identify the patterns associated with pneumonia from CXR images. To do so, we define two experiments:

• Experiment 1: Binary Classification to distinguish between different groups in three contexts: CTL vs PNEU, which includes all images labelled as CTL and PNEU regardless of the type of pneumonia; BAC vs VIR, which divides the images from patients diagnosed from pneumonia according to the cause of the disease (bacterial or viral); VIR vs CVD19 for viral pneumonia. In the last context, the aim was to identify whether viral pneumonia was caused by COVID-19 or not. In this first experiment, the resulting features from the sparse coding phase were then entered into a linear SVM classifier. We varied the size of the patches and the number of resulting components from PCA during the construction of the dictionary in order to evaluate its influence in the performance. The parameters associated with this algorithm were optimized in a grid-search process within the training phase.

• Experiment 2: Multiclass Classification by using an RF classifier in order to distinguish between the four different pathologies contained in the database. This algorithm combines the decisions of individual trees to obtain the final diagnosis of the patient. The process for building the dictionary in addition to the classification framework are identical to Experiment 1 except the aforementioned change in the classier.

We first explore how performance varies according to two parameters: the number of patches each image is divided into and the number of components retrieved from PCA to build the dictionary. Results are summarized in Table 2 for the four different classification contexts. We can see that the maximum accuracy obtained in the CTRL vs PNEU scenario is 93.85%, with a patch size of 14x14 and 5 components used to compute the dictionary. It is important to note that there is not a clear relationship between these two variables and the resulting accuracy. However, a drop in accuracy appears when too large patches are used (56x56). This can be related to the fact that pneumonia patterns are usually located in small regions of the CXR images. When applying sparse coding, information extracted can be related to pulmonary affections derived from pneumonia. However, when the size of patches increases, this information can be due to other sources such as pulmonary structures that are completely normal, increasing the difficulty of the classification task.

It is important to mention that the performance in the second context (BAC vs VIR pneumonia) is slightly lower than in the first scenario, manifesting the higher difficulty of this classification. Specifically, the maximum accuracy was 88.85%, with a patch size of 16x16 and 8 components. We also observe that the discrimination ability of the proposed system is larger in the VIR vs CVD19 scenario, with a maximum accuracy of 96.36%. This can evidence that the pathology caused by COVID-19 is more severe and different than the one caused by other virus or bacterias. Finally, the best result in the multiclass context led to an accuracy of 88.11%. Results in terms of different performance measures associated with the situation of maximum accuracy are shown in Table 3 . Figure 4 summarizes the influence of the patch size in the classification performance. The maximum accuracies are obtained with squared patches of 14x14 or 16x16 pixels in the different classification contexts. Although using too small patches leads to a non-optimal classifier, the accuracy starts highly decreasing when too large patches are employed. This evidences that covering too wide regions can be detrimental for the identification of pneumonia, especially in cases when this affection is not severe. Figure 5 depicts the ROC curves for the different classifiers. The best results are obtained in the VIR vs CVD-19 context, since differences between these two groups of patients are clear. However, our system can also distinguish between patients with the same pathology (pneumonia) but different etiology (bacteria, . Further discussion about the results and their implications are provided in Section 7.

In this study, we proposed a classification system for the detection of different types of pneumonia from CXR images. This approach is based on the construction of a dictionary that relies on the assumption that an image can be expressed as a linear combination of different atoms. We employed a scheme in which each image was divided into patches and the dictionary was built from the components of maximum variance from the patches of all images. The reconstruction errors obtained from the resulting dictionary were then used as input features of a classifier. We evaluated the performance of this approach in different classification scenarios. In the first context, the two classes generated relatively big differences in the observed pattern (pneumonia vs control), whereas in the second (bacterial vs viral pneumonia) and in the third one (CVD19 vs no-CVD19) these differences were extremely small. Besides, the performance of a multiclass classifier was also evaluated in order to check if this method could simultaneously differentiate between the different pathologies.

Previous studies have employed sparse coding for the processing and analysis of different signals (Ortega et al., 2016; Ortiz et al., 2019) . However, most of them have used it within the classification stage instead of as a feature extractor. Spe- cifically, images are reconstructed from atoms of the dictionary corresponding to the different classes. The final label is assigned according to the class that yields a minimum reconstruction error. This alternative, applied in combination with ensemble classification, has shown a high performance in previous works (Shekhar et al., 2014; Xu et al., 2014; Yang et al., 2009) . However, it is difficult to use it when input images are not analyzed as a whole but divided into patches. Patterns associated with COVID-19 can be distributed in different locations in the image. According to the severity of the infection, they can be widespread or bounded in small regions. This last situation can be highly problematic when trying to automatize the diagnosis for one main reason. It is possible that most of the regions within the ensemble are labeled as 'controls' because they are not affected by the pulmonary affection, whereas only a small number of regions are identified as 'covid patient'. In this case, combining the results from individual patches is not straightforward. Employing majority voting is not an optimum solution, especially when a non-severe affection is present. Previous studies have weighted the contribution of individual patches according to a specific residual e.g. uncertainty in Bayesian frameworks (Arco et al., 2020) . There are some scenarios in which two lung regions are labeled with opposite diagnoses and both classifier's decisions are correct, especially if the pneumonia is not widespread. In order to overcome this issue, features extracted from individual parts of the images are treated as a whole in the classification stage to optimize the diagnosis process.

Another remarkable aspect of the proposed method is the high performance obtained without requiring a previous preprocessing of the images. The use of artificial intelligence for the automatic detection of different pathologies is widespread, e.g. neurological disorders such as Parkinson's or Alzheimer's (Arco et al., 2016; Castillo-Barnes et al., 2018; Górriz et al., 2020) . When analyzing patterns associated with brain anatomy or function, most of these techniques require a spatial correspondence between the images of all subjects. This can be obtained by employing operations based on spatial transformations such as registration or normalization. However, the application of these approaches to CXR images is much harder for several reasons. First, there is a high variability in the size and shape of lungs. And most important, there are discrepancies in the position of each patient inside the scanner for all the images acquired. When trying to apply spatial transformations to mitigate these issues, it is possible to introduce high levels of noise that invalidate the results obtained. We have developed an accurate tool that does not require any additional preprocessing to get a high performance. In fact, the information extracted from the sparse coding methodology in addition to the computation of the reconstruction errors perform consistently well despite no spatial correspondence between the different images is computed.

It is worth mentioning that the method proposed in this work can be an excellent option in contexts when the app-licability of deep learning approaches is not straightforward. Alternatives based on deep learning have shown an ideal solution when applied to medical imaging in a wide range of scenarios. Therefore, previous works have demonstrated a high performance when used to detect pneumonia (Alizadehsani et al., 2021b; Kermany et al., 2018b; Mittal et al., 2020; Wang et al., 2021b) . The main issue is that this kind of techniques require a high number of training samples in order to learn the features that allows the detection of a specific pathology. The implementation of a global repository of COVID-19 images would address this problem. However, collaboration between different medical centers is not always possible. For this reason, it is important to note that the design of our method allows detecting the presence of pneumonia even when a high amount of data is not available. Another crucial difference between our proposal and deep learning methods is related to the computational burden. Specifically, the number of mathematical operations performed by our approach is considerably lower than the ones employed in deep learning. This allows the implementation and use of our framework in research centres with reduced computational resources. Moreover, the high performance obtained in the multiclass classification shows that the tool proposed in this work can be successfully employed in a real scenario. These results reveal the usefulness of this technique not only for detecting the presence of pneumonia, but to properly identify the cause of this pathology.

The ongoing crisis of the COVID-19 (Coronavirus disease 2019) pandemic has changed the world. Four million people have died due to this disease, whereas there have been more than 180 million confirmed cases of COVID-19. The collapse of the health system in many countries has demonstrated the need of developing tools to automatize the diagnosis of the disease from medical imaging. In this paper, we proposed a classification framework based on sparse coding to detect the pneumonia patterns caused by different pathologies. This tool creates a dictionary from the most relevant features extracted by PCA in the individual patches of the CXR images. They are then transformed and reconstructed, and the resulting reconstruction errors are then used as inputs of the classifier. The reduced computational cost compared to deep learning while preserving a large performance (88.11% in the multiclass scenario) evidences the applicability of the method as an aid for clinicians in a real context. These results pave the way for the application of sparse coding in a wide range of scenarios, especially when the number of samples available is limited.

Correlation of chest ct and rt-pcr testing for coronavirus disease 2019 (covid-19) in china: A report of 1014 cases

Diagnosis of interstitial lung disease by pattern classification

Uncertainty-aware semisupervised method using large unlabelled and limited labe

Uncertainty-aware semisupervised method using large unlabelled and limited labe

Reducing multiclass to binary: A unifying approach for margin classifiers

Uncertainty-driven ensembles of deep architectures for multiclass classification. application to covid-19 diagnosis in chest x-ray images

Probabilistic combination of eigenlungs-based classifiers for covid-19 diagnosis in chest ct images

Improving short-term prediction from mci to ad by applying searchlight analysis

Fractured aluminum nasopharyngeal swab during drive-through testing for covid-19: radiographic detection of a retained foreign body

Neural correlates of sparse coding and dimensionality reduction

A training algorithm for optimal margin classifier

number 1 -springerlink

Robust ensemble classification methodology for i123-ioflupane SPECT images and multiple heterogeneous biomarkers in the diagnosis of Parkinson's disease. Frontiers in Neuroinformatics 12

Automatic detection of tuberculosis related abnormalities in chest X-ray images using hierarchical feature extraction scheme. Expert Systems with Applications 158

Can ai help in screening viral and covid-19 pneumonia?

Covidetection-net: A tailored covid-19 detection from chest radiography images using deep learning

An optimized deep learning architecture for the diagnosis of covid-19 disease based on gravitational search optimization

Covid-19 in cxr: From detection and severity scoring to patient disease monitoring

Deep convolutional neural network-based computer-aided detection system for covid-19 using multiple lung scans: Design and implementation study

Generative adversarial networks

Artificial intelligence within the interplay between natural and artificial computation: Advances in data science, trends and applications

Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation

Diagnostic performance between ct and initial real-time rt-pcr for clinically suspected 2019 coronavirus disease (covid-19) patients outside wuhan, china

COVIDX-Net: A framework of deep learning classifiers to diagnose COVID-19 in X-ray images

Sparse coding can predict primary visual cortex receptive field changes induced by abnormal visual input

Abnormality detection and intelligent severity assessment of human chest computed tomography scans using deep learning: a case study on sars-cov-2 assessment

Automatic detection of covid-19 from chest ct scan and chest x-rays images using deep learning, transfer learning and stacking

Principal Component Analysis and Factor Analysis

Chest X-Ray Images (Covid-19 & Pneumonia)

Chest X-Ray Images (Pneumonia) dataset

Design of an image edge detection filter using the sobel operator

Automatic detection of coronavirus disease (covid-19) in x-ray and ct images: A machine learning based approach

Identifying medical diagnoses and treatable diseases by image-based deep learning

Identifying medical diagnoses and treatable diseases by image-based deep learning

Early diagnosis of alzheimer's disease based on partial least squares, principal component analysis and support vector machine using segmented mri images

Covid-19 pneumonia diagnosis using a simple 2d deep learning framework with a single chest ct image: Model development and validation

A study of cross-validation and bootstrap for accuracy estimation and model selection

Classification of severe and critical covid-19 using deep learning and radiomics

Artificial intelligence distinguishes covid-19 from community acquired pneumonia on chest ct

Svm-based cad system for early detection of the alzheimer's disease using kernel pca and lda

Automatic tool for alzheimer's disease diagnosis using pca and bayesian classification rules

Automatic detection of pleural effusion in chest radiographs

Covid-19 detection from chest x-ray images using deep learning and convolutional neural networks, in: 11th Hellenic Conference on Artificial Intelligence, Association for Computing Machinery

Receiver operating characteristic curve in diagnostic test assessment

Automated detection of covid-19 from ct scan using convolutional neural network

Detecting pneumonia using convolutions and dynamic capsule routing for chest X-ray image

Automatic detection of coronavirus disease (covid-19) using x-ray images and deep convolutional neural networks

Classification of motor imagery tasks for bci with multiresolution analysis and multiobjective feature selection

Empirical functional pca for 3d image feature extraction through fractal sampling

Automated detection of covid-19 cases using deep neural networks with x-ray images

Automated detection of covid-19 cases on radiographs using shape-dependent fibonacci-p patterns

A fully automated deep learning-based network for detecting covid-19 from a new and large lung ct scan dataset

Nonlinear component analysis as a kernel eigenvalue problem

Analysis sparse coding models for image-based classification

Covidgr dataset and covid-sdnet methodology for predicting covid-19 based on chest x-ray images

Sparse coding and decorrelation in primary visual cortex during natural vision

Covid-19 classification by fgcnet with deep feature fusion from graph convolutional network and convolutional neural network

Automatically discriminating and localizing covid-19 from community-acquired pneumonia on chest x-rays

World Health Organization, 2021a. Coronavirus disease 2019 (covid-19)

World Health Organization, 2021b. Who coronavirus (covid-19) dashboard

Robust face recognition via sparse representation

Supervised bayesian sparse coding for classification

Random forests classifier for machine fault diagnosis

Linear spatial pyramid matching using sparse coding for image classification

Deep learning enables accurate diagnosis of novel coronavirus (covid-19) with ct images

This work was partly supported by the MINECO/ FEDER under the PGC2018-098813-B-C32, RTI2018-098913-B100, CV20-45250 and A-TIC-080-UGR18 projects.