key: cord-0942972-k8v3k9rb authors: Delli Pizzi, Andrea; Chiarelli, Antonio Maria; Chiacchiaretta, Piero; Valdesi, Cristina; Croce, Pierpaolo; Mastrodicasa, Domenico; Villani, Michela; Trebeschi, Stefano; Serafini, Francesco Lorenzo; Rosa, Consuelo; Cocco, Giulio; Luberti, Riccardo; Conte, Sabrina; Mazzamurro, Lucia; Mereu, Manuela; Patea, Rosa Lucia; Panara, Valentina; Marinari, Stefano; Vecchiet, Jacopo; Caulo, Massimo title: Radiomics-based machine learning differentiates “ground-glass” opacities due to COVID-19 from acute non-COVID-19 lung disease date: 2021-08-26 journal: Sci Rep DOI: 10.1038/s41598-021-96755-0 sha: acd1ea01ccc426876478cbf32eb1ea33c7a1f134 doc_id: 942972 cord_uid: k8v3k9rb Ground-glass opacities (GGOs) are a non-specific high-resolution computed tomography (HRCT) finding tipically observed in early Coronavirus disesase 19 (COVID-19) pneumonia. However, GGOs are also seen in other acute lung diseases, thus making challenging the differential diagnosis. To this aim, we investigated the performance of a radiomics-based machine learning method to discriminate GGOs due to COVID-19 from those due to other acute lung diseases. Two sets of patients were included: a first set of 28 patients (COVID) diagnosed with COVID-19 infection confirmed by real-time polymerase chain reaction (RT-PCR) between March and April 2020 having (a) baseline HRCT at hospital admission and (b) predominant GGOs pattern on HRCT; a second set of 30 patients (nCOVID) showing (a) predominant GGOs pattern on HRCT performed between August 2019 and April 2020 and (b) availability of final diagnosis. Two readers independently segmented GGOs on HRCTs using a semi-automated approach, and radiomics features were extracted using a standard open source software (PyRadiomics). Partial least square (PLS) regression was used as the multivariate machine-learning algorithm. A leave-one-out nested cross-validation was implemented. PLS β-weights of radiomics features, including the 5% features with the largest β-weights in magnitude (top 5%), were obtained. The diagnostic performance of the radiomics model was assessed through receiver operating characteristic (ROC) analysis. The Youden’s test assessed sensitivity and specificity of the classification. A null hypothesis probability threshold of 5% was chosen (p < 0.05). The predictive model delivered an AUC of 0.868 (Youden’s index = 0.68, sensitivity = 93%, specificity 75%, p = 4.2 × 10(–7)). Of the seven features included in the top 5% features, five were texture-related. A radiomics-based machine learning signature showed the potential to accurately differentiate GGOs due to COVID-19 pneumonia from those due to other acute lung diseases. Most of the discriminant radiomics features were texture-related. This approach may assist clinician to adopt the appropriate management early, while improving the triage of patients. Study population. The study received formal approval from the Ethical Committee of the University G. d' Annunzio of Chieti-Pescara, Italy; informed consent was waived by the same ethics committee that approved the study (Comitato Etico per la Ricerca Biomedica delle Province di Chieti e Pescara e dell'Università degli Studi "G. d' Annunzio" di Chieti e Pescara, Italy). The study was conducted according to ethical principles laid down by the latest version of the Declaration of Helsinki. We retrospectively included a total of 120 consecutive patients diagnosed with SARS-CoV-2 infection based on RT-PCR who underwent a clinically indicated highresolution chest CT (HRCT) between March 2020 and April 2020 at our institution. Patients were included if they met all the following criteria: (a) baseline HRCT performed at hospital admission, (b) GGO as predominant feature on chest CT scans. Another set of 310 patients (nCOVID) with clinically indicated HRCT for acute respiratory disease performed between August 2019 and April 2020 was retrospectively included in the study (nCOVID) . For this second set, patients were included if they met all the following criteria: (a) GGO as predominant feature on chest CT scans, (b) availability of final diagnosis (clinical, laboratory, or pathology). The presence of a predominant GGOs pattern was assessed by two radiologists (M.M. and R.L.P.) with more than Scientific Reports | (2021) 11:17237 | https://doi.org/10.1038/s41598-021-96755-0 www.nature.com/scientificreports/ 10 years of experience in chest imaging, in consensus. More in detail, the readers assessed the presence of GGOs, consolidations, and "crazy paving" on CT images. In this regard, apart when the GGOs were the only CT finding, the predominant GGOs pattern was defined as present when the GGOs were considered like the major finding compared to consolidation and/or crazy paving pattern 10, [30] [31] [32] [33] . In the first set (COVID), we excluded 92 patients: 14 had severe respiratory artefacts, 78 had a non-predominant GGO pattern. In the second set (nCOVID) we excluded 280 patients: 32 had severe respiratory artefacts, 210 had a non-predominant GGO alteration and 38 were treated in another hospital and the final diagnosis was not available. None of the patients considered eligible for the study had a concomitant malignancy. The final study population was composed of 28 COVID and 30 nCOVID for a total of 58 patients (Fig. 1 ). CT protocol. Non-enhanced chest CT scans were performed in a supine position, during inspiratory breathhold, from the apex to the lung bases, on a 128-slice multi-detector CT scanner (Somatom Definition AS, Siemens Healthineers, Germany). The field of view ranged between 35 and 40 cm according to the body size. The electronic window values were amplitude (W) 1200-1600 HU and window or center level (L) between − 600 and − 750 HU. The main scan parameters were: tube voltage = 120 kVp, automatic tube current modulation (30-70 mAs), pitch = 0.9-1.5 mm (0.9, 1.2 and 1.5 mm for 6, 47 and 5 patients respectively), matrix = 512 × 512. The images were reconstructed with a slice thickness of 0.625-1.250 mm (0.625, 1.000, and 1.250 mm for 46, 8, and 4 patients respectively) with the same increment with a high spatial frequency reconstruction algorithm (B50, I50). A whole-volume semi-automated GGOs delineation was independently performed by two fourth year senior radiology residents (C.V. and M.V.) that were blinded from swabs results using an open-source medical image computing platform, 3DSlicer Version 4.8 (www. 3dsli cer. org) (Fig. 2a) . In detail, the GGOs were segmented using a "threshold-effect" tool and manually setting the threshold between − 1350 and − 700 HU 8, 34, 35 . If necessary, the segmentation was further manually corrected by each reader in order to exclude automated segmented pixels beyond the GGOs. Once the semiautomated segmentation of GGOs was concluded, the lungs were automatically extracted via Convolutional Neural Network (CNN) algorithms to create binary mask 36 . Then, a logical "and", between these masks and the segmentations obtained by the radiology residents, was performed (using "3dcalc") to exclude automated segmented pixels beyond the lungs, thus obtaining the final ROIs 37 . All the ROIs were verified by a senior radiologist with more than 10 years of experience in chest imaging (M.M.) to confrirm the correct position and correspondence with the underlying CT findings. The extraction of the radiomics features was conducted using PyRadiomics (https:// pyrad iomics. readt hedocs. io), a flexible open-source platform capable of extracting a large panel of engineered features from medical images; this radiomics quantification platform enables the standardization of both feature definitions and image processing 38 . To avoid data heterogeneity bias and minimize acquisition-related radiomics variability, HRCT images were subjected to imaging resampling (2 × 2 × 2 mm) 39 . For each ROI, ten built-in filters (Original, wavelet, Laplacian of Gaussian (LoG), square, square root, logarithm, exponential, Gradient, www.nature.com/scientificreports/ LBP2D, LBP3D) were applied and seven feature classes (first order statistics, shape descriptors, glcm, glrlm, ngtdm, gldm, glszm) were calculated, for a total of 1409 radiomics features. The reproducibility assessment of the features extracted from segmentations of all patients was performed. Machine learning approach: partial least square (PLS) regression. A machine learning (i.e. multivariate) approach was implemented to exploit radiomics features multidimensionality (Fig. 2b) . Two main approaches were implemented to improve and correctly assess the generalization performance of the machine learning model [40] [41] [42] . The first approach was to reduce the number of features by selecting only those that were highly repeatable (r > 0.95) between the two masks delineated by the senior radiology residents. The second approach was to implement a machine learning framework based on a linear regression analysis that employed a space dimension reduction procedure, namely the partial least square (PLS) regression 40, [43] [44] [45] . The PLS was used to differentiate COVID from nCOVID patients. Moreover, in this work, a leave-one-out nested cross-validation (nCV) was implemented to optimize the PLS number of components and to assess the PLS generalization performance 42, [46] [47] [48] . The β-weights of the PLS analysis were obtained by running the algorithm on the complete dataset with the optimal number of components delivered by the nCV analysis. They linked the original independent variables with the dependent variable thus depicting the importance and sign of the original variables in the prediction. Among β-weights, top 5% features were calculated. Those features included the 5% features with the largest β-weights in magnitude thus representing the features with the highest predictive capability. The machine learning analyses were implemented in Matlab. Statistical analysis. The inter-reader correlation of radiomics features was assessed using an across-subjects Person correlation coefficient. Only radiomics features with high correlation coefficient (above 0.95) were used within the machine learning model 49 . The COVID vs nCOVID classification performance was assessed through Receiver Operating Characteristic (ROC) analysis comparing the inferred (out-of-training-sample) with the true group. COVID patients were attributed to the "positive" group, whereas nCOVID patients were attributed to the "negative" group. The ROC analysis was also performed on random shuffled group labels to simulate the null hypothesis and evaluate its confidence interval (repeated 10 6 times). The ROC analysis delivered an Area Under the Curve (AUC), which could be transformed into a z-score for assessing its statistical significance by using the random shuffled group labels. The Youden's test was used to calculate the sensitivity and specificity of the ROC analysis. 5% null hypothesis probability threshold was chosen (p < 0.05). The statistical analysis was performed in MATLAB. Ethical statement. This study was approved by the local ethics committee. The study used only pre-existing medical data, therefore patient consent was waived. Study population. The majority of patients included in the study were male (n = 34, 59%), and the median age was 66 years (interquartile range 55-81). Out of the total patient population (n = 58), 28 (48%) were assigned to the COVID group, and 30 (52%) to the nCOVID group. The nCOVID group (n = 30) included four with Cytomegalovirus (CMV) pneumonia, two with pulmonary edema, five with Acute Respiratory Distress Syndrome (ARDS), eight with Organizing Pneumonia, three with Pneumocystis Jirovecii pneumonia, two with Influenza A pneumonia, two with Legionella pneumonia, three with alveolar hemorrhage and one with hypersensitivity pneumonia (Table 1) . A total of 1409 radiomics features were extracted. 153 of these features showed an inter-reader correlation of r > 0.95 and were used for further analysis. When employing radiomics features with an r > 0.95, i.e., 153 radiomics features, an AUC = 0.868 was obtained (z = 5.1, p = 4.2 × 10 -7 , Fig. 3) . A Youden's index of 0.68 was associated with a sensitivity and specificity of 93% and 75% respectively ( Table 2 ). The estimated optimal number of PLS components, evaluated within the nCV framework, was 7. The weights of the PLS (β-weights) are shown in (Fig. 4a,b) . Since a value of "1" was attributed to the COVID patients and a value of "0" was attributed to the nCOVID patients during the machine learning training, a positive weight suggests a higher feature value in COVID compared to nCOVID patients with an opposite behavior for a negative weight. Of the top 5% features, 5 (wavelet_LLH_glrlm_GrayLevelNonUniformity, wavelet_LHH_ glcm_DifferenceVariance, wavelet_LHH_glrlm_GrayLevelVariance, wavelet_HLH_glcm_DifferenceVariance, wavelet_HHL_glrlm_RunEntropy), were associated to glrlm and glcm texture matrices (second second order features), and 2 (wavelet_LLH_firstorder_Skewness, Ibp_2D_firstorder_10Percentile) were related to the image intensity distribution (first order features). All, except one, second order features had a negative weight, meaning that COVID-19 patients (labelled as 1 in the classification algorithm) tended to have a more homogeneous texture. Of the two first order features, one had positive weight (the skweness, larger value in COVID-19 patients) and one had a negative weight (the 10th percentile, smaller value in COVID-19 patients) indicating that COVID group, although having a distribution of image intensities with average equal values that nCOVID group, had a larger occurrence of low intensity pixels. Our results demonstrated that a machine learning signature based on radiomics features extracted from GGOs on CT images is an accurate method to early differentiate COVID-19 pneumonia from other acute non-COVID-19 lung diseases. These results confirm the promising role of radiomics in the diagnosis of COVID-19 pneumonia and are in line with the most recent literature on this topic [21] [22] [23] [24] www.nature.com/scientificreports/ Of note, none of the above-mentioned studies was specifically focused on GGOs and other previous studies investigating the differential diagnosis of GGOs were conducted in pre-COVID-19 era 50, 51 . In this regard, overcoming the limited specificity of GGOs on CT images assumes even more relevance in the pandemic scenario since they represent the most common CT findings in the early phase of COVID-19 pneumonia 10 . In fact, the early identification of the GGOs etiology could help to promptly adopt the appropriate management and reduce the burden on the emergency department. For instance, patients admitted to the hospital for suspected COVID-19 pneumonia are temporarily placed in dedicated COVID-19 rule-out units, and they may experience a delay in care or intervention 10 . In this scenario, chest CT is used as a surrogate for the early identification of COVID-19 pneumonia and may help the triage activity by identifying an alternative diagnosis and by improving the patient selection for intensive/non-intensive care in case of clinical worsening. Furthermore, the treatment of GGOs varies according to their etiology. For example, patients with organizing pneumonia are usually treated with corticosteroid therapy with the occasional addition of antibiotics 52 . On the other hand, corticosteroids are recommended only in patients with severe and critical COVID-19 infection 53 . Interestingly, five of the seven features included in the top 5% predictive features of our study were texture related, thus indicating that the lesion hetereogeneity may help the differential diagnosis of GGOs. These results Table 2 . Diagnostic performance of the radiomics-based machine learning signature including area under the curve (AUC), Youden's index, sensitivity, specificity and p-value. In the last two columns on the right, the 5% features with the largest β-weights in magnitude (top 5% features) and their β-weights. a Youden's test. b 5% features with the largest β-weights in magnitude in Partial Least Square analysis. 21 . Moreover, four out of five texure-related features included in our model revealed a higher homogeneity in GGOs of COVID-19 than in those of non-COVID-19 patients. We speculated that the higher homogeneity in COVID-19 pneumonia may reflect the degree of inflammatory infiltrate in the early stage of diffuse alveolar damage. In fact, GGOs are typically observed in the exudative phase of COVID-19 pneumonia, which is characterized by interstitial and alveolar oedema, hemorrhage, and hyaline membrane formation. With the progression of the disease, GGOs increase in density and heterogeneity, thus evolving in a more consolidative pattern or with a "crazy paving" pattern 16 . Although our results are promising, there are some limitations. First, our study included a relatively low number of patients. Nonetheless, our investigation was intended as a proof-of-concept study and, since our aim was focused on GGOs, our inclusion criteria were necessarily strict, considering only patients with predominant GGOs pattern. Moreover, GGOs are typically found in the acute phase of the disease, which may not correspond with the timing of CT and this may have further reduced the study population. Second, compared to the number of patients included in the study, we analyzed a large number of predictive features. In this regard, the PLS exploited the high collinearity of the different radiomics features, thus delivering a high prediction performance. Moreover, the cross-validation modality delivered an evaluation of the out-of-training sample performance using the sample numerosity available that is unbiased. Hence, we expect that by reducing the ratio between features and subject, the model prediction may further increase. Third, this is a retrospectively designed, single-center study. Further prospective and possibly multicentric studies are warranted to define a more standardized approach. A radiomics-based machine learning signature showed the potential to accurately differentiate GGOs due to COVID-19 pneumonia from those due to other acute lung diseases on HRCT scans. Most of the discriminant radiomics features were related to the texture analysis. After a careful prospective evaluation in larger multicentric studies, this approach may assist clinicians to adopt the appropriate management early, while improving the triage of patients. The datasets generated during and/or analyzed during the current study are not publicly available due to the clinical and confidential nature of the material but can be made available from the corresponding author on reasonable request. COVID-19: A review The epidemeology and pathogensis of coronavirus (COVID-19) outbreak COVID-19 diagnosis and management: A comprehensive review Relation between chest CT findings and clinical conditions of coronavirus disease (COVID-19) pneumonia: A multicenter study Radiological approaches to COVID-19 pneumonia COVID-19 pneumonia: A review of typical CT findings and differential diagnosis Fleischner society: Glossary of terms for thoracic imaging Normal lung quantification in usual interstitial pneumonia pattern: The impact of threshold-based volumetric CT analysis for the staging of idiopathic pulmonary fibrosis Radiological findings from 81 patients with COVID-19 pneumonia in Wuhan, China: A descriptive study Review of the chest CT differential diagnosis of ground-glass opacities in the COVID era Chest CT findings in asymptomatic cases with COVID-19: A systematic review and meta-analysis Chest CT in COVID-19: What the radiologist needs to know Systematic review and meta-analysis on the value of chest CT in the diagnosis of coronavirus disease (COVID-19): Sol scientiae, Illustra Nos Chest CT features of COVID-19 in Rome Crazy-paving" pattern at thin-section CT of the lungs: Radiologic-pathologic overview Multimodality imaging of COVID-19 pneumonia: From diagnosis to follow-up. A comprehensive review COVID-19 pneumonia: The great radiological mimicker Using artificial intelligence to detect COVID-19 and community-acquired pneumonia based on pulmonary CT: Evaluation of the diagnostic accuracy Texture analysis of imaging: What radiologists need to know Machine learning is the key to diagnose COVID-19: A proof-of-concept study A CT radiomics analysis of COVID-19-related ground-glass opacities and consolidation: Is it valuable in a differential diagnosis with other atypical pneumonias CT-based radiomics combined with signs: A valuable tool to help radiologist discriminate COVID-19 and influenza pneumonia Multi-classifier-based identification of COVID-19 from chest computed tomography using generalizable and interpretable radiomics features A deep learning integrated radiomics model for identification of coronavirus disease 2019 using computed tomography Radiomics with artificial intelligence: A practical guide for beginners Federated deep learning for detecting COVID-19 lung abnormalities in CT: A privacy-preserving multinational validation study Development and evaluation of an artificial intelligence system for COVID-19 diagnosis The study of automatic machine learning base on radiomics of non-focus area in the first chest CT of different clinical types of COVID-19 pneumonia Radiomics-based model for accurately distinguishing between severe acute respiratory syndrome associated coronavirus 2 (SARS-CoV-2) and influenza A infected pneumonia Ground-glass opacity at CT: The ABCs Fleischner Society: Glossary of terms for thoracic imaging Isolated diffuse ground-glass opacity in thoracic CT: Causes and clinical presentations Widespread ground-glass opacity of the lung in consecutive patients undergoing CT: Does lobular distribution assist diagnosis? Quantitative computed tomographic indexes in diffuse interstitial lung disease: Correlation with physiologic tests and computed tomography visual scores Automatic detection and quantification of ground-glass opacities on high-resolution CT using multiple neural networks: Comparison with a density mask Capsules for object segmentation AFNI: Software for analysis and visualization of functional magnetic resonance neuroimages Computational radiomics system to decode the radiographic phenotype Minimizing acquisition-related radiomics variability by image resampling and batch effect correction to allow for large-scale data analysis New perspectives in partial least squares and related methods Repeatability and reproducibility of radiomic features: A systematic review MRI-based clinical-radiomics model predicts tumor response before treatment in locally advanced rectal cancer The collinearity problem in linear regression. The partial least squares (PLS) approach to generalized inverses Fast optical signals in the sensorimotor cortex: General Linear Convolution Model applied to multiple source-detector distance-based data Electroencephalography-derived prognosis of functional recovery in acute stroke through machine learning approaches Algorithmic stability and sanity-check bounds for leave-one-out cross-validation Overfitting in linear feature extraction for classification of high-dimensional image data Radiomics performs comparable to morphologic assessment by expert radiologists for prediction of response to neoadjuvant chemoradiotherapy on baseline staging MRI in rectal cancer Radiomic analysis of pulmonary ground-glass opacity nodules for distinction of preinvasive lesions, invasive pulmonary adenocarcinoma and minimally invasive adenocarcinoma based on quantitative texture analysis of CT Use of a radiomics model to predict tumor invasiveness of pulmonary adenocarcinomas appearing as pulmonary ground-glass nodules Interstitial lung disease guideline: The British Thoracic Society in collaboration with the Thoracic Society of Australia and New Zealand and the Irish Thoracic Society The Lancet Infectious, D. Curing COVID-19 Texture feature-based machine learning classifier could assist in the diagnosis of COVID-19 Identification of common and severe COVID-19: The value of CT texture analysis and correlation with clinical characteristics Correspondence and requests for materials should be addressed to P.C.Reprints and permissions information is available at www.nature.com/reprints. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.