key: cord-0682656-drslgzdw authors: Aiello, Marco; Baldi, Dario; Esposito, Giuseppina; Valentino, Marika; Randon, Marco; Salvatore, Marco; Cavaliere, Carlo title: Evaluation of AI-Based Segmentation Tools for COVID-19 Lung Lesions on Conventional and Ultra-low Dose CT Scans date: 2022-04-06 journal: Dose Response DOI: 10.1177/15593258221082896 sha: a69b3b9fdb995eb860d217b348c532f3e4f85f2d doc_id: 682656 cord_uid: drslgzdw A reliable diagnosis and accurate monitoring are pivotal steps for treatment and prevention of COVID-19. Chest computed tomography (CT) has been considered a crucial diagnostic imaging technique for the injury assessment of the viral pneumonia. Furthermore, the automatization of the segmentation methods for lung alterations helps to speed up the diagnosis and lighten radiologists’ workload. Considering the assiduous pathology monitoring, ultra-low dose (ULD) chest CT protocols have been implemented to drastically reduce the radiation burden. Unfortunately, the available AI technologies have not been trained on ULD-CT data and validated and their applicability deserves careful evaluation. Therefore, this work aims to compare the results of available AI tools (BCUnet, CORADS AI, NVIDIA CLARA Train SDK and CT Pneumonia Analysis) on a dataset of 73 CT examinations acquired both with conventional dose (CD) and ULD protocols. COVID-19 volume percentage, resulting from each tool, was statistically compared. This study demonstrated high comparability of the results on CD-CT and ULD-CT data among the four AI tools, with high correlation between the results obtained on both protocols (R > .68, P < .001, for all AI tools). COVID-19 is an infectious disease caused by a newly discovered coronavirus (SARS-CoV-2) and it is an ongoing pandemic. It has spread since January 2020 and up to early March 2021 has infected 130, 509,866 people worldwide, with 45,919,941 confirmed cases only in Europe. 1 The clinical presentation of SARS-CoV-2 infection can range from asymptomatic to moderate or severe symptoms-such as fever, cough, weakness, loss of smell, and taste-respiratory disease and, less frequently, gastro-enteritis or neurological symptoms. The gold standard for COVID-19 detection is the Reverse-Transcription-Polymerase Chain Reaction (RT-PCR) test. Different studies reported that RT-PCR technique has a high false negative rate and low sensitivity, 2 due to its variability in lab execution. In the cases of patients with suspected coronavirus infection, with negative RT-PCR test or waiting for swab responses, chest CT was used to speed up diagnosis times and to isolate the patient from the rest of the community. CT diagnostic technique, as a non-invasive imaging approach, is able to detect some characteristic features in the lung lesions COVID-19 associated, for example, ground glass opacities, rounded opacities, and enlarged intra-infiltrate vessels. 3, 4 The chest X-ray has often proved pathology (59% of cases) in asymptomatic or paucisymptomatic subjects after 14 days of quarantine, even in the absence of a nasopharyngeal swab. 5 In summer 2020, multidisciplinary panels of experts in patient management published guidelines for support the use of chest CT for COVID-19 patients with worsening respiratory status, 6 as well as the WHO. 7 Over the past decades, CT has become an important imaging technique, but it is also a major contributor to individual and collective radiation dose. For patient protection, CT dose is recommended to be as low as reasonably achievable to meet clinical needs. 8 Technical progress including automated exposure control helps to optimize the relationship between image noise and radiation dose. Correspondingly, automated exposure control with tube current modulation has been developed for CT. 9,10 Moreover, adult scanning can be adjusted according to body weight (the smaller the bodies, the lesser the dose; instead, the larger the bodies, the higher the dose), resulting in excellent diagnostic scans. 11 All these last assertions have been confirmed, advocating the proper implementation, especially for chest scanning. [12] [13] [14] [15] [16] [17] The optimal image quality level for CT examinations stands for the level at which diagnostic images can reliably be produced using the lowest dose level and it should be tailored according to each individual patient and relevant cathegorized groups (e.g., pediatric, or obese patients). Thus, to reduce radiological risks and dose, ULD-CT protocols have been evaluated in recent decades. 18 To date, according to the authors' knowledge, very few studies have been published regarding dosimetry in ULD-CT protocols, and nothing that directly compares ULD-CT to both CD-CT and radiography. Schaal et al. estimated an average effective dose of .25 mSv based on exposure data from 55 patients, but lack of information about the size of the patients adds uncertainty to this estimate. 19 In emergent situations such as COVID-19 pandemic, the demand for performing CT scans may significantly increase due to the high rate of infected individuals. The sequential and multiple acquisitions of CT exams can significantly increase the cumulative radiation dose these patients may receive during their hospitalization and recovery. Replacing CD-CT with ULD-CT has been proposed as a method to decrease radiation exposure. In a retrospective study, 20 a low dose CT (LD-CT) combined with iterative reconstruction demonstrated sensitivity, specificity, positive predictive value, negative predictive value, and accuracy of about 90% in the diagnosis of COVID-19. Furthermore, ULD-CT protocols proved an additive diagnostic benefit in patients with concomitant bacterial pneumonia or an alternative diagnosis other than COVID-19. Recently, artificial intelligence (AI) techniques-in particular deep learning (DL) algorithms-have shown great potential in the medical imaging sector thanks to their high ability to extract both morphological and functional features. [21] [22] [23] In particular, such techniques have been applied to detect and differentiate bacterial and viral pneumonia in pediatric chest radiographs. 24, 25 Besides, a retrospective and multicenter study regarding the application of DL technique to differentiate coronavirus pneumonia from other lung diseases was recently published. 26 Harmon et al. show that DL algorithms, trained in a diverse multinational cohort, can achieve up to 90.8% accuracy, with 84% sensitivity and 93% specificity. 27 Wang et al. uncovered some DL and radiomic features that contribute to differentiation of COVID-19 from non-COVID-19 viral pneumonia. 28 As a consequence of DL techniques development to support the diagnosis of COVID-19 on CT scans, several tools based on AI models trained on image datasets acquired with CD-CT protocol have been made available to the clinical and research community. The aim of this work is to evaluate the applicability of these tools also in the domain of low-dose CT images (ULD). Patients 73 patients (41 female, age range: 52.91 ±16.40 (mean age ± standard deviation)) were informed and prospectively enrolled for the research protocol approved by the local ethical committee of IRCCS SDN (Comitato etico IRCCS Pascale, Naples, project identification code: 7_20). The protocol was adopted to face the pandemic situation. Inclusion criteria were the prescription of a chest CT for post-COVID-19 screening, suspect COVID-19, and generally pulmonary pathology. Anamnestic and clinical information were collected. Adult patients, with pulmonary disease or post-COVID-19 infection symptoms, who underwent baseline, unenhanced chest CT (according to clinical needs) on the third-generation dual source CT. Exclusion criteria were patients with a BMI greater than 30 and children and adolescents under 18 years. Patients lay in a head-first supine position during nonenhanced consecutive acquisition CT protocols (CD-CT followed by ULD-CT). All patients underwent LD chest CT by using a SOMATOM FORCE (Siemens Healthineers, Forchheim, Germany) slice .6 mm detector scanner. No contrast agent was admitted. The CD-CT acquisition parameters were as follows: tube voltage of reference 120 kVp, tube current of reference 61 mAs, and pitch factor of 1.2. The automatic tube current modulation (CareDose 4 D) system was used and the automatic tube voltage selection (Care kV) was activated on the "non-enhanced" setting. For ULD-CT acquisition, both the tubes worked at 100 kVp with .6 mm tin filter (100Sn kVp, for spectral shaping), with wide collimation (2 × 192 × .6 mm), a rotation time of .25 s, an ultra-long pitch (pitch = 3, Turbo Flash mode, Siemens Healthineers), with modulated mA at 70 mAs reference. Effective radiation dose was calculated by multiplying the dose-length product by .014 mSv/mGy x cm, as the constant k-value for thoracic imaging. For both protocols, raw data were reconstructed at 3 mm slice thickness by convolutional nucleus B157, Br54 (Br40, moderately smooth). To reduce the image noise, Advanced Modeled Iterative Reconstruction (ADMIRE) strength of 3 was used. All reconstructions were performed with a matrix size of 512 × 512 pixels. Figure 1 shows CD-CT and ULD-CT images of the same patient, where different noise levels can visually appreciate. For each subject, COVID-19 pneumonia lesion segmentation has been executed on imaging data acquired with both CD-CT and ULD-CT protocols. Four AI tools based on deep neural networks have been used: BCUnet, CORADS AI, CT Pneumonia Analysis, and NVIDIA CLARA Train SDK. All these tools have been validated only for research purposes and not for clinical use. The architecture and the design of each AI system are described in the following paragraphs. BCUnet is a deep neural network for the segmentation of pulmonary lesions in thoracic non-contrast CT images. The algorithm is based on a U-net architecture. 29 The model has been trained with a supervised DL technique. Each image included in the dataset is associated with a manual annotation of the lesion. The training dataset consists of 199 annotated CT images of COVID-19 patients, provided by the COVID-19 Lung CT Lesion Segmentation Grand Challenge-2020. 30 In particular, the innovative contribution that characterizes BCUnet is the introduction of an image masking technique and a data augmentation technique, which provides additional images for training the network by adding Gaussian noise and images deformed with an atlas-based method. The model was implemented in Python 3.6, using Tensorflow 2.3 and Keras as frameworks. The MobileNetV2 network weights have been made available by Tensorflow. The code for the CLAHE algorithm is contained in the scikit-image library. COVID-19 Reporting and Data AI System (CORADS AI) is a standardized chest CT scoring system that, after segmentation, automatically assigns scores from 1 to 5 that increase with the level of suspicion of COVID-19. 31 This algorithm was developed by the Diagnostic Image Analysis Group, Amsterdam University Medical Center, Fraunhofer MEVIS, and Thirona. For any detail, see https://grand-challenge.org/algorithms/coradsai/. In the following analysis, the used annotation is CORADS. Clara Training Framework is a package-level application of NVIDIA Clara Train Software Development Kit (SDK), a python-based SDK that allows developers to seek faster implementation of medical-specific DL solutions by leveraging optimized, ready-to-use and pre-trained built in-house NVIDIA models. These pre-trained models are packaged as medical model repositories (MMARs) and contain the scripts needed for model development activities. The collection includes pretrained models for CT lung segmentation, COVID-19 classification in chest studies, and a CT annotated lung model. In this work two CLARA models have been used:(i) lung segmentation (https://ngc.nvidia.com/catalog/models/nvidia:med: clara_train_covid19_ct_lung_seg) and (ii) COVID-19 CT lesion segmentation (https://ngc.nvidia.com/catalog/models/ nvidia:med:clara_pt_covid19_ct_lesion_segmentation). In the following analysis, the used annotation is CLARA. CT Pneumonia Analysis, syngo.via Frontier prototype, is an interactive platform for pneumonia analysis developed by Siemens Healthineers, 32 https://www.siemens-healthineers. com/medical-imaging/digital-transformation-of-radiology/aicovid-19-algorithm) to automatically identify and quantify hyperdense lung regions on lung CT scans, facilitating their analysis and evaluation. CT Pneumonia Analysis performs the automated analysis of pulmonary opacities on axial CT data (section thicknesses up to 5 mm), with the possibility of introducing, through the manual segmentation tool, any additions or changes to the automatically generated masks. In the following analysis the used annotation is Pneumonia. For each patient, COVID-19 volume percentage (CV%) has been calculated considering the COVID-19 lung lesion volume on the total lung volume, as obtained by each AI segmentation tool. The CV% has been estimated on both CD-CT and ULD-CT considering two different convolutional kernels for image reconstruction, that is, Bl57 and Br40, maintaining a 3 mm CT slice thickness and a denoise level of three. Under these conditions, first, a Shapiro test has been implemented to assert the normality data distribution. 33 Moreover, according to Shapiro results, a Kruskal-Wallis (KW) test 34 has been performed to assess the difference between the AI-based segmentation tools on CD-CT and ULD-CT, with a subsequent Dunn's post hoc test 35 to determine which tools differ. To graphically understand data trend and distribution relating to single protocols and single AI segmentation algorithms, boxplots have been realized, denoting the median, the 25th to 75th percentiles (boxes) and the minimum to the maximum outliers. A quantitative analysis has been evaluated to prove the correlation between CD and ULD-CT protocols in CV% parameter, applying the Spearman's correlation coefficient test, 36 figuring the relative heatmap. To strengthen the comparison, the pairwise Bland-Altman (BA) test 37 has been carried out to describe the agreement between the two quantitative protocol measurements, that is, CD-CT CV% and ULD-CT CV% values, with respect to AI tools. To quantify the aforementioned agreement, some constraints have been needed: the mean and the standard deviation (STD) of the differences between the two measurements (mean À1.96 std/ mean +1.96 std). A BA graphical approach has been used. For the sake of in-depth analysis, all the statistics have been repeated on data generated by BCUnet and CORADS tools implementing the lung lesion segmentation lobe by lobe. The lung lobe segmentation has been included because of the importance of knowing the location and distribution of COVID-19 lung disease, which can help in determining the most suitable treatment. 38 Under the same previous conditions, CV% has been calculated for each lung lobe, naming differently as LLL %, the left lower lobe COVID-19 volume percentage; LUL%, the left upper lobe COVID-19 volume percentage; RLL%, the right lower lobe COVID-19 volume percentage; RML%, the right medium lobe COVID-19 volume percentage; and RUL%, the right upper lobe COVID-19 volume percentage. These percentages were obtained for both protocols (CD-CT and ULD-CT). All the statistical analyses have been performed using Python libraries, in particular PyCompare module, SciPy library, Pingouin, and scikit_posthocs packages. The results have been considered statistically significant for P-value less than .05. The Shapiro test has been applied to check the normal tendency of data distribution, including all kernel reconstructions. It has been evaluated per each AI-based segmentation tool with respect to both protocols. Table 1 shows Shapiro test results in terms of W statistic and P-value. As it can be noticed in Table 1 , CORADS and Pneumonia data (both CD-CT and ULD-CT) and CLARA ULD-CT data do not follow a Gaussian distribution (P-value < .05). Since most AI segmentation algorithms have generated non-Gaussian data, all data have been considered non-parametric. The result of the KW test rejects the null hypothesis with a H statistic of 182.19 and P-value less than .001, meaning that the used algorithms differ. Regarding the Dunn's post hoc test, Table 2 and Table 3 show the obtained results relative to CD-CT and ULD-CT data, respectively. According to P-values in Table 2 , BCUnet differs from the other tools (P-value < .01 rejects the null hypothesis of no differences) meaning that CORADS, CLARA and Pneumonia tools generate similar CD-CT CV% values. Instead, in Table 3 it is highlighted that BCUnet and Pneumonia tools differ on ULD-CT CV% values estimation. Therefore, although the segmentation results have been generated by different AI algorithms, it has been demonstrated that these algorithms give comparable results, in terms of CV% values, regardless of the CT dose protocols. In fact, for a qualitative demonstration, Figure 2 shows that the resulting CV% values distributions of each AI tool are comparable for both dose protocols. The non-Gaussian nature of the data can be also observed, according to Shapiro results. Moreover, the overall trend of the boxplots matches the results of Dunn's test. Regarding the correlation analysis, Spearman correlation coefficients (R) have been carried out showing that all correlation coefficients are statistically significant. Figure 3 shows the heatmap of R values. The red-squared correlation coefficients refer to CD and ULD-CT CV% values obtained by the same AI-based segmentation tool, highlighting a strong correlation (R > .6), meaning that ULD-CT protocol can be used for a CV% estimation comparable to CD-CT protocol, using all AI algorithms. To strengthen this result, the pairwise (CD-CT and ULD-CT protocols) BA test has been applied on each AI-based segmentation tool. Figure 4 shows BA results considering CD and ULD-CT CV% values, averaged over all patients. The bias has been computed as the value determined by one CD-CT CV% minus the value determined by the corresponding ULD-CT. The green dots represent the paired difference between CD and ULD CV% values; the bold line is the mean of all differences. As the plots in Figure 4 illustrate, the mean is around zero per each tool, confirming that the bias (distance between zero line and mean line) is negligible. Moreover, except for some outliers, most green dots do not exceed the maximum allowed difference limits, according to which the differences within mean ± 1.96 SD are considered not relevant, demonstrating that the segmentation algorithms can work similarly if they use CD-CT data and ULD-CT ones. After reaching the previous results from CV% data calculated with AI segmentation algorithms on total lungs, these assays have been verified on data obtained with lung lobe lesion segmentation algorithms, using BCUnet and CORADS tools. Data under statistics were the lung lobe COVID-19 lesion volume percentage for each patient and each protocol, as explained in "Statistical analysis" section. Hence, data have been proved non-parametric according to the Shapiro test (P < .01), and also using lobe segmentation, BCUnet and COR-ADS tools result different according to the KW test. Since we implemented only two AI tools for lung lobe segmentation, Dunn's test has not been applied. Considering correlation analysis, LLL%, LUL%, RLL%, RLM%, and RUL% data show significant R values, between CD-CT and ULD-CT protocol, greater than .66 for BCUnet tool (red squares in Figure 5 ) and greater than .51 for CORADS tool (green squares in Figure 5 ). In Figure 6 , BA plots on LLL%, LUL%, RLL%, RLM%, and RUL% of BCUnet tool are presented. The paired measurements are contained in the allowed limits. OnlyRLM % BA plot shows more scattered data, always staying in the confidence interval limits. In Figure 7 , BA plots on LLL%, LUL%, RLL%, RLM%, and RUL% of CORADS tool are shown. As in the previous plots, a low bias is obtained and paired measurements are contained in the allowed limits, as explained in the captions of Figure 6 and Figure 7 . These last . BA plots generated using paired values, examined on LLL%, LUL%, RLL%, RLM%, and RUL% of BCUnet tool. The black line is the mean of differences between two measurements; the dashed lines represent the test limits (mean À1.96 std/mean +1.96 std) and the dots are the paired differences of the two measurements (both dose protocols) for each patient. results confirm that AI-based segmentation tools took out comparable outcomes when obtained on both CD-CT and ULD-CT data. AI-based tools supporting radiological evaluation can offer significant benefit in clinical practice. COVID-19 pandemic has led to the development and refinement of AI algorithms for disease diagnosis and monitoring, in particular for the automation of lesion segmentations on CT images. However, the reliability of AI-based techniques requires careful evaluation of the reproducibility with respect to the use of different acquisition techniques. Furthermore, lack of knowledge of COVID-19 disease felt the need to increase monitoring for fast diagnosis and reliable prevention, leading patients to greater radiological exposure. Thus, low dose protocols for CT scans have been implemented. 20 In this work, the agreement of AI-based COVID-19 lung lesions segmentation methods between chest CT images acquired following both CD-CT and ULD-CT protocols has been evaluated for the first time. Four different DL-based segmentation tools, trained on CD-CT protocol images, have been considered. The CV% parameter has been calculated on both dose protocols data, allowing such statistical inferences. First of all, the data showed on average a behavior far from a Gaussian distribution, thus implementing non-parametric tests (e.g., KW test). The AI tools have been pairwise compared to estimate their differences when applied on both dose protocols. As shown by the Spearman correlation coefficients (R > .6 and P-value < .05), a good agreement between CD and ULD-CT CV% values has resulted, confirmed by the BA test ( Figure 4 ). However, although the lack of diagnostic support tools trained on specific ULD-CT data, it does not represent a critical limitation, since suitability of AI tools designed on conventional datasets have been proved. Besides, the reliability of ULD protocols has been already demonstrated on comparable diagnostic performance evaluation for viral pneumonia detection on CT images, 39-41 but these results can consolidate the role of ULD-CT in clinical routine with the undeniable advantage of high dose reduction rate. This study should be considered as a preliminary investigation, since it requires further development for a comprehensive evaluation. Indeed, the application of retrospective noise reduction algorithms 42 should be evaluated to improve the results of conventional algorithms on ULD-CT data. In addition, the inclusion of ULD-CT data for AI model training should be assessed. Noteworthy, the patients' body mass index (BMI) may have a considerable effect on the signal to noise ratio of the resulting ULD-CT images, 43, 44 and consequently on the analyzed algorithms' results. The influence of BMI on the results agreement relative to both ULD-CT and CD-CT protocols deserves to be further investigated. Among the examinations considered, a limitation of the present study is an unrevealed quantifiable lesion load. In order to mitigate this limitation, two different segmentation Figure 7 . BA plots generated using paired values, examined on LLL%, LUL%, RLL%, RLM%, and RUL% of CORADS tool. The black line is the mean of differences between two measurements; the dashed lines represent the test limits (mean À1.96 std/mean +1.96 std) and the dots are the paired differences of the two measurements (both dose protocols) for each patient. tasks have been considered, that is, lung and lobes segmentation. Although it is a very specific work, our approach can also be applied to other anatomical sectors, analyzing different classification tasks. [45] [46] [47] [48] [49] [50] In conclusion, this work investigates the applicability of automatic segmentation techniques, trained on CD-CT protocol, to the images acquired in ULD mode. This preliminary study confirms the suitability of available AI-based segmentation tools on ULD-CT data, for supporting diagnosis and quantification of COVID-19 lung lesions. WHO dashboard Real-time RT-PCR in COVID-19 detection: issues affecting the results CT imaging features of 2019 novel coronavirus (2019-nCoV) Timely diagnosis and treatment shortens the time to resolution of coronavirus disease (COVID-19) pneumonia and lowers the highest and last CT scores from sequential chest CT Chest radiograph findings in asymptomatic and minimally symptomatic quarantined patients in codogno, Italy during COVID-19 pandemic The role of chest imaging in patient management during the COVID-19 pandemic: a multinational consensus statement from the fleischner society Use of chest imaging in COVID19 Estimated risks of radiation-induced fatal cancer from pediatric CT Dose reduction in computed tomography by attenuation-based on-line modulation of tube current: evaluation of six anatomical regions Dose reduction opportunities for CT scanners Radiation dose management Pulmonary nodules: experimental and clinical studies at low-dose CT Image quality and dose in computed tomography CT of the chest: minimal tube current required for good image quality with the least radiation dose Low-dose CT of the lungs: preliminary observations Pulmonary nodule detection: low-dose versus conventional CT Radiation effective doses to patients undergoing abdominal CT examinations Whole body low dose computed tomography using thirdgeneration dual-source multidetector with spectral shaping: protocol optimization and literature review. Dose-Response Diagnostic performance of ultra-low-dose computed tomography for detecting asbestos-related pleuropulmonary diseases: prospective study in a screening setting Accuracy and reproducibility of low-dose submillisievert chest CT for the diagnosis of COVID-19 Diagnostic accuracy of deep learning in medical imaging: a systematic review and meta-analysis An overview of deep learning in medical imaging focusing on MRI Six artificial intelligence paradigms for tissue characterisation and classification of non-COVID-19 pneumonia against COVID-19 pneumonia in computed tomography lungs Identifying medical diagnoses and treatable diseases by image-based deep learning Visualization and interpretation of convolutional neural network predictions in detecting pneumonia in pediatric chest radiographs Using artificial intelligence to detect COVID-19 and community-acquired pneumonia based on pulmonary CT: evaluation of the diagnostic accuracy Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets Machine learning-based analysis of MR radiomics can help to improve the diagnostic performance of PI-RADS v2 in clinically relevant prostate cancer U-net: convolutional networks for biomedical image segmentation. Presented at the MICCAI Rapid artificial intelligence solutions in a pandemic -the COVID-19-20 lung CT lesion segmentation challenge Automated assessment of CO-RADS and chest CT severity scores in patients with suspected COVID-19 using artificial intelligence Automated quantification of CT patterns associated with COVID-19 from chest CT An analysis of variance test for normality (complete samples) Use of ranks in one-criterion variance analysis Multiple comparisons using rank sums CRC Standard Probability and Statistics Tables and Formulae Measurement in medicine: the analysis of method comparison studies Automatic pulmonary lobe segmentation using deep learning Diagnostic yield, safety, and advantages of ultra-low dose chest CT compared to chest radiography in early stage suspected SARS-CoV-2 pneumonia Ultra-low-dose chest CT performance for the detection of viral pneumonia patterns during the COVID-19 outbreak period: a monocentric experience Evaluation of Ultra Low Dose Chest CT Imaging for Covid 19 Diagnosis and Follow up CT noise-reduction methods for lower-dose scanning: strengths and weaknesses of iterative reconstruction algorithms and new techniques Low radiation dose implications in obese abdominal computed tomography imaging The impact of obesity on abdominal Ct radiation dose and image quality Deep learning algorithm for automated segmentation and volume measurement of the liver and spleen using portal venous phase computed tomography images Artificial intelligenceaided CT segmentation for body composition analysis: a validation study Pixel-level deep segmentation: artificial intelligence quantifies muscle on computed tomography for body morphometric analysis Deep learning for segmentation of 49 selected bones in CT scans: first step in automated PET/CT-based 3D quantification of skeletal metastases Body composition analysis of computed tomography scans in clinical populations: the role of deep learning CT-ORG, a new dataset for multiple organ segmentation in computed tomography The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. The authors disclosed receipt of the following financial support for the research, authorship, and publication of this article: This work was partially funded by the Italian Ministry of Health for the "Ricerca Corrente" project and by POR CAMPANIA FESR 2014 -2020 for the "Protocolli TC del torace a bassissima dose e tecniche di intelligenza artificiale per la diagnosi precoce e quantificazione della malattia da COVID-19" project. Marco Aiello  https://orcid.org/0000-0002-3676-0664 Dario Baldi  https://orcid.org/0000-0001-7464-4499 Carlo Cavaliere  https://orcid.org/0000-0002-3297-2213