key: cord-0887604-sz5bf2dl authors: Ketchanji Mougang, Yolande; Di Zazzo, Lorena; Minieri, Marilena; Capuano, Rosamaria; Catini, Alexandro; Legramente, Jacopo Maria; Paolesse, Roberto; Bernardini, Sergio; Di Natale, Corrado title: Sensor Array and Gas Chromatographic Detection of the Blood Serum Volatolomic Signature of COVID-19 date: 2021-07-10 journal: iScience DOI: 10.1016/j.isci.2021.102851 sha: 5e28b6318743663a6b043858cde41bd3290083ed doc_id: 887604 cord_uid: sz5bf2dl Volatolomics is gaining consideration as a viable approach to diagnose several diseases, and it also shows promising results to discriminate COVID-19 patients via breath analysis. This paper extends the study of the relationship between volatile compounds (VOCs) and COVID-19 to blood serum. Blood samples were collected from subjects recruited at the emergency department of a large public hospital. The volatile compounds (VOCs) were analyzed with a Gas Chromatography Mass Spectrometer (GC/MS). GC/MS data shows that in more than 100 different VOCs, the pattern of abundances of 17 compounds identifies COVID-19 from non-COVID with an accuracy of 89% (sensitivity 94% and specificity 83%). GC/MS analysis was complemented by an array of gas sensors whose data achieved an accuracy of 89% (sensitivity 94% and specificity 80%). Since late 2019, COVID-19 spread rapidly worldwide until reaching the status of a pandemic disease (Wu et al., 2020; Xu et al., 2020) . The diagnosis of COVID-19 is based on the direct detection in patients' samples of its etiological agent known as SARS-COV-2. Reverse transcription real-time polymerase chain reaction assays performed on nasal-pharyngeal swabs are, to date, the gold standard J o u r n a l P r e -p r o o f for diagnosis (Udugama et al., 2020) . These tests are carried out on a massive scale daily worldwide. In the European Union, from March 2020 to February 2021 about 360 million tests were performed (www.ecdc.europa.eu/en/publications-data/covid-19-testing) . Molecular tests require specialized staff, they are costly and time consuming. This last feature is in conflict with the needs in emergency rooms to promptly isolate COVID-19 patients. Alternative rapid tests based on the detection of viral proteins (antigens) have been quickly developed by several companies in order to offer a faster and cheaper detection of the SARS-COV-2 infection. These tests are currently in use in different countries (https://ec.europa.eu/health/sites/health/files/preparedness_response/docs/covid-19_rat_common-list_en.pdf). Nevertheless, antigens rapid tests are usually less sensitive than molecular tests being reliable only at high viral loads (Ciotti et al., 2021) . Moreover the performance variability of different tests produced by different companies is very high. A strategy to further reduce the complexity of tests and to improve their massive diffusion is the incorporation of SARS-COV-2 molecular receptors in engineered biosensors (Mattioli et al., 2020) . Alternative approaches could be oriented rather than to the direct detection of SARS-COV-2 to the detection of the evidence produced by COVID-19 (Lin et al., 2020) . This is the approach of metabolomics which considers the systematic study of the chemical fingerprints, the metabolome, that specific processes leave in the organism (Wishart, 2016) . analysis of breath. As usually happens in volatolomics, these studies evidenced that it is not a single compound that can be univocally related to COVID infection but rather a pattern of volatile compounds. The feasibility of sensors approach was investigated with an array of nanostructured sensors that was used to analyze the breath of subjects in two chinese hospitals (Shan et al., 2020) . The results of this study show that sensors can discriminate COVID-19 respect to healthy subjects and to subjects affected by other lung pathologies. Studies on breath analysis carried out with different technologies indicate that volatolomics offer an alternative route for COVID-19 detection. On the other hand, breath is also the major vehicle of propagation of SARS-COV-2, so breath analysis raises safety issues for medical operators and for the analytical equipment. This concern creates interest in the diagnostic properties of other human samples (e.g. urine and serum) that are accessible for volatolomic studies. Serum is particularly interesting. First of all because proteomic, metabolomics and lipidomic profiles of serum have been found to be strongly affected by COVID-19 (Bruzzone et al., 2020; Shen et al., 2020; Shi et al., 2021) . Additionally, serum samples are always available since blood analysis is part of routine admission to hospitals and emergency departments. Blood serum volatolome is not frequently studied. In most metabolomics studies, the target metabolites are not volatile and they need to be derivatized in order to become volatile and to be detected with gas analyzers (Dunn et al., 2011) . The analysis of volatile compounds released by serum without the addition of derivatizating agents has been rarely considered. For instance, it has been investigated to search for biomarkers of acute myocardial infarction (Ali et al., 2016) . The above mentioned reasons prompted us to investigate the relationship between the COVID-19 and the volatolome of blood serum. The study was carried out with a gas chromatography mass spectrometer (GC/MS) and a gas sensor array. In this paper, the volatile compounds directly released by the serum of 97 patients in an emergency department have been analyzed. The classification of GC/MS and sensor array data show that COVID-19 can be identified with accuracies of about 90%. 97 samples of serum, one per patient, were analyzed by GC/MS and gas sensor array as described in the transparent methods section. Table 1 shows the demographic data and major comorbidities. The measurement setup was adapted from previous literature studies about volatolome analysis of serum (Ali et al., 2016) . Figure 1 shows the observational design and the data analysis flow followed for both GC/MS and gas sensor array data. discriminate COVID-19 patients from the others. Chromatograms are shown in Figure S1 in the Supplemental Information. Peaks abundance extend for a range of about 3 orders of magnitude, then neither the presence of all peaks nor the differences between COVID-19 and controls are easily appreciable in figure S1. To better evaluate the discrimination capability of GC/MS data, the Kruskal-Wallis rank test was applied to the abundance of each peak detected by GC/MS. Figure 2 shows the probability of null-hypothesis (p-value) of all detected VOCs. 17 VOCs, listed in Table 2 , show a p-value less than 0.05. These VOCs have been putatively identified by comparing the mass fragmentation spectra with those in libraries. They are aromatics, methylated alkanes, alkenes, and alcohols which are common elements of human volatolome (Amann et al., 2014) and whose concentration may be altered by cancer and other diseases (Einoch Amor et al., 2019) . The presence of ketones among the selected variables is corroborated by a preliminary study that found an excess of ketones in urines of COVID-19 patients . Ketones in general are supposed to be linked to protein metabolism and to an increased rate of fatty acid oxidation while a possible source of alcohols is the metabolism of hydrocarbons (Haick et al., 2014) . Some of these compounds have been found in human breath. In particular, toluene, heptane, 2,2,4,6,6pentamethyl, benzene, 1,3,5-triethyl, octane, 3,6-dimethyl- (Phillips et al., 2013) (Phillips et al., 1999) and 4-heptanone (Mochalski et al., 2013) . Ethanol 2-butoxy-, besides to be a component of breath, is also correlated with gastric cancer (Xu et al., 2013) . The correspondence of compounds in serum and breath supports the hypothesis of VOCs transfer from blood to breath at the air/blood interface in the lung (Haick et al., 2014) . Furthermore, it suggests that these compounds could also act as COVID-19 biomarkers in breath. The Kruskal-Wallis rank test and correlation tests were performed on GC/MS peaks to exclude the influence of sex, age and hypertension. Variance analysis was performed on qualitative variables (sex and hypertension), and Spearmann correlation coefficients were calculated respect to age (results are shown in figures S2, S3, and S4 in the Supplemental Information). The p-value of Kruskal-Wallis rank test was larger than 0.7 for both sex and hypertension hypotheses and the Spearmann correlation coefficient with age was less than ±0.4. The results of rank-tests and Spearmann correlation analysis for the 17 VOCs are listed in table 2. The results of these analyses suggests that GC/MS data are not affected by sex, age and hypertension condition. Other conditions are numerically marginal and they do not allow for a reliable statistical analysis. The abundance profiles of the set of compounds in table 2 have been further investigated with multivariate analysis. It is important to note that the reduction of VOCs from 184 to 17 improves the reliability of multivariate data analysis (Hendriks et al., 2011) . Principal Component Analysis (PCA) offers a simple method to visualize the relationship between samples and to infer how each single variable, the peaks in case of GC/MS data, contributes to the multivariate data (Jolliffee, 2002) . The abundance of the seventeen GC/MS peaks were standardized (zero mean and unitary variance) before calculating the analysis. PCA results are shown in Figure 3 . The scores plot of the first two principal components ( Figure 3A ) explains less than 48% of the total variance of the data. In figure 3A the group of non-COVID subjects has been labelled according to the pathology: non-inflammatory pathologies, bacterial pneumonia and other inflammatory pathologies. No evident separation between these three groups is observed. In particular, data of bacterial pneumonia do not overlap with those of COVID-19. Due to the exiguity of the explained total variance, Figure 3A is only indicative of the whole distribution of data in the multivariate space. Nonetheless, the first two principal components show that COVID-19 samples tend to be segregated respect to the non-COVID. The Kruskal Wallis rank test applied to the first principal component returns p<10 -6 . Except the separation between COVID-19 and non-COVID samples, the scores plot do not show separations between sex, age and hypertension condition (see Figure S5 in the Supplemental Information). The contribution of peaks to the principal components is described by the loadings. In figure 3B scores and loadings are simultaneously plotted in order to compare the relationship between peaks and samples. Peaks are labelled according the numbers in table 2. Peaks 2, 3, 6, 7, 9 13, 14, 16, and 17 contribute more to COVID-19 while the others are oriented towards the controls. The compounds oriented with COVID-19 include all ketones and alcohols while aromatics and alkanes point towards the non-COVID samples. A similar analysis was carried out on sensor array data. Two samples were wrongly prepared, then 95 samples were measured with the sensor array. Before each sample, to reduce the influence of background air, sensors were exposed to the air from an empty vial (see Figure S6 in Supplemental Information). The response of sensors was calculated as the difference between the frequency shift to serum sample and empty vial (see Figure S7 in Supplemental Information). The capability of each sensor to capture the differences between the group of COVID-19 respect to non-COVID samples was tested with a Kruskal-Wallis rank test. Figure 4 shows the box-plots of sensors distributions. All sensors show a p-value smaller than 0.01. Most of the sensors show a larger response to non-COVID samples. In principle, the response of sensors should be predicted from the GC/MS data. However, this is practically impossible because the sensitivity of sensors to most of the compounds is unknown, and the abundances measured by GC/MS depends on the affinity of each compounds with the SPME sampler. However, from a qualitative point the response of sensor can be compared with the total amount of volatile compounds in the sample. This quantity can be estimated by the total abundances of all VOCs detected in each sample. In agreement to sensors data, the distribution of the total abundance is slightly larger in non-COVID samples (see figure S8 in Supplemental Information). Variance analysis and correlation tests were performed on sensor's data to verify the lack of sensitivity of data respect to sex, age, and hypertension (results are shown in figures S9, S10, and S11 in Supplemental Information). The p-value of the Kruskal-Wallis rank test was larger than 0.3 for both sex and hypertension hypotheses while the Spearmann correlation coefficient with age was less than ±0.4. These results suggest that sex, age and hypertension do not affect sensors data. To explore the multivariate distribution of sensors data the PCA was calculated. Figure 5 shows the scores plot of the first two principal components of sensor array data. The PCA was calculated on standardized data. The first two principal components account for more than 94% of the total variance of data. Respect to GC/MS data, sensors data are more correlated. This is a typical situation with sensor arrays and it is a consequence of the combinatorial selectivity principle (Hierlemann and Gutierrez-Osuna, 2008) . As expected from the box-plots in Figure 4 , the two groups are distinct but also rather overlapped. The three sub-groups of non-COVID samples have been highlighted in the plot, no dependence between sensors data and the condition of non-COVID samples appears in Figure 4 . Sensors' data confirm that non-COVID pneumonia are different from those of COVID-19. Eventually, the major discriminant among the data is the difference between COVID-19 and non-COVID. PCA is a powerful and simple algorithm to visualize multivariate data. Nevertheless, it is calculated by an unsupervised algorithm aimed at maximizing the variance of data. A more efficient indication of the diagnostic performance of multivariate data to capture the differences between groups is obtained by a supervised classification. For this aim GC/MS sensor array data were analyzed with Partial Least Squares Discriminant Analysis (PLS-DA) (Barker and Rayens, 2003) . PCA shows that both GC/MS and sensors data are non-sensitive respect to the differences between the three sub groups of non-COVID subjects; thus, all non-COVID subjects were joined in a unique group of controls and PLS-DA was calculated in order to separate COVID-19 from non-COVID samples. PLS-DA decomposes the multivariate data in a number of latent variables that, similarly to PCA scores, can be displayed to show the separation between groups. To be reliable, PLS-DA models need to be properly validated. Thus, the datasets have been split in two portions one used to train the model and the other to test. Random splitting of data in two groups can result in favourable conditions that could lead to over-optimistic conclusions. To avoid this drawback, PLS-DA was calculated 100 times, and each time a different partition of data was used to train and to test the model. At each step a PLS-DA model was trained by a cross-validation procedure of training data, and applied to the test data. The area under the curve (AUROC) was calculated as an indicator of the goodness of the model (Szymańska et al., 2012) . Eventually, the model corresponding to the average AUROC was considered as J o u r n a l P r e -p r o o f representative for the classification. The confidence intervals of the areas under the ROC, specificity and selectivity are calculated from the distribution of results obtained in the 100 random partitions in training and test sets. In the case of GC/MS data, the large number of variables renders the application of classifiers unreliable, in particular PLS-DA. To this aim, for each training and test partition, a Kruskal-Wallis rank test was applied to the training data set and the variables returning the null-hypothesis probability (pvalue) less than 0.05 were selected for the PLS-DA calculation. The dataset of VOCs corresponding to the average AUROC contains the same compounds listed in table 2. In practice, with these data Kruskal Wallis rank test gives similar results when calculated with 70% of samples. The classification error of the PLS-DA model corresponding to the average AUROC is minimized by three latent variables. The three latent variables explain about 30% of the total variance of the abundance of the selected 17 peaks. Thus, 70% of the data variance does not contribute to COVID-19 identification; rather, it most likely describes the individual peculiarity of each subject. This is an indication of the complexity of the metabolic profile which only in a limited portion is affected by COVID-19. Figure S12 in the Supplemental Information shows the statistical distribution of data in each latent variable and the projection in the planes defined by the three latent variables. The results of the classifier are summarized in the confusion matrices in Table 3 . The total accuracy is 91% in training and 89% in test. The sensitivity of the classifier is 94% ±1% and 94%±2% in training and test respectively, while the specificity is 87%±2% and 83%±3% Figure 5 shows the ROC curve. The areas under the ROC are 0.94±0.02 and 0.94±0.03 in training and test. The same procedure of calculation was applied to sensor's data. Also in this case, the optimal PLS-DA model was made by three latent variables. The total variance explained by the three latent variables is larger than 93%. This difference with respect to GC/MS is due to the larger correlation between the sensors. As a consequence, the dominant difference between samples (COVID-19 vs. non-COVID) tends to incorporate the slight differences due to the peculiar behaviour of each subject. Figure S13 in the Supplemental Information shows the box plots of the latent variables and the projection of data in the planes defined by the three latent variables. In table 4 the confusion matrices of the sensors data classifier are shown. The accuracy of the PLS-DA model of sensor array data is about 91% and 89% in training and test respectively. Figure 7 shows the ROCs in training and test. The areas under the ROCs are 0.92±0.02 in training and 0.91±0.04 in test. The sensitivity of the classifier is 94%±2% in training and 94%±3% in test, and the specificity is 87%±2% in training and 80%±4% in test. The class of controls were made of patients with different conditions, in particular 14 of the non-COVID patients were affected by inflammatory pathologies. Figures 3 and 5 show that subjects with inflammatory pathologies in both GC/MS and sensors data tend to be distributed towards the group of non-COVID. To verify the assumption that inflammatory pathologies are not a confounding factor in J o u r n a l P r e -p r o o f COVID-9 identification, PLS-DA has been calculated to evaluate the discrimination of COVID-19 respect to inflammations. Data are numerically limited to perform a full training and test, so the PLS-DA models have only been cross-validated with a leave-one-out procedure to estimate the classification performance (model details in figures S14 and S15 in supplemental information). Results show that the accuracy of identification is 91% and 93% with GC/MS and sensor array respectively. These values are similar to those obtained with the full set of controls. Thus, although limited by the lack of a proper test, this result suggests that inflammatory conditions should play a limited confounding factor in the identification of COVID-19. Before this study, the serum VOCs involved in the discrimination of COVID-19 were unknown; thus, a general purpose sensor array made of porphyrinoids (porphyrins and corroles) coated quartz microbalances has been used (see transparent methods section for a description of sensors and sensitive molecules). The discrimination of COVID-19 samples might be explained considering that the interactions between the compounds in table 1 and sensors are expected to be driven by Van Der Waals forces and hydrogen bonds, and porphyrinoids coated quartz microbalances are known to be sensitive to these interactions (Paolesse et al., 2017) . Nonetheless, the identification of target volatile compounds will drive the further design of porphyrinoids with improved selectivity towards COVID-19 related compounds. The achieved classification performances are comparable with those obtained analyzing the volatile compounds in breath with different instruments such as proton transfer reaction (Grassin-Delyle et al., 2021) , gas chromatography coupled with an ion mass detector (Ruszkiewicz et al., 2020) and sensor array (Shan et al., 2020) (see table S2 in Supplementary Information) . This result points out the diagnostic properties of serum suggesting that it is a valid alternative to breath in several practical situation where breath analysis may be not reliable for instance because of a lack of preparation of patients or for safety reasons as in the case of COVID-19. Further studies are under development to extend the sample population, and to include comorbidities and additional variables to the classification models. Additionally, the relationship between different stages of COVID-19 and volatile compounds should also be investigated to ascertain not only the diagnostics but also the prognostic properties of volatolomics. As a final remark, it is important to consider that a full comprehension of the relationship between COVID-19 and volatile compounds, either in breath or in serum, is still not available. It will be particularly important to investigate the uniqueness of the relationship between VOCs patterns and COVID-19 in order to improve the predictions of volatolomics based method towards the optimal goal of 100% sensitivity and specificity. This paper provides a proof-of-concept demonstration that the analysis of the volatile compounds released by blood serum can be used for COVID-19 diagnosis. There were some limitations in the present study. The first limitation is concerned with the fact that this pilot study was performed in one location, using one instrumentation, and for a given time interval. Environmental influences, repeatability and longtime performance of the adopted instrumentation need to be verified in multicentric studies. The second is concerned with the selection of patients. All subjects included in the study were recruited at the emergency department of a large hospital in Rome, Italy. In order to limit any influence of the stability of sensors it was decided to analyze samples from patients admitted in a range of two days. The group of COVID-19 patients show more co-morbidities with respect to non-COVID patients and the group of non-COVID patients included subjects with different pathologies, some of them with symptoms compatible with COVID-19. GC/MS and sensor array indicate that the largest separation among the data is due to the COVID-19 pathology; however, further studies with larger sample sizes are necessary to elucidate if and how co-morbidities influence the serum volatolome. A third limitation is related to the sampling of volatile compounds for GC/MS. Sampling for GC/MS was made with a Solid Phase Micro-Extraction (SPME) sampler. SPMEs of different composition are available and each provides a different profile of volatile compounds. Here the same SPME fibre and measurement conditions adopted in a previous serum volatolome investigation was used (Ali et al., 2016) .. It was a wide range fibre aimed at compounds with a molecular weight in the range 40-275 suitable for untargeted analysis. Comparison between different SPMEs will be necessary for a thorough appraisal of the volatolome composition. A fourth limitation concerns the sensor array. This was assembled with sensors used in previous in-vivo and in-vitro volatolomics studies. A optimal detection of COVID-19 will require the development of sensors that can maximize the sensitivity towards those compounds that can be associated with the disease. Finally, a fifth limitation comes from the analysis of data. It is known that PLS-DA models tend to overfit when the number of variables exceeds the number of samples. For this reason the number of variables in GC/MS was reduced by selecting those with p<0.05. However, with a larger sample the whole set of compounds could have been used for classification. This would lead to an optimal definition of the set of compounds relevant to the identification of COVID-19. The overcoming of these limitations is expected to lead to the definition of a procedure for COVID-19 diagnosis based on the volatolomics analysis of blood serum. Furthermore, the detection of specific volatile compounds may shed additional light on the organisms response to COVID-19. Sensor systems Data-processing strategies for metabolomics studies Higher-order chemical sensing Principal Component Analysis The scent of COVID-19: Viral (semi-)volatiles as fast diagnostic biomarkers? The Volatilomic Footprints of Human HGC-27 and CLS-145 COVID-19 infection may cause ketosis and ketoacidosis Progress in understanding COVID-19: insights from the omics approach On the Challenges for the Diagnosis of SARS-CoV-2 Based on a Review of Current Methodologies Near real-time VOCs analysis using an aspiration ion mobility spectrometer Targeting LOX-1 inhibits colorectal cancer metastasis in an animal model Gas sensors based on mass-sensitive transducers part 1: transducers and receptors-basic understanding Novel routes to substituted 5,10,15-triarylcorroles Porphyrinoids for Chemical Sensor Applications Variation in volatile organic compounds in the breath of normal humans Detection of an Extended Human Volatome with Comprehensive Two-Dimensional Gas Chromatography Time-of-Flight Mass Spectrometry Diagnosis of COVID-19 by analysis of breath with gas chromatography-ion mobility spectrometry -a feasibility study Profiling Single Cancer Cells with Volatolomics Approach Multiplexed Nanomaterial-Based Sensor Array for Detection of COVID-19 in Exhaled Breath Proteomic and Metabolomic Characterization of COVID-19 Patient Sera The serum metabolome of COVID-19 patients is distinctive and predictive Double-check: Validation of diagnostic statistics for PLS-DA models in metabolomics studies Chemometrics in metabonomics Diagnosing COVID-19: The Disease and Tools for Detection Emerging applications of metabolomics in drug discovery and precision medicine A new coronavirus associated with human respiratory disease in China A nanomaterial-based breath test for distinguishing gastric cancer from benign J o u r n a l P r e -p r o o f gastric conditions Pathological findings of COVID-19 associated with acute respiratory distress syndrome Diagnosis of pulmonary tuberculosis and assessment of treatment response through analyses of volatile compound patterns in exhaled breath samples