key: cord-0902820-ymhwpdss authors: Lodge, Samantha; Nitschke, Philipp; Loo, Ruey Leng; Kimhofer, Torben; Bong, Sze-How; Richards, Toby; Begum, Sofina; Spraul, Manfred; Schaefer, Hartmut; Lindon, John C.; Holmes, Elaine; Nicholson, Jeremy K. title: Low Volume in Vitro Diagnostic Proton NMR Spectroscopy of Human Blood Plasma for Lipoprotein and Metabolite Analysis: Application to SARS-CoV-2 Biomarkers date: 2021-01-23 journal: J Proteome Res DOI: 10.1021/acs.jproteome.0c00815 sha: 62161c050f26ba329359accc9aae43d5f2edc8a5 doc_id: 902820 cord_uid: ymhwpdss [Image: see text] The utility of low sample volume in vitro diagnostic (IVDr) proton nuclear magnetic resonance ((1)H NMR) spectroscopic experiments on blood plasma for information recovery from limited availability or high value samples was exemplified using plasma from patients with SARS-CoV-2 infection and normal controls. (1)H NMR spectra were obtained using solvent-suppressed 1D, spin–echo (CPMG), and 2-dimensional J-resolved (JRES) spectroscopy using both 3 mm outer diameter SampleJet NMR tubes (100 μL plasma) and 5 mm SampleJet NMR tubes (300 μL plasma) under in vitro diagnostic conditions. We noted near identical diagnostic models in both standard and low volume IVDr lipoprotein analysis (measuring 112 lipoprotein parameters) with a comparison of the two tubes yielding R(2) values ranging between 0.82 and 0.99 for the 40 paired lipoprotein parameters samples. Lipoprotein measurements for the 3 mm tubes were achieved without time penalty over the 5 mm tubes as defined by biomarker recovery for SARS-CoV-2. Overall, biomarker pattern recovery for the lipoproteins was extremely similar, but there were some small positive offsets in the linear equations for several variables due to small shimming artifacts, but there was minimal degradation of the biological information. For the standard untargeted 1D, CPMG, and JRES NMR experiments on the same samples, the reduced signal-to-noise was more constraining and required greater scanning times to achieve similar differential diagnostic performance (15 min per sample per experiment for 3 mm 1D and CPMG, compared to 4 min for the 5 mm tubes). We conclude that the 3 mm IVDr method is fit-for-purpose for quantitative lipoprotein measurements, allowing the preparation of smaller volumes for high value or limited volume samples that is common in clinical studies. If there are no analytical time constraints, the lower volume experiments are equally informative for untargeted profiling. High field proton NMR spectroscopy of biofluids coupled with pattern recognition methods have been applied extensively to problems in metabolic biochemistry, toxicology, and experimental diagnostics for more than three decades. 1−4 More recently, data analytic procedures such as curve and regression fitting have been used to extract quantitative lipoprotein and metabolic data from 1-dimensional blood plasma and urine spectra. 5−7 The mainstay of most biofluid NMR studies is the standard 5 mm outer diameter liquid state NMR tubes (with 5 mm NMR probes) used in the vast majority of all highresolution chemical and biochemical experiments. A range of tube sizes is also widely available, for example 1 mm to 10 mm for high resolution studies and 15 mm 25 mm tubes for widebore NMR applications. Each tube size has been optimized for a particular sample volume range and there are sensitivity consequences for using smaller sample volumes at the same sample concentration because the absolute analyte mass in the probe is reduced proportionately. Of note, the signal-to-noise ratio (S/N) in an NMR experiment is proportional to the absolute analyte mass in the receiver coil; halving this will require a quadruple number of scans to obtain the same S/N in an identical probe with identical filling factor (the % of the receiver coil containing the analytes). 8 This can result in severe spectral acquisition time penalties for low volume samples or compromised S/N ratios with weaker diagnostic performance. It is rare to have volume limitations for urine samples even in small animal studies. 6 However, blood plasma or serum samples are commonly available in limited amounts, such as those on small animals or infants, where it would be unethical to withdraw large amounts of blood, or in situations where the samples that are available are needed for multiple parallel metabolic studies involving multiple technologies. The latter applies to systemic metabolic studies on COVID-19, where attempts to uncover the complex metabolic sequelae via multiple orthogonal technologies often applied on the same samples, 9 leading to analytical priority decisions that may reduce the number of analytical options per sample. 1 H NMR spectroscopy of biofluids for in vitro diagnostics in standard 5 mm tubes requires ca. 0.3 mL of sample. 10, 11 The standard IVDr NMR methods for blood plasma lipoprotein measurements utilize 5 mm tubes with a minimal sample requirement of 300 μL (plus 300 μL buffer) as determined by a validated IVDr protocol, 12 while mass spectrometry requires typically 10−30 μL per assay. 13−15 Any reduction in NMR preparative volumes will therefore provide more opportunities for complementary analytical coverage of the same sample using additional spectroscopic assays. Here the use of 100 μL plasma in 3 mm SampleJet tubes was investigated using a modified IVDr method and longer scanning times and compared with standard IVDr methods in 5 mm SampleJet tubes using a cohort of control, SARS-CoV-2 positive patients, and SARS-CoV-2 negative patients. Blood plasma and serum samples were collected from a cohort of adult individuals in a study initiated at Fiona Stanley Hospital by the Covid Research Response team (https://research-au.net/ covid-research-response/) as part of the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC)/ World Health Organisation (WHO) pandemic trail framework (SMHS Research Governance Office PRN:3976 and Murdoch University Ethics no. 2020/052). Three groups of participants were recruited: (i) patients who presented with SARS-CoV-2 disease symptoms and subsequently tested positive for SARS-CoV-2 infection from upper and/or lower respiratory tract swabs by RT-PCR (n = 9 participants); (ii) blood samples from patients with SARS-CoV-2 disease symptoms and who had tested negative by RT-PCR (n = 27 participants); and (iii) healthy controls with no respiratory symptoms (n = 4 participants) who were enrolled as volunteers, according to Murdoch University Ethics no. 2020/053. The demographic data are shown in Table S1 . Plasma samples were centrifuged and processed according to a standard protocol. 10, 11 Prior to NMR analysis, frozen plasma samples were thawed at 20°C for 30 min and centrifuged for 10 min at 13 000g at 4°C. Plasma samples were prepared using both 5 mm SampleJet NMR tubes, following the recommended procedures for in vitro analytical and diagnostics procedures, 12 and 3 mm SampleJet NMR tubes. For the standard sample preparation for 5 mm NMR tubes, each aliquot of 300 μL of plasma was mixed with 300 μL phosphate buffer (75 mM Na 2 HPO 4 , 2 mM NaN 3 , 4.6 mM sodium trimethylsilyl propionate-[2,2,3,3− 2 H 4 ] (TSP) in 80% D 2 O, pH 7.4 ± 0.1), 11 mixed and spun for 5 s to ensure all liquid is at the bottom of the tube and transferred to a 5 mm SampleJet tube. For the sample preparation using 3 mm SampleJet NMR tubes, each aliquot of 100 μL of plasma was mixed with 100 μL phosphate buffer, mixed and spun for 5 s and 180 μL was transferred to a 3 mm tube for analyses. The 3 mm SampleJet NMR tubes require less than a third of the sample volume compared to the 5 mm SampleJet NMR tubes. NMR spectroscopic analyses were performed on a 600 MHz Bruker Avance III HD spectrometer equipped with a 5 mm BBI probe and fitted with the Bruker SampleJet robot cooling system set to 5°C. A full quantitative calibration was completed prior to the analysis using a previously described protocol. 11 All experiments were completed using the Bruker in vitro diagnostics research (IVDr) methods. 16 For samples prepared in both 3 mm SampleJet and 5 mm SampleJet NMR tubes, three experiments were carried out in automation mode, amounting to a total of 12.5 min acquisition time per sample: standard 1D experiment with solvent presaturation with gradient (32 scans, 96K data points, spectral width of 30 ppm), a Carr−Purcell− Meiboom−Gill (CPMG) spin−echo experiment (32 scans, 72K data points, spectral width of 20 ppm) and a 2-dimensional Jresolved experiment (40 t 1 increments with 2 scans each). In addition, for the samples prepared in both 3 mm SampleJet NMR tubes, three further experiments were completed using higher scan option for each experiment amounting to a total acquisition time of 38 min per sample: standard 1D experiment with solvent presaturation (128 scans, 96K data points, spectral width of 30 ppm), a Carr−Purcell−Meiboom−Gill (CPMG) spin−echo experiment (128 scans, 72K data points, spectral width of 20 ppm) and a 2-dimensional J-resolved experiment (40 t 1 increments with 4 scans each). Data were processed in automation using Bruker Topspin 3.6.2 and ICON NMR to achieve phasing and baseline correction. 17 A total of 112 lipoprotein parameters for each sample were generated using the Bruker IVDr Lipoprotein Subclass Analysis (B.I.-LISA) method by mathematically interrogating and quantifying the −(CH 2 ) n at δ = 1.25 and −CH 3 at δ = 0.80 peaks of the 1D spectrum after normalization to the Bruker QuantRef manager within Topspin using a PLS-2 regression model. 12 The lipoprotein data describe chemical components of cholesterol, free cholesterol, phospholipids, triglycerides, Apolipoproteins A1/A2/B100 and the B100/A1 ratio in different density classes: high-density lipoprotein (HDL, density Table S2 . The 1D and CPMG NMR spectral data sets were referenced to glucose at δ = 5.25 using open source MetaboM8 software packages available from GitHub (github.com/tkimhofer/ metabom8, version 0.1) 18 and corrected for baseline offset. Spectral regions corresponding to the residual water resonance signal (δ = 4.60−4.85) and those predominantly noise (δ < 0.25 and δ > 9.5) were excluded from analyses. The remaining spectral segments were normalized using a probabilistic quotient method, 19 mean-centered and scaled to unit-variance prior to multivariate modeling. Principal component analysis (PCA) was used to describe the underlying composition in quantitative lipoprotein, 1D and CMPG spectral data sets and to identify the clustering of healthy controls, SARS-CoV-2 positive, and SARS-CoV-2 negative individuals. One of the real-world problems in studying complex emergent diseases such as SARS-CoV-2 is that patient sampling is often sparse or incomplete and multiple subsamples are needed to cover a large number of biochemical tests and different mass spectrometric assays. This problem is also applicable where there are limited sample volumes, including studies on children and babies. Here the use of reduced sample volumes was investigated using 3 mm SampleJet NMR tubes (100 μL of plasma required) compared against the standard 5 mm SampleJet tube method (300 μL) for blood plasma with the standard lipoprotein IVDr quantification employing three different study cohortsSARS-CoV-2 positive, SARS-CoV-2 negative, and healthy control samples. IVDr methods extracted 112 lipoprotein parameters from the 1D spectra of the same samples either in 3 mm or 5 mm tubes, both obtained using 32 scans (4 min experiment). The overall PCA mapping of a mixture of the different patient and sample tubes with the different volumes are shown in Figure 1 with identical biological samples connected by dotted lines. The overall statistical distribution of the samples was maintained, i.e., multivariate biological variation exceeded the analytical variation and that thus the outputs of the two analytical procedures were substantially equivalent. However, some minor discrepancies for some samples were observed due to differential field inhomogeneities (shimming artifacts) that slightly influence the quantitative output of the IVDr fitting algorithms for some parameters. The PCA loading plots for the samples prepared in 5 mm and 3 mm SampleJet NMR tubes are nearly identical ( Figures 1B and 1C) , indicating equivalent biomarker information recovery. A detailed biological description of the meaning of these parameters in relation to the complex biochemical of SARS-CoV-2 positivity is reported elsewhere. 9 OPLS-DA was applied to the 5 mm and 3 mm lipoprotein data sets to compare the healthy controls and the SARS-CoV-2 positive patients. The different size tubes both yielded an AUROC of 1.0 with the coefficient plots yielding the same lipoprotein parameters driving the SARS-CoV-2 signature ( Figure S1 ). The quantitative relationships between the different sample volumes is illustrated in Figure 2 for the main six lipoprotein fractions. In all cases, there was a high degree of linearity with exceptionally high coefficients of determination (R 2 in the range 0.82 to 0.99). However, there were some small positive offsets in the linear equations for several metabolites in certain compartments, e.g., LDL cholesterol and total triglycerides, although these need to be considered in relation to the overall magnitude of the lipoprotein concentration ( Figure 2 ). Individual linear equations for all 112 lipoproteins are listed in Table 1 , with all but seven lipoprotein parameters (cholesterol LDL-5, free cholesterol LDL-6, free cholesterol LDL-5, phospholipids LDL-5, Apo-B LDL-5, triglycerides HDL-1, and triglycerides HDL-2) showing R 2 values of <0.9, indicating the 3 mm tubes can be used with confidence for all the measured IVDr lipoprotein parameters. The R 2 values and the calculated linear equations relating the variables for 3 mm and 5 mm tubes are shown in Table 1 . Examination of the 1D and CPMG spectral data sets indicated that the signal-to-noise ratio of the 3 mm SampleJet NMR tubes was 2.8 times lower than that of the 5 mm, when 32 scans were used, which is due to the filling-effect of the 3 mm tubes in the 5 mm NMR probe volume. In particular, the metabolite signals at the aromatic regions were largely not detectable for the 3 mm NMR tubes with 32 scans ( Figure S2 ). However, we expect that improved performance with a dedicated 3 mm NMR probe will be achievable. It was found that at least 128 scans for the 3 mm SampleJet NMR tubes was required in order for the aromatic peaks and some of the lower concentration metabolites to have sufficient signal-to-noise to be comparable to the standard 5 mm IVDr 32 scan method. For the JRES spectral data sets at least 4 scans are required in order to gain sufficient peak information ( Figure S3 ). PCA models were constructed for the 1D and CPMG spectral data sets separately, using healthy controls, SARS-CoV-2 negative, and the SARS-Cov-2 positive patients. Both the standard 32 scans (5 mm NMR tubes) and 128 scans (3 mm NMR tubes) showed highly comparable results for the 1D Figure 4 ) spectral data sets, but this was achieved at the cost of an approximately 4-fold increase in the total experimental time for each sample in the 3 mm NMR tubes. When the PCA model was constructed for the CPMG spectral data set using 32 scans in the 3 mm NMR tubes, it was found that the PCA scores plot show clustering patterns that were similar to those seen in both the 5 mm tubes (32 scans) and the 3 mm (128 scans) SampleJet NMR tubes ( Figure S4A ). However, the loading plots clearly show that the clustering was dominated by the aliphatic molecules in the 3 mm NMR tubes with 32 scans (Figures S4B and S4C) . The difference from the aromatic regions among the SARS-CoV-2 positive patients compared to the healthy and SARS-CoV-2 negative patients were not recovered using the 3 mm NMR tube with 32 scans when compared to the 5 mm NMR tubes using 32 scans ( Figure S4D and S4E). This finding confirms that higher scan numbers are required for the 3 mm SampleJet NMR tubes in order to get the same level of metabolite signals information recovery from the healthy controls, SARS-CoV-2 positive and SARS-CoV-2 negative patients. We conclude that all NMR experiments applied in this test series give substantially equivalent, differential diagnostic information for SARS-CoV-19 positive patient samples. The IVDr lipoprotein analysis is highly efficient and results in excellent quantitative agreement between the different sample volumes with no time penalty. However, the untargeted analysis is more sensitive to the reduced S/N in the smaller volume NMR tubes and therefore requires greater spectral accumulation time to recover similar metabolic biomarkers. We recommend that for clinical studies involving small numbers of samples and with limited sample volume, it would be feasible to adopt untargeted analysis using 3 mm NMR tubes with a higher scan number. Figure S1 : OPLS-DA of the lipoprotein data set in the 5 mm (ns = 32) and 3 mm (ns = 32) SampleJet NMR tubes to compare healthy controls vs the SARS-CoV-2 positive patients; Figure S2 : Overlay of a CPMG spectra of a SARS-CoV-2 positive patient analyzed in 5 mm SampleJet NMR tubes with 32 scans, and 3 mm NMR tube with 32 scans; Figure S3 : JRES spectral overlays of (A) 5 mm SampleJet NMR tube with 2 scans (blue) and SampleJet 3 mm NMR tube with 2 scans; (B) 5 mm SampleJet NMR tube with 2 scans (blue) and 3 mm SampleJet NMR tube with 4 scans; Figure S4 : PCA scores and loading plots of CPMG spectral data for 3 mm SampleJet NMR analyses using 32 scans and spectral overlays with principal component 2 loadings plots (PDF) High resolution H NMR studies of vertebrate blood and plasma Metabolic profiling, metabolomic and metabonomic procedures for NMR spectroscopy of urine, plasma, serum and tissue extracts High resolution proton magnetic resonance spectroscopy of biological fluids Two-dimensional proton nuclear magnetic resonance "maps" of acetaminophen metabolites in human urine The association of fish consumption and its urinary metabolites with cardiovascular risk factors: the International Study of Macro-/ Micronutrients and Blood Pressure (INTERMAP) The Consortium for Metabonomic Toxicology (COMET): aims, activities and achievements Serum metabolic signatures of coronary and carotid atherosclerosis and subsequent cardiovascular disease Microflow NMR: concepts and capabilities Integrative Modelling of Quantitative Plasma Lipoprotein, Metabolic and Amino Acid Data Reveals a Multi-organ Pathological Signature of SARS-CoV-2 Infection Quantitative In-Vitro Diagnostic NMR Spectroscopy for Lipoprotein and Metabolite Measurements in Plasma and Serum: Recommendations for Analytical Artefact Minimization with Special Reference to COVID-19/SARS-CoV-2 Samples Precision high-throughput proton NMR spectroscopy of human urine, serum, and plasma for large-scale metabolic phenotyping Quantitative Lipoprotein Subclass and Low Molecular Weight Metabolite Analysis in Human Serum and Plasma by (1)H NMR Spectroscopy in a Multilaboratory Trial High-Speed Quantitative UPLC-MS Analysis of Multiple Amines in Human Plasma and Serum via Precolumn Derivatization with 6-Aminoquinolyl-Nhydroxysuccinimidyl Carbamate: Application to Acetaminophen-Induced Liver Failure Ultrahigh-Performance Liquid Chromatography Tandem Mass Spectrometry with Electrospray Ionization Quantification of Tryptophan Metabolites and Markers of Gut Health in Serum and Plasma-Application to Clinical and Epidemiology Cohorts High-Throughput Plasma Lipidomics: Detailed Mapping of the Associations with Cardiometabolic Risk Factors Toward Reliable Lipoprotein Particle Predictions from NMR Spectra of Human Blood: An Interlaboratory Ring Test 750 MHz 1 H and 1 H-13 C NMR spectroscopy of human blood plasma Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures