key: cord-0826642-6r40e19s authors: Gegner, Hagen M.; Mechtel, Nils; Heidenreich, Elena; Wirth, Angela; Cortizo, Fabiola Garcia; Bennewitz, Katrin; Fleming, Thomas; Andresen, Carolin; Freichel, Marc; Teleman, Aurelio; Kroll, Jens; Hell, Rüdiger; Poschet, Gernot title: Deep metabolic profiling assessment of tissue extraction protocols for three model organisms date: 2021-12-19 journal: bioRxiv DOI: 10.1101/2021.12.16.472947 sha: 608af7bafefa482aad4acb01ee9fe4e7c3b26575 doc_id: 826642 cord_uid: 6r40e19s Metabolic profiling harbors the potential to better understand various disease entities such as cancer, diabetes, Alzheimer’s, Parkinson’s disease or COVID-19. Deciphering these intricate pathways in human studies requires large sample sizes as a means of reducing variability. While such broad human studies have discovered new associations between a given disease and certain affected metabolites, i.e. biomarkers, they often provide limited functional insights. To design more standardized experiments, reduce variability in the measurements and better resolve the functional component of such dynamic metabolic profiles, model organisms are frequently used. Standardized rearing conditions and uniform sampling strategies are prerequisites towards a successful metabolomic study. However, further aspects such as the choice of extraction protocol and analytical technique can influence the outcome drastically. Here, we employed a highly standardized metabolic profiling assay analyzing 630 metabolites across three commonly used model organisms (Drosophila, mouse and Zebrafish) to find the optimal extraction protocols for various matrices. Focusing on parameters such as metabolite coverage, metabolite yield and variance between replicates we compared seven extraction protocols. We found that the application of a combination of 75% ethanol and methyl tertiary-butyl ether (MTBE), while not producing the broadest coverage and highest yields, was the most reproducible extraction protocol. We were able to determine up to 530 metabolites in mouse kidney samples, 509 in mouse liver, 422 in Zebrafish and 388 in Drosophila and discovered a core overlap of 261 metabolites in these four matrices. To enable other scientists to search for the most suitable extraction protocol in their experimental context and interact with this comprehensive data, we have integrated our data set in the open-source shiny app “MetaboExtract”. This will enable scientists to search for their metabolite or metabolite class of interest, compare it across the different tested extraction protocols and sample types as well as find reference concentrations. Metabolomics, defined as the separation and subsequent measurement of small molecules in either qualitative or quantitative way, enables the generation of metabolic profiles of any sample of interest. While genomics and transcriptomics are analyzed within the framework of a single organism and understood by the blueprint of genes or transcripts of the respective species, metabolomics encompasses all compounds that may be metabolized by an organism, its microbiome or that are introduced by the environment ('exposome') at a given time. Therefore, the metabolome incorporates the environmental influence as well as interactions with other organisms (Johnson et al., 2012) . It can serve as a bridge between the organism, its interactions and any disease, e.g. between diet, the gut microbiome and metabolic disease (Pallister et al., 2017) . While the complexity and dynamic nature of the metabolome is daunting from an analytical perspective, metabolomics harbors the potential to better understand as well as diagnose various disease entities such as diabetes (Arneth et al., 2019) , kidney disease (Abbiss et al., 2019) , Parkinson´s (Shao & Le, 2019) , Alzheimer´s disease (Wilkins & Trushina, 2018) and most recently, COVID-19 (Sindelar et al., 2021) . The potential to understand the metabolic signatures of any given disease entity is tremendous, however, deciphering the intricate underpinnings of those in a human study requires costly, as well as time and work extensive population-wide association studies with several hundred participants per group (Nicholson et al., 2008) . These broad studies may be successful in the discovery of new associations between a respective disease and the measured metabolites, i.e., biomarkers, but they are limited in their mechanistic explanations despite all efforts. While the dynamic nature of the metabolome provides incredibly powerful insights, it also highlights the challenges of metabolome analyses -its variability and associated pitfalls. Variation and noise that are biologically inherent or are introduced at some point to the sample are complicating metabolic analyses, impairing the quality of the findings, limiting their validity and may even overshadow the effect size of the research question itself. Factors that introduce such variability range from intrinsic ones derived from the study organism, e.g. age or sex (Bell et al., 2021; Brennan & Gibbons, 2020) , to extrinsic factors, e.g. diet, lifestyle or medication (Adamski, 2016; Mellert et al., 2011) . Additionally, other factors, such as pre-analytical ones during sample collection (Lippi et al., 2020; Yin et al., 2015) , or analytical factors deriving from the sample preparation, the extraction protocols or analytical approach used to conduct the measurement (Erben et al., 2021; Lin et al., 2007) are also influential and need to be accounted for. Model organisms that are reared under controlled laboratory conditions and manipulated genetically to analyze a certain genotype address several of the challenges mentioned above. Combining the standardized rearing conditions and stringent sampling protocols with the already extensive knowledge accumulated from other "-omics" in models such as mice, Drosophila or Zebrafish enhances the explanatory power of metabolomic studies tremendously while simultaneously reducing the number of samples needed to generate meaningful results. To ensure that the analytical phase, i.e. the extraction and measurement of metabolites, does not introduce biases and variability, an in-depth evaluation of such aspects is necessary. Standardized metabolomic analyses are commercially available by companies such as Metabolon (www.metabolon.com) or Biocrates (www.biocrates.com). The latter has developed standardized and robust LC-MS/MS based kits which enable the absolute quantification of specific compound classes or more broadly, up to 630 metabolites in the case of the MxP Quant 500 kit (Biocrates). Within these 630 metabolites, the MxP Quant 500 kit covers 14 small molecule and 9 different lipid classes. Due to its standardized nature and compatibility with a multitude of LC-MS/MS platforms, data generated via such a kit-based approach enables inter-and intra-laboratory comparability (Siskos et al., 2017) , as well as its integration from different experiments. Although these kits were initially developed for human biofluids, i.e. plasma and serum, they may be used for tissue samples and other sample types such as cultured cells (Andresen et al., 2021) or supernatants likewise. However, there is no consensus on the optimal metabolite extraction procedure amongst the metabolomic community for the investigation of polar and non-polar metabolites covering that many chemical classes within one analysis. In this study, this open question was addressed using the highly standardized targeted metabolomics kit (Biocrates MxP Quant 500) to evaluate seven extraction protocols designed to extract both, polar and non-polar metabolites, differing in their solvent composition and extraction mode (mono/bi-phasic) as well as handling complexity (Figure 1.) . We compared the metabolite coverage, concentration yields and robustness, i.e. the coefficient of variance (CV%) across three commonly used model organisms (mice, Zebrafish and Drosophila), focusing on either whole organisms as sample type (Drosophila) or specific organs (liver and kidney) of the respective model organism. Lastly, we integrated our data in the Shiny app "MetaboExtract" (Andresen et al., 2021) to provide a useful source of metabolite concentrations across model organisms and enable other scientists to search for an optimal extraction procedure for their metabolite or metabolite class of interest. Chemicals were bought from Sigma-Aldrich (Germany). All solvents used for sample extractions and LC-or FIA-MS/MS analyses were of UHPLC-MS quality. Nine-week-old C57Bl6N wildtype mice (Charles River, Germany) were anesthetized with isoflurane and blood was taken to generate EDTA-Plasma. Without regaining consciousness mice were killed by cervical dislocation. Livers and kidneys were excised rapidly and shortly rinsed in ice-cold 0.9 % NaCl. Excess liquid was removed before whole organ weight was determined for later normalization and tissue was snap-frozen in liquid nitrogen. All procedures were approved by the Animal Care and Use Committee at the Regierungspräsidium Karlsruhe, W1118 flies were acquired from Bloomington Drosophila Stock Center. For all metabolic measurements, flies were grown under controlled conditions: Flies were allowed to lay eggs on apple plates for 12 hours. First instar larvae hatching within a 4-or 6-hours window were picked and seeded at a density of 60 animals per vial. Adult flies of all genotypes enclosing within a 24-hour time-window were separated by gender in groups of 30 flies and aged for 10 days. Flies were grown and maintained on food consisting of the following ingredients for 30 liters of food: 480g agar, 660g sugar syrup, 2400g malt, 2400g corn meal, 300g soymeal, 540g yeast, 72g nipagin, 187mL propionic acid and 18.7 mL phosphoric acid. At sample collection, flies were pooled and snap-frozen for metabolic profiling. Adult Zebrafish were kept under a 13-hour light / 11-hour dark cycle and feeding of Zebrafish took places twice a day, freshly hatched Artemia salina in the morning and fish flake food in the afternoon. All experimental interventions on animals were approved by the local government authority, Regierungspräsidium Karlsruhe and by Medical Faculty Mannheim (I-19/02) and carried out in accordance with the approved guidelines. Age of adult male Zebrafish was 9 months and both sexes were included. Zebrafish were sacrificed in ice water and livers were immediately dissected and frozen in liquid nitrogen and subsequently stored at -80°C. 7 mg to 10 mg of livers were used for further analysis. To ensure sufficient input material across the model organisms we used 30 pooled flies (w1118 flies), 7-10 mg of Zebrafish liver or 20-22 mg of mouse (C57Bl6N) liver and kidney pooled from three individuals respectively. All tissue samples were pulverized using a ball mill (MM400, Retsch) with precooled beakers and stainless-steel balls for 30 seconds at the highest frequency (30 Hz). The exact weight was determined for normalization of all measurements. Here we evaluated six different extraction protocols that are described in Figure 1 . We developed these protocols based on own preliminary experience and scanning of general metabolomics literature. The protocol "MeOH/MTBE", noted with an asterisk, was applied twice for mouse samples only. We are including this variation as an additional method (2xMeOH/MTBE) when we are referring to the seven extraction protocols. Briefly, pulverized and frozen samples were extracted using the indicated solvents and subsequent steps of the respective protocol ( Fig.1.) . After a final centrifugation step the solvent extract of the protocols 100IPA, IPA/ACN and MeOH/ACN were transferred into a new 1.5ml tube (Eppendorf) and snap-frozen until kit preparation. The remaining protocols were dried using an Eppendorf Concentrator Plus set to no heat, stored at -80°C and reconstituted in 120 µl isopropanol (60 µl of 100% isopropanol, followed by 60 µl of 30% isopropanol in water) before the measurement. After conducting the described seven extraction protocols, tissue extracts were processed following the manufacturer's protocol of the MxP® Quant 500 kit (Biocrates). Briefly, 10 µl of the samples or blanks were pipetted on the 96 well-plate based kit containing calibrators and internal standards using an automated liquid handling station (epMotion 5075, Eppendorf) and subsequently dried under a nitrogen stream using a positive pressure manifold (Waters). Afterwards, 50 µl phenyl isothiocyanate 5% (PITC) was added to each well to derivatize amino acids and biogenic amines. After 1 h incubation time at room temperature, the plate was dried Validation and filtering Data validation and quantification was performed using MetIDQ (Oxygen-DB110-3005). Here, metabolites were further categorized based on their quantitation ranges. Additional filtering per metabolite was based on the limit of detection (LOD), limit of quantification (LOQ) and concentration within the quantitative range (valid). To remove metabolites that were not present in any model organism and sample type, we considered a metabolite as detectable when at least 2 out of 3 replicates within a tested protocol were above LOD (see Figure 2. ). These metabolites are also visualized as Venn diagrams in Figure 5 for the extraction protocol To find the optimal protocol per model organism and sample type, we analyzed the yield per metabolite achieved across the extraction protocols. For this comparison, missing values and zero values were imputed per metabolite with 20% of the minimal positive value of a given metabolite. Subsequently, to perform statistical analyses, the data was log2-transformed. We then performed an ANOVA per metabolite considering all metabolites that were detectable with at least a single method. Extraction protocols were used as categorical variables and concentration as dependent variables. A Tukey post-hoc test (alpha = 0.05) was used to determine the extraction protocols with the highest median yield as well as non-significantly (p-adjusted > 0.05) lower yields. These extraction protocols were considered optimal, counted and depicted in Figure 3 . Conversely, metabolites that were significantly better extracted in a single extraction protocol are depicted in supplementary Figure S4 . Data can be explored and downloaded using the Shiny app "MetaboExtract" which is available at http://www.metaboextract.shiny.dkfz.de. The underlying code is also available at https://github.com/andresenc/MetaboExtract (Andresen et al., 2021) . Figure 2 and 3 as well as S3 were extracted from the Shiny app. The aim of this study was the comparison of seven extraction protocols (Figure 1 ) across three model organisms to determine the optimal extraction procedure with regards to metabolite coverage, yield and robustness (CV%). In total, we analyzed 630 metabolites, however, after filtering for low concentrated metabolites below the limit of detection (LOD), we continued the analyses using this processed data. Figure S1 . shows the ratio of LOD, LOQ and valid measurements per extraction protocol across all sample types and model organisms. The metabolic profiling kit (Biocrates MxP Quant 500) used for this study quantifies polar as well as non-polar metabolites across 14 small molecule and 9 different lipid classes. Therefore, an extraction procedure is required that enables solubilization ranging from very polar metabolites (e.g. carbohydrates and amino acids) to very non-polar metabolites such as triacylglycerols (Figure 2. ). While the maximum coverage between the different model organisms is expected to be variable, the general performance of the respective protocol remained similar. While IPA, a rapid and easy to perform single solvent extraction protocol, produced fair coverage, the remaining protocols, both containing acetonitrile, achieved the lowest coverage regardless of the sample type or organism. Comparison of the different metabolite classes reveal that these monophasic extraction protocols failed to extract several lipids, such as diand triacylglycerols as well as ceramides or cholesterol esters. Although the coverage of a given extraction is essential, the yield, as the calculated concentration per metabolite, needs to be maximized and loss during the protocol should be avoided. To scrutinize the extraction protocols regarding this criterion we performed an ANOVA (see material and method part), counting the metabolites that reached the highest or a non-significantly lower concentration in a given extraction protocol per model organism (Fig. 3) . Therefore, lower counts in Figure 3 . indicate that other protocols did extract significantly higher concentrations. Here, a similar pattern compared to the metabolite coverage (Figure 2 Overall, the 75EtOH/MTBE protocol resulted in the best coverage and highest metabolite concentrations of the evaluated protocols while also showing excellent to acceptable levels of variance between the measurements. As a next step, we used this extraction protocol as an "MetaboExtract" is an interactive resource to explore metabolite extractions and baseline concentrations of model organisms The presented data provides an attempt to inform about the optimal extraction protocol for a standardized profiling assay. However, it harbors further information such as baseline concentration for sample types and whole model organisms. To access this information researchers may explore the data set via the easy-to-use interactive R/shiny app "MetaboExtract" (Andresen et al., 2021) . There, the already present metabolite data on human tissue and cells was expanded by our data set focused on model organisms. Since all data was generated using the standardized MxP Quant 500 assay it is highly comparable. Organisms, tissues, extraction methods and classes of metabolites may be (de)selected to focus on the data of interest that are then provided in comprehensive and interactive visualizations. The data presented in Figure 2 and in supplementary Figure S4 were generated using MetaboExtract. Its standardized nature provides the potential for further expansion via additional MxP Quant 500 assay measurements. Model organisms enable standardized laboratory-controlled handling, sampling and experiments. This level of standardization in the pre-analytical phase benefits metabolic profiling due to the dynamic nature of the metabolome, rapid turnover rates of metabolites and the influence of the environment (Edison et al., 2016; Saoi & Britz-mckibbin, 2021) . Here, we focused on the extraction and processing of diverse sample types, one of the most important aspects in the analytical phase requiring strict standardization for reproducibility of data. To this end, we used a targeted metabolic profiling approach quantifying up to 630 metabolites across three commonly used model organisms (Drosophila, mouse and Zebrafish) to find a robust, easy-to-use extraction protocol yielding a comprehensive coverage of the target metabolites. This targeted metabolic profiling assay (Biocrates MxP Quant 500) potentially quantifies polar as well as non-polar metabolites across 14 hydrophilic and 9 different lipid classes. Hence, it requires the extraction of a chemically diverse range of metabolites. Besides broad metabolite coverage and high yields in concentrations with little loss during the extraction, we evaluated the robustness of a given method as well as the practicability and effort of performing the protocols. In our study, biphasic extractions (MeOH/CHl3/H2O, 75EtOH/MTBE, (2x)MeOH/MTBE) resulted in superior coverage and concentrations across all model organisms and sample types. Here, the complementary phases, composed of an organic lipid-rich phase and an aqueous phase containing primary and secondary metabolites, were combined and dried in the final step of each protocol allowing for a greater coverage as compared to mono-phasic extractions. Chloroform based bi-phasic extractions by Bligh & Dyer (1959) have been dominantly used over the years due to the focus on the lipid fraction, however, MTBE (methyl tert-butyl ether, i.e. TBME) is more frequently used as a non-toxic and non-carcinogenic alternative to chloroform (Furse et al., 2015; Matyash et al., 2008) . Here, both strong hydrophobic solvents in combination with another organic solvent of lower hydrophobicity such as ethanol or methanol resulted in comparable metabolite coverage. Importantly, the chloroform-based extraction resulted in the highest yield as well as broadest coverage, however, substituting it with MTBE resulted in similar but less variable measurements ( However, the extraction protocol of choice depends highly on the target as well as the sample type. While our study aimed to find the most versatile protocol it also showed that other protocols extracted certain metabolite classes more efficiently than the broader biphasic extractions. For instance, MeOH/ACN/H2O+FA was superior in the extraction of amino acids in Zebrafish and of glycerophospholipids in Drosophila as compared to the other protocols (supplementary Figure S4) . Hence, the most suitable extraction protocol highly depends on the type of analysis, i.e., quantitative or qualitative assessment, the coverage needed, i.e., a small set of targets within one metabolite class or a broad screening, as well as the sample type. To find the most adequate protocol for a given scenario we included this data in a publicly available shiny app -"MetaboExtract" (Andresen et al., 2021) . This open access resource is expandable and makes use of the comparability of standardized assays such as MxP Quant 500. MetaboExtract enables users to review and explore standardized extractions and infer baseline concentrations of metabolites across a variety of sample types and organisms. There, users can search for a metabolite or metabolite class of interest, review or compare measured concentrations following a variety of mono-or biphasic extraction protocols across human cells, cell lines and tissue and, now, model organisms. The custom R scripts (R version 4.0.4) that were used for analysis and visualization are accessible at https://github.com/nilsmechtel. The underlying code for "MetaboExtract" is available at https://github.com/andresenc/MetaboExtract (Andresen et al., 2021) . Metabolomics approaches for the diagnosis and understanding of kidney diseases Key elements of metabolomics in the study of biomarkers of diabetes Comparison of extraction methods for intracellular metabolomics Metabolomics of type 1 and type 2 diabetes Sex differences in systemic metabolites at four life stages: cohort study with repeated metabolomics Sex matters: a focus on the impact of biological sex on metabolomic profiles and dietary interventions Comparison of simple monophasic versus classical biphasic extraction protocols for comprehensive UHPLC-MS/MS lipidomic analysis of Hela cells The time is right to focus on model organism metabolomes. Metabolites Evaluation of different stool extraction methods for metabolomics measurements in human faecal samples Isolation of lipids from biological samples Cross-Laboratory Standardization of Preclinical Lipidomics Using Differential Mobility Spectrometry and Multiple Reaction Monitoring Xenobiotic metabolomics: Major impact on the metabolome Evaluation of metabolite extraction strategies from tissue samples using NMR metabolomics PREDICT: A checklist for preventing preanalytical diagnostic errors in clinical trials Lipid extraction by methyl-terf-butyl ether for high-throughput lipidomics Nutritional impact on the plasma metabolome of rats The metabolome-wide association study: A new look at human disease risk factors Untangling the relationship between diet and visceral fat mass through blood metabolomics and gut microbiome profiling New Advances in Tissue Metabolomics : A Review Recent advances and perspectives of metabolomics-based investigations in Parkinson's disease Longitudinal metabolomics of human plasma reveals prognostic markers of COVID-19 disease severity Interlaboratory Reproducibility of a Targeted Metabolomics Platform for Analysis of Human Serum and Plasma Application of metabolomics in Alzheimer's disease Effects of pre-analytical processes on blood samples used in metabolomics studies The authors gratefully acknowledge the data storage service SDS@hd supported by the The authors have no conflicts of interest to declare. All co-authors have seen and agree with the contents of the manuscript and there is no financial interest to report.