key: cord-0987828-xclctext authors: Picó, Yolanda; Barceló, Damià title: Mass Spectrometry in Wastewater-Based Epidemiology for the Determination of Small and Large Molecules as Biomarkers of Exposure: Toward a Global View of Environment and Human Health under the COVID-19 Outbreak date: 2021-11-08 journal: ACS Omega DOI: 10.1021/acsomega.1c04362 sha: 882b8d616463ea1b0d943d0cafc5117b10285e96 doc_id: 987828 cord_uid: xclctext [Image: see text] Wastewater-based epidemiology (WBE) estimates collective consumption or exposure to chemicals or pathogens by monitoring the substances excreted in the population’s wastewater. Advances in mass spectrometry (MS) and the application of some clinical diagnostic tools and proteomics to wastewater fingerprinting have been linked to the discovery of new biomarkers and indicators of population health and are broadening the scope of WBE that nowadays cover not only small molecule biomarkers but also genetic biomarkers, large molecules, viruses, infection diseases, resistance, etc. This mini-review highlights recent WBE advances using MS and how this progress can create a fingerprint of a city’s health hazards, habits, and lifestyle, which is gaining in public health emphasis. Wastewater-based epidemiology (WBE) has already shown to be an innovative and promising tool for the estimation of some unhealthy or poorly confessable habits at the community level, such as drug, alcohol, and tobacco use. 1 Figure 1 outlines the classical workflow to estimate illicit drug consumption. Drugs or their human metabolites are determined by liquid chromatography−mass spectrometry (LC−MS). Their concentration is combined with a few additional pieces of information to estimate consumption. In fact, the WBE evolution has been closely linked to instrumental development, especially of mass spectrometry (MS). 2 Growing research attention has been paid to WBE that could now be regarded as a means of measuring a community's unique, collective health status, akin to a profile, signature, or fingerprint of overall health. 3 It overcomes some of the limitations of conventional biomonitoring, which can only collect data from individuals and involve high costs and considerable resources and cannot provide results in near real-time. Given this, WBE has been developed steadily over the last 15 years. 4 Most of the research efforts of WBE are concentrated in Europe. Originally, determining the population-wide use or misuse of illicit or therapeutic drugs, pharmaceuticals and personal care products, tobacco and alcohol use, pesticide exposure, 5 genetic biomarkers, 6 etc. was the main target of WBE. Later, this concept gradually involved a broad array of other types of substances in wastewater, including virus particles. 7 This evolution has led to the introduction of new analytical techniques, as, in the case of biological material, the analysis of DNA and RNA fragments by polymerase chain reaction (PCR) that is in many aspects irreplaceable. Recently and as a most relevant example due to the wide literature on the topic, COVID-19 has been detected in sewage, which supports the idea of using WBE to reveal the scale of COVID-19 outbreak. 8−12 Conventional reverse transcription (RT)-PCR and upgraded PCR devices, like digital (d)PCR or droplet digital (dd)PCR, have been reported not only for DNA but also for the RNA, as well. The main advantage of using dPCR or ddPCR is the increasing sensitivity of at least 1 order of magnitude compared to that with PCR. 9,10 Although PCR has become so prevalent for the determination of viruses, the complexity of wastewater means that it still suffers from many unresolved problems, such as proper extraction, cleanup, detection, and quantification of the genetic material. MS may be a good alternative to provide complementary information and help to overcome the problems. With regard to the application of WBE for large molecules, the literature is rather scarce. The potential relevance for WBE of the human proteins present in sewage waters was already stated by Rice and Kasprzyk-Horden. 13 This approach appeared as a new paradigm for public health assessment, although no experimental work was performed to demonstrate with a case study the feasibility of WBE for large molecules. Only recently, our work 14,15 demonstrated that valuable information on large molecules can be achieved by analyzing wastewater treatment plant (WWTP) influent and effluent. In this mini-review, we bring together the recent progress of WBE using MS targeted to small molecules, where most of the literature has been already published, and to large molecules, with very few papers reporting proteins and peptides as biomarkers of exposure. Ideas for future monitoring of viral proteins such as SARS-CoV-2 in WBE are outlined by incorporating the experience from LC−MS in clinical studies to determine proteins and peptides. We believe that WBE MS is a useful approach rather complementary to PCR methodologies by reporting more accurate information on the virus-specific peptides. The main challenge of WBE to provide a fingerprint of different aspects of health, habits, well-being, and lifestyle of a community is the selection of specific biomarkers of human metabolism, excreted in sufficient and stable quantities in wastewater. Normally, the identification of these biomarkers is based on an in-depth search of human biomonitoring (HBM) reports that identify candidates. Several interesting reviews compile extensive lists of many of these potential biomarkers for WBE. 5,16−21 Likewise, liquid chromatography−tandem mass spectrometry (LC−MS/MS) using triple quadrupole (QqQ) or linear quadrupole ion trap (QTRAP) has emerged as a powerful technique for the simultaneous detection of multiple biomarkers in wastewater, providing critical information about their occurrence. In most of these studies, LC−MS/MS is used as targeted analysis, in which the biomarkers are selected "a priori" and other related products cannot be detected because only specific transitions are monitored. The use of this type of analysis ensures maximum sensitivity, which is a crucial aspect in these analyses. In any case, MS-based methods can successfully detect in wastewater biomarkers listed in Table 1 . However, biomarkers selected through the study of the HBM are broader, and it is not clear whether several of these biomarkers are suitable for WBE or not. The final use of these biomarkers in wastewater will depend on their prevalence in wastewater (they must be detected) and their stability (their excretion cannot be evaluated if they are not stable). More information and numerous "in situ" studies are therefore required. Table 1 reflects how in many cases substances used are not exclusive of humans (parent compounds and/or degradation products that can come from other sources). In those cases, exposure is more difficult to assess, being still an ongoing study. As an interesting example, Rousis 22 proposed the urinary factor to deal with this problem, although this factor still needs further improvement to its application. It is also important to note that the choice of MS instrument depends on the purpose of the analysis. For example, a LC− QqQ−MS/MS is often the choice for quantitative analysis of most of the small molecules for WBE as mentioned above. However, high-resolution mass spectrometry (HRMS), using quadrupole time-of-flight (QqTOF) or Orbitrap, could be a best option to directly identify human biomarkers in wastewater opening a new avenue in WBE. Several biomarkers have also been identified due to intense research on these HRMS systems. One difficulty is that wastewater contains thousands of compounds, and the biomarkers of interest may not be in the majority. The simplest solution is to use what is known as suspected screening or wide screening, which involves building a database with those compounds recognized in the literature or in other databases facilitating the identification of compounds. 21, 23, 24 Few examples of nontarget analysis have also been reported in succeeding with the identification of unknown compounds due to the possibility of sharing the most probable empirical formula and the fragmentation pattern to several Internet available databases to recognize the unknown compound recorded by MS for identifying if it is a potential biomarker. 25, 26 The use of in silico tools (mostly retention time, fragmentation, biocatalysis, biodegradation, metabolism, or transformation products prediction models) has also propelled the discovery of biomarkers from the raw MS data. 23, 25 Interestingly, the fate of new psychoactive substances (NPS) and their transformation products in WWTPs is not wellunderstood. Kinyua et al. 24 set up batch reactors seeded with activated sludge to evaluate biotic, abiotic, and sorption losses of p-methoxymethylamphetamine (PMMA) and dihydromephedrone (DHM) and identify transformation products using some of these simulation models. These tools confirm that main reactions were O-and N-demethylation, oxidation, and hydroxylation. Figure 2 illustrates the identification of several DHM transformation products and their fragmentation pattern. Evaluation of wastewater profiles or fingerprinting by MS has also been demonstrated to be useful in biomarker identification. This approach usually requires the application of bioinformatics that support clustering classification to organize ionic signals with common characteristics into different groups, the estimation of ionic signal intensity in a space, and the application of complex multivariant statistics and development of predictive models. This format is still in its infancy regarding WBE. Although not yet widely applied within the field of WBE, the use of HRMS offers a range of new possibilities to further expand the detection of small molecules that act as useful biomarkers in wastewater. 24, 27 The identification of proteins in urban wastewater have been proposed as a logical continuation of the intense search for biomarkers that started recently. Proteins can be an excellent indicator of prevalent diseases in a population. However, the transition from theory to practice is taking a long time. Proteins together with biomarker composition in sewage can reflect the surrounding human activities (e.g., industry, agriculture, and pest control) and the health status of populations based on WBE. Biomarkers might be found in the soluble part of water, but proteins could be expected in particulate matter because many proteins travel in water media by attaching on the surface of particles and bacteria. MS approaches based on LC−Orbitrap-MS and matrix-assisted laser desorption/ionization (MALDI)-TOF could be feasible for rapid detection and routine monitoring of protein characterization. 14, 15 This was the first step toward the characterization of human protein biomarkers in sewage waters. Carrascal et al. 14 demonstrated the presence in urban sewage waters of proteins from diverse origins, including human proteins such as uromodulin, α-amylase, and S100A8, which have been proposed as health markers associated with renal function and stressrelated changes. 28 However, one of the difficulties that is needed to overcome is the high prevalence of bacterial proteins in front of the others. Due to the large number of data recorded, the bioinformatic treatment to recognize the complex pattern of the omics already mentioned for small molecules becomes crucial. Interestingly, Perez-Lopez et al. 15 applied a recent metabolomics methodology named the regions of interest-multivariate curve resolution (ROIMCR) to proteomics. This methodology combines the searching of the regions of interest (ROI) and the multivariate curve resolution-alternating least squares (MCR-ALS) for the resolution of the constituents of the analyzed samples. Figure 3 displays the workflow of the ROIMCR methodology. Both papers 14,15 provide additional information on the samples related to human peptide biomarkers to a variety of characteristic peptides of biological species such as rat, mouse, cattle, chicken, as well as to many bacteria. To the best of our knowledge, these are the first nontarget proteomic studies with the identification of a variety of proteins at the different stages of the WWTP. It is well-known that WWTPs contain a comprehensive list of large molecules, e.g., proteins that are biomarkers of exposure. Among proteins of interest, it has been recently pointed out that the detection of viral proteins will be a valuable tool to follow pandemic outbreaks, e.g., COVID-19, but at present, the MS detection limits are not sensitive enough to compete with PCR tools. 4.1. SARS-CoV-2 Protein Structure. SARS-CoV-2 proteins can be categorized into two major groups: structural and nonstructural proteins. Although nonstructural proteins (e.g., enzymes and transcription factors needed for replication) are encoded by the virus, they can only be found in the infected host cells. These proteins are neither incorporated into the virion nor highly expressed and, thus, are less likely to be detected. The four structural proteins present in the virion particle are known as spike (S), envelope (E), membrane (M), and nucleocapsid (N). The N protein encapsulates and protects the RNA genome, and the S, E, and M constitute the viral envelope. 29 Previous data on SARS-CoV, with a structure similar to that of SARS-CoV-2, indicate that the number of copies of E, M, and N proteins is much higher than that of S, but E and M are relatively short proteins, tightly bound to the membrane, and therefore difficult to extract and detect. 29 These proteins could be found in wastewater. In this section of MS in the area of COVID-19, we show the possibility of using WBE MS for SARS-CoV-2 determination. In this respect, it is worth indicating that, although MS has been already used to determine viral proteins and peptides such as SARS-CoV-2 and other virus in clinical studies, its use in WBE is still pending. There All times and figures are approximate only and depend on specific protocols and equipment employed. b According to real-time RT-PCR detection of SARS-CoV-2 protocol, Institute Pasteur, Paris (https://www.who.int/docs/default-source/coronaviruse/real-time-rt-pcr-assays-for-thedetection-of-sars-cov-2-institut-pasteur-paris.pdf). c Improved using immobilized enzyme digestion to 1−2 h. d Improved by 1−2 orders of magnitude with selected ion monitoring. complexity of the wastewater samples to be analyzed are the main constraints to its implementation. An excellent review paper recently published 30 explored many and varied MS responses to SARS-CoV-2. RT-PCR is the most prevalent molecular-based surveillance approach. However, it has several drawbacks such as the high number of false positives that can be palliated by the complementary information provided by MS. Table 2 compares RNA detection by RT-PCR versus DNA amplicon detection by MS, viral peptide detection by MALDI-TOF, and viral peptide sequencing by LC−ESI−MS/MS. The detection threshold for RNA by RT-PCR requires no more than 10 copies, 100 copies for DNA-MS, and 100000 copies for MALDI-MS and LC-ESI-MS/MS. This illustrates well the different sensitivities of different techniques and the need to apply PCR to WBE of viruses. In this field, the improvements already envisaged in MS, such as single cell MS, ion mobility MS, imaging MS, or the combination of MS with nano-or micro-LC, as well as the constantly increasing sensitivity of the new instruments, may soon transform utopia into reality. In this sense, Gouveai et al. 31 already reported a proof-ofconcept study by testing the potential of LC−MS/MS-based methods for identifying SARS-CoV-2 peptides in nasopharyngeal swabs. With a 20 min MS acquisition window, several virus-specific peptides that allowed proteotyping the virus were identified and quantified in simulated and clinical swabs from COVID-19 patients. The peptides ADETQALPQR (and its variant forms) and GFYAQGSR from the nucleocapsid protein were detected. These peptides are of utmost interest to develop quick and robust targeted assays for identifying the virus. To confirm this, the heatmap in Figure 4 displays the MS1 peak areas, the number of peptide-to-spectrum matches (PSMs) attributed to each peptide in each sample, the number of viral peptides identified in each sample, and the result from the PCR testing performed on the same sample. The authors recognized that it in a second step a greater number of clinical specimens must be tested to validate the usefulness and limits of detection of these peptides. Another point that is relevant for the method's validation is to calculate the percentage of confirmation rate of positive PCR results with these identified peptides that can be used as tracers for SARS-CoV-2 unequivocal identification. Would it be possible to use a similar approach soon for WBE? It would be excellent! In other study, Sanda et al. 32 reported the analysis of the sitespecific glycoforms with focus on the resolution of structural motifs of the identified O-and N-glycopeptides by LC−MS/MS with higher energy collisional dissociation fragmentation and modulated normalized collision. This study on the SARS-CoV S protein identified nine occupied O-glycopeptides and 17 Nglycopeptides in the S proteins produced in human embryonic kidney (HEK) 293 cells. Although LC−ESI−MS/MS is the method of choice for peptide quantitation. MALDI-TOF MS-based peptidome analysis can be applied as a fast, high-throughput, low-cost, and easy-to-use technique. This approach has been successfully used in clinical applications, including microorganism identification. Yan et al. 33 collected serum samples of 146 COVID-19 patients with mild, typical, severe, and critical symptoms and 152 control individuals, including non-COVID-19 patients with fever/cough symptoms, tuberculosis patients, and healthy controls, and analyzed them by MALDI-TOF-MS. The results outlined 25 MS peaks showing statistically significant differences between COVID-19 patients and control individuals. Using various machine-learning methods, a model was constructed with the 25 peaks showing high accuracy (99%) for the identification of COVID-19 patients with a sensitivity of 98% and specificity of 100% on a test cohort (100 samples) independent from the samples used for model generation and feature selection. This accurate and rapid method provides a powerful tool for high-throughput screening and surveillance of COVID-19. These few examples of clinical applications highlight the interest in developing MS techniques. This pioneering work can serve as a basis for new developments to apply MS to WBE and to obtain the additional information that MS can offer as a complement to PCR data. Currently, WBE is involved in going one step further by increasing biomarker fingerprinting of the characteristics and health of populations. The identification of small molecules used as biomarkers is linked to the search of HBM studies and, on the other hand, to the advances in MS. In the latter, the introduction of HRMS and the improvement of raw data processing, identification systems, and applied bioinformatics has opened an important door to the discovery of new useful biomarkers directly in wastewater. However, this aspect of WBE is not yet resolved, and the lack of human-specific metabolites for some compounds and additional sources of transformation products needs to be addressed. Solving these problems can be somewhat time-consuming due to the complexity of wastewater. This leads us to ask a question that can be applied to the rest of the conclusions: Shall we need to increase/cleanup/sample preparation of the WWTP samples? There are many pros and cons to any of the answers that can be given and should be carefully evaluated on a case-by-case basis. Recently, there is an emerging interest in complementing the metabolite profile of biomarkers determined in wastewater with proteomics, considering that proteins can be a good indicator of prevalent diseases in the population. Although the studies are still scarce, WBE of large molecules using MS has a lot of possibilities. In this field, the use of HRMS with nontarget approaches and bioinformatics data processing are sublimated. Problems in the application of proteomics to wastewater remain, such as the prevalence of the bacterial proteome versus human proteome, which makes the identification of the latter difficult. Nevertheless, despite the limited number of studies that exist, several proteins of human origin have been identified that may be related to renal function and stress. This is the first step. It is to be hoped that soon these studies will multiply and will be able to gradually eliminate their difficulties. It can be safely predicted that the role of intelligent bioinformatics systems will be very important. Common proteins and peptides related to humans will be soon identified, and hopefully soon proteins and peptides from SARS-CoV-2 and other viruses will also play an important role in the correct identification. Virus identification is the only field of WBE where MS is not prevalent. However, although aspects such as sensitivity still need to be resolved, in clinical analysis, the identification of these proteins is a reality of great help in diagnosis. There are several questions raised from the previous discussion: Should we be able to find the specific peptides ADET-QALPQR (and its variant forms) and GFYAQGSR from the nucleocapsid protein responsible of SARS-CoV-2 infection in WBE? This would always help to improve identification and why not? The future is always going a step forward, and this would be of help. Do we need a more sensitive LC−MS/MS for SARS peptide specific detection in WWTP? Clearly yes, but the advances in MS do not stop. There are new emerging techniques in MS instrumentation, and sample handling strategies are quickly making comprehensive omics analyses of single cells feasible; ion mobility mass spectrometry and improved workflows in nontarget analysis are enhancing sensitivity to unexpected extremes. Although MS advances augur well for the future with great expectations, one thing is crystal clear: LC−MS will be complementary to PCR infection, and the confirmation results obtained in WBE will need to be compared with those of PCR. Performance-and imageenhancing drug use in the community: use prevalence, user demographics and the potential role of wastewater-based epidemiology Wastewater-based epidemiology: current status and future prospects Monitoring wastewater for assessing community health: Sewage Chemical-Information Mining (SCIM) Wastewater surveillance for population-wide Covid-19: The present and future Wastewater-based epidemiology, a tool to bridge biomarkers of exposure, contaminants, and human health Monitoring Genetic Population Biomarkers for Wastewater-Based Epidemiology Future perspectives of wastewater-based epidemiology: Monitoring infectious disease spread and resistance to the community level Coronavirus in water media: Analysis, fate, disinfection and epidemiological applications Antidrug resistance in the Indian ambient waters of Ahmedabad during the COVID-19 pandemic Wastewater-Based Epidemiology to monitor COVID-19 outbreak: Present and future diagnostic methods to be in your radar Technical framework for wastewater-based epidemiology of SARS-CoV-2 Analytical methodologies for the detection of SARS-CoV-2 in wastewater: Protocols and future perspectives A new paradigm in public health assessment: Water fingerprinting for protein markers of public health using mass spectrometry Non-target protein analysis of samples from wastewater treatment plants using the regions of interest-multivariate curve resolution (ROIMCR) chemometrics method Measuring biomarkers in wastewater as a new source of epidemiological information: Current state and future perspectives Wastewater-based epidemiology biomarkers: Past, present and future Wastewater-based epidemiology approach to assess population exposure to pesticides: a review of a pesticide pharmacokinetic dataset Wastewater-based epidemiology to assess human exposure to personal care and household products−a review of biomarkers, analytical methods, and applications. Trends Environ Wastewater-Based Epidemiology as a Novel Biomonitoring Tool to Evaluate Human Exposure to Pollutants Liquid chromatography−mass spectrometry as a tool for wastewater-based epidemiology: Assessing new psychoactive substances and other human biomarkers Wastewater-based epidemiology as a novel tool to evaluate human exposure to pesticides: Triazines and organophosphates as case studies Analytical method for the simultaneous determination of a broad range of opioids in influent wastewater: Optimization, validation and applicability to monitor consumption patterns Investigation of biotransformation products of p-methoxymethylamphetamine and dihydromephedrone in wastewater by high-resolution mass spectrometry Analysis of psychoactive substances in water by information dependent acquisition on a hybrid quadrupole time-of-flight mass spectrometer Investigating in-sewer transformation products formed from synthetic cathinones and phenethylamines using liquid chromatography coupled to quadrupole time-of-flight mass spectrometry Wide-scope target and suspect screening methodologies to investigate the occurrence of new psychoactive substances in influent wastewater from Athens Mass-Spectrometric Detection of SARS-CoV-2 Virus in Scrapings of the Epithelium of the Nasopharynx of Infected Patients via Nucleocapsid N Mass spectrometry analytical responses to the SARS-CoV2 coronavirus in review Proteotyping SARS-CoV-2 Virus from Nasopharyngeal Swabs: A Proof-of-Concept Focused on a 3 min Mass Spectrometry Window N-and O-Glycosylation of the SARS-CoV-2 Spike Protein Rapid Detection of COVID-19 Using MALDI-TOF-Based Serum Peptidome Profiling Complete contact information is available at: https://pubs.acs.org/10.1021/acsomega.1c04362 The authors declare no competing financial interest. Yolanda PicóYolanda Picóis Full Professor of Nutrition and Food Chemistry and Researcher at the Desertification Research Centre (CIDE) at the University of Valencia. She is the coordinator of the Food Safety and Environmental Research Group (SAMA-UV). Her scientific trajectory accumulates more than 30 years of experience in the fields of Environmental and Food Chemistry, developing analytical methods to contribute significantly to the improvement of the environment and determining the impact of climate change and human development in the presence of traditional and emerging contaminants. Along her career, she has published more than 300 articles included in SCI and more than 20 book chapters and is editor of five international books. ■ REFERENCES