key: cord-0920323-p819hlzc authors: Lu, Tian; Wang, Yingrui; Guo, Tiannan title: Multi-omics in COVID-19: Seeing the unseen but overlooked in the clinic date: 2022-03-15 journal: Cell Rep Med DOI: 10.1016/j.xcrm.2022.100580 sha: b5c8cf92c6bc4f9ff21fe9d3d3509292d3bc3e7d doc_id: 920323 cord_uid: p819hlzc COVID-19 is an ongoing pandemic of global concern and is unlikely to disappear. This commentary discusses how multi-omics technologies have helped uncover the molecular processes and dynamics underlying COVID-19 initiation, progression, and transmission, and how lack of standardization has limited their application in clinical settings. As of February 12, 2022, more than 200 countries and territories have reported over 405 million confirmed cases of COVID-19, and 5.8 million recorded deaths. While vaccinations have not stopped the spread of SARS-CoV-2, they have reduced the risk of serious disease and death in adults and children. Investigations deploying multi-omics technologies reveal the underlying molecular structure of the pathogen and molecular host responses to the virus and vaccines (Figure 1 and 2). Our understanding of SARS-CoV-2, as well as host responses to the virus, have been greatly deepened by multi-omics technologies including, but not limited to, next-generation sequencing (NGS) and proteomic and metabolomic approaches ( Figure 2) . Indeed, the virus causing COVID-19 was first identified by metagenomic RNA sequencing. Based on the RNA sequence of the virus genome, PCR-based assays were developed and rapidly applied in the clinic to replace initial temperature tests to diagnose COVID-19. Multiple mutations in the SARS-CoV-2 sequence lead to the change of pathogenicity, infectivity, transmissibility, and/or antigenicity. 1 E484K, for example, was identified as an escape mutation that could reduce antibody-mediated neutralization. 1 The discovery of the Delta and Omicron variants relied on the prompt application of NGS techniques, which also provided crucial information to track the spread of these variants across different geographic areas. A study from Zimbabwe of the genomic epidemiology of the SARS-CoV-2 variants found that 60% of cases were imported, indicating that human movement is a key factor in its transmission, further supporting the importance of quarantine and restriction of human movements. 2 Rational development of COVID-19 vaccines also relies on mass-spectrometry (MS)-based characterization of the spike (S) glycoprotein of the virus, whose conformational dynamics is key for vaccine design. The viral S protein interacts with ACE2 on host cells, and antibodies that interfere with this by targeting the S protein can potentially neutralize the virus. Watanabe et al. expressed recombinant spike trimers and determined the glycan compositions for 22 N-linked glycan sites by MS (e.g., principally oligomannose-type on N234 and N709), which not only deepens our understanding of the S protein as a vaccine target, it also provides a benchmark to assess the quality of immunogens in the development of vaccines and therapeutic antibodies. 3 Host responses to the pathogen Despite the progress we have made in understanding this virus, we still struggle to predict which patients will develop clinically severe COVID-19 illness. Should there be a precise and practical means to stratify patients according to disease severity and to identify the majority of patients who will survive SARS-CoV-2 infection, many concerns of the global pandemic could be relieved. Further, are there differences in immune responses to the emerging variants of concern? How do different vaccines and multiple vaccine doses impact anti-SARS-CoV-2 immunity? Addressing these and many other crucial questions requires a comprehensive understanding of host responses to this pathogen and to vaccines. Most multi-omics studies are focused on elucidating host responses to the pathogen in various organs and clinical specimens that are otherwise hidden using conventional approaches. COVID-19 host response studies were initially limited to the enumeration of clinical symptoms such as fever and cough and further extended to chest CT and circulating protein biomarkers such as CRP and SAA1, which had been used for other infectious diseases and empirically borrowed to monitor clinical progression of COVID-19. Multi-omics studies have greatly expanded our views on circulating molecular changes, but these are practically undetectable with conventional analytical methodologies ( Figure 2 ). For instance, proteomic and metabolomic characterization of COVID-19 sera were effectively applied to measure about 2000 circulating molecules and identified 93 proteins and 204 metabolites that were significantly and specifically dysregulated in severe cases. 4 This systematic investigation highlighted the critical roles of platelet degranulation, macrophage function, and the complement system. Most of the key molecular changes have been confirmed in other multi-omics studies of COVID-19 from multiple countries, further validating the technical reliability of the proteomic and metabolomic techniques and the potential value of these findings in clinical decision-making. Remarkably, proteins and metabolites can be readily measured by mass spectrometry also in urine, a non-invasive clinical biospecimen. Close to 4000 proteins were detected in urine, while only about 1500 proteins were detected in serum using the same MS method due to the presence of high-abundance proteins in serum. Notably, 80% of the thus measured serum proteins were detectable in urine, while only 31% of urinary proteins were detected in serum. 5 But why do we need to measure these molecules? How can these findings contribute to the management of COVID-19? Multiple omics studies have shown that in properly designed omics experiments, machine learning can be used to identify biomarkers that can be employed to classify and monitor disease progression; for example, a random forest model based on 22 proteins and 7 metabolites expressed in sera could reliably indicate the severity of COVID-19, and to a certain extent, predict the disease prognosis. 4 Urinary protein-derived models performed as well as those from sera and could be used to monitor COVID-19 disease progression. 5 In addition to circulating biomarkers of host responses, local responses in multiple solid organs are also crucial. Histopathological examination of (sectioned) tissues provides the most relevant information at macroscopic and microscopic resolution and is widely accepted as the ''gold standard'' for disease diagnosis. However, histopathology of COVID-19 tissue specimens are not distinctive but rather highly similar to those described in SARS-CoV and Middle East respiratory syndrome coronavirus (MERS-CoV) patients. 6 MS-based proteomics now allows characterization of over 10,000 proteins across multi-organ autopsy samples and the identification of multiple dysregulated proteins involved in coagulation, angiogenesis, fibrosis, and fatty acid metabolism. 7 Without in-depth proteomics techniques, it will take much more effort and time, following the hypothesis-driven research paradigm, to uncover the potential link between upregulation of Cathepsin L in the lung and COVID-19-related mortality or the association between decreased INSL3 protein and impaired Leydig cells in the testis. 7 Proteomics allows us to ''see'' molecular changes underlying the morphological observations. Tissue specimens, in particular lung tissues, are potentially infectious and must be inactivated and sterilized, usually by formalin fix-ation, before analytical analysis, which makes transcriptomic analysis technically challenging due to the instability of mRNAs. In contrast, proteins could be effectively retrieved from formalin-fixed tissues for MS-analysis. 8 Omics technologies also enable the identification of target cells of the virus, which is hardly achievable with conventional techniques. Single-cell RNA sequencing (scRNA-seq) analysis of nasopharyngeal swabs showed ciliated cell losses with secretory, deuterosomal expansion, and increase of macrophages during COVID-19. 9 No alternative technology could directly provide such insights in clinical specimens. scRNA-seq also deconvolutes mixed cells. For example, lymphopenia, the substantial decrease of lymphocytes circulating in the blood, is a common symptom in patients with COVID-19. Various lymphocytes play diverse roles in anti-virus immunity. scRNA seq analysis of the lymphocytes uncovered three patterns of lymphocyte responses associated with different clinical outcomes. 10 Host responses to vaccines Multi-omics studies could also provide potentially invaluable guidance for the use of vaccines, but unfortunately, such data are still inadequate. Over 10 billion vaccine doses have been administered globally, including mRNA vaccines, inactivated viral particles, and adenoviralbased vaccines, among others. While all have provided some protection against SARS-CoV-2, the mRNA vaccines elicit a higher titer of neutralizing antibodies than the others. 11 However, this protection is mediated not only by the neutralizing antibodies, but also other immune effector mechanisms including T cells and innate immune cells. For example, mRNA vaccines showed 80% efficacy against symptomatic infection even after only one dose in the absence of detectable neutralizing antibodies. 11 Moreover, up to 6% of recipients are seronegative after the second injection. 11 According to the US Centers for Disease Control and Prevention (www.cdc.gov), as of February 12, 2022, more than 0.004% of recipients experienced severe side effects, including anaphylaxis, thrombosis with thrombocytopenia syndrome, The elephant refers to a patient with COVID-19, while the magnifiers are conventional technologies for examining limited number of molecules. Multi-omics technologies, as indicated as the satellite, comprehensively profile molecular modulations, most of which could not be seen by the magnifiers. Guillain-Barré Syndrome, and even death. Circulating neutralizing antibodies decline over time. Breakthrough infections occur in vaccinated individuals, especially from emerging and immune escape variants such as Omicron. 12 Customized vaccination against specific variants has not been realized to date, although its importance in preventing COVID-19 is undeniable. A controversial but pragmatic policy to address the uncertainties in protective immunity is to make booster vaccination mandatory. All these are limitations of the current vaccination strategies that are being implemented for billions of individuals in many countries. Timely monitoring of serological host responses is theoretically informative for epidemiological tracking in a population and could potentially offer useful information to guide vaccine dosage and dose spacing. However, even circulating neutralizing antibody tests have not been widely adopted to guide clinical decision making, probably because of the limited practicality of mass blood sampling and the absence of universal standards. Monitoring urinary proteins might be a more practical means to evaluate immunity against the virus; however, no published data are available yet. Why are proteomics and metabolomics largely overlooked in the clinic? While close to 10,000 COVID-19-related omics papers have been published (PubMed search as of February 12, 2022), no findings from omics studies except for PCR and genomic analysis of the virus have been successfully translated for clinical management of COVID-19. One possible reason is that multi-omics technologies, mainly proteomics and metabolomics, are not mature, reproducible, and sufficiently robust to provide clinically valid results. However, recent proteomics data collated from many laboratories globally show dysregulated molecules to be surprisingly consistent. In contrast, false negatives are notoriously frequent in PCR tests of SARS-CoV-2. A recent study of 95,919 patients reported a false-negative rate (FNR) and sensitivity of 9.3% (95% CI 1.5%-17.0%) and 90.7% (95% CI 82.6%-98.9%), respectively. 13 Some may argue that proteins are too vulnerable to be robust biomarkers in real-world applications. However, data have shown that proteins as measured by mass spectrometry are more stable than transcripts measured by NGS. 14 Is it because the measurement of proteins and metabolites is too technically and analytically challenging, and expensive? Indeed, the acquisition of proteomics and metabolomics data involves multiple steps and requires special expertise and expensive instruments, such as high-resolution mass spectrometers for proteomics. However, these requirements should not be insurmountable given that much more sophisticated and expensive techniques such as positron emission tomography (PET) are already widely used clinically. Moreover, the cost of measuring a protein by mass spectrometry has dropped from about $3 in 2006 to less than $0.1 in 2020. 15 The major hurdle delaying clinical applications of proteomics and metabolomics is probably the lack of standardization. Few research laboratories in the rapidly developing field of proteomics use consensus standard operating protocols (SOPs), while the situation in metabolomics is even more confusing because even raw data files generated in different laboratories are rarely shared due to commercial considerations. Although commercialization of metabolomics preceded that of proteomics, most metabolomic platforms developed closed proprietary resources (e.g., compound libraries for molecular Commentary measurement) and techniques and do not share their raw data even for scientific communications. It is unlikely that this daunting situation can be easily reversed to achieve the necessary transparency and standardization. In contrast, the proteomics community was founded on an ethos of scientific transparency and open sharing. We are optimistic that several international consensus SOPs for clinical proteomics will be developed in the near future through concerted efforts of proteomics scientists, clinical practitioners, and other stakeholders. The ability to interrogate biomedical specimens on diverse omic platforms poses new challenges of integrating multi-omics data. Genomic data are mostly qualitative, while the expression data of transcripts, proteins, and metabolites are quantitative. Therefore, it is inherently not straightforward to integrate them by algorithms. Furthermore, unlike the genome which is basically identical among different tissue specimens of an individual, proteomes and metabolomes are tissue-specific and context-dependent, reflecting the actual pathophysiological state and complexity of biology and diseases, thus posing great challenges to identify the critical molecular changes over space and time. Besides these technical limitations, collection of potentially infectious samples during the pandemic poses additional challenges. The most common samples used in COVID-19 studies are blood or bronchoalveolar lavage fluid samples, while data from non-invasive samples like urine and feces have gradually proved to be useful in reflecting host responses. However, there remains a lack of standard operating procedures (SOPs) for hospitals to safely collect and competently pretreat these samples for multi-omics research, which can be a source of bias within a cohort and across different cohorts in separate studies. Biosafety practices restrict access to COVID-19 samples to minimize the risk of accidental virus transmission but also limit comprehensive investigation of these precious specimens to understand and control COVID-19. Had proper and technically feasible SOPs been established for processing these potentially infectious specimens, our understanding of COVID-19 would have progressed much faster. COVID-19 is unlikely to disappear, nor will it be the cause of the last pandemic. Therefore, there is no place for complacency. The global community must act now to be better prepared for the future. SARS-CoV-2 variants, spike mutations and immune escape Genomic epidemiology and the role of international and regional travel in the SARS-CoV-2 epidemic in Zimbabwe: a retrospective study of routinely collected surveillance data Site-specific glycan analysis of the SARS-CoV-2 spike Proteomic and Metabolomic Characterization of COVID-19 Proteomic and metabolomic profiling of urine uncovers immune responses in patients with COVID-19 Histopathological observations in COVID-19: a systematic review Multi-organ proteomic landscape of COVID-19 autopsies High-throughput proteomic analysis of FFPE tissue samples facilitates tumor stratification Impaired local intrinsic immunity to SARS-CoV-2 infection in severe COVID-19 Deep immune profiling of COVID-19 patients reveals distinct immunotypes with therapeutic implications Immunological mechanisms of vaccine-induced protection against COVID-19 in humans Breakthrough infections with SARS-CoV-2 omicron despite mRNA vaccine booster dose False negative rate of COVID-19 PCR testing: a discordant testing analysis Comparative analysis of mRNA and protein degradation in prostate tissues indicates high stability of proteins High-throughput proteomics and AI for cancer biomarker discovery T.G. is a shareholder of Westlake Omics, Inc., and a member of the advisory board of the journal. The remaining authors declare no competing interests.