key: cord-355872-z6vsjmxn authors: Colón-López, Daisy D.; Stefan, Christopher P.; Koehler, Jeffrey W. title: Emerging viral infections date: 2019-08-15 journal: Genomic and Precision Medicine DOI: 10.1016/b978-0-12-801496-7.00010-1 sha: doc_id: 355872 cord_uid: z6vsjmxn The emergence of viral infections is driven by multiple factors including changes in human behavior, population growth, reservoir host distribution, viral diversity and environmental changes. Effective surveillance methods, diagnostic assays and containment measures are pivotal to preventing widespread outbreak of a new viral infection. However, the limited understanding of some emerging viruses poses numerous challenges for effective intervention. In this chapter we discuss various genomics-based methods and strategies to overcome these inherent challenges of emerging and re-emerging viral infections with a focus on current viral threats. We also provide an outlook on the use of genomic tools in personalized medicine and potential solutions to current and foreseeable challenges. the Mastomys natalensis rat, are increased in West Africa during the dry seasons when the rats are more likely to be inside individuals' houses [21] . Fluctuations in climate resulting in increasing crop yields, and the associated rat population, followed by a severe drought can lead to increased cases of Lassa fever. Furthermore, climactic changes are altering vector-borne infectious disease risks due to expanded vector ranges. For example, there is an increased risk of Lyme disease in Canada due to tick range expansion [22] , and climate change is expanding the mosquito range, and the associated infectious disease risk, of Aedes albopictus in the United States [23] . RNA viruses represent a significant source of viral outbreaks due partly to the higher mutation rate and error prone nature of RNA replication itself [24] . Newly introduced mutations into a viral genome can confer new pathogenic characteristics such as enabling transmission to new species or increased virulence. For instance, mutations within the severe acute respiratory syndrome coronavirus (SARS-CoV) genome allowed transmission from bats to humans [24, 25] . The better understanding about the viral agents responsible of causing outbreaks in humans and the underlying factors driving emergence, the better the chances of preparedness and effective intervention for prevention of a new global epidemic ( Figs. 1 and 2) . This chapter provides an overview of the genomics and precision medicine methodologies relevant to the topic of emerging and re-emerging viral threats. The chapter is divided up into two main sections: viral genomics and host genomics, and include examples of their applications in pathogen detection, population susceptibility, diagnostics, and outbreak response. The seminal investigations on the transmission of yellow fever virus by Walter Reed and colleagues in 1901 [27, 28] ushered in an era of discovery and classification of a wide variety of viral pathogens including smallpox virus, poliovirus, and measles virus. Technological advancements including viral cell culture, plaque assays, enzyme-linked immunosorbent assays, and monoclonal antibodies pushed human virology forward. A second wave of viral discovery and characterization is currently ongoing, harvesting the power of next-generation sequencing to identify unknown viruses in acute febrile illness patients [29] [30] [31] . Increasingly, efforts are underway to identify the next emerging pathogen by sequencing and identifying the viruses circulating at the human/animal/pathogen interface [32, 33] . While detection of viruses in wildlife does not predict transmission, identifying mutations or gene acquisitions through surveillance and continued discovery will provide valuable insight in the future. A major hindrance for precision medicine toward emerging zoonotic pathogens is the limited global characterization of potential threats. Taxonomic placement of viruses is problematic due to the lack of universal genes, such as the bacterial 16S RNA gene, limiting classification to viruses with defined biological characteristics [34] . In fact a mere 5000 total species are currently classified by the International Committee on Taxonomy of Viruses (ICTV) which pales in comparison to the estimated 320,000 viral species infecting mammals alone [35] . Metagenomic sequencing has highlighted this significant gap in classified vs unknown viromes, and while biological characterization will not also always be defined, knowledge can be gained from genomic sequences including evolutionary relationships and genome structure [34] . Continuing to gather viral genomic information from environmental, plant, and animal samples will continue to fill gaps in our knowledge base and further prepare us for detecting and predicting emerging pathogens. Metagenomic sequencing has been used for the discovery of previously unclassified human pathogens. In 2008 Lujo virus was discovered in South Africa Sequence diversity of Lassa virus complicates molecular diagnostic tool development. A phylogenetic tree (A) was generated using and alignment of full length and near full length Lassa virus S segment nucleic acid sequences available in GenBank. Diversity of Lassa virus is linked to geographic location as shown here for viruses sequenced from Nigeria, Sierra Leone, Guinea, Mali, and Togo. Selected sequences from each country [starred in (A)] were aligned in (B). The forward and reverse primers (red arrows) and the probe (green arrow) from a previously published Lassa virus Josiah real-time RT-PCR assay [26] are indicated. The primers and probe are exact matches to the Lassa virus Josiah sequence; however, contain mismatches within other Lassa virus isolates, including ones found in Sierra Leone. [31] . Five patients with undiagnosed hemorrhagic fever were discovered after air transfer of a critically ill index case. The disease resulted in a case fatality rate of 80%. Unbiased pyrosequencing of RNA extracts from human samples identified approximately 50% of an arenavirus genome which was thereafter gap filled using primers designed off the sequenced genome. Phylogenetic analysis confirmed the identification of a novel Old World arenavirus [31] . Similarly, deep sequencing identified Bas-Congo virus, of a novel rhabdovirus, in three human cases in the Democratic Republic of Congo [30] . This virus was associated with high fever and rapid death and was found to have only 34% amino-acid identity with other rhabdoviruses. The discovery and characterization of these viruses has allowed novel diagnostics to be developed, including real-time PCR, leading to a better preparedness for future outbreaks. Sequencing human samples from outbreaks has resulted in the discovery of several viruses; however, 70% of emerging human pathogens result from wildlife [36] . Analysis of samples from non-human hosts including arthropods, bats, rodents, and domestic animals offers potential insight into the transmission, evolution, and treatment of these pathogens [37] . Deep sequencing on more than 220 invertebrate species resulted in the discovery of 1445 phylogenetically distinct viromes filling in phylogenetic and evolutionary gaps [38] . Characterization of bacterial and viral relationships in mosquito arthropods demonstrated a symbiotic relationship between the bacterium and host, limiting dengue virus infection and potentially revealing new antiviral strategies [39, 40] . The discovery of Middle-East Respiratory Syndrome-Coronavirus (MERS-CoV) in camels demonstrated these animals as a potential reservoir and host for transmission [41, 42] . While detection of viruses in wildlife does not predict transmission, identifying mutations or gene acquisitions through surveillance and continued discovery will provide valuable insight in the future. In addition to pathogen discovery, metagenomic sequencing is increasingly used to monitor viral genomic changes as an outbreak is occurring [43] [44] [45] [46] . The Ebola virus outbreak in West Africa resulted in 26,648 cases and 11,017 documented deaths, and genomic sequencing was applied in near real-time to provide information to aid in containing the outbreak [44, 45] . Sequencing results early in the outbreak provided valuable insight into origination and transmission routes demonstrating the outbreak started from a single introduction in Guinea in December 2013 and was sustained by human to human transmissions [2] . Sequencing provided molecular evidence that Ebola virus was transmissible by sexual intercourse leading to changes in CDC recommendations for survivors [45, 47] and establishing programs to support national testing of semen and other body fluids in male survivors [48] . Rapid outbreak sequencing allowed for the identification of transmission chains in sporadic clusters following the outbreak [46] , further adding insight to end the epidemic. Sequencing data from Sierra Leone early in the outbreak also found increased Ebola virus diversity that could have an impact on sequence-based therapeutics, vaccines, and diagnostic assays being fielded at the time [44] . Having such rapid sequencing information during an outbreak would inform responders about the potential efficacy of diagnostics and sequence-based countermeasures, either reassuring decision makers or informing them of the need to modify strategies based on the findings. While sequencing has found a niche in the discovery and epidemiologic tracing of emerging infections, its roll for diagnostics is still in its infancy. Emerging pathogens often occur in austere environments or countries with limited infrastructure not conducive to sequencing technologies. Sequencing still has significant hurdles in decreasing the technical and mechanical requirements for routine clinical use. However, next-generation sequencing offers limitless potential due the necessity for little to no prior knowledge in sample composition. Newer technologies such as nanopore sequencing are looking to minimize these hurdles by creating smaller foot-prints and near real-time sequence analysis with minimal sample preparation [49] ; however, difficult paths to their acceptance beyond laboratory derived tests to full regulatory approval remain. Knowledge of viral genomic sequences in designing rapid point of care molecular diagnostics such as PCR is invaluable. The last decade has seen a significant increase in the use of PCR as a primary diagnostic due to its speed, sensitivity, and cost. However, due to the lack of available genomic sequences and high genetic variation within certain viral species, finding a conserved target can be difficult. The advent of multiplex PCR has allowed more targets to be captured in a single assay, lowering costs and increasing throughput. For example, this technique has been used to subtype influenza A and B viruses [50] . Other diagnostic devices such as the BioFire FilmArray can run multiple PCRs at once on a single device allowing the user to select between different panels [51] [52] [53] [54] [55] . Whether singleplex or multiplex, PCR remains the most rapid and sensitive method for the detection of viral genomes directly from clinical matrices. Coordinating with detection, viral genome sequences are increasingly being used to predict and guide antiviral therapy. During the Ebola virus outbreak, sequence analysis of the viral genome over time demonstrated changes which could make the pathogen resistant to therapeutics such as siRNAs, phosphorodiamidate morpholino oligomers (PMOs), and antibodies [56] . The discovery of the CRISPR/Cas9 system and its ability to destroy dsDNA has brought to light its potential in mutating essential sites or removing proviral DNA from infected cells during and HIV infection CRISPR [57] . Similarly studies involving herpesviruses, large dsDNA viruses, have demonstrated the ability to destroy latent herpesvirus from infected cells [58, 59] . These methods require prior knowledge of the viral genome promising a future where antiviral therapies are targeted specifically toward an individual's own specific infection. Emerging and re-emerging viral diseases pose unique challenges to the use of omics and precision medicine tools. How does one predict when and where a pathogen will jump a species barrier or emerge into a new population that could rapidly spread into new geographic regions? To further complicate the scenario, a new human viral infection may not present with obvious signs of an infection. Such a virus may get introduced and transmitted across populations while remaining asymptomatic or unrecognized for long periods of times, like the hepatitis C virus (HCV) and HIV [10] , for instance. Viruses rely on their small-sized genomes to encode enough information to hijack the host's cellular machinery for target-cell recognition and entry, genome replication, protein synthesis, viral participle assembly, and propagation. Evolutionarily, hosts evolved conserved mechanisms to generate an immune response to detect and limit an infection and to prevent re-infection in the future. The early innate immune response functions to slow the infection once the immune system recognizes there is an infection (extensively reviewed in [60] ) and guides the subsequent adaptive immune response to the pathogen. Successful control of these immunological processes depend on tight control of the host's immunological and inflammatory pathways by regulating gene transcription. These processes result in measurable changes in the relative expression levels of coding and non-coding RNA and protein, even when an infection is asymptomatic [61] [62] [63] [64] [65] or difficult to diagnose [66] . The development of low-cost and less-time consuming genomics had facilitated the study of pathway-specific transcript alterations during viral infections. Consequently, various transcriptomic-based assays have been developed to study, characterize, and identify signs of infection using transcript signatures. For example, Zaas and colleagues identified a unique immunological gene expression profile capable of differentiating viral from bacterial respiratory infections in humans [64, 65] , laying to foundations for appropriate antibiotic use without direct pathogen identification. Traditionally, diagnosing viral infections is made based on clinical symptoms, PCR and serological testing, and virus isolation. While the world of omics is expanding with new and improved platforms, host transcriptome-based assays are becoming more feasible and widely available. Those of more relevance to this chapter are targeted and agnostic next-generation sequencing (NGS) platforms for genome-wide association studies (GWAS) and transcriptomics as well as microarray-based assays for amplification-independent profiling of transcripts. GWAS have direct applications to precision medicine whereas host-based transcriptomic assays have shown to have multiple applications for clinical diagnosis, and pathogen identification and characterization. These technologies were not specifically developed for emerging or re-emerging viruses but can be leveraged for these purposed based on their applications. GWAS may provide insights into the genetic factors driving population susceptibility to viral infections [67, 68] . This information can be utilized to identify populations within geographic regions at higher or lower risk of being infected with a new pathogen. Higher-risk populations may include those with increased direct exposure to animal reservoirs of zoonotic pathogens or their arthropod vectors. Other population subgroups without defined topographical relationships can be determined as well. A few examples of these are demographic, age and, gender groups that can be classified based on genetic polymorphisms and other susceptibility markers. In this setting, GWAS can provide guidance in outbreak preparedness and intervention. The information provided by GWAS-based studies is not focused on the directed response to an infection but on the information already stored within the host's genome. On the other hand, transcriptomic approaches identifies the interactions between the pathogen and the host's genome by evaluating transcript levels, typically mRNAs. The most commonly used transcriptomic methods are RNA-seq [69, 70] and microarrays [65, 66] . RNA-seq is generally suitable for unbiased transcriptomic profiling and provides better understanding of global transcriptomic changes. This agnostic method is appropriate for identifying changes in the human transcriptome as a result of an emerging viral infection to show specific mechanisms of immune response evasion and other effects in the host's biology at the transcriptomic level. Conversely, targeted-approaches depend on previous knowledge of host-transcriptomics. Studies with human cohorts and animal models have developed gene signatures that discriminate between viral and bacterial respiratory infections [61] [62] [63] [64] [65] . Other groups have developed sepsis classifiers to guide treatment options in clinical settings [71, 72] . A goal of developing these disease classifiers is to use them to predict disease outcome prior to the onset of symptoms and to assist in decision making when a symptomatic disease with an unknown etiology is suspected. Effective and accurate discernment between a viral and a bacterial infection can make the difference in administering the appropriate therapeutics and potentially saving lives. When a specific viral species can't be identified, a gene classifier may be useful to determine potential disease outcomes such as lethality. As technology continues to advance, the linkages of genomics and precision medicine with emerging infectious diseases will continue to strengthen. Improvements with sequencing accuracy and speed are pushing pathogen-specific genomics to near real-time [73, 74] , allowing rapid access to critical information within a timeframe that can impact an outbreak response [44, 45] . Confidence in the results from sequencing is leading to clinically actionable diagnostic patient testing [29] , and it is reasonable to foresee rapid, pathogen agnostic diagnostic assays routinely utilized in the clinic. Population growth, expansion into rural geographic regions, and rapid travel has greatly expanded the pathogen/host/environment interactome, putting a greater number of people at risk for formerly exotic infectious diseases such as Ebola virus or even the next unknown, soon-to-emerge pathogen. Applying genomics and the host transcriptome to generalize the response to a viral or bacterial pathogen would allow for more appropriate antibiotic use without requiring direct pathogen detection. Besides the benefits to antibiotic stewardship in an era where antimicrobial resistance is expanding rapidly, such an application can greatly improve clinical care in the event of an emerging, highly virulent unknown unknown organism. Opinions, interpretations, conclusions, and recommendations are those of the author and are not necessarily endorsed by the U.S. Army. Report of an International Commission. Ebola haemorrhagic fever in Zaire Emergence of Zaire Ebola virus disease in Guinea Zika virus: a report on three cases of human infection during an epidemic of jaundice in Nigeria Zika virus, a cause of fever in Central Java, Indonesia Zika virus outbreak on Yap Island, federated states of Micronesia Zika virus, French polynesia, South pacific Zika virus outbreak, Bahia, Brazil Zika virus spreads to new areas-region of the Americas Chimpanzee reservoirs of pandemic and nonpandemic HIV-1 HIV epidemiology. The early spread and epidemic ignition of HIV-1 in human populations Centers for disease C, prevention. Importation and containment of Ebola virus disease-Senegal Ebola virus disease outbreak-Nigeria Laboratory response to 2014 Ebola virus outbreak in Mali Second Ebola patient is treated in UK Molecular characterization of the first Ebola virus isolated in Italy First secondary case of Ebola outside Africa: epidemiological characteristics and contact monitoring Second US nurse with Ebola had traveled by plane Ebola and compliance with infection prevention measures in Nigeria Transmission dynamics and control of Ebola virus disease outbreak in Nigeria Zika virus. I. Isolations and serological specificity Fluctuation of abundance and Lassa virus prevalence in Mastomys natalensis in Guinea, West Africa. Vector Borne Zoonotic Dis Major emerging vector-borne zoonotic diseases of public health importance in Canada Climate change and range expansion of the Asian tiger mosquito (Aedes albopictus) in Northeastern USA: implications for public health practitioners Emerging virus diseases: can we ever expect the unexpected? SARS-CoV and emergent coronaviruses: viral determinants of interspecies transmission Comprehensive panel of real-time TaqMan polymerase chain reaction assays for detection and absolute quantification of filoviruses, arenaviruses, and new world hantaviruses Yellow fever: 100 years of discovery The etiology of yellow fever: an additional note Actionable diagnosis of neuroleptospirosis by next-generation sequencing A novel rhabdovirus associated with acute hemorrhagic fever in central Africa Genetic detection and characterization of Lujo virus, a new hemorrhagic fever-associated arenavirus from southern Africa Author correction: the discovery of Bombali virus adds further support for bats as hosts of ebolaviruses A metagenomic viral discovery approach identifies potential zoonotic and novel mammalian viruses in Neoromicia bats within South Africa Consensus statement: virus taxonomy in the age of metagenomics A strategy to estimate unknown viral diversity in mammals Public health threat of new, reemerging, and neglected zoonoses in the industrialized world Viral metagenomics on animals as a tool for the detection of zoonoses prior to human infection? Redefining the invertebrate RNA virosphere A Wolbachia symbiont in Aedes aegypti limits infection with dengue, Chikungunya, and Plasmodium The endosymbiotic bacterium Wolbachia induces resistance to dengue virus in Aedes aegypti Middle East respiratory syndrome coronavirus in dromedary camels: an outbreak investigation Middle East respiratory syndrome coronavirus quasispecies that include homologues of human isolates revealed through whole-genome analysis and virus cultured from dromedary camels in Saudi Arabia Open-source genomic analysis of Shiga-toxin-producing E. coli O104:H4 Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak Molecular evidence of sexual transmission of Ebola virus Rapid outbreak sequencing of Ebola virus in Sierra Leone identifies transmission chains linked to sporadic cases Possible sexual transmission of Ebola virus-Liberia Implementation of a National semen testing and counseling program for male Ebola survivors-Liberia The sequence of sequencers: the history of sequencing DNA Multiplex PCR for typing and subtyping influenza and respiratory syncytial viruses Evaluation of the BioFire FilmArray respiratory panel and the GenMark eSensor respiratory viral panel on lower respiratory tract specimens Preliminary evaluation of BioFire FilmArray((R)) gastrointestinal panel for the detection of noroviruses and other enteric viruses from wastewater and shellfish Application of BioFire FilmArray blood culture identification panel for rapid identification of the causative agents of ventilator-associated pneumonia Cost justification of the Bio-Fire FilmArray meningitis/encephalitis panel versus standard of care for diagnosing meningitis in a community hospital Clinical evaluation of the BioFire FilmArray((R)) BioThreat-E test for the diagnosis of Ebola virus disease in Guinea Evaluation of the potential impact of Ebola virus genomic drift on the efficacy of sequence-based candidate therapeutics Antiviral goes viral: harnessing CRISPR/Cas9 to combat viruses in humans RNA-guided endonuclease provides a therapeutic strategy to cure latent herpesviridae infection CRISPR/Cas9-mediated genome editing of herpesviruses limits productive and latent infections Antiviral actions of interferons Differential evolution of peripheral cytokine levels in symptomatic and asymptomatic responses to experimental influenza virus challenge A genomic signature of influenza infection shows potential for presymptomatic detection, guiding early therapy, and monitoring clinical responses A host transcriptional signature for presymptomatic detection of infection in humans exposed to influenza H1N1 or H3N2 A host-based RT-PCR gene expression signature to identify acute respiratory viral infection Gene expression signatures diagnose influenza and other symptomatic respiratory viral infections in humans Detection of tuberculosis in HIV-infected and -uninfected African adults using whole blood RNA expression signatures: a case-control study Genome-wide association study identifies two risk loci for tuberculosis in Han Chinese APOBEC3G variants and protection against HIV-1 Infection in Burkina Faso A conserved transcriptional response to intranasal Ebola virus exposure in nonhuman primates prior to onset of fever Comparison of transcriptomic platforms for analysis of whole blood from Ebola-infected Cynomolgus Macaques A community approach to mortality prediction in sepsis via gene expression analysis Robust classification of bacterial and viral infections via integrated host gene expression diagnostics Nanopore sequencing as a rapidly deployable Ebola outbreak tool MinION as part of a biomedical rapidly deployable laboratory