key: cord-0004983-bnf3mvaf authors: Desselberger, U. title: Report on an ICTV-sponsored symposium on Virus Evolution date: 2005-01-13 journal: Arch Virol DOI: 10.1007/s00705-004-0466-9 sha: 38f60372949d05ec90734bcbfaa37e351bc99045 doc_id: 4983 cord_uid: bnf3mvaf nan genes etc. However, for viruses it should be noted that not all genes encoding proteins of similar functions (e.g. polymerases) stem from one single root. Viruses in the Retroviridae seem to be of very ancient origin as retroelements have been found in Eukarya, Bacteria and now also in Archaea; in the latter, there is evidence for at least four different lateral gene transfers [22] . Regarding the order of genes in a genome, patterns have often been maintained, but also often been rearranged. Phylogenetic relationships have been found useful for virus identification, work on origin, speed and mechanisms of evolution, taxonomy, and the elucidation of transmission pathways (e.g. the transmission of HIV from a source to a victim [16] ). A case is being made for the use of rankless taxonomy clades (these being monophyletic groups without grading) instead of hierarchical formal Linnean taxa for classification (see PhyloCode; www.ohiou.edu/phylocode). Such a tree-based, rankless system could be constructed independently of taxonomy. In the discussion, the positions of clades within accepted phylogenies and the role of the quasispecies concept in a phylogenetically based classification system were considered. Alexander E Gorbalenya (Leiden University Medical Center) spoke about "Using evolutionary models to learn about RNA viruses". Starting from the idea that evolutionary models lead to the generation of structure/function studies, which in turn may or may not verify the model, the concept was developed that biopolymer alignments represent evolutionary models. Proof of concept was explored using sequence data for members of the Flaviviridae, Nidovirales and Birnaviridae as examples. Citing data from Lindenbach and Rice [15] and the group of Tautz (e.g. [33] ), it was concluded that hepaciviruses (e.g. Hepatitis C virus) and pestiviruses (e.g. Bovine viral diarrhea virus) have more in common than was originally thought; however, the phylogenetic analysis of hepacivirus and pestivirus genomes is still a challenge to the taxonomy. The Nidovirales were established as an order that comprises the Coronaviridae, Arteriviridae and Roniviridae families. This conclusion was based on the finding that viruses in the Nidovirales share the mechanism of discontinuous transcription [30] and that they have motifs of their replicase enzymes in common [6, 32] . However, the taxonomy of the Coronaviridae is under further review [5] . For instance, it has recently been found that the cysteine proteinases of an invertebrate nidovirus and of members of the Potyviridae share unusual motifs [35] . Viruses in the Birnaviridae (carrying dsRNA genomes) and some (but not other) members of the Tetraviridae (carrying ssRNA genomes and infecting insects) share a unique arrangement of motifs in their replicases [6] . A particular folding model of the replicase of Infectious bursal disease virus, a member of the Birnaviridae, has recently been tested and verified by the group of E Mundt [34] . In the discussion, the relationship between coronaviruses and influenza C viruses (sharing neuraminate-O-acetyl esterase motifs and functions) was noted. Graham Hatfull (University of Pittsburgh, Pennsylvania) spoke about "Mycobacteriophage genomics and the origins of mosaicism". Given that there are an estimated 10 31 bacteriophages on earth (most of them in the sea), they represent an enormous genomic diversity and are also an excellent tool box with which to probe evolutionary theories. Approximately 250 tailed dsDNA bacteriophages have been completely sequenced, and 30 of those represent mycobacteriophages (of a genome size of approximately 2 Mbp). Partial genomic sequences of 14 of these phages (approximately 1 Mbp each) have been subjected to phylogenetic comparison and analyses [7, 8] . Their genes are closely packed and code for replication, integration, assembly and regulation functions. In the genomes, there is pervasive mosaicism, implying that horizontal exchange of genes has been an important component of their evolution. Over 80% of the genes are only seen in mycobacteriophages but there are also some non-phage genes (which probably were picked up from host genomes). In terms of evolution and classification, each phage genome is considered to be a unique assembly of individual modules (a module either being an individual gene or a set of genes). In order Report on ICTV 631 to arrive at its present resting place, each module has a different phylogenetic history. The models for the generation of mosaicism are targeted recombination and random illegitimate recombination, followed by selection ('Recombination reassorts genetic modules'). In order to conceptualize evolutionary relationships, the model of a three-dimensional web-like (or 'sweb') reconstruction of events was proposed. This would allocate unique 'sequence space' to each phage without ranks or preconceptions. In the discussion, the issues of the stability of mosaic genomes, the speed of recombination during evolution, the lack of a species concept, and the integration and reactivation of mosaic genomes were considered. Simon Wain-Hobson (Institut Pasteur, Paris) described and analyzed "The enormous multiplicity of HIV infection in vivo and the end of clonality". In addition to a minimum point mutation rate of 0.25/genome (increasing to 700/genome when nearing 'error catastrophe', see below), each HIV genome has undergone 3 recombination events on average, i.e. recombination creates much more diversity than point mutations [12, 14, 17] . In vitro, a single round of replication of HIV-1 in T lymphocytes generated on average 9 recombination events per virus [14] . HIV recombinants are frequently produced within individuals, and are even more frequently observed at epidemiological levels. SIV recombinants are discovered within 15 days of infection. A prerequisite of recombination is a multiply infected cell (either co-or superinfected); such cells have been found in HIV-infected patients [17] . Proviral sequences are randomly distributed on chromosomes; one chromosome can harbour several of them. There are also recombinant proviruses. There can be 600-700 proviral DNA copies per cell, and the amount of DNA in a cell can be increased threefold. One T cell can produce 500-4000 HIV particles that are spread preferentially by cell-to-cell contact. Within an individual, donor cells (mostly dendritic, i.e. antigen presenting cells) carry sequences different from those found in recipient cells (mostly T cells). Upon multiple infections the recombination rate increases and can reach the level of self-destruction ('error catastrophe', see below). Concomitantly, the ratio [number of virus particles (nvp)/pfu], already high for all members of the Retroviridae, increases further. In these circumstances, an accurate phylogeny cannot be constructed. John Coffin (Tufts University School of Medicine, Boston MA, and National Cancer Institute, Frederick MD) spoke about "Retrovirus evolution and drug resistance". For retroviruses, host-virus co-evolution has been known for some time. The formation of endogenous retroviruses (ERVs) as integrated proviral sequences leads to indefinite vertical transmission in the host. ERVs can be considered and analysed as representing fossil records of previous virus-host interactions [9] . In human germlines, HERV-K sequences are ubiquitous [10] . The history of retrovirus evolution in humans is long. ERVs represent 6-8% of the human genome. Strong parallels can be found in the phylogeny of ERV of primates and that of primates themselves to the extent that the time points of evolutionary events in ERVs and primates can be mutually determined. HERV-K sequences entered human hosts approximately 30 million years ago. Every human individual carries 30-50 different ERVs of which 13 are considered as 'old' and 24 as 'new'. The analysis of long terminal repeats (LTRs) of ERVs has allowed distinct waves of infection to be identified. Mutations have accumulated as the species evolved. However, the 5 and 3 ends of the LTRs have not always co-evolved (about 6/36 human proviruses have 'mismatched' LTRs). Approximately 50% of sequence changes are consistent with evolution by point mutations; other changes are due to multiple recombination events. HERVs are still active and can be reactivated. Using examples of the development of resistance of HIV to the action of the antiviral drug 3 -thiacytidine (3TC), mutations, selection, drift and linkage were recognized as genetic factors affecting the evolution of drug resistance. By using an ultrasensitive detection assay [20] , direct sequencing of HIV RNA from limiting dilutions and the application of mathematical methods, extensive recombination events and evidence for 632 U. Desselberger compensatory mutations were recognized as the main factors in the development of drug resistance. Esteban Domingo (Centro de Investigación en Sanidad Animal and Universidad Autónoma de Madrid) spoke about "Quasispecies dynamics and extinction of RNA viruses". After an introduction in which basic genetic terms were defined (mutation and mutation rate, hypermutation, recombination, reassortment, segmentation etc), the quasispecies concept was presented according to which any sample of an RNA virus represents a 'swarm' of closely related mutants. This composition allows the virus to adapt in a flexible way to changing environmental conditions. Parameters of adaptability are: the number of mutations per genome (1-100), the population size (up to 10 12 infectious particles/host organism), the genomic length (9.5 kb for HIV, 3-30 kb for other RNA viruses, i.e. relatively small for all RNA viruses), and the number of mutations needed to produce a phenotypic change (can be very small). Mutant spectra matter for the quasispecies of many RNA viruses (vesicular stomatitis virus, picornaviruses [poliovirus, foot-and-mouth disease virus (FMDV)], lymphocytic choriomeningitis virus (LCMV), bunyaviruses etc). Hypermutated (pre-extinction) RNA often interferes with the infectivity of clonal RNAs. Assignment of a quasispecies to a phenotype is indeterminate. Quasispecies have both deterministic and stochastic features [21, 27] . Under bottleneck conditions (e.g. plaque-to-plaque passage in cell culture), the quasispecies spectrum will become narrower, and the fitness of the quasispecies to survive will decrease, due to the operation of Muller's ratchet [3] . The fidelity of the transcriptase/replicase will go in parallel with the viability of a quasispecies distribution; with decreasing fidelity of these enzymes, the viral sequences will transgress via an error threshold to become random sequences. For FMDV, a constant rate of 0.25 mutations/genome/plaque transfer has been found during plaque-to-plaque passage. In the presence of a mutagen, viral extinction was frequently observed in vitro [2] . Ribavirin, a licensed antiviral drug, was shown to be a mutagen as well. Chronic infection of mice with LCMV was prevented (cured) by treatment of the animals with fluorouracil, a mutagen [28] . In the discussion, the influence of the ratio [nvp/pfu] on viability was considered. Marilyn Roossinck (Samuel Roberts Noble Foundation, Ardmore OK) asked "What determines the quasispecies population size? Lessons from plant viruses". Using examples from the Tobamovirus genus and the Bromoviridae family, it was shown that the mutation frequency depended on virus host interactions [25, 26] . For Brome mosaic virus, the control of diversity was located in RNA segment 2, encoding the polymerase protein, and RNA segment 3, encoding the cell-to-cell movement and coat proteins. Bottleneck conditions limited diversity: of the 15 (silent) mutants in a mixed inoculum, only 7 were found in the 8th leaf and only 5 in the 15th leaf from the site of inoculation. The transmission frequency differed for different mutants. Viruses with large host ranges were found to have large quasispecies 'swarms' (or 'clouds' [31] ). For further details see www.noble.org/virus evolution. Ann Palmenberg spoke about "RNA structure and comparative picornavirology". For RNA viruses, every viral base is to be regarded as a compromise forged by the totality of different selective pressures. Those are mainly: protein recognition, mRNA transcription, mRNA translation, protein structure, and RNA structure. Different nucleic acids occur in different structural forms: in vivo, DNA is usually in the B form, containing a wide major groove and rising by 3.4 A • /bp. Duplex regions of RNA occur in the A form, rising by 2.6 A • /bp and being more stable than DNAs. The base stacking of RNAs contributes hugely to their stability, and RNA folding is largely driven by base stacking [4] . Evolutionary co-variance of nucleotides is sometimes observed; compensatory changes may stabilize a stem, and such observations may help to confirm an RNA structure. The most stable forms of RNA or DNA contain a minimum of free energy [36, 37] . Using computer programmes developed by Zuker's group, the optimal folding of RNAs of relatively small size (picornaviruses) and large size (SARS coronavirus) has been accomplished [19, 24] . The question arose: how can one recognize if a calculated fold is real? After randomizing and refolding the RNA sequence of encephalomyocarditis virus (in silico), a highly stable form (∆G of −1720 Kcal/mol) was obtained that was indistinguishable in stability from, and in the fold of, the real RNA. Thus, in reality 'the optimal fold' should be considered a myth; there is no single optimal structure. By mathematical derivation the number of alternative partners with which each base of an RNA can interact (= P-num) can be obtained [36] and plotted against the sequence; troughs of the curve indicate regions of few alternative partners. The P-num derivative is a 'quantitative measure of the propensity of that base to become involved with the same or alternative pairing partners in a collection of suboptimal folds' [11, 19] ; it is thus a powerful parameter for locating wriggles in an RNA structure. 'To maintain RNA structure, evolution selects against better alternatives elsewhere in the genome'. The internal ribosome entry site (IRES) of picornaviral RNAs is a structural motif with a low P-num value. IRES structures are similar to those of tRNAs in that they exhibit no significant sequence similarity, yet fold into virtually identical structures. The picornaviral cis-acting replication elements (CREs), which display a CACAAA sequence to 3D polymerases, also have low P-num values, and again very different sequences adopt very similar 3-dimensional structures. Vice versa, nucleotide sequence similarity does not always conserve RNA structures. The aim of the talk was to show the significance of RNA structural considerations for the evolution of viruses. At the conclusion of the symposium, Andy Ball thanked all speakers and discussants. The symposium was attended by approximately 150 participants. Comments and suggestions on drafts of this report by Andrew Ball, Jean Cohen, Esteban Domingo, Mary Estes, Denis Fargette, Anne-Lise Haenni, Mike Mayo and Ann Palmenberg are gratefully acknowledged. Classification of papillomaviruses Quasispecies dynamics and RNA virus extinction Multiple molecular pathways for fitness recovery of an RNA virus debilitated by operation of Muller's ratchet Improved free-energy parameters for predictions of RNA duplex stability A comparative sequence analysis to revise the current taxonomy of the family Coronaviridae The palm subdomain-based active site is internally permutated in viral RNA dependent RNA polymerases of an ancient lineage Bacteriophages with tails: chasing their origins and evolution Bacteriophage genomics A novel, endogenous retrovirus-related element in the human genome resembles a DNA transposon: evidence for an evolutionary link Human endogenous retrovirus K solo-LTR formation and insertional polymorphism: implications for human and viral evolution Improved predictions of secondary structures of RNA Multiply infected spleen cells in HIV patients Imbroglios of viral taxonomy: genetic exchange and failings of phenetic approaches Dynamics of HIV-1 recombination in its natural target cells Flaviviridae: The viruses and their replication Molecular evidence of HIV-1 transmission in a criminal case The nonclonal and transitory nature of HIV in vivo Don't forget about viruses Topological organization of picornaviral genomes: Statistical prediction of RNA structural signals New real-time reverse transcriptase-initiated PCR assay with single-copy sensitivity for human immunodeficiency virus type 1 RNA in plasma Reproducible nonlinear population dynamics and critical points during replicative competitions of RNA virus quasispecies Retroids in Archaea: phylogeny and lateral origins The structure of a thermophilic archaeal virus shows a double stranded DNA viral capsid that spans all domains of life Generation of coronavirus spike deletion variants by high-frequency recombination at regions of predicted RNA secondary structure Evolutionary history of Cucumber mosaic virus deduced by phylogenetic analyses Plant RNA virus evolution Synchronous loss of quasispecies memory in parallel viral lineages: a deterministic feature of viral quasispecies Lethal mutagenesis of the prototypic arenavirus lymphocytic choriomeningitis virus (LCMV) New Haven CT 30. Sawicki SG, Sawicki DL (1998) A new model for coronavirus transcription Genetic diversity in RNA virus quasispecies is controlled by host-virus interaction Unique and conserved features of genome and proteome of SARS-coronavirus, an early split off from the coronavirus group 2 lineage Of statistics and genomes VP1 of infectious bursal disease virus is an RNA dependent RNA polymerase The 3C-like proteinase of an invertebrate nidovirus links coronavirus and potyvirus homologs On finding all suboptimal foldings of an RNA molecule Prediction of RNA secondary structure by energy minimization Parc d'Innovation, Boulevard Sébastian Brandt, 67400 Illkirch, France. -Redaktion: Sachsenplatz 4-6