key: cord-0976325-r94tu8fi
authors: Masters, Paul S.
title: The Molecular Biology of Coronaviruses
date: 2006-07-28
journal: Adv Virus Res
DOI: 10.1016/s0065-3527(06)66005-3
sha: f085aa777d230139f3ae371f7d28ec3070a6c254
doc_id: 976325
cord_uid: r94tu8fi

Coronaviruses are large, enveloped RNA viruses of both medical and veterinary importance. Interest in this viral family has intensified in the past few years as a result of the identification of a newly emerged coronavirus as the causative agent of severe acute respiratory syndrome (SARS). At the molecular level, coronaviruses employ a variety of unusual strategies to accomplish a complex program of gene expression. Coronavirus replication entails ribosome frameshifting during genome translation, the synthesis of both genomic and multiple subgenomic RNA species, and the assembly of progeny virions by a pathway that is unique among enveloped RNA viruses. Progress in the investigation of these processes has been enhanced by the development of reverse genetic systems, an advance that was heretofore obstructed by the enormous size of the coronavirus genome. This review summarizes both classical and contemporary discoveries in the study of the molecular biology of these infectious agents, with particular emphasis on the nature and recognition of viral receptors, viral RNA synthesis, and the molecular interactions governing virion assembly.

by the enormous size of the coronavirus genome. This review summarizes both classical and contemporary discoveries in the study of the molecular biology of these infectious agents, with particular emphasis on the nature and recognition of viral receptors, viral RNA synthesis, and the molecular interactions governing virion assembly.

Coronaviruses are a family of enveloped RNA viruses that are distributed widely among mammals and birds, causing principally respiratory or enteric diseases but in some cases neurologic illness or hepatitis (Lai and Holmes, 2001) . Individual coronaviruses usually infect their hosts in a species-specific manner, and infections can be acute or persistent. Infections are transmitted mainly via respiratory and fecal-oral routes. The most distinctive feature of this viral family is genome size: coronaviruses have the largest genomes among all RNA viruses, including those RNA viruses with segmented genomes. This expansive coding capacity seems to both provide and necessitate a wealth of gene-expression strategies, most of which are incompletely understood.

Two prior reviews with the same title as this one have appeared in the Advances in Virus Research series (Lai and Cavanagh, 1997; Sturman and Holmes, 1983) . The earlier of the two noted that the recognition of coronaviruses as a separate virus family occurred in the 1960s, in the wake of the discovery of several new human respiratory pathogens, certain of which, it was realized, appeared highly similar to the previously described avian infectious bronchitis virus (IBV) and mouse hepatitis virus (MHV) (Almeida and Tyrrell, 1967) . These latter viruses had a characteristic morphology in negative-stained electron microscopy, marked by a "fringe" of surface structures described as "spikes" (Berry et al., 1964) or "club-like" projections (Becker et al., 1967) . Such structures were less densely distributed and differently shaped than those of the myxoviruses. To some, the fringe resembled the solar corona, giving rise to the name that was ultimately assigned to the group (Almeida et al., 1968) . Almost four decades later, recognition of the same characteristic virion morphology alerted the world to the emergence of another new human respiratory pathogen: the coronavirus responsible for the devastating outbreak of severe acute respiratory syndrome (SARS) in (Ksiazek et al., 2003 Peiris et al., 2003) . The sudden appearance of SARS has stimulated a burst of new research to understand the basic replication mechanisms of members of this family of viral agents, as a means toward their control and prophylaxis. Thus, the time is right to again assess the state of our collective knowledge about the molecular biology of coronaviruses.

Owing to limitations imposed by both space and the expertise of the author, "molecular biology" will be considered here in the more narrow sense, that is, the molecular details of the cellular replication of coronaviruses. No attempt will be made to address matters of pathogenesis, viral immunology, or epidemiology. For greater depth and differences of emphasis in particular areas, as well as for historical perspectives, the reader is referred to the two excellent predecessors of this review (Lai and Cavanagh, 1997; Sturman and Holmes, 1983) and also to volumes edited by Siddell (1995) and Enjuanes (2005) .

Coronaviruses are currently classified as one of the two genera in the family Coronaviridae (Enjuanes et al., 2000b) . However, it is likely that the coronaviruses, as well as the other genus within the Coronaviridae, the toroviruses (Snijder and Horzinek, 1993) , will each be accorded the taxonomic status of family in the near future (González et al., 2003) . Therefore, throughout this review, the coronaviruses are referred to as a family. Both the coronaviruses and the toroviruses, in addition to two other families, the Arteriviridae (Snijder and Meulenberg, 1998) and the Roniviridae (Cowley et al., 2000; Dhar et al., 2004) , have been grouped together in the order Nidovirales. This higher level of organization recognizes a relatedness among these families that sets them apart from other nonsegmented positive-strand RNA viruses. The most salient features that all nidoviruses have in common are: gene expression through transcription of a set of multiple 3 0 -nested subgenomic RNAs; expression of the replicase polyprotein via ribosomal frameshifting; unique enzymatic activities among the replicase protein products; a virion membrane envelope; and a multispanning integral membrane protein in the virion. The first of these qualities provides the name for the order, which derives from the Latin nido for nest (Enjuanes et al., 2000a) . In contrast to their commonalities, however, nidovirus families differ from one another in distinct ways, most conspicuously in the numbers, types, and sizes of the structural proteins in their virions and in the morphologies of their nucleocapsids. A more detailed comparison of characteristics of these virus families has been given by Enjuanes et al. (2000b) and Lai and Cavanagh (1997) .

Members of the coronavirus family have been sorted into three groups (Table I) , which, it has been proposed, are sufficiently divergent to merit the taxonomic status of genera (González et al., 2003) . Classification into groups was originally based on antigenic relationships. However, such a criterion reflects the properties of a limited subset of viral proteins, and cases have arisen where clearly related viruses in group 1 were found not to be serologically cross-reactive (Sanchez et al., 1990) . Consequently, sequence comparisons of entire viral genomes (or of as much genomic sequence as is available) have come to be the basis for group classification . Almost all group 1 and group 2 viruses have mammalian hosts, with human coronaviruses falling into each of these groups. Viruses of group 3, by contrast, have been isolated solely from avian hosts. Most of the coronaviruses in Table I have been studied for decades, and, by the turn of the century, the scope of the family seemed to be fairly well-defined. Accordingly, it came as quite a shock, in 2003, when the causative agent of SARS was found to be a coronavirus (SARS-CoV). Equally astonishing have been the outcomes of renewed efforts, following the SARS epidemic, to detect previously unknown viruses; these investigations have led to the discovery of two more human respiratory coronaviruses, HCoV-NL63 (van der Hoek et al., 2004) and HCoV-HKU1 . Three distinct bat coronaviruses have also been isolated: two are members of group 1, and the third, in group 2, is a likely precursor of the human SARS-CoV Li et al., 2005c; Poon et al., 2005) . In addition, new IBV-like viruses have been found that infect geese, pigeons, and ducks (Jonassen et al., 2005) .

In almost all cases, the assignment of a coronavirus species to a given group has been unequivocal. Exceptionally, the classification of SARS-CoV has provoked considerable controversy. The original, unrooted, phylogenetic characterizations of the SARS-CoV genome sequence posited this virus to be roughly equidistant from each of the three previously established groups. It was thus proposed to be the first recognized member of a fourth group of coronaviruses (Marra et al., 2003; Rota et al., 2003) . However, a subsequently constructed phylogeny based on gene 1b, which contains the viral RNA-dependent RNA polymerase and which was rooted in the toroviruses as an outgroup, concluded that SARS-CoV is most closely related to the group 2 coronaviruses . In the same vein, it was noted that regions of gene 1a of SARS-CoV contain domains that are unique to the group 2 coronaviruses . Other analyses of a subset of structural gene sequences (Eickmann et al., 2003) and of RNA secondary structures in the 3 0 untranslated region (3 0 UTR) of the genome (Goebel et al., 2004b) also supported a group 2 assignment. By contrast, some authors have argued, based on bioinformatics methods, that the ancestor of SARS-CoV was derived from multiple recombination events among progenitors from all three groups (Rest and Mindell, 2003; Stanhope et al., 2004; Stavrinides and Guttman, 2004) . While these latter studies assume that historically there has been limitless opportunity for intergroup recombination, there is no well-documented example of recombination between extant coronaviruses of different groups. Moreover, it is not clear that intergroup recombination is even possible, owing to replicative incompatibilities among the three coronavirus groups (Goebel et al., 2004b) . Therefore, although SARS-CoV does indeed have unique features, the currently available evidence best supports the conclusion that it is more closely allied with the group 2 coronaviruses and that it has not sufficiently diverged to constitute a fourth group .

Coronaviruses are roughly spherical and moderately pleiomorphic (Fig. 1) . Virions have typically been reported to have average diameters of 80-120 nm, but extreme sizes as small as 50 nm and as large as 200 nm are occasionally given in the older literature (Oshiro, 1973; McIntosh, 1974) . The surface spikes or peplomers of these viruses, variously described as club-like, pear-shaped, or petal-shaped, project some 17-20 nm from the virion surface (McIntosh, 1974) , having a thin base that swells to a width of about 10 nm at the distal extremity (Sugiyama and Amano, 1981) . For some coronaviruses a second set of projections, 5-10-nm long, forms an undergrowth beneath the major spikes (Guy et al., 2000; Patel et al., 1982; Sugiyama and Amano, 1981) . These shorter structures are now known to be the hemagglutinin-esterase (HE) protein that is found in a subset of group 2 coronaviruses (Section III.G). At least some of the heterogeneity in coronavirus particle morphology can be attributed to the distorting effects of negative-staining procedures. Freeze-dried (Roseto et al., 1982) and cryo-electron microscopic (Risco et al., 1996) preparations of BCoV and TGEV, respectively, showed much more homogeneous populations of virions, with diameters 10-30 nm greater than virions in comparable samples prepared by negative staining. Extraordinary three-dimensional images have been obtained for SARS-CoV virions emerging from infected Vero cells (Ng et al., 2004) . These scanning electron micrographs and atomic force micrographs reveal knobby, rosette-like viral particles resembling tiny cauliflowers. It will be exciting to see future applications of advanced imaging techniques to the study of coronavirus structure.

The internal component of the coronavirus virion is obscure in electron micrographs of whole virions. In negative-stained images the core appears as an indistinct mass with a densely staining center, giving the virion a "punched-in" spherical appearance. Imaging of virions that have burst spontaneously, expelling their contents, or that have been treated with nonionic detergents has allowed visualization of the coronavirus core. Such analyses led to the attribution of another distinguishing characteristic to the coronavirus family: that its members possess helically symmetric nucleocapsids. Such nucleocapsid symmetry is the rule for negative-strand RNA viruses, but almost all positive-strand RNA animal viruses have icosahedral ribonucleoprotein capsids. However, although it is fairly well accepted that coronaviruses have helical nucleocapsids, there are surprisingly few published data that bear on this issue. Additionally, the reported results vary considerably with both the viral species and the method of preparation. The earliest study of nucleocapsids from spontaneously disrupted HCoV-229E virions found tangled, threadlike structures 8-9 nm in diameter; these were unraveled or clustered to various degrees and, in rare cases, retained some of the shape of the parent virion (Kennedy and Johnson-Lussenburg, 1975/76) . A subsequent analysis of spontaneously disrupted virions of HCoV-229E and MHV observed more clearly helical nucleocapsids, with diameters of 14-16 nm and hollow cores of 3-4 nm (Macnaughton et al., 1978) . The most highly resolved images of any coronavirus nucleocapsid were obtained with NP-40-disrupted HCoV-229E virions (Caul et al., 1979) . These preparations showed filamentous structures 9-11 or 11-13 nm in diameter, depending on the method of staining, with a 3-4-nm central canal. The coronavirus nucleocapsid was noted to be thinner in cross-section than those of paramyxoviruses and also to lack the sharply segmented "herringbone" appearance characteristic of paramyxovirus nucleocapsids. By contrast, in early studies, IBV and TGEV nucleocapsids were refractory to the techniques that had been successful with other viruses. Visualization of IBV nucleocapsids, which seemed to be very sensitive to degradation (Macnaughton et al., 1978) , was finally achieved by electron microscopy of viral samples prepared by carbon-platinum shadowing (Davies et al., 1981) . This revealed linear strands, some as long as 6-7 mm, which were only 1.5-nm thick, suggesting that they represented unwound helices. TGEV, on the other hand, was found to be more resistant to nonionic detergents. Treatment of virions of this species with NP-40 resulted in spherical subviral particles with no threadlike substructure visible (Garwes et al., 1976) . The TGEV core was later seen as a spherically symmetric, possibly icosahedral, superstructure that only dissociated further into a helical nucleocapsid following Triton X-100 treatment of virions (Risco et al., 1996) . Such a collection of incomplete and often discrepant results makes it clear that much further examination of the internal structure of coronavirus virions is warranted. It would substantially aid our understanding of coronavirus structure and assembly if we had available a detailed description of nucleocapsid shape, length, diameter, helical repeat distance, and protein:RNA stoichiometry.

There are three protein components of the viral envelope (Fig. 1) . The most prominent of these is the S glycoprotein (formerly called E2) (Cavanagh, 1995) , which mediates receptor attachment and viral and host cell membrane fusion (Collins et al., 1982) . The S protein is a very large, N-exo, C-endo transmembrane protein that assembles into trimers ( Delmas and Laude, 1990; S ong et al., 2004) to form the dist inctive surface spikes of coronaviruses (Fig. 2 ). S protein is inserted into the endoplasmic reticulum (ER) via a cleaved, amino-terminal signal peptide (Cavanagh et al., 1986b) . The ectodomain makes up most of the molecule, with only a small carboxy-terminal segment (of 71 or fewer of the total 1162-1452 residues) constituting the transmembrane domain and endodomain. Monomers of S protein, prior to glycosylation, are 128-160 kDa, but molecular masses of the glycosylated forms of FIG 2. The spike (S) protein. At the right is a linear map of the protein, denoting the amino-terminal S1 and the carboxy-terminal S2 portions of the molecule. The arrowhead marks the site of cleavage for those S proteins that become cleaved by cellular protease(s). The signal peptide and regions of mapped receptor-binding domains (RBDs) are shown in S1. The heptad repeat regions (HR1 and HR2), putative fusion peptide (F), transmembrane domain, and endodomain are indicated in S2. At the left is a model for the S protein trimer. full-length monomers fall in the range of 150-200 kDa. The S molecule is thus highly glycosylated, and this modification is exclusively N-linked (Holmes et al., 1981; Rottier et al., 1981) . S protein ectodomains have from 19 to 39 potential consensus glycosylation sites, but a comprehensive mapping of actual glycosylation has not yet been reported for any coronavirus. A mass spectrometric analysis of the SARS-CoV S protein has shown that at least 12 of the 23 candidate sites are glycosylated in this molecule (Krokhin et al., 2003) . For the TGEV S protein, it has been demonstrated that the early steps of glycosylation occur cotranslationally, but that terminal glycosylation is preceded by trimerization, which can be rate-limiting in S protein maturation (Delmas and Laude, 1990 ). In addition, glycosylation of TGEV S may assist monomer folding, given that tunicamycin inhibition of high-mannose transfer was found to also block trimerization.

The S protein ectodomain has between 30 and 50 cysteine residues, and within each coronavirus group the positions of cysteines are well conserved Eickmann et al., 2003) . However, as with glycosylation, a comprehensive mapping of disulfide linkages has not yet been achieved for any coronavirus S protein.

In most group 2 and all group 3 coronaviruses, the S protein is cleaved by a trypsin-like host protease into two polypeptides, S1 and S2, of roughly equal sizes. Even for uncleaved S proteins, that is, those of the group 1 coronaviruses and SARS-CoV, the designations S1 and S2 are used for the amino-terminal and carboxy-terminal halves of the S protein, respectively. Peptide sequencing has shown that cleavage occurs following the last residue in a highly basic motif: RRFRR in IBV S protein (Cavanagh et al., 1986b) , RRAHR in MHV strain A59 S protein , and KRRSRR in BCoV S protein . Similar cleavage sites are predicted from the sequences of other group 2 S proteins, except that of SARS-CoV. It has been noted that the S protein of MHV strain JHM has a cleavage motif (RRARR) more basic than that found in MHV strain A59 (RRAHR). An expression study has shown that this difference accounts for the almost total extent of cleavage of the JHM S protein that is seen in cell lines in which the A59 S protein undergoes only partial cleavage (Bos et al., 1995) .

The S1 domain is the most divergent region of the molecule, both across and within the three coronavirus groups. Even among strains and isolates of a single coronavirus species, the sequence of S1 can vary extensively (Gallagher et al., 1990; Parker et al., 1989; Wang et al., 1994) . By contrast, the most conserved part of the molecule across the three coronavirus groups is a region that encompasses the S2 portion of the ectodomain, plus the start of the transmembrane domain (de Groot et al., 1987 ). An early model for the coronavirus spike, which has held up well in light of subsequent work, proposed that the S1 domains of the S protein oligomer constitute the bulb portion of the spike. The stalk portion of the spike, on the other hand, was envisioned to be a coiled-coil structure, analogous to that in influenza HA protein, formed by association of heptad repeat regions of the S2 domains of monomers (de Groot et al., 1987) . The roles of these two regions of the S protein in the initiation of infection will be discussed (Section IV.A).

The M glycoprotein (formerly called E1) is the most abundant constituent of coronaviruses (Sturman, 1977; Sturman et al., 1980) and gives the virion envelope its shape. The preglycosylated M polypeptide ranges in size from 25 to 30 kDa (221-262 amino acids), but multiple higher-molecular-mass glycosylated forms are often observed by SDS-PAGE (Krijnse Locker et al., 1992a) . The M protein of MHV has also been noted to multimerize under standard conditions of SDS-PAGE (Sturman, 1977) .

M is a multispanning membrane protein with a small, aminoterminal domain located on the exterior of the virion, or, intracellularly, in the lumen of the ER (Fig. 3) . The ectodomain is followed by three transmembrane segments and then a large carboxy terminus comprising the major part of the molecule. This latter domain is situated in the interior of the virion or on the cytoplasmic face of intracellular membranes (Rottier, 1995) . M proteins within each coronavirus group are moderately well conserved, but they are quite divergent across the three groups. The region of M protein showing the most conservation among all coronaviruses is a segment of some 25 residues encompassing the end of the third transmembrane domain and the start of the endodomain; a portion of this segment even retains homology to its torovirus counterpart . The ectodomain, which is the least conserved part of the M molecule, is glycosylated. For most group 2 coronaviruses, glycosylation is O-linked, although two exceptions to this pattern are MHV strain 2 (Yamada et al., 2000) and SARS-CoV (Nal et al., 2005) , both of which have M proteins with N-linked carbohydrate. Group 1 and group 3 coronavirus M proteins, by contrast, exhibit N-linked glycosylation exclusively (Cavanagh and Davis, 1988; Garwes et al., 1984; Jacobs et al., 1986; . At the time of its discovery in the MHV M protein, O-linked glycosylation had not previously been seen to occur in a viral protein (Holmes et al., 1981) , and MHV M has since been used as a model to study the sites and mechanism of this type of posttranslational modification (de Haan et al., 1998b; Krijnse Locker et al., 1992a; Niemann et al., 1982) . Although the roles of M protein glycosylation are not fully understood, the glycosylation status of M can influence both organ tropism in vivo and the capacity of some coronaviruses to induce alpha interferon in vitro (Charley and Laude, 1988; de Haan et al., 2003a; Laude et al., 1992) .

The coronavirus M protein was the first polytopic viral membrane protein to be described Rottier et al., 1984) , and the atypical topology of the MHV and IBV M proteins was examined in considerable depth in cell-free translation and cellular expression studies. For both of these M proteins, the entire ectodomain was found to be protease sensitive. However, at the other end of the molecule, no more than 20-25 amino acids could be removed from the carboxy terminus by protease treatment (Cavanagh et al., 1986a; Mayer et al., 1988; Rottier et al., 1984 Rottier et al., , 1986 . This pattern suggested that almost all of the endodomain of M is tightly associated with the surface of the membrane or that it has an unusually compact structure that is refractory to proteolysis (Rottier, 1995) . Most M proteins do not possess a cleaved amino-terminal signal peptide (Cavanagh et al., 1986b; Rottier et al., 1984) , and for both IBV and MHV it was demonstrated that either the first or the third transmembrane domain alone is sufficient to function as the signal for insertion and anchoring of the protein in its native orientation in the membrane (Krijnse Locker et al., 1992b; Machamer and Rose, 1987; Mayer et al., 1988) . The M proteins of a subset of group 1 coronaviruses (TGEV, FIPV, and CCoV) each contain a cleavable amino-terminal signal sequence (Laude et al., 1987) , although this element may not be required for membrane insertion (Kapke et al., 1988; Vennema et al., 1991) . Another anomalous feature of at least one group 1 coronavirus, TGEV, is that roughly one-third of its M protein assumes a topology in which part of the endodomain constitutes a fourth transmembrane segment, thereby positioning the carboxy terminus of the molecule on the exterior of the virion (Risco et al., 1995) . This alternative configuration of M has yet to be demonstrated for other coronavirus family members.

The E protein (formerly called sM) is a small polypeptide, ranging from 8.4 to 12 kDa (76-109 amino acids), that is only a minor constituent of virions (Fig. 3) . Owing to its tiny size and limited quantity, E was recognized as a virion component much later than were the other structural proteins, first in IBV and then in TGEV (Godet et al., 1992) and MHV (Yu et al., 1994) . Its significance was also obscured by the fact that in some coronaviruses, the coding region for E protein occurs as the furthest-downstream open reading frame (ORF) in a bi-or tricistronic mRNA and must therefore be expressed by a nonstandard translational mechanism (Boursnell et al., 1985; Budzilowicz and Weiss, 1987; Leibowitz et al., 1988; Skinner et al., 1985; Thiel and Siddell, 1994) . E protein sequences are extremely divergent across the three coronavirus groups and in some cases, among members of a single group. Nevertheless, the same general architecture can be discerned in all E proteins: a short hydrophilic amino terminus (8-12 residues), followed by a large hydrophobic region (21-29 residues) containing two to four cysteines, and a then hydrophilic carboxy-terminal tail (39-76 residues), the latter constituting most of the molecule.

E is an integral membrane protein, as has been shown for both the MHV and IBV E proteins by the criterion of resistance to alkaline extraction (Corse and Machamer, 2000; Vennema et al., 1996) , and membrane insertion occurs without cleavage of a signal sequence . The E protein of IBV has been shown to be palmitoylated on one or both of its two cysteine residues (Corse and Machamer, 2002) , but it is not currently clear whether this modification is a general characteristic. One study of MHV E showed a gel mobility shift of E caused by hydroxylamine treatment, which cleaves thioester linkages (Yu et al., 1994) , but attempts to incorporate labeled palmitic acid into either the TGEV or MHV E protein have been unsuccessful (Godet et al., 1992; Raamsman et al., 2000) . The topology of E in the membrane is at least partially resolved. Although one early report suggested a C-exo, N-endo membrane orientation for the TGEV E protein (Godet et al., 1992) , more extensive investigations of the MHV and IBV E proteins both concluded that the carboxyterminal tail of the molecule is cytoplasmic (or, correspondingly, is situated in the interior of the virion) (Corse and Machamer, 2000; Raamsman et al., 2000) . Moreover, for IBV E, it was shown that the carboxy-terminal tail, in the absence of the membrane-bound domain, specifies targeting to the budding compartment (Corse and Machamer, 2002) . The status of the amino terminus is less clear, however. The IBV E protein amino terminus was inaccessible to antibodies at the cytoplasmic face of the Golgi membrane, suggesting that this end of the molecule is situated in the lumen (corresponding to the exterior of the virion) (Corse and Machamer, 2000) . Such a single transit, placing the termini of the protein on opposite faces of the membrane, would be consistent with prediction, by molecular dynamics simulations, that a broad set of E proteins occur as transmembrane oligomers (Torres et al., 2005) . Conflicting results were obtained with MHV E, though. Based on the cytoplasmic reactivity of an engineered amino-terminal epitope tag, it was proposed that the MHV E protein amino terminus is buried within the membrane near the cytoplasmic face (Maeda et al., 2001) . This result also accords with the finding that no part of the MHV E protein in purified virions is accessible to protease treatment . Such an orientation would mean that the hydrophobic domain of E protein forms a hairpin, looping back through the membrane. This topology agrees with the outcome of a biophysical analysis of the SARS-CoV E protein transmembrane domain (Arbely et al., 2004) . However, in the latter study it was asserted that the palindromic hairpin configuration of the transmembrane segment is unique to the SARS-CoV E protein, which begs the question of how the other coronavirus E proteins are situated in the membrane and why the E protein of SARS-CoV should differ.

The N protein, which ranges from 43 to 50 kDa, is the protein component of the helical nucleocapsid and is thought to bind the genomic RNA in a beads-on-a-string fashion (Laude and Masters, 1995) (Fig. 3 ).

Based on a comparison of sequences of multiple strains, it has been proposed that the MHV N protein is divided into three conserved domains, which are separated by two highly variable spacer regions (Parker and Masters, 1990) . Domains 1 and 2, which constitute most of the molecule, are rich in arginines and lysines, as is typical of many viral RNA-binding proteins. In contrast, the short, carboxy-terminal domain 3 has a net negative charge resulting from an excess of acidic over basic residues. While there is now considerable evidence to support the notion that domain 3 truly constitutes a separate domain (Hurst et al., 2005; Koetzner et al., 1992) , little is known about the structure of the other two putative domains. The overall features of the three-domain model appear to extend to N proteins of coronaviruses in groups 1 and 3, although the boundaries between domains appear to be less clearly defined for these latter N proteins. There is not a high degree of intergroup sequence homology among N proteins, with the exception of a strongly conserved stretch of 30 amino acids, near the junction of domains 1 and 2, which contains many aromatic hydrophobic residues (Laude and Masters, 1995) .

The main activity of N protein is to bind to the viral RNA. Unlike the helical nucleocapsids of nonsegmented negative-strand RNA viruses, coronavirus ribonucleoprotein complexes are quite sensitive to the action of ribonucleases (Macnaughton et al., 1978) . A significant portion of the stability of the nucleocapsid may derive from N-N monomer interactions (Narayanan et al., 2003b) . Both sequence-specific and nonspecific modes of RNA binding by N have been assayed in vitro (Chen et al., 2005; Masters, 1992; Molenkamp and Spaan, 1997; Nelson and Stohlman, 1993; Nelson et al., 2000; Robbins et al., 1986; Stohlman et al., 1988; Zhou et al., 1996) . Specific RNA substrates that have been identified for N protein include the positivesense transcription regulating sequence (Chen et al., 2005; Nelson et al., 2000; Stohlman et al., 1988) , regions of the 3 0 UTR (Zhou et al., 1996) and the N gene , and the genomic RNA packaging signal Molenkamp and Spaan, 1997) (Section IV.C). The RNA-binding capability of the MHV N protein has been mapped to domain 2 of this molecule (Masters, 1992; Nelson and Stohlman, 1993) . However, for IBV, two separate RNA-binding sites have been found to map, respectively, to amino-and carboxy-terminal fragments of N protein (Zhou and Collisson, 2000) , and RNA-binding activity has been reported for a fragment of the SARS-CoV N protein containing parts of domains 1 and 2 (Huang et al., 2004b) .

N is a phosphoprotein, as has been shown for MHV, IBV, BCoV, TGEV, and SARS-CoV (Calvo et al., 2005; King and Brian, 1982; Lomniczi and Morser, 1981; Stohlman and Lai, 1979; Zakhartchouk et al., 2005) . For MHV N, phosphorylation occurs exclusively on serine residues (Siddell et al., 1981; Stohlman and Lai, 1979 ), but in IBV N a phosphothreonine residue was also found (Chen et al., 2005) . Kinetic analysis has shown that MHV N protein acquires phosphates rapidly following its synthesis (Siddell et al., 1981; Stohlman et al., 1983) , and phosphorylation may lead to the association of N with intracellular membranes (Calvo et al., 2005; Stohlman et al., 1983) . Although some 15% of the amino acids of coronavirus N proteins are candidate phosphoacceptor serines and threonines, phosphorylation appears to be targeted to a small subset of residues. For MHV, this was concluded both from the degree of charge heterogeneity of N protein observed in two-dimensional gel electrophoresis and from the limited number of tryptic phosphopeptides of N that could be separated by HPLC Wilbur et al., 1986) . Mass spectrometry has been employed to map the sites of phosphorylation of the IBVand TGEV N proteins. For IBV N, this was accomplished by comparison of unphosphorylated N protein expressed in bacteria with phosphorylated N protein expressed in insect cells (Chen et al., 2005) . Four sites of phosphorylation were found, two each in domains 2 and 3: Ser190, Ser192, Thr378, and Ser379. For TGEV N, purified virions and multiple fractions from infected cells were analyzed (Calvo et al., 2005) . Here also, four sites of phosphorylation were found, one in domain 1 and three in domain 2: Ser9, Ser156, Ser254, and Ser256. In both of these analyses, the degree of sequence coverage achieved did not entirely rule out the possibility of additional, undetected phosphorylated residues in each of these N proteins.

The role of N protein phosphorylation is currently unresolved, but this modification has long been speculated to have regulatory significance. In vitro binding evidence has been presented that phosphorylated IBV N is better able to distinguish between viral and nonviral RNA substrates than is nonphosphorylated N (Chen et al., 2005) . Possibly related to this result is the early conclusion, inferred from the differential accessibilities of some monoclonal antibodies, that phosphorylation induces a conformational change in the MHV N protein (Stohlman et al., 1983) . It has also been found that only a subset of the intracellular phosphorylated forms of BCoV N protein are incorporated into virions, suggesting that phosphorylation is linked to virion assembly and maturation (Hogue, 1995) . The recent mapping of at least some of the N phosphorylation sites in some coronaviruses has now laid the groundwork for testing of the hypothetical functions of phosphorylation by reverse genetic methods.

A number of potential activities, other than its structural role in the virion, have been put forward for N protein. Based on the specific binding of N protein to the transcription-regulating sequence within the leader RNA, it has been proposed that N participates in viral transcription Choi et al., 2002; Stohlman et al., 1988) . However, an engineered HCoV-229E replicon RNA that was devoid of the N gene and all other structural protein genes retained the capability to synthesize subgenomic RNA (Thiel et al., 2001b) . Thus, if N protein does function in transcription, it must be in a modulatory, but not essential, capacity. Likewise, the binding of N protein to leader RNA has been implicated as a means for preferential translation of viral mRNAs (Tahara et al., 1994 (Tahara et al., , 1998 , although data supporting this attractive hypothesis are, as yet, incomplete. N protein has also been found to enhance the efficiency of replication of replicon or genomic RNA in reverse genetic systems in which infections are initiated from engineered viral RNA (Almazan et al., 2004; Schelle et al., 2005; Thiel et al., 2001a; Yount et al., 2002) . This may be indicative of a direct role of N in RNA replication, but it remains possible that the enhancement actually results from the sustained translation of a limiting replicase component.

Finally, it was shown that, in addition to its presence in the cytoplasm, IBV N protein localized to the nucleoli of about 10% of cells that were infected with IBV or were independently expressing N protein . This observation was extended to the N proteins of MHV and TGEV, suggesting that nucleolar localization is a general feature of all three coronavirus groups. Such localization was proposed to correlate with the arrest of cell division . Additionally, both MHV and IBV N proteins were found to bind to two nucleolar proteins, fibrillarin and nucleolin (Chen et al., 2002) . It must be noted, however, that nucleolar localization of N was not observed in TGEV-infected or SARS-CoV-infected cells by other groups of investigators (Calvo et al., 2005; Rowland et al., 2005) . All steps of coronavirus replication are thought to occur outside of the nucleus. For MHV, it was shown some time ago that viral replication could occur in enucleated cells or in cells treated with actinomycin D or -amanitin, host RNA polymerase inhibitors (Brayton et al., 1981; Wilhelmsen et al., 1981) . By contrast, other studies reported that similar conditions reduced the growth yield of IBV, HCoV-229E, or FCoV (Evans and Simpson, 1980; Kennedy and Johnson-Lussenburg, 1979; Lewis et al., 1992) . Even if coronavirus replication does not have an absolute dependence on the nucleus, the possibility remains that some viruses can alter host nuclear functions so as to create an environment more favorable for viral infection. Such a modification might be brought about through the nuclear trafficking of one or more viral components.

The genomes of coronaviruses are nonsegmented, single-stranded RNA molecules of positive sense, that is, the same sense as mRNA ( Fig. 4) (Lai and Stohlman, 1978; Lomniczi and Kennedy, 1977; Schochetman et al., 1977; Wege et al., 1978) . Structurally they resemble most eukaryotic mRNAs, in having both 5 0 caps (Lai and Stohlman, 1981) and 3 0 poly(A) tails (Lai and Stohlman, 1978; Lomniczi, 1977; Schochetman et al., 1977; Wege et al., 1978) . Unlike most eukaryotic mRNAs, coronavirus genomes are extremely large-nearly three times the size of alphavirus and flavivirus genomes and four times the size of picornavirus genomes. Indeed, at lengths ranging from 27.3 (HCoV-229E) to 31.3 kb (MHV), coronavirus genomes are among the largest mature RNA molecules known to biology. Again, unlike most eukaryotic mRNAs, coronavirus genomes contain multiple ORFs. The genes for the four canonical structural proteins discussed previously account for less than one-third of the coding capacity of the genome and are clustered at the 3 0 end. A single gene, which encodes the viral replicase, occupies the 5 0 -most two-thirds of the genome. The invariant gene order in all members of the coronavirus family is 5 0 -replicase-S-E-M-N-3 0 . However, engineered rearrangement of the gene order of MHV was found to be completely tolerated by the virus (de Haan et al., 2002b) . This implies that the native order, although it became fixed early in the evolution of the family, is not functionally essential. At the termini of the genome are a 5 0 UTR, ranging from 210 to 530 nucleotides, and a 3 0 UTR, ranging from 270 to 500 nucleotides. The noncoding regions between the ORFs are generally quite small; in some cases, there is a small overlap between adjacent ORFs. Additionally, one or a number of accessory genes are intercalated among the structural protein genes.

In common with almost all other positive-sense RNA viruses, the genomic RNA of coronaviruses is infectious when transfected into permissive host cells, as was originally shown for TGEV (Norman et al., 1968) , IBV (Lomniczi, 1977; Schochetman et al., 1977) , and MHV (Wege et al., 1978) . The genome has multiple functions during infection. It acts initially as an mRNA that is translated into the huge replicase polyprotein, the complete synthesis of which requires a ribosomal frameshifting event (Section V.C.1). The replicase is the only translation product derived from the genome; all downstream ORFs are expressed from subgenomic RNAs. The genome next serves as the template for replication and transcription (Section V). Finally, the genome plays a role in assembly, as progeny genomes are incorporated into progeny virions (Section IV.C).

Interspersed among the set of canonical genes, replicase, S, E, M, and N, all coronavirus genomes contain additional ORFs, in a wide range of configurations. As shown in Table II , these "extra" genes can fall in any of the genomic intervals among the canonical genes and can vary from as few as one (PEDV and HCoV-NL63) to as many as eight genes (SARS-CoV). In some cases, accessory genes can be entirely embedded in another ORF, as the internal (I) gene found within the N gene of many group 2 coronviruses (Fischer et al., 1997a; Lapps et al., 1987; Senanayake et al., 1992) , or they can be extensively overlapped with another gene, as the 3b gene of SARS-CoV. In addition, many accessory genes do not constitute the 5 0 -most ORF in the largest subgenomic RNA in which they appear, and they therefore must require nonstandard translation mechanisms for their expression . Intracellular expression has been demonstrated for a number of accessory proteins, but for many others it is at present merely speculative.

The coronavirus accessory genes were originally labeled nonstructural, but this is not entirely apt, since the products of some of them, the group 2 HE protein, the I protein (Fischer et al., 1997a) , and the SARS-CoV 3a protein, have been shown to be components of virions. Accessory genes were also previously called group-specific genes, but this appellation has become a misnomer in light of the diversity revealed by recently discovered coronaviruses. In general, accessory genes are numbered according to the subgenomic RNA in whose unique region they appear, but this nomenclature system is sometimes overridden by historical precedent. As a result, identically numbered genes in two different viruses, for example, the 5a genes of MHV and IBV, do not necessarily occupy the same genomic position. Likewise, two identically numbered genes, for example, the 3a genes of SARS-CoV and TGEV, do not necessarily have any sequence homology.

It is often speculated that the coronavirus accessory genes were horizontally acquired from cellular or heterologous viral sources, but only in two cases, the group 2 HE and 2a genes, is there good evidence for this proposal. HE, the most clear-cut example, is discussed later. A possible function for the 2a protein has been inferred from a 

* Accessory genes and proteins are listed only for coronaviruses for which a complete genomic sequence is available. The protein product is indicated in parentheses in cases where it has a different designation than the gene. Products of separate transcripts are separated by hyphens; the transcription of accessory genes may vary among different strains of the same virus species (O'Connor and Brian, 1999) . The canonical coronavirus genes are indicated in brackets; rep denotes replicase. bioinformatics analysis, which places it in a very large family of cellular and viral 2 0 ,3 0 -cyclic phosphodiesterases (Mazumder et al., 2002) . Besides its presence in some group 2 coronaviruses, this gene also appears in another family within the Nidovirales order, the toroviruses (Snijder et al., 1990) . Curiously, in the toroviruses, the 2a homolog is situated as a module within the replicase polyprotein, suggesting either that it was acquired independently or that there was nonhomologous recombination between ancestors of viruses within the two families . However, most accessory gene ORFs have no obvious homology to any other viral or cellular sequence in public databases. It is conceivable that many of them evolved in individual coronaviruses by the scavenging of ORFs from the virus's own genome, through duplication and subsequent mutation, as has been proposed for several of the accessory proteins of SARS-CoV (Inberg and Linial, 2004) . It is tempting to regard this as a possible origin for the SARS-CoV 3a protein, which has a topology and size remarkably similar to that of the M protein, although there is no sequence similarity between the two. Such a relationship would parallel that in the arteriviruses, another Nidovirales family, in which the major envelope glycoprotein is also a triple-spanning membrane protein and forms heterodimers with its M protein (Snijder and Meulenberg, 1998) .

It also needs to be considered that, although there is evidence that some accessory genes encode "luxury" functions for their respective viruses, other accessory genes may be genetic junk. Many isolates of IBV contain an extremely diverged segment of some 200 nucleotides between the N gene and the 3 0 UTR (Sapats et al., 1996) . This was long considered to be a hypervariable region of the 3 0 UTR, although it was shown to be dispensable for RNA synthesis (Dalton et al., 2001) . Intriguingly, coronavirus sequences closely related to IBV have been characterized in pigeons and geese. These sequences have one and two additional ORFs, respectively, between the N gene and the 3 0 UTR (Jonassen et al., 2005) . This finding suggests that the IBV hypervariable region and the PCoV ORF are degenerate remnants of a precursor retained in the GCoV sequence. The two GCoV ORFs, in turn, may be vestiges of one or more functional ancestral genes, or they may be derived from horizontally acquired sequences that there has been no selective pressure to eliminate. A similar situation probably pertains for the SARS-CoV 8a and 8b genes. Isolates of SARS-CoV from marketplace animals near the source of the epidemic were found to contain an additional 29 nucleotides absent from all but one previously reported human isolate, and this apparent insertion resulted in the fusion of ORFs 8a and 8b into a single ORF 8 . One scenario consistent with this observation is that loss of the 29-nt sequence was concomitant with the jump of the virus from animals to humans, although the functional significance of this loss, if any, is not yet clear.

In all cases examined, through natural or engineered mutants, accessory protein genes have been found to be nonessential for viral replication in tissue culture. This dispensability has been determined for the 2a and HE genes of MHV (de Haan et al., 2002a; Schwarz et al., 1990) , genes 4 and 5a of MHV (de Haan et al., 2002a; Weiss et al., 1993; Yokomori and Lai, 1991) , the I gene of MHV (Fischer et al., 1997a) , gene 7 of TGEV (Ortego et al., 2003) , genes 7a and 7b of FIPV (Haijema et al., 2003 , and genes 5a and 5b of IBV Youn et al., 2005) . Similarly, some accessory protein genes do not seem to play any role in infection of the natural host. For gene 4 (Ontiveros et al., 2001) and the I gene (Fischer et al., 1997a) of MHV, and for gene 7b of FIPV (Haijema et al., 2003) , selective knockout produced no detectable effect on pathogenesis in mice or cats, respectively. By contrast, disruption of gene 7 of TGEV greatly reduced viral replication in the lung and gut of infected piglets (Ortego et al., 2003) . In the same manner, viruses with knockouts of either the 3abc gene cluster or genes 7a and 7b in FIPV produced no clinical symptoms in cats at doses that were fatal with wild-type virus . The deletion of genes 2a and HE, or of genes 4 and 5a, in MHV completely abrogated the lethality of intracranial infection in mice (de Haan et al., 2002a) . Even a single point mutation in MHV ORF 2a, which had no effect in tissue culture, was found to greatly attenuate virulence in vivo (Sperry et al., 2005) . In a study that took the opposite approach to assessing accessory protein function, it was discovered that engineered insertion of gene 6 of SARS-CoV greatly enhanced the virulence of an attenuated variant of MHV (Pewe et al., 2005) .

The most extensively characterized accessory protein is HE (formerly called E3), which is a fourth constituent of the membrane envelope in many group 2 coronaviruses (Brian et al., 1995) . HE forms a second set of small spikes that appear as an understory among the tall S protein spikes. It was first identified as a hemagglutinin in HEV (Callebaut and Pensaert , 1980 ) and BCoV ( King and Brian , 1982 ; King et al. , 1985) . The HE monomer has an N-exo, C-endo transmembrane topology, with an amino-terminal signal peptide, a large ectodomain, a transmembrane anchor, and a very short, carboxy-terminal endodomain. Monomers of HE, prior to glycosylation are 48 kDa; this size increases to 65 kDa after addition and processing of oligosaccharide, which is exclusively N-linked (Hogue et al., 1989; Kienzle et al., 1990; Yokomori et al., 1989) . The mature protein is a homodimer that is stabilized by both intrachain and interchain disulfide bonds (Hogue et al., 1989) . The hemagglutinating property of HE raised the possibility that, in the viruses in which it appears, this protein may duplicate or replace the role that is assigned to the coronavirus S protein. However, it has been shown, through the construction of MHV-BCoV chimeric viruses, that the BCoV HE protein, in the absence of BCoV S protein, is not sufficient for initiation of infection in tissue culture (Popova and Zhang, 2002) .

The HE protein also contains an acetylesterase activity. This was originally discovered in BCoV and HCoV-OC43, where it was shown to be similar to the receptor-binding and receptor-destroying activity found in influenza C virus (Vlasak et al., 1988a, b) . The nature of the esterase enzyme has subsequently been comprehensively studied and compared among a number of group 2 coronaviruses (Klausegger et al., 1999; Regl et al., 1999; Smits et al., 2005) . HE proteins of BCoV, HCoV-OC43, ECoV, and MHV strain DVIM were found to be sialate-9-Oacetylesterases. By contrast, HE proteins of RCoV, and MHV strains S and JHM were found to be sialate-4-O-acetylesterases. Surprisingly, the coronavirus HE gene is clearly related to the influenza C virus HA1 gene (Luytjes et al., 1988) . Equally remarkably, toroviruses also possess a homolog of the HE gene but at a different genomic locus than where it appears in the group 2 coronaviruses (Cornelissen et al., 1997) . This may be evidence of genetic trafficking among pairs of ancestors of these three viruses, as was originally proposed (Luytjes et al., 1988; Snijder et al., 1991) . Alternatively, it may indicate that members of different virus families independently acquired the HE gene by horizontal transfer from cellular sources (Cornelissen et al., 1997) .

There are two ways in which HE could act in coronavirus replication. It could serve as a cofactor for S, assisting attachment of virus to host cells. Additionally, it could prevent aggregation of progeny virions and travel of virus through the extracellular mucosa (Cornelissen et al., 1997) . The role of HE protein in coronavirus infection has been systematically documented in a recent pair of elegant studies Lissenberg et al., 2005) . To evaluate the cost and benefit of the HE gene, three isogenic MHV mutants were engineered: HE þ , with an expressed and functional HE gene; HE 0 , with an expressed HE gene that was inactive, owing to active site point mutations; and HE À , which lacked HE expression because of an introduced frameshift. It was demonstrated that, following multiple passages, there was rapid loss of HE expression in the HE þ virus. Moreover, competition experiments showed a growth advantage for the HE À virus, but not the HE 0 virus. Consistent with this, examination of esterase-negative mutants arising from the HE þ virus showed that it was not loss of activity, but, rather, loss of the ability of HE to be incorporated into virions that correlated with the growth advantage of HE À viruses . By contrast, in infections of mice, it was found that the presence of HE (whether or not it was enzymatically active) dramatically enhanced neurovirulence, as measured by viral spread and lethality . These results imply that sialic acid-bearing coreceptors can function to influence the course of MHV infection. Thus, the HE protein is a burden in vitro but provides an advantage to the virus in vivo. The selection against HE in vitro provides a cautionary example that tissue culture adaptation of a virus can rapidly lead to selection of a variant that differs from the natural isolate.

Coronavirus infections are initiated by the binding of virions to cellular receptors (Fig. 5) . This sets off a series of events culminating in the deposition of the nucleocapsid into the cytoplasm, where the viral genome becomes available for translation. The positive-sense genome, which also serves as the first mRNA of infection, is translated into the enormous replicase polyprotein. The replicase then uses the genome as the template for the synthesis, via negative-strand intermediates, of both progeny genomes and a set of subgenomic mRNAs. The latter are translated into structural proteins and accessory proteins. The membrane-bound structural proteins, M, S, and E, are inserted into the ER, from where they transit to the endoplasmic reticulum-Golgi intermediate compartment (ERGIC) . Nucleocapsids are formed from the encapsidation of progeny genomes by N protein, and these coalesce with the membrane-bound components, forming virions by budding into the ERGIC. Finally, progeny virions are exported from infected cells by transport to the plasma membrane in smooth-walled vesicles, or Golgi sacs, that remain to be more clearly defined. During infection by some coronaviruses, but not others, a fraction of S protein that has not been assembled into virions ultimately reaches the plasma membrane. At the cell surface S protein can cause the fusion of an infected cell with adjacent, uninfected cells, leading to the formation of large, multinucleate syncytia. This enables the spread of infection independent of the action of extracellular virus, thereby providing some measure of escape from immune surveillance. Key aspects of the coronavirus replication cycle are discussed in more detail in the remainder of this section and in the next section (Section V).

The pairings of coronaviruses and their corresponding receptors are generally highly species specific, but the adaptation of SARS-CoV to the human population has reminded us that this allegiance is mutable. Well prior to the emergence of SARS, it was clearly documented that another coronavirus, BCoV, was capable of sporadic cross-species transmission (Saif, 2004) . Viruses very closely related to BCoV had been isolated from wild ruminants (Tsunemitsu et al., 1995) , domestic dogs (Erles et al., 2003) , and, in one case, a human child . Nevertheless, the interaction between S protein and receptor remains the principal, if not sole, determinant of coronavirus host species range and tissue tropism. At the cellular level, this has been demonstrated by manipulation of each of the interacting partners. First, expression of an identified receptor in nonpermissive cells, often of a heterologous species, invariably has rendered those cells permissive for the corresponding coronavirus (Delmas et al., 1992; Dveksler et al., 1991; Li et al., 2003 Li et al., , 2004 Mossel et al., 2005; Tresnan et al., 1996; Yeager et al., 1992) . Second, the engineered swapping of S protein ectodomains has been shown to change the in vitro host cell species specificity of MHV to that of FIPV (Kuo et al., 2000) or, conversely, of FIPV to that of MHV (Haijema et al., 2003) . Similarly, exchange of the relevant regions of S protein ectodomains was shown to transform a strictly respiratory isolate of TGEV into a more virulent, enterotropic strain (Sanchez et al., 1999) . Replacement of the S protein ectodomain of MHV strain A59 caused the virus to acquire the highly virulent neurotropism of MHV strain 4 (Phillips et al., 1999) or the highly virulent hepatotropism of MHV strain 2 (Navas et al., 2001) . Table III lists the known cellular receptors for coronaviruses of groups 1 and 2; to date no receptors have been identified for coronaviruses of group 3. Group 2 coronavirus receptors include the earliest and the most recent of the items in Table III . The MHV receptor (formerly MHVR1, now called mCEACAM1) is a member of the carcinoembryonic antigen (CEA) family, a group of proteins within the immunoglobulin (Ig) superfamily. CEACAM1 was the first receptor discovered for a coronavirus, and, indeed, it was one of the first receptors found for any virus (Williams et al., 1990 . Cloning of cDNA to the largest mRNA for this protein revealed that full-length CEA-CAM1 has four Ig-like domains (Dveksler et al., 1991) , but a number of two-and four-domain versions of the molecule were later found to be expressed in mouse cells. This diversity of MHV receptor isoforms was found to be generated by multiple alleles of the Ceacam1 gene as well as by the existence of multiple alternative splicing variants of its mRNA (Compton, 1994; Dveksler et al., 1993a,b; Ohtsuka and Taguchi, 1997; Ohtsuka et al., 1996; Yokomori and Lai, 1992) . The wide range of pathogenicity of MHV in mice is therefore thought to result from the interactions of S proteins of different virus strains with the tissue-specific spectra of receptor variants displayed in mice having different genetic backgrounds. A number of lines of evidence argue that CEACAM1 is the only biologically relevant receptor for MHV. This was initially suggested by an early experiment showing that in vivo administration of a monoclonal antibody to CEACAM1 greatly enhanced the frequency of survival of mice subsequently given a lethal challen ge of MHV (Smith et al. , 1991 ) . More definitively, it was demonstrated th at homozy gous Ceacam 1 knoc kout mice were totally resistant to infection by high doses of MHV (Hem mila et al. , 2004) . Thus, even thou gh CEA CAM2, the prod uct of the other murine Ceacam gene family membe r, can functi on as a weak MHV recep tor in tissu e culture ( Nedelle c et al. , 1994 ) , i t cannot be used as an alternat ive recep tor in vivo .

Initia l stud ies of the structural requirem ents for CEACAM1 function show ed that the molec ule must be glyco sylated in order to be functi onal as an MHV recep tor ( Pensiero et al. , 1992) . Moreo ver, th e amino-term inal Ig-like domain was found to be the part of the molecu le that is bo und bo th by MHV S protein and by the monocl ona l antibody origina lly used to identify the receptor ( Dveksler et al. , 1993b ) . The essen tial differenc e be tween high-affinit y and low-affinity S binding recep tor alleles has been mapp ed to a deter minan t as small as six amino acid residues on the amino-termina l domain ( Rao et al. , 1997; Wessner et al. , 1998 ) . Thes e criti cal residue s, it turns out, fall wit hin a prom inent, uniqu ely conv oluted loop in the recent ly . solved x-ray crystallographic structure for a two-Ig-domain isoform of CEACAM1 (Tan et al., 2002) . Notably, this loop was found to be topologically similar to protruding loops of the virus-binding domains of the receptors for rhinoviruses, HIV, and measles, all of which, like CEACAM1, are cell adhesion molecules. The CEACAM1 structure now provides the basis for beginning to understand the relative affinities of receptor variants for different S protein ligands.

Other group 2 coronaviruses use different receptors. The rat coronaviruses RCoV and SDAV, although closely related to MHV and able to grow in some of the same cell lines as does MHV, do not gain entry to cells via mCEACAM1. Anti-CEACAM1 monoclonal antibody, which totally blocks MHV infection, was shown to have no effect on infection by rat coronaviruses; moreover, expression of mCEACAM1 in nonpermissive BHK cells rendered them susceptible to MHV but not to rat coronaviruses (Gagneten et al., 1996) . BCoV is phylogenetically close to MHV, but the two viruses neither share common hosts nor are they supported by any of the same cell lines in tissue culture. To date, the only identified cell attachment factor for BCoV is 9-O-acetyl sialic acid (Schultze et al., 1991) , but it is not yet clear whether this moiety must be linked to specific proteins or glycolipids or whether there is also a specific cellular protein receptor for BCoV.

Not surprisingly, SARS-CoV, which is phylogenetically most distant from all other group 2 coronaviruses, uses a receptor wholly unrelated to CEACAMs. The SARS-CoV receptor, which was found in remarkably short order after the discovery of the virus, is angiotensin-converting enzyme 2 (ACE2). This was identified through the use of a SARS-CoV S1-IgG fusion protein to immunoprecipitate membrane proteins from Vero E6 cells, an African green monkey kidney cell line that is the best in vitro host for SARS-CoV . Binding of S1-IgG to Vero E6 cells was inhibited by soluble ACE2 protein but not by a related protein, ACE1. Expression of cloned cDNA for ACE2 was then shown to render nonpermissive cells susceptible to infection by SARS-CoV . ACE2 was also identified by expression cloning of an S1-binding activity, and it was shown to render cells infectable by a retroviral pseudotype carrying the SARS-CoV S protein (Wang et al., 2004) .

ACE2 is a zinc-binding carboxypeptidase that is involved in regulation of heart function. It is an N-exo, C-endo transmembrane glycoprotein with a broad tissue distribution. Active-site mutants of ACE2 showed no detectable defects in binding to SARS-CoV S protein or in promoting S protein-mediated syncytia formation , suggesting that ACE2 catalytic activity is not required for receptor function. This conclusion needs to be verified by direct SARS-CoV infection, however. Recently solved x-ray structures for ACE2 have revealed that a large conformational change is induced by the binding of an inhibitor in the active site of the enzyme (Towler et al., 2004) . Although this finding raised the possibility of a means to interfere with the initiation of infection, the inhibitor does not affect S protein binding or receptor function of ACE2 (Li et al., 2005a) .

Numerous cell lines from a range of species have been classified with respect to their permissivity or nonpermissivity to SARS-CoV Giroglou et al., 2004; Mossel et al., 2005) , thereby allowing inferences as to which species homologs of ACE2 could have some degree of SARS-CoV receptor activity. In direct tests of S1 binding, human ACE2 was shown to be a much better receptor than was mouse ACE2; the receptor activity of rat ACE2, however, was barely detectable above background . In all cases tested, nonpermissive cells were shown to be made permissive by expression of human ACE2 (Mossel et al., 2005) . The full picture of factors influencing SARS-CoV host and tissue tropism is still developing. Human CD209L (also called L-SIGN or DC-SIGNR), a lectin family member, has been found to act as a second receptor for SARS-CoV, but it has much lower efficiency than does ACE2 (Jeffers et al., 2004) . A related lectin, DC-SIGN, was identified as a coreceptor, since it was able to transfer the virus from dendritic cells to susceptible cells; DC-SIGN could not act as receptor on its own, however Yang et al., 2004) .

Many group 1 coronaviruses use the aminopeptidase N (APN) of their cognate species as a receptor (Table III) (Delmas et al., 1992; Tresnan et al., 1996; Yeager et al., 1992) . APN (also called CD13) is a cell-surface, zinc-binding protease that contributes to the digestion of small peptides in respiratory and enteric epithelia; it is also found in human neural tissue that is susceptible to HCoV-229E (Lachance et al., 1998) . The APN molecule is a homodimer; each monomer has a C-exo, N-endo membrane orientation and is heavily glycosylated. Competition experiments with monoclonal antibodies suggested that there is some overlap between the catalytic domain of hAPN and the binding site for HCoV-229E (Yeager et al., 1992) . However, neither the use of specific APN inhibitors, nor the mutational disruption of the catalytic site of pAPN, affected its TGEV receptor activity, indicating that the enzymatic activity of APN, per se, is not required for initiation of infection (Delmas et al., 1994a) . In general, the receptor activities of APN homologs are not interchangeable: hAPN cannot act as a receptor for TGEV (Delmas et al., 1994a) , and pAPN cannot act as a receptor for HCoV229E (Kolb et al., 1996) . Curiously, fAPN can serve as a receptor not only for FIPV but also for CCoV, TGEV, and HCoV-229E (Tresnan et al., 1996) . These contrasting properties have been used as the framework for dissecting the basis of species-specific or -nonspecific function, through the construction and analysis of chimeric receptors (Benbacer et al., 1997; Delmas et al., 1994a; Hegyi and Kolb, 1998; Kolb et al., 1996 Kolb et al., , 1997 . However, chimera construction has not revealed a single linear determinant for virus binding. Rather, two different regions of the molecule have been found to influence receptor activity with respect to a given coronavirus. A detailed study of one of these regions showed that the critical characteristic in chimeras that exclude HCoV-229E is a particular glycosylation site. HCoV-229E likely does not directly bind to this region of APN, but it is hindered from doing so in homologs that are glycosylated at this locus (Wentworth and Holmes, 2001) .

Not all group 1 coronaviruses use APN as a receptor, however. It has been proposed that one subset of FIPV strains uses a different receptor, since an antibody to fAPN blocked replication of type II strains of FIPV but not replication of type I strains of FIPV (Hohdatsu et al., 1998) . This conclusion is consistent with the observation that there is greater sequence divergence between type I FIPV S proteins and type II FIPV S proteins than there is between type II FIPV S proteins and the S proteins of CCoV or TGEV (Herrewegh et al., 1998; Motokawa et al., 1996) . Likewise, although it has been suggested that pAPN can facilitate cellular entry of PEDV (Oh et al., 2003) , the major receptor for PEDV probably differs from that for TGEV, since the two viruses are able to grow in mutually exclusive sets of cells lines derived from different species (Hofmann and Wyler, 1988) . The most outstanding exception to the generality of APN as a receptor for group 1 coronaviruses is the discovery that HCoV-NL63 cannot use hAPN to initiate infection; instead it is able to employ the same receptor as SARS-CoV, namely ACE2 (Hofmann et al., 2005) . This finding raises very interesting questions, one of which is why HCoV-NL63 causes a much milder respiratory disease than does SARS-CoV. Another is why two very different, zinc-binding, cell-surface peptidases, APN and ACE2, should serve as receptors for such a substantial number of coronaviruses. This situation can currently be ascribed to an amazing coincidence, but it may later be found to have deeper significance.

The more variable of the two portions of the spike molecule, S1, is the part that binds to the receptor. Binding leads to a conformational change that results in the more highly conserved portion of the spike molecule, S2, mediating fusion between virion and cell membranes. Just as different coronaviruses can bind to different receptors, coronaviruses also appear to use different regions of S1 with which to do so. Receptor-binding domains (RBDs) have so far been mapped in four S proteins (Fig. 2) . In the group 1 coronavirus TGEV, the RBD was localized to amino acids 579-655, a region highly conserved among the S proteins of TGEV, PRCoV, FIPV, FCoV, and CCoV (Godet et al., 1994) . For the more distantly related group 1 coronavirus HCoV-229E, the RBD was found to fall in an adjacent, nonoverlapping segment of S1, amino acids 417-547 (Bonavia et al., 2003) . By contrast, the RBD of MHV was localized to the amino terminus of the S molecule, amino acids 1-330 (Kubo et al., 1994; Suzuki and Taguchi, 1996; Taguchi, 1995) . Finally, the RBD of SARS-CoV was mapped to amino acids 270-510 or 303-537 by binding of S protein fragments to Vero cells Xiao et al., 2003) . These loci were contained within a domain shown to harbor the epitope for a neutralizing singlechain antibody fragment that blocked S1 association with the ACE2 receptor (Sui et al., 2004) . The SARS-CoV RBD was more finely delimited, to amino acids 318-510, by analysis of the binding to ACE2 of a large set of S1 constructs . Thus, on a linear map of S proteins aligned principally by their S2 domains, the MHV RBD falls near the amino end of S1, the SARS-CoV RBD is in the middle of S1, and the TGEV and HCoV-229E RBDs fall near the carboxyl end of S1. The complementarity of the MHV and TGEV RBD loci is further emphasized by the fact that substantial deletions are tolerated in TGEV S1 in the region that corresponds to the MHV RBD . Conversely, substantial deletions are tolerated in MHV S1 in the region that corresponds to the TGEV RBD (Parker et al., 1989; Rowe et al., 1997) .

For MHV, persistent infection in tissue culture was shown to lead to the selection of variant viruses with an extended host range (Baric et al., 1997 (Baric et al., , 1999 Schickli et al., 1997) . These viruses gained the ability to grow in cell lines from numerous species not permissive to wild-type MHV through an acquired recognition of receptors other than CEA-CAM1. Analysis and engineered reconstruction of one of these selected variants showed that a relatively small number of amino acid changes in the S protein RBD accounted for its extended host range (Schickli et al., 2004; Thackray and Holmes, 2004) . Comparison of the RBDs of various strains of MHV, of the extended host range mutant of MHV, and of other group 2 coronaviruses allowed the identification of five residues in the RBD that were uniquely conserved among MHV strains (Thackray et al., 2005) . Mutations in some of these residues were lethal or resulted in viruses that formed very small plaques; in particular, a tyrosine at position 162 of the RBD was proposed as a candidate element in a key interaction with the receptor.

A set of elegant studies with the SARS-CoV S protein and ACE2 has provided the most detailed image of RBD-receptor interactions yet available for any coronavirus. Aided by the x-ray structure of ACE2, Li et al. (2005a) used the rat ACE2 molecule, which has negligible receptor activity, as a scaffold to identify critical residues in human ACE2. Transfer of as few as four human ACE 2 residues to rat ACE2 enabled the latter to bind S protein almost as well as human ACE2 did. A similar approach was used to determine key S1 residue changes that allowed the interspecies jump of SARS-CoV. The S1 domains of two SARS-CoV isolates were compared in this analysis: one (TOR2) from the main 2002-2003 SARS outbreak, and one (GD) from the subsequent 2003-2004 outbreak; the latter outbreak was much less severe and did not include any human-to-human transmission. Both the TOR2 and GD viruses are thought to have been transmitted to humans from palm civets, the final intermediary host in the jump of SARS-CoV from an unknown natural reservoir. However, only the TOR2 virus efficiently adapted to humans. Correspondingly, it was found that the S1 domains of both the TOR2 and GD viruses bound to palm civet ACE2, but only TOR2 S1 bound to human ACE2 (Li et al., 2005a) . Binding experiments with numerous chimeric variants were used to chart precisely which of the multiple coordinated changes in both the S1 RBD and in the human and palm-civet ACE2 could account for differences in the mutual affinities of the two molecules. The basis for the results that were obtained was then deduced from the x-ray structure of human ACE2 in a complex with the SARS-CoV S protein RBD (Li et al., 2005b) . The RBD was found to bind to the aminoterminal, catalytic domain of ACE2, contacting the latter with a concave, 71-residue loop. Inspection of the interface of this contact revealed that an astonishingly small number of RBD amino acid changes were critical to the adaptation of the virus from one species homolog of ACE2 to another. A change as subtle as the gain of a methyl group (serine to threonine at residue 487 of the RBD) that fits into a hydrophobic pocket on the receptor could account for a 20-fold increase in affinity of S1 for human ACE2.

The binding of spike to its cellular receptor triggers a major conformational change in the S molecule. In some cases, induction of this conformational change may also require a shift to an acidic pH. Thus, some coronav iruses, such as MHV, fuse with the pla sma membr ane at the cell surf ace ( Sturma n et al., 1990; Weism iller et al. , 1990 ) , while others , such as TGEV ( Hanse n et al. , 1998 ) , HCoV-229E (Nomur a et al. , 2004 ) , and SAR S-CoV (Hof mann et al. , 2004; Simm ons et al. , 2004; Yang et al. , 2004 ) , appea r to enter the cell via receptor-mediated endocyto sis and then fuse wit h the membr ane s of acidifi ed endosomes . There may be a very fine bal ance betw een these two states. For MHV, it was found tha t as few as three amino acid chang es in a heptad repeat region in S2 could gover n the sw itch from plasm a membr ane fusion to strictly acid pH-dep endent fusion ( Gallagh er et al. , 1991; Na sh and Buchm eier, 1997 ) . For SAR S-CoV, protease treatme nt of cells at th e earlies t step s of in fection was found to allow th e viru s to en ter cells from the surfa ce, rather than throug h an e ndocytic pathw ay ( Matsu yama et al. , 2005) . Such treatm ent enh anced the in fectivity of the viru s by order s of mag nitude, and this enhan cement was recep tor depend ent. Althou gh S ARS-CoV S prot ein is not detectab ly cleav ed in virio ns or pseud ovi rions prod uced in tiss ue culture ( Simm ons et al. , 2004; Song et al. , 2004) , prot ease treatmen t may mimic the envi ronment result ing from an in flammat ory respon se in infected lungs.

Much of the charact erization of the recep tor-induced con formationa l chang e in S was initially carrie d out wit h the MHV S prot ein, for which it was found that the effects of recep tor binding could al so be elicite d by treatm ent of virions at mild alkaline pH ( Sturm an et al ., 1990 ) . Such treatm ent caused the diss ociation and relea se of th e cleaved S1 subunit and th e aggre gation of S2 subunits ; the accompan ying confor mational changes in S1 were monitored by differential access of a panel of monoclonal antibodies at neutral and alkaline pH (Weismiller et al., 1990) . Disulfide bond formation plays an important role in S protein folding, and disulfides in S1 may become rearranged during the conformational transitions of S1 following receptor binding (Lewicki and Gallagher, 2002; Opstelten et al., 1993; Sturman et al., 1990) . The S protein of the highly virulent MHV strain 4 (JHM) has been shown to exist in a particularly metastable configuration. This results in a hair-trigger spike so highly fusogenic that it can mediate fusion between infected cells and cells lacking receptors, thereby leading to more extensive neuropathogenesis than occurs with other MHV strains (Gallagher and Buchmeier, 2001; Gallagher et al., 1992; Krueger et al., 2001; Nash and Buchmeier, 1996) .

In the normal spike-receptor interaction, both the S1-binding and the S1-activation functions were found to reside in the amino-terminal Ig domain of CEACAM1 (Miura et al., 2004) . The role of the additional Ig domain(s) in the various CEACAM isoforms is apparently to give the virus access to the amino-terminal Ig domain. Similarly, although the RBD of the MHV S protein lies near the amino terminus of S1, portions of the molecule distal to this site can significantly influence the stability of the S1-receptor interaction (Gallagher, 1997) . The conformational change that separates S1 from the rest of the molecule, in turn, transmits a major change to S2. This secondary change has been monitored by the differential susceptibility of S2 to protease treatment before and after the binding of S1 to soluble receptor (Matsuyama and Taguchi, 2002) . Additionally, the same changes were shown to be caused, in the absence of receptor, by mild alkaline pH, which induced a fusogenic state in S2 that could be measured by a liposome flotation assay .

It has been realized that the coronavirus S protein is a type I viral fusion protein with functional similarities to the fusion proteins of phylogenetically distant RNA viruses such as influenza virus, HIV, and Ebola virus (Bosch et al., 2003) . Similar to its counterparts in other viruses, the coronavirus S2 domain contains two separated heptad repeats, HR1 and HR2, with a fusion peptide upstream of HR1 and the transmembrane domain immediately downstream of HR2 (Fig. 2) . Mutations in the MHV S protein HR1 and HR2 regions were shown to inhibit or abolish fusion (Luo and Weiss, 1998; Luo et al., 1999) . Unlike its counterparts, however, the coronavirus S protein does not require cleavage to be fusogenic, and it contains an internal fusion peptide, although the exact assignment of this domain is not agreed upon (Guillen et al., 2005; Sainz et al., 2005) . Even for MHV S and other cleaved S proteins, the fusion peptide is not the amino terminus of S2 created by cleavage (Luo and Weiss, 1998) , as is the case in other type I fusion proteins.

The receptor-mediated conformational change in S1 and the dissociation of S1 from S2 are thought to initiate a major rearrangement in the remaining S2 trimer. This rearrangement exposes a fusion peptide that interacts with the host cellular membrane, and it brings together the two heptad repeats in each monomer so as to form an antiparallel, six-helix "trimer-of-dimers" bundle. The result is the juxtaposition of the viral and cellular membranes in sufficient proximity to allow the mixing of their lipid bilayers and the delivery of the contents of the virion into the cytoplasm. The trimer of dimers is extremely stable, forming a rod-like, protease-resistant complex, the biophysical properties of which have been studied in depth for the S proteins of MHV (Bosch et al., 2003) and SARS-CoV (Bosch et al., 2003 Ingallinella et al., 2004; Liu et al., 2004; Tripet et al., 2004) by the use of model peptides. X-ray crystallographic structures have been solved for peptide complexes for both the MHV S protein (Xu et al., 2004a) and the SARS-CoV S protein (Duquerroy et al., 2005; Supekar et al., 2004; Xu et al., 2004b) . In the six-helix bundle, the three HR1 helices were found to form a central, coiled-coil core, and the three HR2 helices, in an antiparallel orientation, pack into the grooves between the HR1 monomers. There is no contact between the HR2 monomers, each of which associates with the HR1 grooves through hydrophobic interactions. The overall structures obtained for MHV S and SARS-CoV S are highly similar to each other and strongly resemble the structures of the fusion cores of influenza virus HA and HIV gp41. Noteworthy differences are that the coronavirus HR1 coiled-coil is two to three times larger than its counterparts in other viruses and that the much shorter coronavirus HR2 helices assume a unique conformation within the bundle. A major goal of these studies is the design of peptides that are able to inhibit formation of this complex in SARS-CoV infections.

In addition to the mechanisms of the conformational rearrangements of S1 and S2, other factors influence coronavirus fusion and entry, in ways that are not yet well understood. For two coronaviruses, the role of cholesterol in virus entry has been investigated. Cholesterol supplementation was found to augment MHV replication, while cholesterol depletion was inhibitory; these effects were shown to occur at the earliest stages of infection (Thorp and Gallagher, 2004) . Contrary to expectations, the basis for the action of cholesterol was not through clustering of CEACAM receptors into lipid rafts, either before or after the binding of virus to receptor (Choi et al., 2005; Thorp and Gallagher, 2004) . However, cell-bound virions did cluster into lipid rafts, suggesting that MHV S protein associates with some host factor other than CEACAM prior to entry (Choi et al., 2005) . For HCoV-229E, on the other hand, both virus and hAPN receptor were found to redistribute on the cell surface from an initially disperse pattern to clusters within caveolin-1-rich lipid rafts (Nomura et al., 2004) . Thus, the mechanism by which cholesterol assists infection may differ between coronaviruses that enter the cell via receptor-mediated endocytosis and those that fuse with the plasma membrane.

For those coronaviruses that bring about syncytia formation, cellcell fusion appears to have different requirements than virus-cell fusion. Studies with MHV have long noted a correlation between the degree of S protein cleavage and the amount of cell-cell fusion, both of which could be enhanced by trypsin treatment . The extent and kinetics of S protein cleavage were shown to vary among different cell lines, implicating the involvement of a cellular, rather than viral, protease (Frana et al., 1985) . Consistent with this, an MHV strain A59 mutant isolated from persistently infected glial cells was found to have an altered cleavage site, RRADR instead of the wild-type RRAHR (Gombold et al., 1993) ; this change caused an extreme delay, but not abrogation, of fusion of infected cells. Studies of expressed MHV S proteins with wild-type or mutated cleavage sites gave essentially the same results, showing that the fusion delay was strictly a property of mutant S protein (Bos et al., 1995; Stauber et al., 1993; Taguchi, 1993) . However, S protein was found not to be cleaved at all in MHV-infected primary glial cells or hepatocytes, indicating that cleavage was not a requirement for virus-cell fusion (Hingley et al., 1998) . It was demonstrated that furin or a furin-like protease is responsible for MHV S cleavage in tissue culture (de Haan et al., 2004) . Treatment of cells with a specific furin inhibitor blocked both cleavage and cell-cell fusion, but it had no effect on virus-cell fusion.

Another component of the MHV S protein that operates in cell-cell fusion is the cysteine-rich region of the endodomain, mutation of which delays or abrogates syncytia formation (Bos et al., 1995; Chang et al., 2000) . It is currently not known how this segment of the S molecule, which is on the opposite side of the membrane from the six-helix bundle, participates in the fusion process. The cysteine-rich region of the endodomain is a possible target for palmitoylation (Bos et al., 1995) , which is a known modification of MHV S (Niemann and Klenk, 1981) , but, as yet, a role for palmitoylation has not been established.

Once the full program of viral gene expression is underway, through transcription, translation, and genome replication, progeny viruses can begin to assemble. Coronavirus virion assembly occurs through a series of cooperative interactions that occur in the ER and the ERGIC among the canonical set of structural proteins, S, M, E, and N. The M protein is a party to most, if not all, of these interactions and has come to be recognized as the central organizer of the assembly process. Despite its dominant role, however, M protein alone is not sufficient for virion formation. Independent expression of M protein does not result in its assembly into virion-like structures. Under these circumstances, M was shown to traverse the secretory pathway as far as the trans-Golgi (Klumperman et al., 1994; Machamer and Rose, 1987; Machamer et al., 1990; Rottier and Rose, 1987; Swift and Machamer, 1991) , where it forms large, detergent-insoluble complexes (Krijnse Locker et al., 1995; Weisz et al., 1993) . By contrast, MHV, IBV, TGEV, and FIPV, representative species from each of the three coronavirus groups, were found to bud into a proximal compartment, the ERGIC (Klumperman et al., 1994; Krijnse Locker et al., 1994; Tooze et al., 1984 Tooze et al., , 1988 . These observations suggested that some factor, in addition to M, must determine the site of virion assembly and budding.

The identification of the unknown factor came from the development of virus-like particle (VLP) systems for coronaviruses. Such studies showed that, for MHV, coexpression of both M protein and the minor virion component, E protein, was necessary and sufficient for the formation of particles (Bos et al., 1996; Vennema et al., 1996) . The resulting VLPs were morphologically identical to virions (minus spikes) and were released from cells by a pathway similar to that used by virions. Notably, neither the S protein nor the nucleocapsid was found to be required for VLP formation. These results were subsequently generalized for coronaviruses from all three groups: BCoV and TGEV (Baudoux et al., 1998) , IBV Machamer, 2000, 2003) , and SARS-CoV (Mortola and Roy, 2004) . Currently, there is one known exception to this trend: in a separate study of SARS-CoV, M and N proteins were reported to be necessary and sufficient for VLP formation, whereas E protein was dispensable (Huang et al., 2004a) . This latter contradiction remains to be resolved. It may reflect a unique aspect of SARS-CoV virion assembly, or, alternatively, it may indicate that VLP requirements can vary with different expression systems.

Since VLPs contain very little E protein, it is assumed that lateral interactions between M protein monomers are the driving force for virion envelope formation. These interactions have been explored through examination of the ability of constructed M protein mutants to support or to interfere with VLP formation. A study that tested the structural requirements of the M protein found that mutations either in the ectodomain, or in any of the three transmembrane domains, or in the carboxy-terminal endodomain, could inhibit or abolish VLP formation (de Haan et al., 1998a) . In particular, the carboxy terminus of M was extremely sensitive to small deletions or even to point mutations of the final residue of the molecule. Construction of many of these latter mutations in the viral genome revealed a consistent set of effects on viral viability. Yet, virions were better able than VLPs to tolerate carboxy-terminal alterations in M protein, presumably because virions were stabilized by additional intermolecular interactions not present in VLPs. In experiments in which both wild-type and mutant M proteins were coexpressed with E protein, wild-type M protein was able to rescue low concentrations of assembly-defective mutant M proteins into VLPs (de Haan et al., 1998a) . This finding, coupled with results from coimmunoprecipitation analyses, provided the basis for further work, which concluded that monomers of M interact via multiple contacts throughout the molecule and particularly in the transmembrane domains .

That VLPs could be formed in the absence of S protein (Bos et al., 1996; Vennema et al., 1996) confirmed the much earlier discovery that treatment of MHV-infected cells with the glycosylation inhibitor tunicamycin led to the assembly and release of spikeless (and consequently, noninfectious) virions (Holmes et al., 1981; Rottier et al., 1981) . These findings were also consistent with the properties of certain classical temperature-sensitive mutants of MHV and IBV, which, owing to S gene lesions, failed to incorporate spikes into virions at the nonpermissive temperature Ricard et al., 1995; Shen et al., 2004) . Independently expressed MHV, FIPV, or IBV S proteins enter the default secretory pathway and ultimately reach the plasma membrane (Vennema et al., 1990) . In the presence of M protein, however, a major fraction of S is retained in intracellular membranes, as was shown by coimmunoprecipitation of S and M proteins from MHVinfected cells . Moreover, the interaction of M with S was demonstrated to be specific; complexes of M did not impede the progress of a heterologous glycoprotein (the VSV G protein) to the plasma membrane. Additionally, kinetic experiments revealed that the folding and oligomerization of S protein in the ER is rate limiting in the M-S interaction, in which nascent M protein immediately participates . Complexes of the M and S proteins were similarly observed in BCoV-infected cells, for which it was found that M also determines the selection of HE protein for incorporation into virions (Nguyen and Hogue, 1997) . The simplest picture to be drawn from all this evidence, then, is that S protein is entirely passive in assembly but becomes trapped by M protein upon passage through the ER. Nevertheless, there are indications that, in some cases, S cooperates in its own capture. By the criterion of acquisition of endo H resistance, independently expressed S protein was found to be transported to the cell surface with much slower kinetics than S protein that was incorporated into virions. This led to the proposal that free S protein harbors intracellular retention signals that become hidden during virion assembly (Vennema et al., 1990) . Such signals have been found in the (group 3) IBV S protein cytoplasmic endodomain, which contains both a dilysine motif that was shown to specify retention in the ERGIC and a tyrosine-based motif that causes retrieval by endocytosis from the plasma membrane (Lontok et al., 2004) . Additionally, a novel dibasic ERGIC retention signal was identified in the S protein endodomains of group 1 coronaviruses (TGEV, FIPV, and HCoV-229E) and SARS-CoV, but not other group 2 coronaviruses, such as MHV and BCoV.

Although the S protein is not required for VLP formation, it does become incorporated into VLPs if it is coexpressed with the M and E proteins (Bos et al., 1996; Vennema et al., 1996) . VLP manipulations thus made it possible to begin to dissect the molecular basis for the specific selection of S protein by M protein. As for M-M homotypic interactions, the sites within M protein that bind to S protein have not yet been pinpointed. On a broader scale, deletion mapping has indicated that the ectodomain of M protein and the carboxy-terminal 25 residues of the endodomain do not participate in interactions with S, even though both of these regions are critical for VLP formation (de Haan et al., 1999) . The residues of S protein that interact with M protein, on the other hand, have been much more precisely localized. This mapping began with the swapping of ectodomains between the very divergent S proteins of MHV and FIPV . This type of exchange showed that the incorporation of S protein into VLPs of a given species was determined by the presence of merely the transmembrane domain and endodomain of S protein from the same species. The source of the S ectodomain did not matter. The assembly competence of the 1324-residue MHV S protein or the 1452-residue FIPV S protein was therefore restricted to just the 61-amino-acid, carboxy-terminal region of each of these molecules. That the domainswitched S molecules were completely functional was demonstrated by the construction of an MHV mutant, designated fMHV, in which the ectodomain of the MHV S protein was replaced by that of the FIPV S protein (Kuo et al., 2000) . As predicted, this mutant gained the ability to grow in feline cells, while losing the ability to grow in mouse cells. The fMHV chimera provided the basis for powerful selections, based on host cell species restriction, that have been used with the reverse genetic system of targeted RNA recombination (Section VI) (Kuo and Masters, 2002; Masters, 1999; Masters and Rottier, 2005) . The converse construct, an FIPV mutant designated mFIPV, in which the ectodomain of the FIPV S protein was replaced by that of the MHV S protein, had properties exactly complementary to those of fMHV (Haijema et al., 2003) .

More detaile d dissect ion of the transmembr ane domain and endodomain of the MHV S protein has been carrie d out to furthe r localiz e th e deter minants of S incorp oration into virions Ye et al. , 2004 ) . In one study, th e S protei n transm embrane domain, or the endodom ain, or both, were swapped with the corre spond ing region(s ) of a heterol ogou s tran smembran e protein , whic h was expres sed as an ext ra viral gene prod uct (Ye et al., 2004) . Mutati ons were const ructed in this surr ogate virion structural prot ein, or, alterna tively, direc tly in th e S prot ein. From this work, the virion assembly property of S was found to map solely to the 38-residue endodomain, with a major role assigned to the charge-rich, carboxy-terminal region of the endodomain. Additionally, it was observed that the adjacent, membrane-proximal, cysteine-rich region of the endodomain was critical for cell-cell fusion during infection, consistent with results previously reported from investigations using S protein expression systems (Bos et al., 1995; Chang et al., 2000) . A second study, based on analysis of a progressive series of carboxyterminal truncations of the S protein in VLPs and in viral mutants, also mapped the virion assembly competence of S to the endodomain . In this work, however, the major role in assembly was attributed to the cysteine-rich region of the endodomain, and the overall size, rather than the sequence of the endodomain, was seen to be critical. Thus, the precise nature of the interaction between the S protein endodomain and the M protein remains to be resolved.

The interaction of the viral nucleocapsid with M protein was originally examined by the fractionation of purified MHV virions (Sturman et al., 1980) . At 4 C, M protein was separated from other components on density gradient centrifugation of NP-40-solubilized virion preparations, but M reassociated with the nucleocapsid when the temperature was elevated to 37 C. Further analysis suggested that, contrary to expectations, this temperature-dependent association was mediated by M binding to viral RNA, rather than to N protein. The notion of M protein as an RNA-binding protein has been revived in light of recent results on the mechanism of genome packaging (Section IV.C) (Narayanan et al., 2003a) .

For TGEV virions, the use of particular low-ionic-strength conditions of NP-40 treatment similarly resulted in the finding that a fraction of M protein was persistently integrated with subviral cores (Risco et al., 1996) . For assay of this association, in vitro-translated M protein was bound to immobilized nucleocapsid purified from virions (Escors et al., 2001) . Through the combined approaches of deletion mapping, inhibition by antibodies of defined specificity, and peptide competition, the M-nucleocapsid interaction was localized to a segment of 16 residues adjacent to the carboxy terminus of the 262-residue TGEV M protein.

Studies of MHV have taken genetic avenues to explore the N protein-M protein interaction. In one report, a viral mutant was constructed in which the carboxy-terminal two amino acids of the 228residue MHV M protein were deleted (Kuo and Masters, 2002) , a lesion previously known to abolish VLP formation (de Haan et al., 1998a) . The resulting highly impaired virus, designated MÁ2, formed tiny plaques and grew to maximal titers many orders of magnitude lower than those of the wild type. Multiple independent second-site revertants of the MÁ2 mutant were isolated and mapped to either the carboxy terminus of M or that of N. Reconstruction of some of these compensating mutations, in the presence of the original MÁ2 mutation, provided evidence for a structural interaction between the carboxy termini of the M and the N proteins. In a complementary analysis, a set of viral mutants were created containing all possible clustered charged-to-alanine mutations in the carboxy-terminal domain 3 of the N protein (Hurst et al., 2005) . One of the members of this set, designated N-CCA4, was extremely defective, having a phenotype similar to that of the MÁ2 mutant. Multiple independent second-site suppressors of N-CCA4 were found to map in the carboxy-terminal region of either the N or the M protein, thereby reciprocating the genetic cross-talk uncovered with the MÁ2 mutant. Additionally, it was shown that the transfer of N protein domain 3 to a heterologous protein allowed incorporation of that protein into MHV virions.

In contrast to the more overt structural roles of the M, S, and N proteins, the part played by E protein in assembly is enigmatic. On discovery of the essential nature of E in VLP formation, it was speculated that the low amount of E protein in virions and VLPs indicated a catalytic, rather than structural, function for this factor. E protein might serve to induce membrane curvature in the ERGIC, or it might act to pinch off the neck of the viral particle in the final stage of the budding process (Vennema et al., 1996) . In a search for evidence correlating the VLP findings to the situation in whole virions, a set of clustered charged-to-alanine mutations were constructed in the E gene of MHV. One of the resulting mutants was markedly thermolabile, and its assembled virions had striking morphologic defects, exhibiting pinched and elongated shapes that were rarely seen among wild-type virions (Fischer et al., 1998) . This phenotype clearly supported a critical role for E protein in virion assembly. Surprisingly, however, it was later found to be possible to entirely delete the E gene from the MHV genome, although the resulting ÁE mutant virus was only minimally viable compared to the wild type (Kuo and Masters, 2003) . This indicated that, for MHV, the E protein is important, but not absolutely essential, to virion assembly. By contrast, for TGEV, two independent reverse genetic studies showed that knockout of the E gene was lethal. Viable virus could be recovered only if E protein was provided in trans (Curtis et al., 2002; Ortego et al., 2002) . This discordance may point to basic morphogenic differences between group 2 coronaviruses (such as MHV) and group 1 coronaviruses (such as TGEV). Alternatively, it is possible that E protein has multiple activities, one of which is essential for group 1 coronaviruses but is largely dispensable for group 2 coronaviruses.

The information available about E protein at this time is not sufficiently complete to allow us to understand the function of this tiny molecule. One of the most intriguing questions is whether it is necessary for E protein to directly physically interact with M protein, or whether E acts at a distance. If E protein has multiple roles, then perhaps both of these possibilities are applicable. Direct interaction between the E and M proteins is implied by the observation that, at least in some cases, coexpression of E and M proteins from different species does not support VLP formation (Baudoux et al., 1998) . The demonstration that IBV E and M can be cross-linked to one another also has established that the two proteins are in close physical proximity in infected or transfected cells (Corse and Machamer, 2003) . Contrary to this, some data appear to argue that E acts independently of M. The individual expression of MHV or IBV E protein results in membrane vesicles that are exported from cells (Corse and Machamer, 2000; Maeda et al., 1999) . Additionally, it has been shown that the expression of MHV E protein alone leads to the formation of clusters of convoluted membranous structures highly similar to those seen in coronavirus-infected cells . This suggests that the E protein, without other viral proteins, acts to induce membrane curvature in the ERGIC. Some indirect evidence may also be taken to indicate that E does not directly contact other viral proteins. Multiple revertant searches with E gene mutants failed to identify any suppressor mutations that map in M or in any gene other than E (Fischer et al., 1998) . Similarly, none of the intergenic suppressors of the MÁ2 mutant mapped to the E gene (Kuo and Masters, 2002) . It has been found that the SARS-CoV E protein forms cation-selective ion channels in a model membrane system (Wilson et al., 2004) . Moreover, this channel-forming property was contained in the amino-terminal 40 residues of the 76-residue SARS-CoV E molecule. Such an activity made be the basis for an independent mode of action of E protein.

Although a variety of positive-and negative-strand viral RNA species are synthesized during the course of infection (Section V), coronaviruses selectively incorporate genomic (positive-strand) RNA into assembled virions. This may be accomplished with varying degrees of stringency by different members of the family. Sucrose gradientpurified virions of MHV have been found to exclusively contain genomic RNA (Makino et al., 1990) . By contrast, similarly purified virions of BCoV (Hofmann et al., 1990) , TGEV (Sethna et al., 1989 (Sethna et al., , 1991 , and IBV (Zhao et al., 1993) have been reported to contain significant quantities of subgenomic mRNA, in some cases in molar amounts exceeding those of the genomic RNA. However, in a study of TGEV, in which virions were extensively purified by an ELISA-based immunopurification procedure, a very high degree of selectivity for genomic RNA packaging was observed (Escors et al., 2003) .

In those viruses in which it has been mapped, the RNA element that specifies selective packaging falls, as would be expected, in a region of the genome that is not found in any of the subgenomic mRNAs. In MHV, the genomic packaging signal was localized through analysis of defective interfering (DI) RNAs. DI RNAs are extensively deleted variants of the genome that propagate as molecular parasites, using the replicative machinery of a helper virus. Some DI RNAs are packaged efficiently, while others have lost such a capability. Dissection of particular members of the former class revealed that a relatively small span of internal sequence could account for packaging competence (Makino et al., 1990; van der Most et al., 1991) . The exact boundaries of the MHV packaging signal are not precisely defined, but reports from different groups have converged on RNA segments of 180-190 nt, within a 220-nt region that is centered some 20.3 kb from the 5 0 end of the genome (Fosmire et al., 1992; Molenkamp and Spaan, 1997) . The MHV packaging element is thus embedded in the coding sequence of nsp15, at the distal end of the replicase gene. A core 69-nt RNA secondary structural element can act as a minimal signal for packaging (Fosmire et al., 1992; Woo et al., 1997) , but larger versions of the element, consisting of the core plus flanking sequences, function more efficiently Narayanan and Makino, 2001) . Even the larger versions of the element may not be entirely sufficient, however: some data suggest that other cis-acting sequences found in genomic, but not subgenomic, DI RNA contribute to the overall efficiency of packaging .

For the closely related group 2 coronavirus BCoV, the 190-nt genomic region homologous to the MHV packaging signal has been shown to have the same function as its MHV counterpart. Moreover, the MHV and BCoV packaging signals are able to act in a reciprocal fashion: a nonviral RNA containing the MHV packaging signal can be packaged by BCoV helper virus, and a nonviral RNA containing the BCoV packaging signal can be packaged by MHV helper virus . This functional homology does not appear to extend across group boundaries, though. For the group 1 coronavirus TGEV, the packaging signal was also shown to be retained in particular DI RNAs, which were found to be incorporated into defective virions that could be separated from helper virus by density gradient centrifugation (Mendez et al., 1996) . Surprisingly, dissection of the smallest packaged DI RNA revealed that the packaging signal for TGEV maps to the upstream end of the replicase gene, localizing in the region of 100-649 nt from the 5 0 end of the genome (Escors et al., 2003) . For the group 3 coronavirus IBV, a packaged DI RNA has been isolated and characterized (Penzes et al., 1994) , but mapping of the packaging element in this RNA has thus far been inconclusive, owing to the need to decouple requirements for replication from those for packaging (Dalton et al., 2001) . Nevertheless, it is clear that the IBV DI RNA does not harbor a region of the IBV genome homologous to the region that contains the packaging signal in MHV. Similarly, the IBV DI RNA may also lack the counterpart of the TGEV packaging signal. It will be interesting to see whether the packaging signals of viruses in the three coronavirus groups, once they are completely characterized, are found to retain structural similarities despite differences in sequence and location.

The mechanism by which packaging signals operate is not yet clear, and results with MHV have in fact taken an unanticipated turn. In this context, it is important to note the distinction between encapsidation and packaging, two terms that are often used interchangeably in the coronavirus literature. Encapsidation is the process of formation of the nucleocapsid, that is, the cooperative binding of N protein to viral RNA. Packaging is the incorporation of the nucleocapsid into virions. For enveloped viruses, the two processes are not necessarily the same. For example, for nonsegmented negative-strand viruses, both genomic and antigenomic RNA are encapsidated, but only genomic RNA is packaged. For coronaviruses, it was logical to assume that encapsidation is initiated by the N protein. Indeed, specific binding of MHV N protein to the packaging signal RNA has been demonstrated in vitro (Molenkamp and Spaan, 1997) . However, in vitro RNA binding experiments have also shown a specific interaction between the MHV N protein and the leader RNA, which is located at the 5 0 end of subgenomic and genomic RNA (Nelson et al., 2000; Stohlman et al., 1988) . It remains to be seen whether either of these sequencespecific modes of RNA binding represents a nucleation step ultimately leading to encapsidation by multiple monomers of N. The binding of N to leader RNA appears incongruent with the specificity of packaging, but it is consistent with the observation that anti-N antibodies coimmunoprecipitate both subgenomic and genomic RNA from cells infected with MHV or BCoV Narayanan et al., 2000) . A possible resolution of this paradox has come from findings that reveal a role for M protein in the selectivity of packaging. Antibodies to MHV M protein were shown to coimmunoprecipitate the fraction of N protein that is bound to genomic RNA, but not N protein that is bound to subgenomic RNA (Narayanan et al., 2000) . Furthermore, this specific M-N interaction is dependent on the presence of the MHV packaging signal (Narayanan and Makino, 2001) . Remarkably, recent work with coexpressed MHV proteins has attributed the direct selection of packaging signal RNA to the M protein. Thus, VLPs formed by M and E proteins, but devoid of N protein, were found to incorporate a heterologous RNA molecule only if it contained the MHV packaging signal (Narayanan et al., 2003a) . If this discovery turns out to generalize to all coronaviruses, then it will mean that M protein orchestrates every single interaction necessary for virion assembly.

Coronavirus RNA synthesis proceeds by a complex and incompletely understood mechanism, portions of which involve interactions between distant segments of the genome (Lai and Cavanagh, 1997; Lai and Holmes, 2001; . Following its translation into the replicase polyproteins, the genomic RNA (gRNA) next acts as the template for synthesis of negative-sense RNA species.

Further events produce a series of smaller, subgenomic RNAs (sgRNAs) of both polarities (Fig. 6) Sethna et al., 1989 Sethna et al., , 1991 . The positive-sense sgRNAs, each of which serves as the message for one of the ORFs downstream of the replicase ORF, have compositions equivalent to large genomic deletions. Positivesense sgRNAs contain a 70-100-nt leader RNA, which is identical to the 5 0 end of the genome, joined at a downstream site to a stretch of sequence (the body of the sgRNA), which is identical to the 3 0 end of the genome. Collectively, the sgRNAs are said to form a 3 0 -nested set. The 3 0 -nested set of sgRNAs, with or without a leader sequence, is a defining feature of the order Nidovirales (Enjuanes et al., 2000a; van Vliet et al., 2002) . The negative-sense sgRNAs, roughly a tenth to a hundredth as abundant as their positive-sense counterparts, each possess the complement of this arrangement, including a 5 0 oligo(U) tract of 9-26 residues and a 3 0 antileader (Sethna et al., 1991) .

Many advances in understanding the mechanism of coronavirus RNA synthesis were facilitated by the discovery and cloning of DI RNAs of MHV (Makino et al., 1985 van der Most et al., 1991) FIG 6. Coronavirus RNA synthesis. The nested set of positive-and negative-strand RNAs produced during replication and transcription are shown, using MHV as an example. The inset shows details of the arrangement of leader and body copies of the transcription-regulating sequence (TRS). and, subsequently, of other coronaviruses (Chang et al., 1994; Mendez et al., 1996; Penzes et al., 1994) . Because they are extensively deleted genomic variants that propagate by competing for the viral RNA synthesis machinery, DI RNAs have evolved to retain cis-acting sequence elements necessary for replication. Manipulations of naturally occurring and artificially constructed DI RNAs, which are studied by transfection into infected cells, enabled the mapping of elements from the genome that participate in replication and transcription (Brian and Baric, 2005) .

In studies of replication, deletion analyses of various cloned MHV DI RNAs have demonstrated that either 466, 474, or 859 nucleotides at the 5 0 end of the MHV genome are required to support replication (Kim et al., 1993; Lin and Lai, 1993; Luytjes et al., 1996) . The exact magnitude of this value appears to have been dependent on which MHV genomic regions were present in the individual DI RNA with which a particular analysis was begun. In the very closely related BCoV, 498 nucleotides at the 5 0 end of a naturally occurring DI RNA have been shown to suffice for replication (Chang et al., 1994) . For TGEV and IBV, the minimal 5 0 cis-acting replication signals have thus far been limited to 1348 and 544 nucleotides, respectively (Dalton et al., 2001; Izeta et al., 1999) . In all cases, this region extends well beyond the leader RNA and includes a portion of the 5 0 end of the replicase ORF. This means that coronavirus sgRNAs do not have a sufficient extent of 5 0 sequence to function as replicons, as was once proposed (Sethna et al., 1989) . Only in BCoV has the 5 0 cis-acting replication signal been further defined. Detailed dissections of this element, through structural probing and functional mutational analyses, have identified four stemloop structures essential for RNA replication (Chang et al., 1994 Raman and Brian, 2005; Raman et al., 2003) . For stems III and IV, secondary structure, rather than primary sequence, has been shown to be of functional importance; these structures were found to be conserved in the more closely related group 2 coronaviruses but not in SARS-CoV.

At the other end of the genome, deletion analyses found that the minimal stretch of the 3 0 terminus able to sustain MHV DI RNA replication falls between 436 and 462 nucleotides (Kim et al., 1993; Lin and Lai, 1993; . Notably, this range of sequence would include a portion of the adjacent N gene as well as the entire 301-nucleotide 3 0 UTR. By contrast, the minimal 3 0 cis-acting replication signals for TGEV and IBV were 492 and 338 nucleotides, respectively. DI RNAs containing such minimal elements were devoid of any part of the N gene (Dalton et al., 2001; Izeta et al., 1999) . Consistent with this latter finding, it was shown for engineered mutants of MHV that translocation of the N gene to an upstream genomic position had no effect on replication (Goebel et al., 2004a) . This argues strongly that no essential 3 0 cis-acting region is present in the N gene within the intact MHV genome. If any such region does exist, it must be able to act at a distance of nearly 1.5 kb. Given the requirement in MHV for the entire 3 0 UTR, it was somewhat paradoxical when further study showed that a minimum of 45-55 nucleotides at the 3 0 end of the genome, plus an indeterminate amount of poly(A) tail, sufficed to support negative-strand RNA synthesis . From this result it was concluded that the promoter for negative-strand initiation lies completely within the last 55 nucleotides of the genome and that the remainder of the 3 0 cis-acting element must be required for positive-strand RNA synthesis. Alternatively, the 3 0 -most 45-55 nucleotides of the genome may constitute the minimal region able to associate in trans with helper virus genome so as to allow initiation of negative-strand synthesis. A finer examination of the 3 0 poly(A) tail requirement found that, for both MHV and BCoV DI RNAs, no fewer than 5-10 A residues are necessary for replication, and there is a correlation between DI RNA replication competence and the ability to bind poly(A)-binding protein .

Further investigation of the 3 0 UTR in MHV and BCoV has produced a fairly complete picture of the RNA landscape of this region. At the upstream end of the 3 0 UTR, two functionally essential structures have been demonstrated by chemical and enzymatic probing and by genetic studies with both DI RNAs and constructed viral mutants. The first structure is a bulged stem-loop (Hsue and Masters, 1997; Hsue et al., 2000; Goebel et al., 2004a) ; the second is an adjacent RNA pseudoknot (Goebel et al., 2004a; Williams et al., 1999 ). An intriguing property of these upstream RNA elements is that they partially overlap, that is, the bulged stem-loop and the pseudoknot would not be able to fold up simultaneously. It has thus been proposed that they constitute components of a molecular switch that is operative at some stage of RNA synthesis, although a target of their putative regulation has not yet been identified (Goebel et al., 2004a) . Further downstream in the MHV genome is a complex RNA secondary structural element that takes up most of the remainder of the 3 0 UTR Liu et al., 2001) . Although this structure is only poorly conserved with the structure predicted for the corresponding region of the BCoV 3 0 UTR, mutations made in one stem that is highly conserved between the two viruses were found to be deleterious to DI RNA replication. Surprisingly, in the heart of this most divergent region of the 3 0 UTR is found an octanucleotide motif, 5 0 -GGAAGAGC-3 0 , that is absolutely conserved in the 3 0 UTRs of all coronaviruses in all three groups.

The presence of the 3 0 UTR stem-loop and pseudoknot appears to be a distinguishing feature of the group 2 coronaviruses. The group 1 coronaviruses all contain a highly conserved pseudoknot (Williams et al., 1999) , but no detectable counterpart of the bulged stem-loop in either upstream or downstream proximity to it. On the other hand, the group 3 coronaviruses have a highly conserved and functionally essential stem-loop (Dalton et al., 2001 ), but merely a poor candidate for the pseudoknot structure can be found nearby (Williams et al., 1999) . Only the group 2 coronaviruses have both elements, and, in all cases, the elements overlap in the same fashion. Despite sequence divergence among the 3 0 UTRs of group 2 coronaviruses, these genomic segments are functionally equivalent. The BCoV 3 0 UTR was found to be able to entirely replace the MHV 3 0 UTR (Hsue and Masters, 1997) . Moreover, it was demonstrated that replication of a BCoV DI RNA could be supported by any of a number of closely related group 2 helper viruses, including MHV (Wu et al., 2003) . More strikingly yet, the SARS-CoV 3 0 UTR was found to be able to entirely replace the MHV 3 0 UTR (Goebel et al., 2004b) . Thus, the replicase machinery of a group 2 coronavirus, MHV, is able to recognize and use the 3 0 cis-acting structures and sequences of other group 2 coronaviruses, BCoV and SARS-CoV. By contrast, the MHV 3 0 UTR cannot be replaced with either the group 1 TGEV 3 0 UTR or the group 3 IBV 3 0 UTR.

Numerous investigations have focused on the intriguing nature of coronavirus sgRNA transcription. The sites of leader-to-body fusion in the sgRNAs occur at loci in the genome that contain a short run of sequence that is identical, or nearly identical, to the 3 0 end of the leader RNA (Fig. 6) . These sites are called transcription-regulating sequences (TRSs); they have also been designated transcription-associated sequences (TASs) or intergenic sequences (IGs or IGSs). TRSs are fairly well conserved within each coronavirus group. The core consensus TRS is 5 0 -AACUAAAC-3 0 for group 1; 5 0 -AAUCUAAAC-3 0 for group 2 (except for SARS-CoV, for which it is 5 0 -AAACGAAC-3 0 ); and 5 0 -CUUAACAA-3 0 for group 3 (Thiel et al., 2003a; . Not every TRS in a given virus conforms exactly to the consensus sequence; a number of allowable variant bases are found in individual TRSs.

It was clear from very early studies that the sgRNAs are formed by a discontinuous, cotranscriptional process and that they are not produced by splicing of a full-length genomic precursor (Jacobs et al., 1981; . As for RNA replication, the first systematic means of addressing the mechanism of transcription came from the manipulation of engineered DI RNAs. The efficiency of fusion at a given TRS was at first thought to be mediated solely by base-pairing between the 3 0 end of the leader and the complement of the TRS. However, studies with DI RNAs containing authentic and mutated TRSs led many investigators to conclude that, beyond a minimum threshold of potential base pairing, other factors must predominate (Hiscox et al., 1995; Makino et al., 1991; van der Most et al., 1994) . DI RNA studies thus provided the first indication of the importance of the local sequence context of the TRS and the position of the TRS relative to the 3 0 end of the genome (Joo and Makino, 1995; Krishnan et al., 1996; Ozdarendeli et al., 2001; van Marle et al., 1995) .

The original conceptual framework for many studies was that of leader-primed transcription. In this model, sgRNAs were envisioned to be generated during positive-strand RNA synthesis. It was proposed that the polymerase pauses near the end of the leader sequence and detaches with the nascent free leader RNA. This step is followed by reattachment of the leader RNA to the complement of a TRS at an internal portion of the negative-strand template, from where the nascent RNA is then elongated (Lai, 1986) . A refinement of this idea was that leader-to-body fusion results from quasi-continuous synthesis across two distant portions of a looped-out template, which are brought together via protein-RNA and protein-protein interactions .

More recently, accumulated experimental results, while retaining the notion of a looped-out template, have been taken to support a mechanism in which the discontinuous step in sgRNA synthesis occurs during negative-strand RNA synthesis (Fig. 7) Sawicki, 1998, 2005) . In this model, the viral polymerase, starting from the 3 0 end of a genomic template, switches templates at an internal TRS and resumes synthesis at the homologous TRS sequence at the 3 0 end of the genomic leader RNA. The resulting negative-strand sgRNA, in association with positive-strand gRNA, then serves as the template for synthesis of multiple copies of the corresponding positive-strand sgRNA. This new view originated with the discovery of negativestrand sgRNAs (Sethna et al., 1989) and with the demonstration that free leader RNA could not be detected in infected cells (Chang et al., 1994) . Most Sawicki and Sawicki, 1990; Sawicki et al., 2001; Schaad and Baric, 1994) , although not all Mizutani et al., 2000) , subsequent biochemical work supported the contention that the negative-strand sgRNA species are kinetically competent to serve as templates for FIG 7. Model for discontinuous negative-strand transcription. Negative-strand sgRNAs are initiated at the 3 0 end of the gRNA template. Elongation proceeds as far as a body copy of a transcription-regulating sequence (TRS). A strand-switching event then occurs, pairing the newly transcribed negative-sense body TRS with the leader copy of the TRS, from which point transcription resumes. A complex of the (þ)gRNA and the (À)sgRNA then serves as the template for synthesis of multiple (þ)sgRNAs. positiv e-strand sgRN As. In additi on, some of the str ongest evidence for negat ive-stran d disc ontinuou s sgRN A synthes is came from landmar k stud ies using a full-le ngth infectio us cDN A of equin e arteri viru s, th e prot otype membe r of the close ly related arteri viru s family. This work made use of a robust system in which both the leader copy and one or multip le body copies of the TRS were singly or simul taneousl y mutate d in the geno me; RNA sy nthesis in th is system was able to be assayed in the initial pass age of infect ious RNA (Pas ternak et al. , 2001 (Pas ternak et al. , , 2003 (Pas ternak et al. , , 2004 van Marle et al. , 1999 ) . The arteri virus result s have been corroborated , in part, by exp erime nts enab led by the dev elopme nt of revers e genetic approaches for TGEV and MHV (Alonso et al., 2002; Curtis et al. , 2004; de Haa n et al. , 2002a ,b; Sola et al. , 2005; Zuni ga et al. , 2004) . At this time, there is a broad, but not universal, consensus that for coronaviruses, as well as for other nidoviruses, both replication and transcription initiate with negative-strand RNA synthesis. However, much further work needs to be done to elucidate the details of the template-switching step of discontinuous transcription. It will also be necessary to extend to the coronaviruses principles that have been more clearly established for the arteriviruses.

An important feature of coronavirus RNA synthesis is the high rate of homologous and nonhomologous RNA-RNA recombination that has been demonstrated to occur among selected and unselected markers during the course of infection. Although most experimental work in this area has been performed with MHV (Keck et al., , 1988a Makino et al., 1986 Makino et al., , 1987 , a high frequency of homologous recombination is clearly an attribute of the entire coronavirus family, given that it has been observed in other viruses in all three groups: TGEV (Sanchez et al., 1999) , FIPV (Haijema et al., 2003; Herrewegh et al., 1998) , BCV , and IBV (Cavanagh et al., 1992; Kottier et al., 1995; Kusters et al., 1990; Wang et al., 1993) . In addition, nonhomologous recombination was likely, in all three groups, to be the mechanism of acquisition of the various accessory protein genes.

RNA recombination is thought to result from a copy-choice mechanism, as originally described for poliovirus (Kirkegaard and Baltimore, 1986) . In this scheme, the viral polymerase, with its nascent RNA strand intact, detaches from one template and resumes elongation at the identical position, or a similar position, on another template. In MHV, recombination has been shown to take place along the entire length of the genome at an estimated frequency of 1% per 1.3 kb (almost 25% over the entire genome), the highest rate observed for any RNA virus . On a fine scale, the sites of recombination were seen to be random (Banner and Lai, 1991) , although strong selective pressures were able to create the appearance of local clustering of recombinational hot spots in one study (Banner et al., 1990) . Some results suggest that the rate of recombination increases across the entire MHV genome, from 5 0 to 3 0 end Baric, 1992, 1994) . This gradient may result from homologous recombination between genomic and subgenomic RNAs, since the latter would provide a source of donor and acceptor templates that would become more numerous as a function of proximity to the 3 0 end of the genome.

Most evidence supports a model for viral RNA recombination having three mechanistic requirements (Lai, 1992) . First, the RNA polymerase must pause during synthesis. This may be an intrinsic property of the enzyme, or it may result from the enzyme encountering a template secondary structure that exceeds a certain stability threshold. Second, a new template must be in physical proximity. Third, some property of the new template must allow the transfer of the nascent RNA strand and the resumption of RNA synthesis. Alternatively, strand transfer could result from a processive mechanism that does not require polymerase dissociation (Jarvis and Kirkegaard, 1991) . For poliovirus, classical experiments showed that RNA recombination occurs during negative-strand RNA synthesis (Kirkegaard and Baltimore, 1986) , most likely because positive-strand acceptor templates far outnumber negative strands (Jarvis and Kirkegaard, 1992) . The same is likely to be true for coronaviruses, since they, too, have a high ratio of positivestrand to negative-strand RNA Sawicki, 1986, 1990; Sethna et al., 1989) . Moreover, for MHV, most or all negative-strand RNA is found duplexed with positive-strand RNA Sawicki and Sawicki, 1986) . Thus, there may be a bias toward negative-strand recombination simply because positive-strand RNA is the most available (single-stranded) acceptor template. However, instances of coronavirus homologous recombination that occurred during positive-strand RNA synthesis have been documented (Liao and Lai, 1992) . Also, work with extremely defective MHV mutants has shown that sufficiently strong selective pressures can reveal unusual nonhomologous rearrangements, including recombination between negative-and positive-strand RNA, which are likely to be constantly occurring at a low frequency during viral RNA synthesis.

One form of nonhomologous recombination that occurs between genomic and subgenomic RNA has been hypothesized to result from the collapse of the transcription complex during negative-strand discontinuous transcription (Kuo and Masters, 2002) . Such a disruption, followed by resumption of replicative antigenome synthesis, would leave a partial copy of the leader sequence embedded at an internal point in the genome, near the junction between two genes. This type of recombinant was selected repeatedly in revertants of a severely impaired MHV M protein mutant. However, similar transcriptional collapse events may have been a significant factor in coronavirus evolution. Remnants of leader RNAs were found in the genomes of wild-type HCoV-OC43 (Mounir and Talbot, 1993) and in a mutant of MHV strain S . Most strikingly, the recently described HCoV-HKU1 genome contains two very significant segments of embedded leader sequence . Each of these leader remnants occurs at a site where there is an apparent deletion of an entire accessory gene, with respect to the genomic layouts of the closest relatives of this virus, MHV and BCoV.

The replicase complex that carries out the intricacies of viral RNA replication and transcription is encoded by the first gene of the coronavirus genome. This huge gene occupies roughly two-thirds of the genome and contains two ORFs, the complete expression of which is dependent on a programmed ribosomal frameshift. The discovery of coronavirus ribosomal frameshifting resulted from the completion of the sequence of IBV, the first member of the family for which an entire genomic sequence was obtained (Brierley et al., 1987) . This revealed a small (43 nt) overlap between ORF 1a (11.9 kb) and ORF 1b (8.1 kb), the latter in the À1 frame relative to the former; moreover, there was no sgRNA that could serve as the mRNA for ORF 1b. This arrangement was subsequently found to exist for all coronaviruses. Thus, ribosomal frameshifting, which had previously been seen only in retroviruses (Jacks et al., 1988) , was proposed as a mechanism for expression of ORF 1b. Programmed frameshifting was demonstrated for the IBV gene 1a/1b overlap region in reporter gene constructs in experiments using in vitro translation systems and, in some cases, cellular expression systems (Brierley et al., 1989) . In such systems, a frameshifting incidence of 25-30% was measured, representing an efficiency far greater than the 5% seen at the retroviral gag-pol junction. It should be noted, however, that the efficiency of in vivo frameshifting occurring in cells infected with IBV, or any other coronavirus,

has not yet been quantitated; nor is it known whether that value remains constant over the course of infection.

IBV ribosomal frameshifting was found to depend on two genomic RNA elements (Fig. 8) : a heptanucleotide "slippery sequence" (UUUAAAC) and a downstream, hairpin-type pseudoknot (Brierley et al., 1989) . In addition, the spacing between these elements is critical. It is thought that the pseudoknot impedes the progress of the elongating ribosome. With some fixed probability, the delay required for the ribosome to melt out this secondary structural element allows the simultaneous slippage of the P and A site tRNAs by one base in the À1 direction. Normal translational elongation then resumes. Studies of the kinetics of translation, using a model mRNA based on the IBV frameshifting region, support the idea of ribosomal pausing at the pseudoknot (Somogyi et al., 1993) . Moreover, mutational studies of IBV frameshifting (Brierley et al., 1989) and direct mass spectrometric analysis of the SARS-CoV frameshifted polypeptide product (Baranov et al., 2005) have confirmed both the locus of the slippage site and the occurrence of simultaneous slippage. The reason why coronaviruses employ ribosomal frameshifting as a gene expression strategy is less well established at this time. The explanation most commonly given is that, as for retroviruses, the frameshifting mechanism provides a fixed ratio of translation products, in the necessary proximity of one another, for assembly into a macromolecular complex. It could also be speculated that frameshifting forestalls expression of the enzymatic FIG 8. RNA elements required for ribosomal frameshifting. The expanded region shows RNA sequences and secondary structures that program the frameshift, using IBV as an example. products of ORF 1b until a platform and a cellular environment for them have been prepared by the products of ORF 1a.

The two genomic components required for ribosomal frameshifting have been investigated in considerable detail. Exhaustive mutagenesis of the slippery sequence showed that frameshifting could be facilitated by a number of heptameric sequences of the form XXXYYYN, where XXX and YYY are the postslippage P and A site codons, respectively (Brierley et al., 1992) . Hierarchies of preferred combinations of X, Y, and N were defined, and these indicated a major role for the strength of the A-site tRNA interaction. However, although some heptanucleotides showed a frameshifting efficiency nearly as high as that of the wild type, it must be noted that, to date, all known coronaviruses have been found to contain a slippery sequence of UUUAAAC (Brian and Baric, 2005; Plant et al., 2005) .

The second component, the pseudoknot, has similarly been examined through exhaustive mutagenesis (Brierley et al., 1991) . Although the involvement of a downstream RNA secondary structural element in ribosomal frameshifting was first recognized with retroviruses (Jacks et al., 1988) , the earliest demonstration that the requisite structure is a pseudoknot came from the study of IBV (Brierley et al., 1989) . This demonstration was initially by classic stem replacement mutagenesis, and, subsequently, by intensive modification of pseudoknot elements; all of the results of both types of studies supported the proposed structure. It was also revealed that the length of stem 1 is very important for frameshifting efficiency (Napthine et al., 1999) and that it is the structure, not the primary sequence, that is significant for both stems 1 and 2. Higher-order structure was also found to be critical: the pseudoknot could not be replaced by a single stem-loop of the same stability, containing the identical base pairs as the sum of the two pseudoknot stems (Brierley et al., 1991) .

The frameshifting signals of other coronaviruses have been found to generally conform to the rules defined for IBV, although additional complexities have emerged. With the completion of the genomic sequences of the group 1 coronaviruses HCoV-229E (Herold and Siddell, 1993) and TGEV (Eleouet et al., 1995) , an "elaborated" pseudoknot was proposed for members of this group, containing a third stem falling within an unusually large loop 2. It is currently unresolved whether the group 1 elaborated pseudoknot is the operative structure in frameshifting, as suggested by some mutational evidence (Herold and Siddell, 1993) . By contrast, loop 2 can be assigned as for the other coronaviruses, with the extra group 1-specific element providing an alternative, long-range kissing loop interaction between the upstream arm of pseudoknot stem 2 and the loop of a downstream stem-loop (Plant et al., 2005) . Analysis of the sequence of the frameshifting region of the SARS-CoV genome led to the prediction of a third stemloop within loop 2 of the pseudoknot (Ramos et al., 2004) . This third element is situated differently from the additional stem of the group 1 elaborated pseudoknot, but it is similar to the potential bulged stemloop that was earlier proposed to reside in loop 2 of the pseudoknot of the torovirus Berne virus (Snijder et al., 1990) . Further computational analysis has similarly found a possible third stem within loop 2 of the frameshifting pseudoknots of all coronaviruses, and the SARS-CoV stem 3 structure has been shown to be consistent with NMR data and nuclease mapping (Plant et al., 2005) . The role of stem 3 in ribosomal frameshifting is, as yet, unclear. Contrary to the previous results in the IBV system, mutagenesis studies suggest that both the primary sequence and the structures of the SARS-CoV stems 2 and 3 affect the efficiency of frameshifting (Baranov et al., 2005; Plant et al., 2005) . On the other hand, the complete deletion of stem 3 is not detrimental to frameshifting. This seeming discrepancy has led to the suggestion that stem 3 plays an as yet undiscovered regulatory role, perhaps in the switch from genome translation to replication (Plant et al., 2005) .

The end result of the ribosomal frameshifting-mediated translation of the replicase gene is the synthesis of two very large polyproteins, pp1a and pp1ab. These range from 440 to 500 kDa and from 740 to 810 kDa, respectively, and they are cotranslationally processed by two or three internally contained proteinase activities. The Herculean task of mapping all of the polyprotein processing events began at a time before investigators were even aware of the full sizes of coronavirus genomes Perlman, 1986, 1987; Soe et al., 1987) . Only relatively recently have replicase cleavage maps been completed for at least one representative from each coronavirus group Kanjanahaluethai et al., 2003; Lim and Liu, 1998; Liu et al., 1998; Lu and Denison, 1997; Pinon et al., 1997; Schiller et al., 1998; Xu et al., 2001; Ziebuhr and Siddell, 1999; Ziebuhr et al., 2001) . Knowledge gained from these efforts allowed the informed prediction Thiel et al., 2003a) and rapid experimental verification (Harcourt et al., 2004; Prentice et al., 2004b) of the processing pathway for the SARS-CoV replicase.

The final products of the autoproteolytic cleavage of pp1a and pp1ab are 16 nonstructural proteins, designated nsp1-nsp16 (Fig. 9 ). Nsp1-nsp11 are derived from pp1a, whereas nsp1-nsp10 and nsp12-nsp16 are derived from pp1ab. Thus, all products processed from pp1a are common to those processed from pp1ab, except for nsp11, which is an oligopeptide generated when ribosomal frameshifting does not occur. For IBV, which lacks a counterpart of nsp1, there are 15 final products of polyprotein cleavage. These are numbered beginning with nsp2, in order to maintain correspondence with their homologs in the other coronaviruses. Comparative layouts and processing schemes for the replicase genes of all three coronavirus groups can be found in the review by Ziebuhr (2005) and references therein. Detailed lists and schematics of cleavage sites, the proteinases responsible, and the resulting nsp products for HCoV-229E, MHV, and IBV can be found in Table 2 and Figure 2 of the review by Ziebuhr et al. (2000) . It should be noted that partial proteolytic products may also be significant in the processing scheme. The efficiency of cleavage at particular polyprotein sites may be regulated by both the exact primary sequence at the site and the site's accessibility to the proteinase (Ziebuhr, 2005; Ziebuhr et al., 2000) .

Elucidation of the precise roles of nsp1-nsp16 will be the next major undertaking. Functions for many domains of the coronavirus replicase were predicted by pioneering bioinformatics methods well before the term "bioinformatics" was invented (Gorbalenya et al., 1989; Lee et al., 1991) . While knowledge about many of the replicase proteins is still at a very early stage, substantial progress has been made for others. Research in this field is proceeding at an unprecedented pace for reasons of both opportunity and necessity. First, tools that were not previously available, most notably reverse genetics systems for the replicase gene, are now at the disposal of coronavirus researchers. Second, the replicase products present a wide array of promising targets for anti-SARS therapeutics. The information that is currently at hand points to a correspondence between the genomic order of the encode d activitie s of the replica se gene and the temp oral prog ram of infect ion. The prod ucts of pp1a appear to functi on to prep are the cell for in fection and to assemb le th e mach inery for RNA synthe sis. Then, the prod ucts that are uni que to pp1ab carry out the actua l catalysis of RNA replica tion and transcript ion.

The very first mature transl ation prod uct for MHV pp1a, nsp1, has been shown to play a role in cell cycle arres t. It ma y th us prep are a favorab le cellu lar en vironment for vir al replica tion (Chen and Mak ino, 2004; . The next clea vage prod uct, nsp2, div erges consid erably among different corona viruses , and no functi on for it has yet been pred icted or demons trated . Surpr isingly, deleti on of the complete nsp2 reg ion from the genome of MHV or SAR S-CoV was no t lethal. However, nsp2 deleti on mutants showed delaye d vir al growth kinet ics . Othe r early replic ase prod ucts are th e enzym es that carry out the proc essing of the poly proteins: papain-like prot einases, whic h are in nsp3 ( Bake r et al. , 1993 ) , and the main prot einase, whic h is in nsp5 (Lu et al. , 1995 ) . Mos t coronavir uses have two papain-like prot eina ses, designa ted PL1 pro and PL2 pro . By contrast, IBV and SARS-CoV have a single PL pro . PL1 pro and PL2 pro may have arisen by duplication, and in vitro, they appear to have some redundancy in their activities. However, for HCoV-229E, a genetic analysis showed that PL2 pro is essential, and the presence of both PL1 pro and PL2 pro was found to confer a clear advantage in viral fitness . In addition to the papain-like proteinases, nsp3 in many coronaviruses contains a domain that harbors ADPribose-1 00 -monophosphatase activity (Putics et al., 2005) . The construction of active-site mutants has shown that this activity is dispensable for replication of HCoV-229E in tissue culture. Although the cellular homolog of this enzyme plays a role in tRNA processing, the biological significance of the virally encoded activity is unknown. Nsp3 can also contain some variable domains. In HCoV-HKU1, as many as 14 tandem repeats of an acidic decapeptide are present in an amino-terminal segment of nsp3 (Woo et al., 2005 [note: nsp3 is misidentified as nsp1 in this reference]). In SARS-CoV, nsp3 contains a "SARS-unique" domain that is not found in any other coronavirus .

The coronavirus main proteinase, designated M pro , constitutes all of nsp5. This enzyme has also been called the 3C-like proteinase (3CL pro ), because of its resemblance to the 3C proteinases of picornaviruses. Crystal structures have been solved for M pro for HCoV-229E (Anand et al., 2002) , TGEV , and SARS-CoV (Yang et al., 2003) . These reveal that M pro is a dimer, each monomer of which has a three-domain structure, with an active site located in a cleft between the first and second domains in each monomer. At the carboxy terminus is an extra domain not found in the 3CL pro of other viral families. Multiple structures determined for the SARS-CoV M pro showed that the entire molecule undergoes major pH-dependent conformational changes, which have been proposed to regulate activity.

At the carboxy-terminal end of pp1a is a cluster of small proteins, nsp7-nsp10. The crystal structure of SARS-CoV nsp9 was solved independently by two groups (Egloff et al., 2004; Sutton et al., 2004) . In addition, prompted by features of the structure, investigators found that nsp9 has nonspecific RNA-binding activity. Biophysical evidence has also been presented for an interaction between nsp9 and nsp8 (Sutton et al., 2004) . Therefore, although nsp9 was found to occur as a dimer in the crystals, its natural binding partner may be nsp8. A solution structure for SARS-CoV nsp7 was determined by NMR; this structure showed potential protein-protein interaction surfaces for this small polypeptide (Peti et al., 2005) . Moreover, a cocrystal structure of SARS-CoV nsp7 with nsp8 revealed a complex of eight monomers of each protein forming a hollow cylindrical structure. This hexadecameric assembly was proposed to be able to encircle an RNA template, possibly acting as a processivity factor for the RNA polymerase (Zhai et al., 2005) . Thus, a picture of a putative complex of all four of the nsp7-nsp10 polypeptides is being gradually pieced together, but, as yet, there is a paucity of functional data to complement this wealth of structural information.

Transmembrane domains in nsp3, nsp4, and nsp6 anchor the replicase complex to intracellular membranes, and these proteins may be involved in the remodeling of the latter, to form double-membrane compartments that are dedicated to viral RNA synthesis (Bi et al., 1999; Gosert et al., 2002; Prentice et al., 2004a; Shi et al., 1999; van der Meer et al., 1999) . These double-membrane vesicles, which colocalize with nascent viral RNA, are distinct from the sites of virion assembly and budding. Coronavirus RNA synthesis may thus take place in structures that are similar to the autophagosomal RNA synthesis compartments that have been characterized in picornavirusinfected cells (Jackson et al., 2005) . The nsp7-nsp10 products localize in discrete perinuclear and cytoplasmic foci in infected cells (Bost et al., 2000) , in a membrane-associated complex that also includes nsp2. This complex colocalizes with N protein and the viral helicase (nsp13) early in infection. However, late in infection, N protein and the helicase segregate into biochemically distinct membranes in the ERGIC that also contain M protein, suggesting a role for the helicase in genome encapsidation or packaging (Bost et al., 2001; Sims et al., 2000) .

The postribosomal frameshift products of the replicase, nsp12-nsp16, contain the actual enzymes of RNA replication and transcription. The coronavirus RNA-dependent RNA polymerase (RdRp) is contained within nsp12, the first part of pp1ab synthesized after frameshifting. This protein has the fingers, palm, and thumb domains common to a number of viral RdRps and reverse transcriptases. In addition, the RdRp contains a very large, amino-terminal domain that is unique to the coronaviruses. For MHV, the ability of the RdRp to associate with intracellular membranes was mapped to a 38-amino acid segment of the unique domain (Brockway et al., 2003) . Membrane association of expressed RdRp also depended on MHV infection, indicating that other viral components are required for this targeting. In addition, the RdRp was shown to form intermolecular associations with M pro , nsp8, and nsp9. For the SARS-CoV RdRp, preliminary biochemical characterization of the bacterially expressed enzyme suggests that the coronavirus-unique domain is essential for activity (Cheng et al., 2005) .

Nsp13 contains multiple activities that have been extensively characterized for HCoV-229E and SARS-CoV (Ivanov and Ziebuhr, 2004; Ivanov et al., 2004a; Seybert et al., 2000) . This protein is a helicase with a highly processive duplex unwinding activity for both DNA and RNA substrates. The nsp13 helicase unwinds with 5 0 -3 0 polarity, suggesting that it has a role in preparing the template for the RdRp. Nsp13 also has RNA-dependent NTPase and dNTPase activities, which probably provide the energy for its translocation along RNA templates. In addition, nsp13 is a RNA 5 0 -triphosphatase, making it a candidate to carry out the initial step of RNA capping.

Nsp14 and nsp15 have each been assigned ribonucleolytic functions. Such activities would, at first glance, seem to be out of place in an RNA virus. Nsp14 has been predicted to be an exonuclease (designated ExoN), which, it is speculated, could be involved in an RNA processing step integral to coronavirus transcription . This activity has not yet been demonstrated, but a point mutation in nsp14 of MHV was shown to be markedly attenuating in the mouse host (Sperry et al., 2005) . Nsp15 is an endoribonuclease, designated NendoU, that is found only in the nidoviruses . This enzyme, from HCoV-229E and SARS-CoV, has been shown to hydrolyze both single-and double-stranded RNA, with a specificity for cleavage immediately upstream and downstream of uridylate residues (Bhardwaj et al., 2004; Ivanov et al., 2004b) . NendoU exhibited optimal activity with manganese ion, rather than magnesium ion, and it was essentially inactive with 2 0 -O-ribose-methylated RNA substrates (Ivanov et al., 2004b) . Mutation of the active site of nsp15 of HCoV-229E was found to be lethal.

Finally, nsp16, the carboxy-terminal product of pp1ab, has been predicted to contain 2 0 -O-methyltransferase activity von Grotthuss et al., 2003 [note: nsp16 is misidentified as nsp13 in this reference]). Such an activity, which has not yet been demonstrated, would have a most obvious role in RNA capping. However, the possibility has been raised that 2 0 -O-methylation serves to protect a segment of duplex RNA from the NendoU activity of nsp15 in one stage of discontinuous negative-strand RNA synthesis (Ivanov et al., 2004b) . Relevant to RNA capping, it must be noted that if coronaviruses possess their own guanylyltransferase or cap 7-methyltranferase activities, these have not yet been identified among the many replicase proteins.

RNA viruses often expropriate and redirect host cell components, to assist in mechanisms of their own gene expression (Ahlquist et al., 2003) . A number of host factors have been proposed to participate in coronavirus RNA synthesis. To date, all of these have been discovered with either MHV or BCoV, and all were originally identified on the basis of their ability to bind in vitro to RNA segments of functional importance. The most completely characterized coronavirus host factor is heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1), which was initially found as a member of a set of proteins that bound to the negative strand of the MHV TRS (Furuya and Lai, 1993; Li et al., 1997; Zhang and Lai, 1995) . Its RNA-binding property, its affinity for MHV N protein, and its propensity to dimerize, all made hnRNP A1 attractive as a potential mediator of the antigenome looping-out event envisaged by the leader-primed transcription model (Wang and Zhang, 1999; Zhang and Lai, 1995; Zhang et al., 1999) . Overexpression of hnRNP A1 was shown to result in a marked increase in the kinetics of MHV RNA synthesis, suggesting that this factor affects genome replication as well as transcription. Additionally, expression of a truncated form of hnRNP A1 had a dominant-negative effect on MHV replication (Shi et al., 2000) . The role of hnRNP A1 was questioned on the basis of the finding that MHV replication and RNA synthesis were completely unimpaired in CB3 cells, a mutant murine cell line that does not express hnRNP A1 (Ben-David et al., 1992; Shen and Masters, 2001) . In addition, high-affinity hnRNP A1 binding sites (Burd and Dreyfuss, 1994) , when placed in the MHV genome, did not act in lieu of a TRS and did not displace the site of leader-body fusion away from a TRS (Shen and Masters, 2001) . However, it was subsequently shown that other hnRNP A/B family members, which are present in CB3 cells, could replace hnRNP A1; further, overexpression of hnRNP A/B was shown to enhance MHV RNA synthesis (Shi et al., 2003) .

Other members of the hnRNP family have also been implicated in MHV RNA synthesis. Pyrimidine tract-binding protein (PTB, also known as hnRNP I) was shown to bind to pentanucleotide repeats upstream of the positive-strand leader copy of the TRS . In addition, PTB bound the negative strand of the 3 0 UTR, specifically at the complement of the invariant octanucleotide motif . The positive strand of the same region of the 3 0 UTR was also bound by hnRNP A1, and deletions in this region inhibited DI RNA synthesis (Huang and Lai, 2001) . Another hnRNP, synaptotagmin-binding cytoplasmic RNA-interacting protein (SYN-CRIP), was found to bind to both positive-and negative-strand MHV RNA near the region of the leader pentanucleotide repeats (Choi et al., 2004) . Moreover, RNAi-mediated downregulation of SYNCRIP delayed the kinetics of MHV RNA synthesis. In the BCoV 5 0 UTR, multiple complexes of six proteins have been found to bind specifically to the stem-loop IV that is required for DI RNA replication (Raman and Brian, 2005) . It is not yet clear whether some of these proteins are previously identified hnRNPs or whether they represent new cellular factors.

In the 3 0 UTR of MHV, a complex of proteins was found to bind to two similar 11-base motifs in positive-strand RNA, at distances of 26-36 and 129-139 nucleotides from the poly(A) tail (Liu et al., 1997; Yu and Leibowitz, 1995a,b) . DI RNAs with mutations in either of these elements were defective in replication. The largest member of the protein complex was identified as mitochondrial aconitase, a protein not previously known to have RNA-binding activity (Nanda and Leibowitz, 2001) . Other components of the complex were then found to be the chaperones HSP60, HSP40, and mitochondrial HSP70 (Nanda et al., 2004) . Although MHV replication does not have any known involvement with mitochondria, both mitochondrial aconitase and mitochondrial HSP70 have substantial cytoplasmic fractions. Finally, at the furthest downstream ends of the genomes of MHV and BCoV, poly(A) binding protein binds to the poly(A) tail and appears to play a role in RNA synthesis beyond its function in translation .

Among the array of candidate host factors in coronavirus RNA synthesis, it remains to be established which are essential and which play enhancing roles, either as RNA chaperones or in some other capacity.

Such assessments can be difficult, because many of these factors are critical or essential to normal cellular functions. Thus, the validation of host factors will likely require the establishment of an efficient in vitro RNA replication and transcription system, in which reconstitution of coronavirus RNA synthesis can be achieved from isolated components and precursors.

Numerous classical coronavirus mutants have been isolated over the past 25 years, mainly with MHV (Lai and Cavanagh, 1997) . Mutants were either identified as naturally occurring viral variants (often on the basis of causing atypical pathogenesis), or else they were obtained through selection criteria such as escape from neutralization by monoclonal antibodies. A number of sets of MHV mutants were generated by chemical mutagenesis, followed by screening for temperature-sensitive phenotypes (Koolen et al., 1983; Martin et al., 1988; Robb et al., 1979; Schaad et al., 1990) or, in one case, for aberrant cytopathic effects or plaque morphologies . Although the latter search yielded an unusually high proportion of structural protein mutants, viruses with conditionally lethal, RNA-negative phenotypes were the predominant isolates in all searches. The arrangement of the coronavirus genome dictates that the vast majority of randomly generated mutations will fall in the replicase gene, owing to its large target size. Despite assiduous efforts that applied classical genetic methods to the study of the replicase Baric, 1992, 1994; Schaad et al., 1990) , progress was limited by the technology available at the time, and exploitation of the full value of these mutants would await the development of reverse genetic techniques.

The basic blueprint for positive-strand RNA virus reverse geneticsthe transcription of infectious RNA from a full-length cDNA copy of the viral genome-was established more than two decades ago with poliovirus (Racaniello and Baltimore, 1981) . It became possible only recently to apply this scheme to coronaviruses, however, owing to the need to surmount a number of formidable hurdles. Most notable were the obstacles posed by the huge sizes of coronavirus genomes and the high instabilities of various regions of the replicase gene when they were propagated as cloned cDNA in E. coli. The first reverse genetic system for coronaviruses, targeted RNA recombination, was developed to circumvent these barriers, at a time when it was far from clear whether the construction of full-length infectious cDNA clones would ever be technically feasible (Masters, 1999; Masters and Rottier, 2005) . This method, originally developed in MHV, takes advantage of the high rate of homologous RNA recombination in coronaviruses. A synthetic donor RNA bearing the mutation of interest is introduced into cells that have been infected with a recipient parent virus possessing some characteristic that can be selected against. Mutant recombinants that arise among progeny viruses are then identified by counterselection of the recipient parent virus.

The earliest form of targeted RNA recombination employed, as the recipient parent virus, a classical MHV mutant that was thermolabile owing to an internal deletion in the N gene (Koetzner et al., 1992; Peng et al., 1995a) , which is the 3 0 -most gene in the genome. Mutations were introduced into the N gene or the 3 0 UTR by means of in vitrosynthesized donor RNAs corresponding to the smallest MHV sgRNA. Recombinants, which were identified as survivors of a heat-killing selection, had restored the region deleted in the parent virus and, concomitantly, had acquired marker mutations planted in the donor RNA. The efficiency of this system was subsequently increased by the incorporation of 5 0 -cis-acting elements that converted the donor RNA into a replicating DI RNA (Masters et al., 1994; van der Most et al., 1992) . The scope of this technique was then extended through the addition of 3 0 -contiguous genomic sequence to donor RNAs, ultimately allowing reverse-genetic access to all of the structural genes of MHV (Fischer et al., 1997a (Fischer et al., ,b, 1998 Peng et al., 1995b) . The strength and versatility of targeted RNA recombination were substantially enhanced as a result of the construction of the interspecies coronavirus mutant fMHV, a chimera in which the S protein ectodomain of MHV was replaced by the S protein ectodomain from FIPV (Kuo et al., 2000) . This replacement resulted in a virus that had acquired the ability to grow in feline cells and had simultaneously lost the ability to grow in murine cells. Although the immediate rationale for the creation of fMHV was to dissect domain requirements for virion assembly (Section IV.B.2), it was readily apparent that this chimera offered a tremendous selective advantage in targeted RNA recombination. The use of fMHV as the recipient parent virus allowed the selection of recombinants harboring virtually any nonlethal MHV mutation in the 3 0 -most 10 kb of the genome, on the basis of their having regained the ability to grow in murine cells. Numerous mutants, many with extremely fragile phenotypes, have since been obtained by this method (de Haan et al., 2002a,b; Goebel et al., 2004a,b; Hurst et al., 2005; Kuo et al., 2002 Kuo et al., , 2003 . The generality of this host-range-based selection system has been established by the extension of the method to another strain of MHV ( Ontive ros et al. , 2001 ) and by use of an analogo us chimera , mFIPV, for the con struction of FIPV mutan ts (Haij ema et al. , 2003 (Haij ema et al. , , 2004 .

Despite its value, however, targeted RNA recom bin ation can be used to engi neer only the downstr eam one-thi rd of the geno me. The complete exten t of revers e gene tics did not become available to coronav irus rese arch unt il relativel y recent ly. Thr ough the excepti onal persev erance and in ventiv eness of three in dependen t laborator ies, syste ms based on full-len gth cDNA clones have been dev eloped , each using a differen t str ategy to overcom e the stabilit y problem s inhe rent to coronavi rus cDN A. These syste ms all provide a ca pability of great impor tance tha t is effe ctively beyon d the scope of targeted RNA recombina tion: acces s to th e replic ase gene. In the first such method , a full-length cDNA copy of the TGE V geno me was asse mbled in a low copy-num ber bac terial artificial chrom osom e (BAC) vector. Infectious coronavir us RNA was produced in this syste m by a "DN A-lau nch," in vivo nuclear transc ription by host RNA polymeras e II from an engineered CM V prom oter (Alm azan et al. , 2000) . The DNA laun ch ensure d complete capping of the viral RNA, and it bypa ssed potentia l limitat ions of the syste m arising from the efficie ncy of in vitro transcript ion of genomic RNA. Hete rologous sequence was removed from the 3 0 end of the transc ribed RNA through the action of an incorporated hepatitis delta virus ribozyme. Further stabilization of the full-length BAC clone in bacteria was achieved through the insertion of a eukaryotic intron into either of two positions in the mapp ed tox ic region of th e TGEV cDN A (G onzá lez et al. , 2002) . This allowed stable propagation of the BAC for over 200 bacterial generations.

In the second method, full-length genomic cDNAs were assembled by in vitro ligation of smaller, more stable subcloned cDNAs . Infectious RNA was then transcribed in vitro from the ligated product. The boundaries of the subcloned genomic cDNA fragments were chosen so as to allow ease of manipulation for site-directed mutagenesis applications. Most importantly, some fragment boundaries were arranged in such a way as to interrupt regions of cloned cDNA instability. This is essentially the same scheme that had been earlier used to produce infectious RNA for yellow fever virus, a flavivirus (Rice et al., 1989) . However, for coronaviruses, the scheme had to be executed on a much grander scale, with five to seven fragments instead of two. To facilitate this approach, the innovation was introduced of directing the unique assembly of fragments by means of nonsymmetric overhangs generated by restriction enzymes that cut at a distance from their recognition sequences. This ensured that the fragments became connected in a predetermined order by ligation, without the generation of rearranged byproducts. Originally demonstrated with TGEV , this in vitro assembly technique has subsequently been successfully used to engineer the genomes of MHV , SARS-CoV (Yount et al., 2003) , and IBV (Youn et al., 2005) .

In the third method, entire coronavirus cDNAs, generated by longrange RT-PCR (Thiel et al., 1997) , were inserted into a unique restriction site in the genome of vaccinia virus . In this scheme, vaccinia virus served as a huge cloning vehicle, in which the coronavirus genome cDNAs did not exhibit the instabilities encountered in E. coli plasmids. Infectious RNA was produced by in vitro transcription from purified vaccinia virus DNA (Thiel et al., 2001a) . Alternatively, a DNA launch was carried out in vivo with transfected cDNA and fowlpox-encoded T7 RNA polymerase . The use of vaccinia as a vector has allowed manipulation of the resulting cloned cDNA by any among the suite of methods that have been developed for poxvirus reverse genetics. In particular, transient dominant selection has been used to carry out site-directed mutagenesis . Engineered mutations have also been directly recombined from PCR products into vaccinia clones, through exploitation of both negative and positive selection of a gpt cassette (Coley et al., 2005) . A further innovation came from the rescue of recombinant coronaviruses from cell lines expressing N protein, given that N protein has been shown to greatly enhance recovery of virus in all three full-length cDNA systems (Almazan et al., 2004; Schelle et al., 2005; Thiel et al., 2001a; Yount et al., 2002) . This poxvirus-vectored technique was originally applied to HCoV-229E (Thiel et al., 2001a) , and it has since been used to engineer the genomes of IBV and MHV (Coley et al., 2005) .

The two main options for reverse genetic systems both have their own relative advantages. For reverse genetic studies involving coronavirus structural genes or the 3 0 UTR, targeted RNA recombination is currently the easier system to manipulate, and it has the power to recover extremely defective mutants. Another asset of targeted RNA recombination is that it lends itself well to studies involving domain exchange between different proteins (Peng et al., 1995b) or the exchange of genomic elements (Hsue and Masters, 1997) . In these cases, the system, through its own selection of allowable crossover sites, can reveal which substitutions retain functionality and which are lethal. On the other hand, full-length cDNA reverse-genetic strategies provide the capacity to site-specifically mutagenize the exceedingly large viral RNA replicase gene. This advantage is just beginning to be exploited, and it can be expected to play a major role in the future in the acquisition of an understanding of the workings of the complex RNA synthesis machinery. In addition to molecular biological studies, coronavirus reverse-genetic investigations have opened the door to the development of these viruses, and their derivative replicons, for vaccines (Alonso et al., 2002; Haijema et al., 2004) , expression systems (de Haan et al., 2003b , and gene delivery vectors (Thiel et al., 2001b (Thiel et al., , 2003b .

Deduced sequence of the bovine coronavirus spike protein and identification of the internal proteolytic cleavage site

Host factors in positive-strand RNA virus genome replication

Engineering the largest RNA virus genome as an infectious bacterial artificial chromosome

The nucleoprotein is required for efficient coronavirus genome replication

The morphology of three previously uncharacterized human respiratory viruses that grow in organ culture

In vitro and in vivo expression of foreign genes by transmissible gastroenteritis coronavirus-derived minigenomes

Characterizations of coronavirus cis-acting RNA elements and the transcription step affecting its transcription efficiency

Coronavirus transcription early in infection

Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain

Coronavirus main proteinase (3CLpro) structure: Basis for design of anti-SARS drugs

A highly unusual palindromic transmembrane helical hairpin formed by SARS coronavirus E protein

Sequence and topology of a model intracellular membrane protein, E1 glycoprotein, from a coronavirus

Amino acids 270 to 510 of the severe acute respiratory syndrome coronavirus spike protein are required for interaction with receptor

Identification of the catalytic sites of a papain-like cysteine proteinase of murine coronavirus

Random nature of coronavirus RNA recombination in the absence of selective pressure

A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of murine coronavirus

Programmed ribosomal frameshifting in decoding the SARS-CoV genome

Development of mouse hepatitis virus and SARS-CoV infectious cDNA constructs

Subgenomic negative-strand RNA function during mouse hepatitis virus infection

Interactions between coronavirus nucleocapsid protein and viral RNAs: Implications for viral transcription

Establishing a genetic recombination map for murine coronavirus strain A59 complementation groups

Episodic evolution mediates interspecies transfer of a murine coronavirus

Persistent infection promotes cross-species transmissibility of mouse hepatitis virus

Coronavirus pseudoparticles formed with recombinant M and E proteins induce alpha interferon synthesis by leukocytes

Morphogenesis of avian infectious bronchitis virus and a related human virus (strain 229E)

Interspecies aminopeptidase-N chimeras reveal species-specific receptor recognition by canine coronavirus, feline infectious peritonitis virus, and transmissible gastroenteritis virus

Retroviral insertions downstream of the heterogeneous nuclear ribonucleoprotein A1 gene in erythroleukemia cells: Evidence that A1 is not essential for cell growth

The structure of infectious bronchitis virus

The severe acute respiratory syndrome coronavirus Nsp15 protein is an endoribonuclease that prefers manganese as a cofactor

Localization of mouse hepatitis virus open reading frame 1A derived proteins

Identification of a receptor-binding domain of the spike glycoprotein of human coronavirus HCoV-229E

Pathogenic murine coronaviruses: II. characterization of virus-specific proteins of murine coronaviruses JHMV and A59V

Characterization of a second cleavage site and demonstration of activity in trans by the papain-like proteinase of the murine coronavirus mouse hepatitis virus strain A59

Mutational analysis of the murine coronavirus spike protein: Effect on cell-to-cell fusion

The production of recombinant infectious DI-particles of a murine coronavirus in the absence of helper virus

A subgenomic mRNA transcript of the coronavirus mouse hepatitis virus strain A59 defective interfering (DI) RNA is packaged when it contains the DI packaging signal

The coronavirus spike protein is a class I virus fusion protein: Structural and functional characterization of the fusion core complex

Severe acute respiratory syndrome coronavirus (SARS-CoV) infection inhibition using spike protein heptad repeat-derived peptides

Spike protein assembly into the coronavirion: Exploring the limits of its sequence requirements

Four proteins processed from the replicase gene polyprotein of mouse hepatitis virus colocalize in the cell periphery and adjacent to sites of virion assembly

Mouse hepatitis virus replicase protein complexes are translocated to sites of M protein accumulation in the ERGIC at late times of infection

Sequencing of coronavirus IBV genomic RNA: Three open reading frames in the 5 0 'unique' region of mRNA D

Host cell nuclear function and murine hepatitis virus replication

Coronavirus genome structure and replication

The Coronaviridae

An efficient ribosomal frame-shifting signal in the polymeraseencoding region of the coronavirus IBV

Characterization of an efficient coronavirus ribosomal frameshifting signal: Requirement for an RNA pseudoknot

Mutational analysis of the RNA pseudoknot component of a coronavirus ribosomal frameshifting signal

Mutational analysis of the "slipperysequence" component of a coronavirus ribosomal frameshifting signal

Generation of a recombinant avian coronavirus infectious bronchitis virus using transient dominant selection

Characterization of the expression, intracellular localization, and replication complex association of the putative mouse hepatitis virus RNA-dependent RNA polymerase

In vitro synthesis of two polypeptides from a nonstructural gene of coronavirus mouse hepatitis virus strain A59

RNA binding specificity of hnRNP A1: Significance of hnRNP A1 high-affinity binding sites in pre-mRNA splicing

Characterization and isolation of structural polypeptides in haemagglutinating encephalomyelitis virus

Phosphorylation and subcellular localization of transmissible gastroenteritis virus nucleocapsid protein in infected cells

Reverse genetics system for the avian coronavirus infectious bronchitis virus

Gene 5 of the avian coronavirus infectious bronchitis virus is not essential for replication

Preliminary studies on the isolation of coronaviruse 229E nucleocapsids

The Coronaviridae

Evolution of avian coronavirus IBV: Sequence of the matrix glycoprotein gene and intergenic region of several serotypes

Coronavirus IBV glycopolypeptides: Locational studies using proteases and saponin, a membrane permeabilizer

Coronavirus IBV: Partial amino terminal sequencing of spike polypeptide S2 identifies the sequence Arg-Arg-Phe-Arg-Arg at the cleavage site of the spike precursor propolypeptide of IBV strains Beaudette and M41

Infectious bronchitis virus: Evidence for recombination within the Massachusetts serotype

Coronavirus-induced membrane fusion requires the cysteine-rich domain in the spike protein

A cis-acting function for the coronavirus leader in defective interfering RNA replication

The UCUAAAC promoter motif is not required for high-frequency leader recombination in bovine coronavirus defective interfering RNA

Induction of alpha interferon by transmissible gastroenteritis coronavirus: Role of transmembrane glycoprotein E1

Murine coronavirus replication induces cell cycle arrest in G0/G1 phase

Murine coronavirus nonstructural protein p28 arrests cell cycle in G0/G1 phase

Interaction of the coronavirus nucleoprotein with nucleolar antigens and the host cell

Mass spectroscopic characterization of the coronavirus infectious bronchitis virus nucleoprotein and elucidation of the role of phosphorylation in RNA binding by using surface plasmon resonance

Expression, purification, and characterization of SARS coronavirus RNA polymerase

Polypyrimidine-tract-binding protein affects transcription but not translation of mouse hepatitis virus RNA

SYNCRIP, a member of the heterogeneous nuclear ribonucleoprotein family, is involved in mouse hepatitis virus RNA synthesis

Murine coronavirus requires lipid rafts for virus entry and cell-cell fusion but not for virus release

Recombinant mouse hepatitis virus strain A59 from cloned, full-length cDNA replicates to high titers in vitro and is fully pathogenic in vivo

Monoclonal antibodies to murine hepatitis virus-4 (strain JHM) define the viral glycoprotein responsible for attachment and cell-cell fusion

Identification of a bovine coronavirus packaging signal

Identification of nucleocapsid binding sites within coronavirus-defective genomes

Enterotropic strains of mouse coronavirus differ in their use of murine carcinoembryonic antigen-related glycoprotein receptors

Hemagglutinin-esterase, a novel structural protein of torovirus

Infectious bronchitis virus E protein is targeted to the Golgi complex and directs release of virus-like particles

The cytoplasmic tail of infectious bronchitis virus E protein directs Golgi targeting

The cytoplasmic tails of infectious bronchitis virus E and M proteins mediate their interaction

Gill-associated virus of Penaeus monodon prawns: An invertebrate virus with ORF1a and ORF1b genes related to arteri-and coronaviruses

Heterologous gene expression from transmissible gastroenteritis virus replicon particles

Reverse genetic analysis of the transcription regulatory sequence of the coronavirus transmissible gastroenteritis virus

cis-acting sequences required for coronavirus infectious bronchitis virus defective-RNA replication and packaging

Ribonucleoprotein of avian infectious bronchitis virus

Evidence for a coiled-coil structure in the spike proteins of coronaviruses

Coronavirus particle assembly: Primary structure requirements of the membrane protein

Structural requirements for O-glycosylation of the mouse hepatitis virus membrane protein

Mapping of the coronavirus membrane protein domains involved in interaction with the spike protein

Assembly of the coronavirus envelope: Homotypic interactions between the M proteins

The group-specific murine coronavirus genes are not essential, but their deletion, by reverse genetics, is attenuating in the natural host

Coronaviruses maintain viability despite dramatic rearrangements of the strictly conserved genome organization

The glycosylation status of the murine hepatitis coronavirus M protein affects the interferogenic capacity of the virus in vitro and its ability to replicate in the liver but not the brain

Coronaviruses as vectors: Position dependence of foreign gene expression

Cleavage inhibition of the murine coronavirus spike protein by a furin-like enzyme affects cell-cell but not virus-cell fusion

Coronaviruses as vectors: Stability of foreign gene expression

Assembly of coronavirus spike protein into trimers and its role in epitope expression

Aminopeptidase N is a major receptor for the entero-pathogenic coronavirus TGEV

Determinants essential for the transmissible gastroenteritis virus-receptor interaction reside within a domain of aminopeptidase-N that is distinct from the enzymatic site

Further characterization of aminopeptidase-N as a receptor for coronaviruses

Another triple-spanning envelope protein among intracellularly budding RNA viruses: The torovirus E protein

Identification of putative polymerase gene product in cells infected with murine coronavirus A59

Translation and processing of mouse hepatitis virus virion RNA in a cell-free system

Genomic organization, biology, and diagnosis of Taura syndrome virus and yellowhead virus of penaeid shrimp

Central ions and lateral asparagine/glutamine zippers stabilize the post-fusion hairpin conformation of the SARS coronavirus spike glycoprotein

Cloning of the mouse hepatitis virus (MHV) receptor: Expression in human and hamster cell lines confers susceptibility to MHV

Several members of the mouse carcinoembryonic antigen-related glycoprotein family are functional receptors for the coronavirus mouse hepatitis virus-A59

Mouse hepatitis virus strain A59 and blocking antireceptor monoclonal antibody bind to the N-terminal domain of cellular receptor

The severe acute respiratory syndrome-coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world

Complete sequence (20 kilobases) of the polyprotein-encoding gene 1 of transmissible gastroenteritis virus

Coronavirus replication and reverse genetics

Nidovirales. In "Virus Taxonomy. Seventh Report of the International Committee on Taxonomy of Viruses

Virus Taxonomy. Seventh Report of the International Committee on Taxonomy of Viruses

Coronavirus reverse genetics and development of vectors for gene expression

Detection of a group 2 coronavirus in dogs with canine infectious respiratory disease

The membrane M protein carboxy terminus binds to transmissible gastroenteritis coronavirus core and contributes to core stability

Transmissible gastroenteritis coronavirus packaging signal is located at the 5 0 end of the virus genome

The coronavirus avian infectious bronchitis virus requires the cell nucleus and host transcriptional factors

The internal open reading frame within the nucleocapsid gene of mouse hepatitis virus encodes a structural protein that is not essential for viral replication

Analysis of a recombinant mouse hepatitis virus expressing a foreign gene reveals a novel aspect of coronavirus transcription

Analysis of constructed E gene mutants of mouse hepatitis virus confirms a pivotal role for E protein in coronavirus assembly

Identification and characterization of a coronavirus packaging signal

Proteolytic cleavage of the E2 glycoprotein of murine coronavirus: Host-dependent differences in proteolytic cleavage and cell fusion

Evidence for variable rates of recombination in the MHV genome

Map locations of mouse hepatitis virus temperaturesensitive mutants: Confirmation of variable rates of recombination

Three different cellular proteins bind to complementary sites on the 5 0 -end-positive and 3 0 -end-negative strands of mouse hepatitis virus RNA

Attachment glycoproteins and receptor specificity of rat coronaviruses

A role for naturally occurring variation of the murine coronavirus spike protein in stabilizing association with the cellular receptor

Coronavirus spike proteins in viral entry and pathogenesis

Neutralization-resistant variants of a neurotropic coronavirus are generated by deletions within the aminoterminal half of the spike glycoprotein

Alteration of the pH dependence of coronavirus-induced cell fusion: Effect of mutations in the spike glycoprotein

Cell receptor-independent infection by a neurotropic murine coronavirus

Isolation of subviral components from transmissible gastroenteritis virus

Defective replication of porcine transmissible gastroenteritis virus in a continuous cell line

Discovery of novel human and animal cells infected by the severe acute respiratory syndrome coronavirus by replication-specific multiplex reverse transcription-PCR

Retroviral vectors pseudotyped with severe acute respiratory syndrome coronavirus S protein

Assembly of spikes into coronavirus particles is mediated by the carboxy-terminal domain of the spike protein

TGEV corona virus ORF4 encodes a membrane protein that is incorporated into virions

Major receptor-binding and neutralization determinants are located within the same domain of the transmissible gastroenteritis virus (coronavirus) spike protein

Characterization of the RNA components of a putative molecular switch in the 3 0 untranslated region of the murine coronavirus genome

The 3 0 cis-acting genomic replication element of the severe acute respiratory syndrome coronavirus can function in the murine coronavirus genome

Fusion-defective mutants of mouse hepatitis virus A59 contain a mutation in the spike protein cleavage signal

Stabilization of a full-length infectious cDNA clone of transmissible gastroenteritis coronavirus by insertion of an intron

A comparative sequence analysis to revise the current taxonomy of the family Coronaviridae

Coronavirus genome: Prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis

Severe acute respiratory syndrome coronavirus phylogeny: Toward consensus

RNA replication of mouse hepatitis virus takes place at double-membrane vesicles

The nsp2 replicase proteins of murine hepatitis virus and severe acute respiratory syndrome coronavirus are dispensable for viral replication

Isolation and characterization of viruses related to the SARS coronavirus from animals in Southern China

Identification of the membrane-active regions of the severe acute respiratory syndrome coronavirus spike membrane glycoprotein using a 16/18-mer peptide scan: Implications for the viral fusion mechanism

Characterization of a coronavirus isolated from a diarrheic foal

Switching species tropism: An effective way to manipulate the feline coronavirus genome

Live, attenuated coronavirus vaccines through the directed deletion of group-specific genes provide protection against feline infectious peritonitis

The coronavirus transmissible gastroenteritis virus causes infection after receptor-mediated endocytosis and acid-dependent fusion with an intracellular compartment

Identification of severe acute respiratory syndrome coronavirus replicase products and characterization of papainlike protease activity

Characterization of determinants involved in the feline infectious peritonitis virus receptor function of feline aminopeptidase N

Ceacam1aÀ/À mice are completely resistant to infection by murine coronavirus mouse hepatitis virus A59

An 'elaborated' pseudoknot is required for high frequency frameshifting during translation of HCV 229E polymerase mRNA

Feline coronavirus type II strains 79-1683 and 79-1146 originate from a double recombination between feline coronavirus type I and canine coronavirus

The spike protein of murine coronavirus mouse hepatitis virus strain A59 is not cleaved in primary glial cells and primary hepatocytes

Investigation of the control of coronavirus subgenomic mRNA transcription by using T7-generated negative-sense RNA transcripts

The coronavirus infectious bronchitis virus nucleoprotein localizes to the nucleolus

S protein of severe acute respiratory syndrome-associated coronavirus mediates entry into hepatoma cell lines and is targeted by neutralizing antibodies in infected patients

Human coronavirus NL63 employs the severe acute respiratory syndrome coronavirus receptor for cellular entry

Propagation of the virus of porcine epidemic diarrhea in cell culture

The 5 0 end of coronavirus minus-strand RNAs contain a short poly(U) tract

Bovine coronavirus mRNA replication continues throughout persistent infection in cell culture

Bovine coronavirus nucleocapsid protein processing and assembly

Synthesis and processing of the bovine enteric coronavirus haemagglutinin protein

Differences in virus receptor for type I and type II feline infectious peritonitis virus

Tunicamycin resistant glycosylation of a coronavirus glycoprotein: Demonstration of a novel type of viral glycoprotein

A bulged stem-loop structure in the 3 0 untranslated region of the genome of the coronavirus mouse hepatitis virus is essential for replication

Characterization of an essential RNA secondary structure in the 3 0 untranslated region of the murine coronavirus genome

Polypyrimidine tract-binding protein binds to the complementary strand of the mouse hepatitis virus 3 0 untranslated region, thereby altering RNA conformation

Heterogeneous nuclear ribonucleoprotein a1 binds to the 3 0 -untranslated region and mediates potential 5 0 -3 0 -end cross talks of mouse hepatitis virus RNA

Structure of the N-terminal RNA-binding domain of the SARS CoV nucleocapsid protein

Generation of synthetic severe acute respiratory syndrome coronavirus pseudoparticles: Implications for assembly and vaccine production

A major determinant for membrane protein interaction localizes to the carboxy-terminal domain of the mouse coronavirus nucleocapsid protein

Evolutional insights on uncharacterized SARS coronavirus genes

Structural characterization of the fusion-active complex of severe acute respiratory syndrome (SARS) coronavirus

Human coronavirus 229E nonstructural protein 13: Characterization of duplex-unwinding, nucleoside triphosphatase, and RNA 5 0 -triphosphatase activities

Multiple enzymatic activities associated with severe acute respiratory syndrome coronavirus helicase

Major genetic marker of nidoviruses encodes a replicative endoribonuclease

Replication and packaging of transmissible gastroenteritis coronavirus-derived synthetic minigenomes

Signals for ribosomal frameshifting in the Rous sarcoma virus gag-pol region

Subversion of cellular autophagosomal machinery by RNA viruses

Synthesis of subgenomic mRNAs of mouse hepatitis virus is initiated independently: Evidence from UV transcription mapping

Characterization and translation of transmissible gastroenteritis virus mRNAs

The polymerase in its labyrinth: Mechanisms and implications of RNA recombination

Poliovirus RNA recombination: Mechanistic studies in the absence of selection

CD209L (L-SIGN) is a receptor for severe acute respiratory syndrome coronavirus

Effect of mutations in the mouse hepatitis virus 3 0 (þ)42 protein binding element on RNA replication

The effect of two closely inserted transcription consensus sequences on coronavirus transcription

Molecular identification and characterization of novel coronaviruses infecting graylag geese (Anser anser), feral pigeons (Columbia livia) and mallards (Anas platyrhynchos)

Identification of the murine coronavirus MP1 cleavage site recognized by papain-like proteinase 2

The amino-terminal signal peptide on the porcine transmissible gastroenteritis coronavirus matrix protein is not an absolute requirement for membrane translocation and glycosylation

Expression of hemagglutinin-esterase protein from recombinant mouse hepatitis virus enhances neurovirulence

Multiple recombination sites at the 5 0 -end of murine coronavirus RNA

In vivo RNA-RNA recombination of coronavirus in mouse brain

RNA recombination of murine coronaviruses: Recombination between fusion-positive mouse hepatitis virus A59 and fusion-negative mouse hepatitis virus 2

/76). Isolation and morphology of the internal component of human coronavirus, strain 229E

Inhibition of coronavirus 229E replication by actinomycin D

Structure and orientation of expressed bovine coronavirus hemagglutinin-esterase protein

Analysis of cis-acting sequences essential for coronavirus defective interfering RNA replication

Bovine coronavirus structural proteins

Bovine coronavirus hemagglutinin protein

The mechanism of RNA recombination in poliovirus

Identification of a coronavirus hemagglutinin-esterase with a substrate specificity different from those of influenza C virus and bovine coronavirus

Coronavirus M proteins accumulate in the Golgi complex beyond the site of virion budding

Repair and mutagenesis of the genome of a deletion mutant of the coronavirus mouse hepatitis virus by targeted RNA recombination

Characterization of functional domains in the human coronavirus HCV 229E receptor

Identification of residues critical for the human coronavirus 229E receptor function of human aminopeptidase N

Temperature-sensitive mutants of mouse hepatitis virus strain A59: Isolation, characterization and neuropathogenic properties

Experimental evidence of recombination in coronavirus infectious bronchitis virus

Differential localization of sialyltransferases in N-and O-linked glycosylation

Membrane assembly of the triple-spanning coronavirus M protein. Individual transmembrane domains show preferred orientation

Characterization of the budding compartment of mouse hepatitis virus: Evidence that transport from the RER to the Golgi complex requires only one vesicular transport step

Oligomerization of a trans-Golgi/trans-Golgi network retained protein occurs in the Golgi complex and may be part of its recognition

Tandem placement of a coronavirus promoter results in enhanced mRNA synthesis from the downstream-most initiation site

Variations in disparate regions of the murine coronavirus spike protein impact the initiation of membrane fusion

Mass spectrometric characterization of proteins from the SARS virus: A preliminary report

A novel coronavirus associated with severe acute respiratory syndrome

Localization of neutralizing epitopes and the receptor-binding site within the amino-terminal 330 amino acids of the murine coronavirus spike protein

Genetic evidence for a structural interaction between the carboxy termini of the membrane and nucleocapsid proteins of mouse hepatitis virus

The small envelope protein E is not essential for murine coronavirus replication

Retargeting of coronavirus by substitution of the spike glycoprotein ectodomain: Crossing the host cell species barrier

Sequence evidence for RNA recombination in field isolates of avian coronavirus infectious bronchitis virus

Involvement of aminopeptidase N (CD13) in infection of human neural cells by human coronavirus 229E

Coronavirus leader-RNA-primed transcription: An alternative mechanism to RNA splicing

RNA recombination in animal and plant viruses

The molecular biology of coronaviruses

Coronaviridae: The viruses and their replication

RNA of mouse hepatitis virus

Comparative analysis of RNA genomes of mouse hepatitis viruses

Coronavirus: How a large RNA viral genome is replicated and transcribed

Sequence analysis of the bovine coronavirus nucleocapsid and matrix protein genes

Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats

The Coronaviridae

Sequence and N-terminal processing of the transmembrane protein E1 of the coronavirus transmissible gastroenteritis virus

Single amino acid changes in the viral glycoprotein M affect induction of alpha interferon by the coronavirus transmissible gastroenteritis virus

Functional domains in the spike protein of transmissible gastroenteritis virus

The complete sequence (22 kilobases) of murine coronavirus gene 1 encoding the putative proteases and RNA polymerase

Detection of a murine coronavirus nonstructural protein encoded in a downstream open reading frame

Quaternary structure of coronavirus spikes in complex with carcinoembryonic antigen-related cell adhesion molecule cellular receptors

Differential in vitro inhibition of feline enteric coronavirus and feline infectious peritonitis virus by actinomycin D

Structure of SARS coronavirus spike receptor-binding domain complexed with receptor

Heterogeneous nuclear ribonucleoprotein A1 binds to the transcription-regulatory region of mouse hepatitis virus RNA

Polypyrimidine tract-binding protein binds to the leader RNA of mouse hepatitis virus and serves as a regulator of viral transcription

Angiotensin-converting enzyme 2 is a functional receptor for the SARS coronavirus

Efficient replication of severe acute respiratory syndrome coronavirus in mouse cells is limited by murine angiotensin-converting enzyme 2

Receptor and viral determinants of SARS-coronavirus adaptation to human ACE2

Bats are natural reservoirs of SARSlike coronaviruses

RNA recombination in a coronavirus: Recombination between viral genomic RNA and transfected RNA fragments

Characterization of the two overlapping papain-like proteinase domains encoded in gene 1 of the coronavirus infectious bronchitis virus and determination of the C-terminal cleavage site of an 87-kDa protein

Deletion mapping of a mouse hepatitis virus defective interfering RNA reveals the requirement of an internal and discontiguous sequence for replication

Identification of the cis-acting signal for minu-strand RNA synthesis of a murine coronavirus: Implications for the role of minus-strand RNA in RNA replication and transcription

Luxury at a cost? Recombinant mouse hepatitis viruses expressing the accessory hemagglutinin-esterase protein display reduced fitness in vitro

Association of the infectious bronchitis virus 3c protein with the virion envelope

A polycistronic mRNA specified by the coronavirus infectious bronchitis virus

Proteolytic mapping of the coronavirus infectious bronchitis virus 1b polyprotein: Evidence for the presence of four cleavage sites of the 3C-like proteinase and identification of two novel cleavage products

A specific host cellular protein binding element near the 3 0 end of mouse hepatitis virus genomic RNA

Secondary structural elements within the 3 0 untranslated region of mouse hepatitis virus strain JHM genomic RNA

Interaction between heptad repeat 1 and 2 regions in spike protein of SARS-associated coronavirus: Implications for virus fusogenic mechanism and identification of fusion inhibitors

Biological properties of avian coronavirus RNA

Genome of infectious bronchitis virus

Polypeptides of infectious bronchitis virus. I. Polypeptides of the virion

Intracellular targeting signals contribute to the localization of coronavirus spike proteins near the virus assembly site

Determinants of mouse hepatitis virus 3C-like proteinase activity

Identification and characterization of a serinelike proteinase of the murine coronavirus MHV-A59

Roles in cell-to-cell fusion of two conserved hydrophobic regions in the murine coronavirus spike protein

Amino acid substitutions within the leucine zipper domain of the murine coronavirus spike protein cause defects in oligomerization and the ability to induce cell-to-cell fusion

Primary structure of the glycoprotein E2 of coronavirus MHV-A59 and identification of the trypsin cleavage site

Sequence of mouse hepatitis virus A59 mRNA 2: Indications for RNA recombination between coronaviruses and influenza C virus

Replication of synthetic interfering RNAs derived from coronavirus mouse hepatitis virus-A59

Characterization of two temperature-sensitive mutants of coronavirus mouse hepatitis virus strain A59 with maturation defects in the spike protein

A specific transmembrane domain of a coronavirus E1 glycoprotein is required for its retention in the Golgi region

The E1 glycoprotein of an avian coronavirus is targeted to the cis Golgi complex

Ribonucleoprotein-like structures from coronavirus particles

Release of E protein in membrane vesicles from virus-infected cells and E protein-expressing cells

Membrane topology of coronavirus E protein

Structure of the intracellular defective viral RNAs of defective interfering particles of mouse hepatitis virus

High-frequency RNA recombination of murine coronaviruses

RNA recombination of coronaviruses: Localization of neutralizing epitopes and neuropathogenic determinants on the carboxyl terminus of peplomers

Defective interfering particles of murine coronavirus: Mechanism of synthesis of defective viral RNAs

Analysis of efficiently packaged defective interfering RNAs of murine coronavirus: Localization of a possible RNApackaging signal

A system for the study of coronavirus mRNA synthesis: A regulated expressed subgenomic defective interfering RNA results from intergenic site insertion

The genome sequence of the SARS-associated coronavirus

Temperature-sensitive mutants of mouse hepatitis virus type 3 (MHV-3): Isolation, biochemical and genetic characterization

DC-SIGN and DC-SIGNR interact with the glycoprotein of Marburg virus and the S protein of severe acute respiratory syndrome coronavirus

Localization of an RNA-binding domain in the nucleocapsid protein of the coronavirus mouse hepatitis virus

Reverse genetics of the largest RNA viruses

Coronavirus reverse genetics by targeted RNA recombination

Optimization of targeted RNA recombination and mapping of a novel nucleocapsid gene mutation in the coronavirus mouse hepatitis virus

Receptor-induced conformational changes of murine coronavirus spike protein

Proteasemediated enhancement of severe acute respiratory syndrome coronavirus infection

Membrane integration and intracellular transport of the coronavirus glycoprotein E1, a class III membrane glycoprotein

Detection of novel members, structure-function analysis and evolutionary classification of the 2H phosphoesterase superfamily

Molecular characterization of transmissible gastroenteritis coronavirus defective interfering genomes: Packaging and heterogeneity

N-terminal domain of the murine coronavirus receptor CEACAM1 is responsible for fusogenic activation and conformational changes of the spike protein

Nascent synthesis of leader sequencecontaining subgenomic mRNAs in coronavirus genome-length replicative intermediate RNA

Identification of a specific interaction between the coronavirus mouse hepatitis virus A59 nucleocapsid protein and packaging signal

Retroviruses pseudotyped with the severe acute respiratory syndrome coronavirus spike protein efficiently infect cells expressing angiotensin-converting enzyme 2

Efficient assembly and release of SARS coronavirus-like particles by a heterologous expression system

Exogenous ACE2 expression allows refractory cell lines to support severe acute respiratory syndrome coronavirus replication

Comparison of the amino acid sequence and phylogenetic analysis of the peplomer, integral membrane and nucleocapsid proteins of feline, canine and porcine coronaviruses

Human coronavirus OC43 RNA 4 lacks two open reading frames located downstream of the S gene of bovine coronavirus

Differential maturation and subcellular localization of severe acute respiratory syndrome coronavirus surface proteins

Mitochondrial aconitase binds to the 3 0 untranslated region of the mouse hepatitis virus genome

Mitochondrial HSP70, HSP40, and HSP60 bind to the 3 0 untranslated region of the Murine hepatitis virus genome

The role of RNA pseudoknot stem 1 length in the promotion of efficient À1 ribosomal frameshifting

Cooperation of an RNA packaging signal and a viral envelope protein in coronavirus RNA packaging

Characterization of the coronavirus M protein and nucleocapsid interaction in infected cells

Nucleocapsid-independent specific viral RNA packaging via viral envelope protein and viral RNA signal

Characterization of N protein selfassociation in coronavirus ribonucleoprotein complexes

Spike glycoprotein-mediated fusion in biliary glycoprotein-independent cell-associated spread of mouse hepatitis virus infection

Entry of mouse hepatitis virus into cells by endosomal and nonendosomal pathways

Murine coronavirus spike protein determines the ability of the virus to replicate in the liver and cause hepatitis

Bgp2, a new member of the carcinoembryonic antigen-related gene family, encodes an alternative receptor for mouse hepatitis viruses

Localization of the RNA-binding domain of mouse hepatitis virus nucleocapsid protein

High affinity interaction between nucleocapsid protein and leader/intergenic sequence of mouse hepatitis virus RNA

Topographic changes in SARS coronavirus-infected cells at late stages of infection

Protein interactions during coronavirus assembly

Coronavirus glycoprotein E1, a new type of viral glycoprotein

Posttranslational glycosylation of coronavirus glycoprotein E1: Inhibition by monensin

Human coronavirus 229E binds to CD13 in rafts and enters the cell through caveolae

Infectious nucleic acid from a transmissible agent causing gastroenteritis in pigs

The major product of porcine transmissible gastroenteritis coronavirus gene 3b is an integral membrane glycoprotein of 31 kDa

Identification of a putative cellular receptor 150 kDa polypeptide for porcine epidemic diarrhea virus in porcine enterocytes

Mouse susceptibility to mouse hepatitis virus infection is linked to viral receptor genotype

Difference in virus-binding activity of two distinct receptor proteins for mouse hepatitis virus

Inactivation of expression of gene 4 of mouse hepatitis virus strain JHM does not affect virulence in the murine CNS

Disulfide bonds in folding and transport of mouse hepatitis coronavirus glycoproteins

Envelope glycoprotein interactions in coronavirus assembly

Generation of a replicationcompetent, propagation-deficient virus vector based on the transmissible gastroenteritis coronavirus genome

Transmissible gastroenteritis coronavirus gene 7 is not essential but influences in vivo virus replication and virulence

Coronaviruses

Downstream sequences influence the choice between a naturally occurring noncanonical and closely positioned upstream canonical heptameric fusion motif during bovine coronavirus subgenomic mRNA synthesis

Infection of a calf with the enteric coronavirus strain Paris

Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein

Sequence analysis reveals extensive polymorphism and evidence of deletions within the E2 glycoprotein gene of several strains of murine hepatitis virus

Sequence requirements for RNA strand transfer during nidovirus discontinuous subgenomic RNA synthesis

The stability of the duplex between sense and antisense transcription-regulating sequences is a crucial factor in arterivirus subgenomic mRNA synthesis

Regulation of relative abundance of arterivirus subgenomic mRNAs

Coronavirus as a possible cause of severe acute respiratory syndrome

Analysis of second-site revertants of a murine coronavirus nucleocapsid protein deletion mutant and construction of nucleocapsid protein mutants by targeted RNA recombination

Construction of murine coronavirus mutants containing interspecies chimeric nucleocapsid proteins

Binding of the coronavirus mouse hepatitis virus A59 to its receptor expressed from a recombinant vaccinia virus depends on posttranslational processing of the receptor glycoprotein

Characterization of a replicating and packaged defective RNA of avian coronavirus infectious bronchitis virus

Structural genomics of the severe acute respiratory syndrome coronavirus: Nuclear magnetic resonance structure of the protein nsP7

A severe acute respiratory syndrome-associated coronavirus-specific protein enhances virulence of an attenuated murine coronavirus

Pathogenesis of chimeric MHV4/MHV-A59 recombinant viruses: The murine coronavirus spike protein is a major determinant of neurovirulence

Efficient autoproteolytic processing of the MHV-A59 3C-like proteinase from the flanking hydrophobic domains requires membranes

A three-stemmed mRNA pseudoknot in the SARS coronavirus frameshift signal

Identification of a novel coronavirus in bats

The spike but not the hemagglutinin/esterase protein of bovine coronavirus is necessary and sufficient for viral infection

Coronavirus replication complex formation utilizes components of cellular autophagy

Identification and characterization of severe acute respiratory syndrome coronavirus replicase proteins

ADP-ribose-1 0 -monophosphatase: A conserved coronavirus enzyme that is dispensable for viral replication in tissue culture

Characterization of the coronavirus mouse hepatitis virus strain A59 small membrane protein E

Cloned poliovirus cDNA is infectious in mammalian cells

Stem-loop IV in the 5 0 untranslated region is a cisacting element in bovine coronavirus defective interfering RNA replication

Stem-loop III in the 5 0 untranslated region is a cis-acting element in bovine coronavirus defective interfering RNA replication

Programmed À1 ribosomal frameshifting in the SARS coronavirus

Identification of a contiguous 6-residue determinant in the MHV receptor that controls the level of virion binding to cells

The hemagglutinin-esterase of mouse hepatitis virus strain S is a sialate-4-O-acetylesterase

SARS associated coronavirus has a recombinant polymerase and coronaviruses have a history of host-shifting

A conditionallethal murine coronavirus mutant that fails to incorporate the spike glycoprotein into assembled virions

Transcription of infectious yellow fever RNA from full-length cDNA templates produced by in vitro ligation

Membrane protein molecules of transmissible gastroenteritis coronavirus also expose the carboxy-terminal region on the external surface of the virion

The transmissible gastroenteritis coronavirus contains a spherical core shell consisting of M and N proteins

Pathogenic murine coronaviruses. III. Biological and biochemical characterization of temperature-sensitive mutants of JHMV

RNAbinding proteins of coronavirus MHV: Detection of monomeric and multimeric N protein with an RNA overlay-protein blot assay

Bovine enteric coronavirus structure as studied by a freeze-drying technique

Characterization of a novel coronavirus associated with severe acute respiratory syndrome

The Coronaviridae

Coronavirus E1 protein expressed from cloned cDNA localizes in the Golgi region

Viral protein synthesis in mouse hepatitis virus strain A59-infected cells: Effects of tunicamycin

Assembly in vitro of a spanning membrane protein of the endoplasmic reticulum: The E1 glycoprotein of coronavirus mouse hepatitis virus A59

Predicted membrane topology of the coronavirus protein E1

Evolution of mouse hepatitis virus: Detection and characterization of spike deletion variants during persistent infection

Intracellular localization of the severe acute respiratory syndrome coronavirus nucleocapsid protein: Absence of nucleolar accumulation during infection and after expression as a recombinant protein in Vero cells

Animal coronaviruses: What can they teach us about the severe acute respiratory syndrome?

Identification and characterization of the putative fusion peptide of the severe acute respiratory syndrome-associated coronavirus spike protein

Antigenic homology among coronaviruses related to transmissible gastroenteritis virus

Targeted recombination demonstrates that the spike gene of transmissible gastroenteritis coronavirus is a determinant of its enteric tropism and virulence

Novel variation in the N protein of avian infectious bronchitis virus

The RNA structures engaged in replication and transcription of the A59 strain of mouse hepatitis virus

Coronavirus minus strand RNA synthesis and effect of cycloheximide on coronavirus RNA synthesis

Coronavirus transcription: Subgenomic mouse hepatitis virus replicative intermediates function in RNA synthesis

A new model for coronavirus transcription

Coronavirus transcription: A perspective

Genetics of mouse hepatitis virus transcription: Evidence that subgenomic negative strands are functional templates

Genetics of mouse hepatitis virus transcription: Identification of cistrons which may function in positive and negative strand RNA synthesis

Selective replication of coronavirus genomes that express nucleocapsid protein

The murine coronavirus mouse hepatitis virus strain A59 from persistently infected murine cells exhibits an extended host range

The N-terminal region of the murine coronavirus spike glycoprotein is associated with the extended host range of viruses from persistently infected murine cells

Processing of the coronavirus MHV-JHM polymerase polyprotein: Identification of precursors and proteolytic products spanning 400 kilodaltons of ORF1a

Presence of infectious polyadenylated RNA in coronavirus avian bronchitis virus

The S protein of bovine coronavirus is a hemagglutinin recognizing 9-O-acetylated sialic acid as a receptor determinant

Murine coronavirus nonstructural protein ns2 is not essential for virus replication in transformed cells

The nucleocapsid protein gene of bovine coronavirus is bicistronic

Coronavirus subgenomic minusstrand RNAs and the potential for mRNA replicons

Minus-strand copies of replicating coronavirus mRNAs contain antileaders

The human coronavirus 229E superfamily 1 helicase has RNA and DNA duplex-unwinding activities with 5 0 -to-3 0 polarity

A single amino acid mutation in the spike protein of coronavirus infectious bronchitis virus hampers its maturation and incorporation into virions at the nonpermissive temperature

Evaluation of the role of heterogeneous nuclear ribonucleoprotein A1 as a host factor in murine coronavirus discontinuous transcription and genome replication

Colocalization and membrane association of murine hepatitis virus gene 1 products and De novo-synthesized viral RNA in infected cells

Heterogeneous nuclear ribonucleoprotein A1 regulates RNA synthesis of a cytoplasmic virus

Multiple type A/B heterogeneous nuclear ribonucleoproteins (hnRNPs) can replace hnRNP A1 in mouse hepatitis virus RNA synthesis

The Coronaviridae

Coronavirus JHM: A virionassociated protein kinase

Characterization of severe acute respiratory syndrome-associated coronavirus (SARS-CoV) spike glycoprotein-mediated viral entry

Mouse hepatitis virus replicase proteins associate with two distinct populations of intracellular membranes

Coronavirus MHV-JHM mRNA 5 has a sequence arrangement which potentially allows translation of a second, downstream open reading frame

Monoclonal antibody to the receptor for murine coronavirus MHV-A59 inhibits viral replication in vivo

Nidovirus sialate-O-acetylesterases: Evolution and substrate specificity of coronaviral and toroviral receptor-destroying enzymes

Toroviruses: Replication, evolution and comparison with other members of the coronavirus-like superfamily

The molecular biology of arteriviruses

The carboxy-terminal part of the putative Berne virus polymerase is expressed by ribosomal frameshifting and contains sequence motifs which indicate that toro-and coronaviruses are evolutionarily related

Comparison of the genome organization of toro-and coronaviruses: Evidence for two nonhomologous RNA recombination events during Berne virus evolution

Unique and conserved features of genome and proteome of SARS coronavirus, an early split-off from the coronavirus group 2 lineage

Sequence and translation of the murine coronavirus 5 0 -end genomic RNA reveals the N-terminal structure of the putative RNA polymerase

Role of nucleotides immediately flanking the transcription-regulating sequence core in coronavirus subgenomic mRNA synthesis

Ribosomal pausing during translation of an RNA pseudoknot

Synthesis and characterization of a native, oligomeric form of recombinant severe acute respiratory syndrome coronavirus spike glycoprotein

Host protein interactions with the 3 0 end of bovine coronavirus RNA and the requirement of the poly(A) tail for coronavirus defective genome replication

Single-amino-acid substitutions in open reading frame (ORF) 1b-nsp14 and ORF 2a proteins of the coronavirus mouse hepatitis virus are attenuating in mice

Evidence from the evolutionary analysis of nucleotide sequences for a recombinant history of SARS-CoV

Proteolytic cleavage of the murine coronavirus surface glycoprotein is not required for fusion activity

Mosaic evolution of the severe acute respiratory syndrome coronavirus

Synthesis of coronavirus mRNAs: Kinetics of inactivation of infectious bronchitic virus RNA synthesis by UV light

Coronavirus proteins: Structure and function of the oligosaccharides of the avian infectious bronchitis virus glycoproteins

Phosphoproteins of murine hepatitis virus

Synthesis and subcellular localization of the murine coronavirus nucleocapsid protein

Specific interaction between coronavirus leader RNA and nucleocapsid protein

Characterization of a coronavirus: I. Structural proteins: Effect of preparative conditions on the migration of protein in polyacrylamide gels

The molecular biology of coronaviruses

Isolation of coronavirus envelope glycoproteins and interaction with the viral nucleocapsid

Proteolytic cleavage of the E2 glycoprotein of murine coronavirus: Activation of cell-fusing activity of virions by trypsin and separation of two different 90K cleavage fragments

Temperature-sensitive mutants of MHV-A59

Conformational change of the coronavirus peplomer glycoprotein at pH 8.0 and 37 degrees C correlates with virus aggregation and virus-induced cell fusion

Morphological and biological properties of a new coronavirus associated with diarrhea in infant mice

Potent neutralization of severe acute respiratory syndrome (SARS) coronavirus by a human mAb to S1 protein that blocks receptor association

Structure of a proteolytically resistant core from the severe acute respiratory syndrome coronavirus S2 fusion protein

The nsp9 replicase protein of SARS-coronavirus

Analysis of the receptor-binding site of murine coronavirus spike protein

A Golgi retention signal in a membranespanning domain of coronavirus E1 protein

Fusion formation by the uncleaved spike protein of murine coronavirus JHMV variant cl-2

The S2 subunit of the murine coronavirus spike protein is not involved in receptor binding

A murine coronavirus MHV-S isolate from persistently infected cells has a leader and two consensus sequences between the M and N genes

Coronavirus translational regulation: Leader affects mRNA efficiency

Mouse hepatitis virus nucleocapsid protein as a translational effector of viral mRNAs

Crystal structure of murine sCEACAM1a[1,4]: A coronavirus receptor in the CEA family

Amino acid substitutions and an insertion in the spike glycoprotein extend the host range of the murine coronavirus MHV-A59

Substitutions of conserved amino acids in the receptor-binding domain of the spike glycoprotein affect utilization of murine CEACAM1a by the murine coronavirus MHV-A59

Internal ribosome entry in the coding region of murine hepatitis virus mRNA5

Reverse genetics of coronaviruses using vaccinia virus vectors

Effective amplification of 20-kb DNA by reverse transcription PCR

Infectious RNA transcribed in vitro from a cDNA copy of the human coronavirus genome cloned in vaccinia virus

Viral replicase gene products suffice for coronavirus discontinuous transcription

Mechanisms and enzymes involved in SARS coronavirus genome expression

Multigene RNA vector based on coronavirus transcription

Requirements for CEACAMs and cholesterol during murine coronavirus cell entry

Replication of coronavirus MHV-A59 in Sac-cells: Determination of the first site of budding of progeny virions

Site of addition of N-acetyl-galactosamine to the E1 glycoprotein of mouse hepatitis virus-A59

The transmembrane oligomers of coronavirus protein E

ACE2 X-ray structures reveal a large hinge-bending motion important for inhibitor binding and catalysis

Feline aminopeptidase N serves as a receptor for feline, canine, porcine, and human coronaviruses in serogroup I

Structural characterization of the SARS-coronavirus spike S fusion protein core

Isolation of coronaviruses antigenically indistinguishable from bovine coronavirus from wild ruminants with diarrhea

Localization of mouse hepatitis virus nonstructural proteins and RNA synthesis indicates a role for late endosomes in viral replication

The Coronaviridae

A domain at the 3 0 end of the polymerase gene is essential for encapsidation of coronavirus defective interfering RNAs

Homologous RNA recombination allows efficient introduction of site-specific mutations into the genome of coronavirus MHV-A59 via synthetic co-replicating RNAs

Translation but not the encoded sequence is essential for the efficient propogation of defective interfering RNAs of the coronavirus mouse hepatitis virus

Regulation of coronavirus mRNA transcription

Arterivirus discontinuous mRNA transcription is guided by base pairing between sense and antisense transcription-regulating sequences

Discontinuous and non-discontinuous subgenomic RNA transcription in a nidovirus

Intracellular transport of recombinant coronavirus spike proteins: Implications for virus assembly

Enhancement of the vaccinia virus/phage T7 RNA polymerase expression system using encephalomyocarditis virus 5 0 -untranslated region sequences

Nucleocapsid-independent assembly of coronavirus-like particles by co-expression of viral envelope protein genes

Human and bovine coronaviruses recognize sialic acid-containing receptors similar to those of influenza C viruses

The E3 protein of bovine coronavirus is a receptor-destroying enzyme with acetylesterase activity

mRNA cap-1 methyltransferase in the SARS genome

The nucleocapsid protein of mouse hepatitis virus interacts with the cellular heterogeneous nuclear ribonucleoprotein A1 in vitro and in vivo

Evidence of natural recombination within the S1 gene of infectious bronchitis virus

Evolutionary implications of genetic variations in the S1 gene of infectious bronchitis virus

Expression cloning of functional receptor used by SARS coronavirus

Genomic RNA of the murine coronavirus JHM

Monoclonal antibodies to the peplomer glycoprotein of coronavirus mouse hepatitis virus identify two subunits and detect a conformational change in the subunit released under mild alkaline conditions

The ns 4 gene of mouse hepatitis virus (MHV), strain A 59 contains two ORFs and thus differs from ns 4 of the JHM and S strains

Oligomerization of a membrane protein correlates with its retention in the Golgi complex

Molecular determinants of species specificity in the coronavirus receptor aminopeptidase N (CD13): Influence of N-linked glycosylation

Mutational analysis of the virus and monoclonal antibody binding sites in MHVR, the cellular receptor of the murine coronavirus mouse hepatitis virus strain A59

Phosphorylation of the mouse hepatitis virus nucleocapsid protein

The replication of murine coronaviruses in enucleated cells

A phylogenetically conserved hairpin-type 3 0 untranslated region pseudoknot functions in coronavirus RNA replication

Purification of the 110-kilodalton glycoprotein receptor for mouse hepatitis virus (MHV)-A59 from mouse liver and identification of a nonfunctional, homologous protein in MHV-resistant SJL/J mice

Receptor for mouse hepatitis virus is a member of the carcinoembryonic antigen family of glycoproteins

SARS coronavirus E protein forms cation-selective ion channels

A 193-amino acid fragment of the SARS coronavirus S protein efficiently binds angiotensin-converting enzyme 2

Murine coronavirus packaging signal confers packaging to nonviral RNA

Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia

Common RNA replication signals exist among group 2 coronaviruses: Evidence for in vivo recombination between animal and human coronavirus molecules

Localization to the nucleolus is a common feature of coronavirus nucleoproteins, and the protein may disrupt host cell division

The SARS-CoV S glycoprotein: Expression and functional characterization

Further identification and characterization of novel intermediate and mature cleavage products released from the ORF 1b region of the avian coronavirus infectious bronchitis virus 1a/1b polyprotein

Structural basis for coronavirus-mediated membrane fusion. Crystal structure of mouse hepatitis virus spike protein fusion core

Crystal structure of severe acute respiratory syndrome coronavirus spike protein fusion core

Unique N-linked glycosylation of murine coronavirus MHV-2 membrane protein at the conserved O-linked glycosylation site

The crystal structures of severe acute respiratory syndrome virus main protease and its complex with an inhibitor

pH-dependent entry of severe acute respiratory syndrome coronavirus is mediated by the spike glycoprotein and enhanced by dendritic cell transfer through DC-SIGN

Genetic analysis of determinants for spike glycoprotein assembly into murine coronavirus virions: Distinct roles for charge-rich and cysteine-rich regions of the endodomain

Human aminopeptidase N is a receptor for human coronavirus 229E

Mouse hepatitis virus S RNA sequence reveals that nonstructural proteins ns4 and ns5a are not essential for murine coronavirus replication

Mouse hepatitis virus utilizes two carcinoembryonic antigens as alternative receptors

Biosynthesis, structure, and biological activities of envelope protein gp65 of murine coronavirus

In vitro assembled, recombinant infectious bronchitis viruses demonstrate that the 5a open reading frame is not essential for replication

Strategy for systematic assembly of large RNA and DNA genomes: Transmissible gastroenteritis virus model

Systematic assembly of a full-length infectious cDNA of mouse hepatitis virus strain A59

Reverse genetics with a full-length infectious cDNA of severe acute respiratory syndrome coronavirus

Specific binding of host cellular proteins to multiple sites within the 3 0 end of mouse hepatitis virus genomic RNA

A conserved motif at the 3 0 end of mouse hepatitis virus genomic RNA required for host protein binding and viral RNA replication

Mouse hepatitis virus gene 5b protein is a new virion envelope protein

Severe acute respiratory syndrome coronavirus nucleocapsid protein expressed by an adenovirus vector is phosphorylated and immunogenic in mice

Conformational changes in the spike glycoprotein of murine coronavirus are induced at 37 degrees C either by soluble murine CEACAM1 receptors or by pH 8

Insights into SARS-CoV transcription and replication from the structure of the nsp7-nsp8 hexadecamer

Interactions between the cytoplasmic proteins and the intergenic (promoter) sequence of mouse hepatitis virus RNA: Correlation with the amounts of subgenomic mRNA transcribed

Coronavirus leader RNA regulates and initiates subgenomic mRNA transcription both in trans and in cis

Formation of a ribonucleoprotein complex of mouse hepatitis virus involving heterogeneous nuclear ribonucleoprotein A1 and transcription-regulatory elements of viral RNA

Biological and genetic characterization of a hemagglutinating coronavirus isolated from a diarrhoeic child

Presence of subgenomic mRNAs in virions of coronavirus IBV

The amino and carboxyl domains of the infectious bronchitis virus nucleocapsid protein interact with 3 0 genomic RNA

The infectious bronchitis virus nucleocapsid protein binds RNA sequences in the 3 0 terminus of the genome

The coronavirus replicase

Processing of the human coronavirus 229E replicase polyproteins by the virus-encoded 3C-like proteinase: Identification of proteolytic products and cleavage sites common to pp1a and pp1ab

Virus-encoded proteinases and proteolytic processing in the Nidovirales

The autocatalytic release of a putative RNA virus transcription factor from its polyprotein precursor involves two paralogous papain-like proteases that cleave the same peptide bond

Sequence motifs involved in the regulation of discontinuous coronavirus subgenomic RNA synthesis

I am indebted to Lawrence Sturman for first telling me what coronaviruses are and why I should care. I am grateful to Adriana Verschoor, Wadsworth Center Publications Editor, for improving the style and accuracy of the manuscript. This work was supported in part by Public Health Service grants AI 45695, AI 60755, and AI 64603 from the National Institutes of Health.