key: cord-253178-c41xejo3 authors: Neuman, B.W.; Buchmeier, M.J. title: Supramolecular Architecture of the Coronavirus Particle date: 2016-09-15 journal: Adv Virus Res DOI: 10.1016/bs.aivir.2016.08.005 sha: doc_id: 253178 cord_uid: c41xejo3 Coronavirus particles serve three fundamentally important functions in infection. The virion provides the means to deliver the viral genome across the plasma membrane of a host cell. The virion is also a means of escape for newly synthesized genomes. Lastly, the virion is a durable vessel that protects the genome on its journey between cells. This review summarizes the available X-ray crystallography, NMR, and cryoelectron microscopy structural data for coronavirus structural proteins, and looks at the role of each of the major structural proteins in virus entry and assembly. The potential wider conservation of the nucleoprotein fold identified in the Arteriviridae and Coronaviridae families and a speculative model for the evolution of corona-like virus architecture are discussed. A virus particle is essentially a ruggedized viral genome that contains at least the minimal set of components necessary to propagate a virus infection. While virions from most viruses are considered to be metabolically inactive, some undergo internal structural changes after release, including 1 protease-dependent retrovirus maturation (Konvalinka et al., 2015) and bicaudovirus elongation (Haring et al., 2005; Scheele et al., 2011) . In addition to the viral genome and four conserved virally encoded structural proteins, coronavirus particles have been shown to contain a variety of packaged host-encoded proteins (Dent et al., 2015; Kong et al., 2010; Neuman et al., 2008; Nogales et al., 2012) including enzymes that may play important roles in promoting or preventing infection such as protein kinases (Neuman et al., 2008; Siddell et al., 1981) , cyclophilin A (Neuman et al., 2008; Pfefferle et al., 2011) , and APOBEC3G (Wang and Wang, 2009) . Purified virions can also contain low levels of some virus-encoded replicase proteins (Neuman et al., 2008; Nogales et al., 2012) , though packaged replicase proteins have not been shown to enhance infectivity. While the positive sense ssRNA genome is infectious for members of the genera Alphacoronavirus (Almazan et al., 2000; Donaldson et al., 2008b; Jengarn et al., 2015; Tekes et al., 2008; Thiel et al., 2001; Yount et al., 2000) , Betacoronavirus (Donaldson et al., 2008a; Scobey et al., 2013; Yount et al., 2003) , and Gammacoronavirus (Casais et al., 2001) , it has been shown that expression of the viral nucleoprotein and nsp3 can together or separately promote infection (Hurst et al., 2010 (Hurst et al., , 2013 Pan et al., 2008; Schelle et al., 2005; Thiel et al., 2001) . Taken together, this data demonstrate that packaged virion proteins are not essential for infection, but suggest that packaged proteins may confer a small replication advantage. Coronaviruses encode three conserved membrane-associated proteins that are incorporated in virions: spike (S), envelope (E), membrane (M), and nucleoprotein (N; Fig. 1 ). These four proteins occur in the order S-E-M-N in every known coronavirus lineage (Woo et al., 2014) . In between the S-E-M-N genes, coronaviruses encode species-specific accessory proteins, many of which appear to be incorporated in virions at low levels, ranging from one accessory in alphacoronaviruses including human coronavirus NL63 (Pyrc et al., 2004) to a predicted nine accessories in the gammacoronavirus HKU22 (Woo et al., 2014) . The genomic position of these accessory genes varies, with accessories encoded before S in some betacoronaviruses, between S and E in most lineages, between M and N in most lineages, and after N rarely in alphacoronaviruses and gammacoronaviruses and commonly in deltacoronaviruses. Interestingly, the M gene appears to directly follow the E gene throughout the Coronaviridae, though there is not an obvious transcriptional or translational reason why this should necessarily be the case. Interestingly, in one study, deletion of E resulted in the evolution of spontaneously joined gene Neuman et al., 2011 ). An alternative shorter, wider cryo-EM reconstruction of SARS-CoV virion proteins is shown for comparison (G; EMD-1423). Schematics are based on MHV proteins, following the annotation of Kuo et al. (2016) . The orientation of N protein domains shown here is hypothetical, and is intended for illustrative purposes only. (H) Domain structure and annotation of MHV S, E, M, and N proteins showing domains outside the virion (solid blue), inside the virion (striped blue), and the position of solved structures. Transmembrane regions (TM), sites of palmitoylation (Acyl), a conserved sequence preceding the transmembrane of S (KW), a serine-arginine-rich unstructured region (SR), phosphorylation sites (stars), and the C-terminal M-interacting domain of N (N3) are marked. Comparison of the appearance of dimeric M LONG and M COMPACT (I). fragments of M encoded in the position normally occupied by E (Kuo and Masters, 2010) . The coronavirus particle provides three important things for the genome: a durable transport vessel, a means of escape for newly synthesized genomes, and a means of entry into a cell. The following sections will examine each of the key functions of the coronavirus particle in durability, budding, and entry. The genome likely comprises a relatively small part of the internal volume of a virion. MHV produces virions that are approximately 80 nm in diameter, which is typical for hydrated coronavirus particles in vitreous ice (Barcena et al., 2009; Neuman et al., 2006 Neuman et al., , 2011 . The average radius of MHV virions is about 42 nm (Neuman et al., 2011) ; subtracting 8 nm occupied by the viral membrane and M protein at each side (Neuman et al., 2011) , and assuming a spherical virion, gives a predicted interior volume of 1.6 Â 10 5 nm 3 for a coronavirus particle. The partial specific volume of a coronavirus genome, calculated using a density of 0.57 cm 3 /g for ssRNA (Voss and Gerstein, 2005) and the molecular mass of the smallest coronavirus genome (porcine deltacoronavirus HKU15, 25.4 kb) and the largest (bottlenose dolphin coronavirus HKU22, 31.8 kb), gives a volume of 0.8-1.0 Â 10 4 nm 3 per genome, equivalent to 5-6% of the virion. From the estimated 0.7-2.2 Â 10 3 N proteins per virion (Neuman et al., 2011) and the volume of about 60 nm 3 per N protein (Neuman et al., 2006) , we can calculate that N proteins should occupy 0.4-1.3 Â 10 5 nm 3 or 25-80% of the virion interior, and that each N is associated with 14-40 nt of genomic RNA (Neuman et al., 2011) . In their native state, virions are filled with water, and preimaging procedures that remove the water from the virion have led to the mistaken but strangely persistent ideas that both arteriviruses (Horzinek et al., 1971 ) and coronaviruses (Risco et al., 1996) contain an icosahedrally organized ribonucleoprotein core. The shape of hydrated coronavirus particles in water ice as observed by cryoelectron microscopy is roughly spherical, and shows no sign of icosahedral organization (Barcena et al., 2009; Beniac et al., 2006 Beniac et al., , 2007 Neuman et al., 2006 Neuman et al., , 2011 . The same is true of Arteriviridae (Spilman et al., 2009) , and probably also of Mesoniviridae (Nga et al., 2011) . Other members of the Nidovirales produce bacillus-shaped particles, including Bafinivirus (Schutze et al., 2006) and Okavirus (Spann et al., 1997) . In cell culture toroviruses produce a mixture of spherical and bacilliform particles (B. Neuman, unpublished data) . Careful measurement of coronavirus particles in cryoelectron micrographs and tomograms reveals that most coronavirus particles are slightly prolate spheroids that differ from the shape of more spherical exosomal vesicles that appear in the same images (Neuman et al., 2011) . While the rigidity of coronavirus particles remains to be investigated, the shape difference, along with the observation that coronavirus particles remain roughly spherical despite repeated prodding with a carbon nanotube during atomic force microscopy Ng et al., 2004) , suggests that coronavirus particles are relatively resistant to deformation, as recently reported for influenza A virus . Just because a virion is enveloped, does not mean it is necessarily fragile or quickly inactivated, as demonstrated by the enveloped pithovirus that was successfully recovered from 30,000-year-old permafrost (Legendre et al., 2014) . Coronavirus particles are relatively robust compared to HIV-1, with SARS-CoV virions remaining infectious for 1-4 days on the relatively harsh environment of hard surfaces (reviewed in Sobsey and Meschke, 2003) . MERS-CoV virions are somewhat less robust than SARS-CoV, with half lives on the order of an hour on hard surfaces and a maximum survival time of 2-3 days, but are considerably more durable than pandemic influenza A virus under the same conditions (van Doremalen et al., 2013) . The persistent infectivity of coronaviruses outside the body has been used as evidence to suggest that direct contact with contaminated surfaces as well as respiratory droplets is a potential route of MERS-CoV spread (Assiri et al., 2013; Goh et al., 2013) . The M protein facilitates viral assembly by interacting with other M (Arndt et al., 2010; de Haan et al., 2000) , E (Boscarino et al., 2008; Corse and Machamer, 2003; Lim and Liu, 2001), S (de Haan et al., 1999; Godeke et al., 2000) , and N proteins (Escors et al., 2001; Hurst et al., 2005; Narayanan et al., 2000; Sturman et al., 1980) . The MHV M protein may also interact with the RNA packaging signal that mediates incorporation of the viral genome into viral particles (Narayanan et al., 2003) , but direct M-genome interactions are not efficient enough to rescue viruses in which the C-terminal region of N that interacts with M has been perturbed, suggesting that M-genome interactions are less important than M-N interactions for recovery of infectious virus (Kuo et al., 2016) . Communication between the carboxyl termini of the M and N proteins has also been observed from mutagenesis and second-site reversion studies (Hurst et al., 2005; Kuo and Masters, 2002) . M proteins in SARS-CoV, FCoV, and MHV virions and virus-like particles (VLPs) form homodimers (Neuman et al., 2011) , which appear to be functionally analogous to the M-GP5 heterodimers of Arteriviridae (de Vries et al., 1995; Faaberg et al., 1995; Snijder et al., 2003) . Coronavirus M dimers resemble the shape of a Greek amphora, with the ectodomain forming the lip, transmembrane region forming narrow neck, and the endodomain forming the lower chamber (Fig. 1I ). In virions it appears that the transmembrane region does not make contact between adjacent M dimers, suggesting that reported M-M interaction domains in the transmembrane region are between the two monomers that make up an M dimer. M protein deletion mutants were not assembly competent in the absence of wild-type M . Thus M-M interactions are necessary, but not always sufficient for VLP assembly . In contrast, the endodomains of M dimers appeared to make close contact in the cryo-EM reconstructions (Neuman et al., 2011) , suggesting that interactions between M dimers are likely to occur in the endodomain. The endodomain of MHV M is important for M-M interactions that could involve monomers or dimers. Purified SARS-CoV M endodomains can dimerize and can form M dimer-dimer interactions (Neuman et al., 2011) , suggesting that the endodomain is the primary site of M dimer-dimer interaction. Cryo-EM and cryoelectron tomography analysis also suggests that M can exist in two forms (Neuman et al., 2011) . The main form found on virions and VLPs is an elongated conformation that makes contact with the ribonucleoprotein and imparts a spherical membrane curvature of about 5-6 degrees per M dimer. The minor form is more compact, has indistinct boundaries suggestive of a disordered aggregate, and does not appear to impart membrane curvature. The long conformation could be partially converted to the shorter conformation by transient acidification, weakening M-RNP interactions. This suggests a model in which formation of M LONG -RNP interactions drives the budding process, and membrane fusion is preceded by release of M LONG -RNP interactions. However, further work is needed to test the accuracy of this model. The structure of the M protein is not known, but may be partially inferred from sequence comparison and secondary structure prediction algorithms. M proteins possess a short glycosylated ectodomain of variable sequence (Oostra et al., 2006) , followed by three closely spaced hydrophobic transmembrane helix signatures, and a relatively long cytoplasmic tail region that may fold into a compact beta-dominated structure (Masters et al., 2006) . The weight of evidence suggests that coronavirus M and E proteins are the critical components required for assembly of coronavirus virions and VLPs (Bos et al., 1996; Corse and Machamer, 2000; Vennema et al., 1996b) . However the SARS-CoV M protein appears to readily form VLPs in the absence of E (Tseng et al., 2010) , as does the M and S protein of IBV (Liu et al., 2013) . M protein sequences from different coronaviruses are highly conserved, approaching the level of conservation of some of the viral enzymes and replicase accessory proteins from pp1a (Stadler et al., 2003) . During the assembly process, N protein contributes to the formation of M LONG , narrowing the size range of resulting VLPs ( Fig. 2 ; Neuman et al., 2011) and increasing the efficiency of VLP production (Boscarino et al., 2008; Siu et al., 2008) . A study of VLP size and organization showed that MHV VLPs and virions with different components formed particles with a constrained minimum size, and that the size range of particles became narrower and approached the minimum size of VLPs as more structural components were incorporated ( Fig. 2; Neuman et al., 2011) . This suggests that incorporation of S and the genome, which are both essential for virion infectivity but dispensable for VLP production, affected the efficiency of budding in the sense that larger particles incorporate more M proteins. Coronavirus nucleoproteins are phosphoproteins, and are encoded near the 3 0 end of the genome. MHV N is phosphorylated at six sites (S162, S170, T177, S389, S424, and T428; White et al., 2007) by host kinases like cyclindependent kinase, glycogen synthase kinase, mitogen-activated protein kinase, and casein kinase II (Surjit et al., 2005) . The protein is also sumoylated (at Lys62 in the SARS-CoV N protein), a posttranslational modification that enhances the protein's tendency to homooligomerize and affects typical N protein-mediated interference in host cell division (Li et al., 2005b) . Several groups have successfully expressed soluble protein (both full-length and partial domains), but its unusually high positive charge, tendency to oligomerize, structural flexibility, and extremely low stability have hampered structural studies of the whole protein (Chang et al., 2006) . N possesses two RNA-binding domains: an N-terminal domain with adjacent S/R-rich motif ) and the C-terminal 209 amino acids Surjit et al., 2004; Yu et al., 2005) . The N protein also binds with nanomolar affinity to human cyclophilin A, though the physiological significance of this finding is still unknown (Luo et al., 2004; Pfefferle et al., 2011) . N protein supports coronavirus infection in several ways: the C-terminal domain (CTD) of N is important for binding the genomic RNA packaging signal leading to selective genome incorporation (Kuo et al., 2014; Molenkamp and Spaan, 1997) , the N3 domain interacts with the endodomain of M to form virions (Kuo et al., 2016) , and the serine-arginine repeat region of N (SR) interacts with the first ubiquitin-like domain of nsp3 in a critical early replication step (Hurst et al., 2010 (Hurst et al., , 2013 . It has also been demonstrated that N can oligomerize through interactions in the CTD (Chang et al., 2013) , bind viral RNA through the N-terminal domain (Fan et al., 2005) , unwind double-stranded nucleic acid in the manner of an RNA chaperone (Neuman et al., 2008; Zuniga et al., 2007) , and pack in a helix through the N-terminal domain (Saikatendu et al., 2007) , though none of these other functions has yet been demonstrated to be important for infection. N protein is dynamically associated with sites of viral RNA replication, suggesting that N may also function to protect the genome or possibly mediate genome transport to the budding site (Verheije et al., 2010) . The nucleoprotein of coronavirus is not a good match for nucleoproteins of other members of the Nidovirales at the level of amino acid sequence. However, the small N protein of arteriviruses EAV (Deshpande et al., 2007) and PRRSV (Doan and Dokland, 2003) adopts a similar fold to the CTD of coronavirus N . Alignment of predicted protein secondary structures from N proteins from other nidoviruses with the structures of coronavirus and arenavirus N suggests that a helix-strand-strandhelix motif may form a conserved functional domain near the C-terminus of all nidovirus N proteins (Fig. 3 ). E proteins are encoded by all known coronavirus genomes and are found at low levels in the virion (Godet et al., 1992; Liu and Inglis, 1991) . As pointed out by Kuo and Masters (Kuo et al., 2016 ), E appears to have three distinct functions that contribute to infection: regulating aggregation-prone M-M interactions (Boscarino et al., 2008) , disrupting Golgi organization in a way that produces larger vesicles capable of transporting virions (Machamer and Youn, 2006; Machamer, 2011, 2012) , and interacting with host factors in a way that affects pathogenesis (DeDiego et al., 2007 Dediego et al., 2008; Nieto-Torres et al., 2015; Regla-Nava et al., 2015; Teoh et al., 2010) . E proteins of several coronaviruses have been reported to have ion channel activity (Liao et al., 2004; Madan et al., 2005; Wilson et al., 2004) , which appears to enhance viral growth (Wilson et al., 2006; Ye and Hogue, 2007) . In addition to these three roles, E proteins have been speculated to be involved in scission of the membrane to free newly budded virions based on evidence from MHV VLP formation in the presence of E (Vennema et al., 1996a) and aberrant virion formation in E mutant viruses (Fischer et al., 1998) , analogous to the function of M2 in influenza A virus (Rossman et al., 2010) . However, it is not clear whether the role of E in virion release is distinct from its role in limiting M aggregation (Boscarino et al., 2008) and SARS-CoV M does not appear to require the presence of E to generate VLPs (Tseng et al., 2010) , suggesting that the role of E differs, or differs in importance, in some coronaviruses. E is probably best viewed as a multifunctional accessory gene that contributes to both virus growth and pathogenesis. Expression of SARS-CoV E protein is dispensable for coronavirus growth, but deletion results in severe defects in virus growth, presumably related to inefficient assembly (DeDiego et al., 2007; Kuo and Masters, 2003) . Interestingly, deletion of the SARS-CoV E gene has less effect on replication than the corresponding E gene deletion in MHV, suggesting that the function of E may be duplicated elsewhere in the genome. A candidate for the compensating factor could be the SARS-CoV 3a protein, which similarly displays ion channel activity (Lu et al., 2006) and serves to increase the viral growth rate when present (Yount et al., 2005) , or possibly the SARS-CoV ORF6-encoded protein, which is predicted to adopt a similar membrane topology to E and induces intracellular membrane rearrangement (Zhou et al., 2010) . In MHV, duplication of the amino-terminal part of M can partially compensate for the deletion of E, suggesting that the functions of MHV M and E proteins may overlap (Kuo and Masters, 2010) . The 229E-4a accessory protein also has ion channel activity, but it is not known whether this can compensate for deletion of E (Zhang et al., 2014) . In contrast, deletion of E in TGEV prevented the formation of infectious virions (Ortego et al., 2007) . A number of general structural features can be identified in all coronavirus E proteins, including a short hydrophilic N-terminal region, followed by a hydrophobic putative transmembrane region, and a relatively long hydrophilic C-terminal tail . SARS-CoV E protein forms pentamers (Pervushin et al., 2009 ) that conduct cations (Pervushin et al., 2009) , which is likely a conserved feature of E proteins. There is evidence that IBV E protein has different functions that correlate to different oligomerization states in mutant E proteins: monomeric E is sufficient to disrupt the Golgi, while oligomerization-competent E supports VLP release (Westerbeck and Machamer, 2015) . The target of ion channel activity may be disruption of intracellular processes by releasing calcium ions stored in the endoplasmic reticulum, ultimately driving an inflammatory immune response . The E protein is palmitoylated in at least some coronaviruses (Corse and Machamer, 2002; Liao et al., 2006) , and palmitylation is important for the role of MHV E protein in limiting M-M aggregation (Boscarino et al., 2008) . The transmembrane and palmitoylated domains of E are each sufficient to colocalize with M protein . The first open reading frame downstream of the SARS-CoV replicase encodes the S glycoprotein which is conserved in all coronaviruses. The $180-kDa spike protein plays a central role in the host cell attachment and entry processes. The spike is organized into an amino-terminal S1 domain that contains receptor-binding determinants, and a carboxylterminal S2 domain that contains the membrane anchor and fusion motor domains (Belouzard et al., 2012; Heald-Sargent and Gallagher, 2012) . Coronavirus S proteins contain short amino-terminal hydrophobic signal sequence motifs (von Heijne, 1984) . Although some coronavirus spike proteins are cleaved between the S1 and S2 regions as part of the activation process, the SARS-CoV spike is not appreciably cleaved before it is internalized in a host cell (Xiao et al., 2003) . The more variable amino-terminal region of the spike protein (S1) has been demonstrated to contain the receptorbinding activity (Wong et al., 2004) . The more conserved S2 region contains the transmembrane anchor, palmitic acid acylation site (Thorp et al., 2006) that is important for membrane fusion (McBride and Machamer, 2010; Shulla and Gallagher, 2009) , and the coiled-coil fusion motor domain (Bosch et al., 2003; Duquerroy et al., 2005; Liu et al., 2004; Tripet et al., 2004; Xu et al., 2004a,b) . One or more protease cleavage events are necessary to prime S for membrane fusion. An apparent fusion peptide of coronaviruses resides in the S2 region (Bosch et al., 2003; Sainz et al., 2005) , where it is exposed by cleavage that takes place on tetraspanin-enriched membranes where host proteases including TMPRSS2 and HAT localize for some coronaviruses including MERS-CoV, while other spikes can be primed for entry by cathepsins (Bertram et al., 2013; Earnest et al., 2015; Glowacka et al., 2011; Heurich et al., 2014; Huang et al., 2006; Shulla et al., 2011; Simmons et al., 2005) . This is consistent with host gene knockdown experiments that show that coronavirus entry is dependent on several elements that are important in the endosomal and lysosomal trafficking (Burkard et al., 2014; Wong et al., 2015) . Expression of IFITM proteins inhibits entry driven by several coronavirus spike proteins (Huang et al., 2011; Wrensch et al., 2014) , but paradoxically appears to promote infection by HCoV-OC43 (Zhao et al., 2014) . Near-atomic resolution cryo-EM structures have been published for the ectodomains of trimeric MHV (Walls et al., 2016) and HKU1 (Kirchdoerfer et al., 2016) up to the second heptad repeat region. The high-resolution spike ectodomain structures are similar to the profile to the upper part of SARS-CoV S in one study (Neuman et al., 2006) and taller and less square than the spikes reconstructed by another group (Beniac et al., 2006 (Beniac et al., , 2007 . High-resolution X-ray crystallography structures have also been obtained for two domains of the coronavirus spike protein. The structure of the minimal receptor-binding domain from SARS-CoV was solved first in conjunction with angiotensin-I converting enzyme 2 (ACE2; Towler et al., 2004) , the primary cellular receptor for SARS-CoV (Li et al., 2003) and human coronavirus NL63 (Hofmann et al., 2005) . More recently, S1 receptor structures have been solved for betacoronaviruses MHV (Peng et al., 2011) and MERS-CoV and alphacoronaviruses NL63 (Wu et al., 2009) , TGEV (Reguera et al., 2012) , and PRCoV (Reguera et al., 2012) . The receptor-binding domain structure of SARS-CoV consists of a core subdomain containing a five-stranded antiparallel β sheet with three short connecting α helices, and an extended loop subdomain that contacts ACE2 (Li et al., 2005a) , and the domains that mediate S1 receptor contact in other betacoronaviruses also seem to involve curved β sheets at the center with stabilizing loop interactions at the side. In contrast, the receptorbinding domain of alphacoronaviruses makes contact via a series of loops positioned at the edge of a β sheet. Image analysis of cryoelectron micrographs of SARS-CoV (Beniac et al., 2006; Neuman et al., 2006) and other coronaviruses (Neuman et al., 2006) confirms that spikes exist as a homotrimer in the native prefusion state on virions. However, some biochemical characterizations have revealed that S1 interacts with the receptor protein as a dimer, even within the context of the trimeric spike (Lewicki and Gallagher, 2002; Xiao et al., 2004) . In the model of coronavirus spike protein-mediated fusion, receptor binding triggers conformational changes including disulfide reshuffling (Gallagher, 1996; Lavillette et al., 2006) that release HR1 and HR2 to form the coiled-coil fusion motor structure, thereby driving fusion of the viral and host cell membranes and release of the viral ribonucleoprotein into the cytosol (Bosch et al., 2003) . The fusion motor complex of S2 consisting of two hydrophobic amino acid 4-3 heptad repeat regions (HR1 and HR2) which form amphipathic helices of a coiled-coil structure has also been solved in several forms (Bosch et al., 2003; Duquerroy et al., 2005; Liu et al., 2004; Tripet et al., 2004; Xu et al., 2004a,b) . In the structure representing the largest region of S2 structure, HR1 forms a trimeric 120 Å coiled coil (Duquerroy et al., 2005) . Coordinated chloride ions in the structure are instrumental in the formation of hydrogen bonds that stretch both ends of the HR2 region into an extended conformation surrounding the central alpha helix (Duquerroy et al., 2005) . After insertion of the fusion peptide into the target membrane, a single-particle fusion study revealed that it takes about 15 s to go from membrane insertion to hemifusion, 15-60 more seconds until a pore is formed, then another 30 s for complete lipid mixing between the virion and cell (Costello et al., 2013) . Spike protein is incorporated into virions through interactions with the membrane protein M . It can be generally accepted that all the determinants for virion incorporation reside in the transmembrane and carboxyl-terminal regions of the spike protein Kuo et al., 2000) . However, a recent study found that a chimeric MHV with SARS-CoV M and S transmembrane and endodomain was severely deficient in incorporating S into virions, suggesting that cellular localization signals or more complex interactions among the structural proteins may help support S incorporation (Kuo et al., 2016) . Working on nidoviruses is a mixed blessing-on one hand, there is sufficient evolutionary divergence and complexity with a few common threads to make it seem possible to reconstruct an evolutionary path from a simple coronavirus-like progenitor to the present array of nidoviruses. However, the extreme divergence between homologous proteins means that common ancestry is sometimes only evident at the level of protein fold and conserved function, meaning that homologs cannot necessarily be recognized by amino acid alignment. Two structural proteins stand out as being conserved: coronavirus M and N. One or more M-like three-pass transmembrane proteins with endodomains rich in predicted β structure are found in coronaviruses, toroviruses, bafiniviruses, arteriviruses, the arteri-like possum nidovirus (Dunowska et al., 2012) , and the newly discovered toro-like ball python nidovirus (Stenglein et al., 2014) . Roniviridae also encode a structural polyprotein of GP116 and GP64 that includes a superficially similar three-pass transmembrane protein known as 3N at its amino terminus, though 3N has not yet been detected in infected cells or virions to date. Mesoniviridae lack a three-pass M protein, but two single-pass transmembrane proteins with predicted β-rich endodomains known as M and 3b may serve the same function as coronavirus M in the virion. Every member of the Nidovirales also encodes a positively charged N-like protein. Although the proteins differ in size, common predicted structure elements near the C-terminus suggest that nidovirus N proteins may in fact be homologous, as shown in Fig. 3 . In comparison, proposed S-like proteins are highly divergent, and E-like proteins are absent in several nidovirus lineages, suggesting S and E are later refinements to nidovirus virion architecture. We can therefore imagine two evolutionary paths that gradually built up to the current complexity of nidovirus structural proteins (Fig. 4) . The first Fig. 4 Models of coronavirus virion evolution by gradual accumulation of structural proteins. Evolution of potential progenitors with enveloped pleomorphic or helical encapsidated virion architecture is shown leading to a filamentous enveloped intermediate stage, superficially resembling virions of the genus Bafinivirus or the family Roniviridae. Structural diversification by capture of attachment and fusion proteins from an unknown source, and partial duplication of M to make E in some lineages then leads to modern nidovirus lineages. model involves an ancestral enveloped virus that encodes a hypothetical protein similar to coronavirus M (but capable of RNA packaging and attachment to host cells) served as the original structural protein in a progenitor of the last common ancestor of nidoviruses. In this model, first proto-N, then S-like and E-like proteins would be added. M protein equivalents of coronavirus, arterivirus, and torovirus are similar in appearance, and are consistent with the size of two protein chains (B. Neuman, unpublished data) suggesting that networks formed of dimers may be an ancestral trait that originated in the hypothetical proto-M. M has many of the characteristics that would be expected of a primitive movement factor. M is generally the most abundant protein in virions, is essential for VLP formation, and for S and N incorporation into VLPs. MHV M protein has been reported to mediate incorporation of RNA containing the genomic packaging signal into VLPs in the absence of N inefficiently (Narayanan et al., 2003) , and the structurally similar 3a accessory protein of SARS-CoV may also bind RNA (Sharma et al., 2007) , suggesting that RNA packaging could be an ancestral feature of M-like proteins. The M-like GP5 protein of arteriviruses may also be involved in attachment to host cells via its glycosylated ectodomain (Tian et al., 2012) , though studies with chimeric PRRSV proteins demonstrated that M and GP5 are not solely responsible for differences in tropism (Lu et al., 2012) . These observations, together with the previously mentioned instance of M protein duplication compensating for deletion of E (Kuo and Masters, 2010) , suggest that M-like proteins as a group have the potential to carry out some of the essential functions of E, N, and S. The other potential evolutionary path we will consider is an encapsidated helical progenitor virus that encoded a proto-N related to the CTD of coronavirus N. Evidence for this is the potential conservation of fold in the CTD, suggesting a common origin for nidovirus N proteins. As described earlier, N selectively binds the genomic RNA and interacts with other N proteins to form a helical ribonucleoprotein (Barcena et al., 2009 ) reminiscent of the encapsidated forms of helical +RNA viruses of the Tymovirales, Virgaviridae, or Closteroviridae. In this model, a primitive virus with a helical capsid would gain an advantage in attaching to host cells by capturing a gene encoding a membrane-spanning protein like M. Considered individually, or as a group, viral particles formed by nidoviruses are pleomorphic. Whatever their origin, coronavirus structural proteins demonstrate a remarkable plasticity to accommodate gene deletion, gene duplication, and genetic divergence, while still facilitating the entry, egress, and protection of the genome. It seems fitting, therefore, that pleomorphic nidovirus particles should be formed from a set of structural components that could themselves collectively also be described as pleomorphic. Engineering the largest RNA virus genome as an infectious bacterial artificial chromosome A conserved domain in the coronavirus membrane protein tail is important for virus assembly Hospital outbreak of Middle East respiratory syndrome coronavirus Cryo-electron tomography of mouse hepatitis virus: insights into the structure of the coronavirion Mechanisms of coronavirus cell entry mediated by the viral spike protein Architecture of the SARS coronavirus prefusion spike Conformational reorganization of the SARS coronavirus spike following receptor binding: implications for membrane fusion TMPRSS2 activates the human coronavirus 229E for cathepsin-independent host cell entry and is expressed in viral target cells in the respiratory epithelium The production of recombinant infectious DI-particles of a murine coronavirus in the absence of helper virus Envelope protein palmitoylations are crucial for murine coronavirus assembly The coronavirus spike protein is a class I virus fusion protein: structural and functional characterization of the fusion core complex Coronavirus cell entry occurs through the endo-/lysosomal pathway in a proteolysis-dependent manner Reverse genetics system for the avian coronavirus infectious bronchitis virus Modular organization of SARS coronavirus nucleocapsid protein Transient oligomerization of the SARS-CoV N protein-implication for virus ribonucleoprotein packaging Infectious bronchitis virus E protein is targeted to the Golgi complex and directs release of virus-like particles The cytoplasmic tail of infectious bronchitis virus E protein directs Golgi targeting The cytoplasmic tails of infectious bronchitis virus E and M proteins mediate their interaction Single particle assay of coronavirus membrane fusion with proteinaceous receptor-embedded supported bilayers Mapping of the coronavirus membrane protein domains involved in interaction with the spike protein Assembly of the coronavirus envelope: homotypic interactions between the M proteins The two major envelope proteins of equine arteritis virus associate into disulfide-linked heterodimers A severe acute respiratory syndrome coronavirus that lacks the E gene is attenuated in vitro and in vivo Pathogenicity of severe acute respiratory coronavirus deletion mutants in hACE-2 transgenic mice Severe acute respiratory syndrome coronavirus envelope protein regulates cell stress response and apoptosis The proteome of the infectious bronchitis virus Beau-R virion Structure of the equine arteritis virus nucleocapsid protein reveals a dimer-dimer arrangement Structure of the nucleocapsid protein of porcine reproductive and respiratory syndrome virus Systematic assembly and genetic manipulation of the mouse hepatitis virus A59 genome Systematic assembly of a full-length infectious clone of human coronavirus NL63 Identification of a novel nidovirus associated with a neurological disease of the Australian brushtail possum (Trichosurus vulpecula) Central ions and lateral asparagine/glutamine zippers stabilize the post-fusion hairpin conformation of the SARS coronavirus spike glycoprotein Coronavirus and influenza virus proteolytic priming takes place in tetraspanin-enriched membrane microdomains The membrane M protein carboxy terminus binds to transmissible gastroenteritis coronavirus core and contributes to core stability Disulfide bonds between two envelope proteins of lactate dehydrogenase-elevating virus are essential for viral infectivity The nucleocapsid protein of coronavirus infectious bronchitis virus: crystal structure of its N-terminal domain and multimerization properties Analysis of constructed E gene mutants of mouse hepatitis virus confirms a pivotal role for E protein in coronavirus assembly Murine coronavirus membrane fusion is blocked by modification of thiols buried within the spike protein Evidence that TMPRSS2 activates the severe acute respiratory syndrome coronavirus spike protein for membrane fusion and reduces viral control by the humoral immune response Assembly of spikes into coronavirus particles is mediated by the carboxy-terminal domain of the spike protein TGEV corona virus ORF4 encodes a membrane protein that is incorporated into virions Prediction of intrinsic disorder in MERS-CoV/HCoV-EMC supports a high oral-fecal transmission Virology: independent virus development outside a host Analysis of multimerization of the SARS coronavirus nucleocapsid protein Ready, set, fuse! The coronavirus spike protein and acquisition of fusion competence TMPRSS2 and ADAM17 cleave ACE2 differentially and only proteolysis by TMPRSS2 augments entry driven by the severe acute respiratory syndrome coronavirus spike protein Human coronavirus NL63 employs the severe acute respiratory syndrome coronavirus receptor for cellular entry Studies on the substructure of togaviruses. II. Analysis of equine arteritis, rubella, bovine viral diarrhea, and hog cholera viruses SARS coronavirus, but not human coronavirus NL63, utilizes cathepsin L to infect ACE2-expressing cells Distinct patterns of IFITM-mediated restriction of filoviruses, SARS coronavirus, and influenza A virus A major determinant for membrane protein interaction localizes to the carboxy-terminal domain of the mouse coronavirus nucleocapsid protein An interaction between the nucleocapsid protein and a component of the replicase-transcriptase complex is crucial for the infectivity of coronavirus genomic RNA Characterization of a critical interaction between the coronavirus nucleocapsid protein and nonstructural protein 3 of the viral replicase-transcriptase complex Genetic manipulation of porcine epidemic diarrhoea virus recovered from a full-length infectious cDNA clone Pre-fusion structure of a human coronavirus spike protein Proteomic analysis of purified coronavirus infectious bronchitis virus particles Retroviral proteases and their roles in virion maturation Genetic evidence for a structural interaction between the carboxy termini of the membrane and nucleocapsid proteins of mouse hepatitis virus The small envelope protein E is not essential for murine coronavirus replication Evolved variants of the membrane protein can partially replace the envelope protein in murine coronavirus assembly Retargeting of coronavirus by substitution of the spike glycoprotein ectodomain: crossing the host cell species barrier Recognition of the murine coronavirus genomic RNA packaging signal depends on the second RNA-binding domain of the nucleocapsid protein Analyses of coronavirus assembly interactions with interspecies membrane and nucleocapsid protein chimeras Significant redox insensitivity of the functions of the SARS-CoV spike glycoprotein: comparison with HIV envelope Thirty-thousand-year-old distant relative of giant icosahedral DNA viruses with a pandoravirus morphology Quaternary structure of coronavirus spikes in complex with carcinoembryonic antigen-related cell adhesion molecule cellular receptors Angiotensin-converting enzyme 2 is a functional receptor for the SARS coronavirus Structure of SARS coronavirus spike receptor-binding domain complexed with receptor Sumoylation of the nucleocapsid protein of severe acute respiratory syndrome coronavirus Bending and puncturing the influenza lipid envelope Expression of SARS-coronavirus envelope protein in Escherichia coli cells alters membrane permeability Biochemical and functional characterization of the membrane association and membrane permeabilizing activity of the severe acute respiratory syndrome coronavirus envelope protein The missing link in coronavirus assembly. Retention of the avian coronavirus infectious bronchitis virus envelope protein in the pre-Golgi compartments and physical interaction between the envelope and membrane proteins Association of the infectious bronchitis virus 3c protein with the virion envelope Interaction between heptad repeat 1 and 2 regions in spike protein of SARS-associated coronavirus: implications for virus fusogenic mechanism and identification of fusion inhibitors Coronavirus envelope protein: a small membrane protein with multiple functions Responses of three different ecotypes of reed (Phragmites communis Trin.) to their natural habitats: leaf surface micro-morphology, anatomy, chloroplast ultrastructure and physio-chemical characteristics Assembly and immunogenicity of coronavirus-like particles carrying infectious bronchitis virus M and S proteins Severe acute respiratory syndrome-associated coronavirus 3a protein forms an ion channel and modulates virus release Chimeric viruses containing the N-terminal ectodomains of GP5 and M proteins of porcine reproductive and respiratory syndrome virus do not change the cellular tropism of equine arteritis virus Nucleocapsid protein of SARS coronavirus tightly binds to human cyclophilin A Structures of the N-and C-terminal domains of MHV-A59 nucleocapsid protein corroborate a conserved RNAprotein binding mechanism in coronavirus The transmembrane domain of the infectious bronchitis virus E protein is required for efficient virus release Viroporin activity of murine hepatitis virus E protein Genetic and molecular biological analysis of protein-protein interactions in coronavirus assembly Palmitoylation of SARS-CoV S protein is necessary for partitioning into detergent-resistant membranes and cell-cell fusion but not interaction with M protein Identification of a specific interaction between the coronavirus mouse hepatitis virus A59 nucleocapsid protein and packaging signal Characterization of the coronavirus M protein and nucleocapsid interaction in infected cells Nucleocapsid-independent specific viral RNA packaging via viral envelope protein and viral RNA signal Supramolecular architecture of severe acute respiratory syndrome coronavirus revealed by electron cryomicroscopy Proteomics analysis unravels the functional repertoire of coronavirus nonstructural protein 3 A structural analysis of M protein in coronavirus assembly and morphology Topographic changes in SARS coronavirus-infected cells at late stages of infection Discovery of the first insect nidovirus, a missing evolutionary link in the emergence of the largest RNA virus genomes Subcellular location and topology of severe acute respiratory syndrome coronavirus envelope protein Severe acute respiratory syndrome coronavirus E protein transports calcium ions and activates the NLRP3 inflammasome Transmissible gastroenteritis coronavirus RNA-dependent RNA polymerase and nonstructural proteins 2, 3, and 8 are incorporated into viral particles Glycosylation of the severe acute respiratory syndrome coronavirus triple-spanning membrane proteins 3a and M Absence of E protein arrests transmissible gastroenteritis coronavirus maturation in the secretory pathway Genome-wide analysis of protein-protein interactions and involvement of viral proteins in SARS-CoV replication Crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor Structure and inhibition of the SARS coronavirus envelope protein ion channel The SARS-coronavirus-host interactome: identification of cyclophilins as target for pan-coronavirus inhibitors Genome structure and transcriptional regulation of human coronavirus NL63 Severe acute respiratory syndrome coronaviruses with mutations in the E protein are attenuated and promising vaccine candidates Structural bases of coronavirus attachment to host aminopeptidase N and its inhibition by neutralizing antibodies The transmissible gastroenteritis coronavirus contains a spherical core shell consisting of M and N proteins Influenza virus M2 protein mediates ESCRT-independent membrane scission The hydrophobic domain of infectious bronchitis virus E protein alters the host secretory pathway and is important for release of infectious virus A single polar residue and distinct membrane topologies impact the function of the infectious bronchitis coronavirus E protein Ribonucleocapsid formation of severe acute respiratory syndrome coronavirus through molecular action of the N-terminal domain of N protein Identification and characterization of the putative fusion peptide of the severe acute respiratory syndrome-associated coronavirus spike protein Chaperone role for proteins p618 and p892 in the extracellular tail development of Acidianus two-tailed virus Selective replication of coronavirus genomes that express nucleocapsid protein Characterization of White bream virus reveals a novel genetic cluster of nidoviruses Reverse genetics with a full-length infectious cDNA of the Middle East respiratory syndrome coronavirus The 3a accessory protein of SARS coronavirus specifically interacts with the 5'UTR of its genomic RNA, using a unique 75 amino acid interaction domain Role of spike protein endodomains in regulating coronavirus entry A transmembrane serine protease is linked to the severe acute respiratory syndrome coronavirus receptor and activates virus entry Coronavirus JHM: a virion-associated protein kinase Inhibitors of cathepsin L prevent severe acute respiratory syndrome coronavirus entry The M, E, and N structural proteins of the severe acute respiratory syndrome coronavirus are required for efficient assembly, trafficking, and release of virus-like particles Heterodimerization of the two major envelope proteins is essential for arterivirus infectivity WHO Virus Survival Report: Survival in the Environment with Special Attention to Survival in Sewage Droplets and Other Media of Fecal or Respiratory Origin A yellow-head-like virus from Penaeus monodon cultured in Australia Cryo-electron tomography of porcine reproductive and respiratory syndrome virus: organization of the nucleocapsid SARS-beginning to understand a new virus Ball python nidovirus: a candidate etiologic agent for severe respiratory disease in Python regius Isolation of coronavirus envelope glycoproteins and interaction with the viral nucleocapsid The nucleocapsid protein of the SARS coronavirus is capable of self-association through a C-terminal 209 amino acid interaction domain The severe acute respiratory syndrome coronavirus nucleocapsid protein is phosphorylated and localizes in the cytoplasm by 14-3-3-mediated translocation Genome organization and reverse genetic analysis of a type I feline coronavirus The SARS coronavirus E protein interacts with PALS1 and alters tight junction formation and epithelial morphogenesis Infectious RNA transcribed in vitro from a cDNA copy of the human coronavirus genome cloned in vaccinia virus Palmitoylations on murine coronavirus spike proteins are essential for virion assembly and infectivity Arterivirus minor envelope proteins are a major determinant of viral tropism in cell culture ACE2 X-ray structures reveal a large hinge-bending motion important for inhibitor binding and catalysis Structural characterization of the SARS-coronavirus spike S fusion protein core Stability of Middle East respiratory syndrome coronavirus (MERS-CoV) under different environmental conditions Nucleocapsid-independent assembly of coronavirus-like particles by co-expression of viral envelope protein genes The coronavirus nucleocapsid protein is dynamically associated with the replication-transcription complexes Analysis of the distribution of charged residues in the N-terminal region of signal sequences: implications for protein export in prokaryotic and eukaryotic cells Calculation of standard atomic volumes for RNA and comparison with proteins: RNA is packed more tightly Cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer APOBEC3G cytidine deaminase association with coronavirus nucleocapsid protein Structure of MERS-CoV spike receptor-binding domain complexed with human receptor DPP4 A coronavirus E protein is present in two distinct pools with different effects on assembly and the secretory pathway Identification of mouse hepatitis coronavirus A59 nucleocapsid protein phosphorylation sites SARS coronavirus E protein forms cation-selective ion channels Hexamethylene amiloride blocks E protein ion channels and inhibits coronavirus replication A 193-amino acid fragment of the SARS coronavirus S protein efficiently binds angiotensin-converting enzyme 2 Genome-widescreen reveals valosin-containing protein requirement for coronavirus exit from endosomes Discovery of a novel bottlenose dolphin coronavirus reveals a distinct species of marine mammal coronavirus in Gammacoronavirus IFITM proteins inhibit entry driven by the MERS-coronavirus spike protein: evidence for cholesterol-independent mechanisms Crystal structure of NL63 respiratory coronavirus receptor-binding domain complexed with its human receptor The SARS-CoV S glycoprotein: expression and functional characterization Oligomerization of the SARS-CoV S glycoprotein: dimerization of the N-terminus and trimerization of the ectodomain Structural basis for coronavirus-mediated membrane fusion. Crystal structure of mouse hepatitis virus spike protein fusion core Crystal structure of severe acute respiratory syndrome coronavirus spike protein fusion core Role of the coronavirus E viroporin protein transmembrane domain in virus assembly Strategy for systematic assembly of large RNA and DNA genomes: transmissible gastroenteritis virus model Reverse genetics with a full-length infectious cDNA of severe acute respiratory syndrome coronavirus Severe acute respiratory syndrome coronavirus group-specific open reading frames encode nonessential functions for replication in cell cultures and mice Recombinant severe acute respiratory syndrome (SARS) coronavirus nucleocapsid protein forms a dimer through its C-terminal domain Crystal structure of the severe acute respiratory syndrome (SARS) coronavirus nucleocapsid protein dimerization domain reveals evolutionary linkage between corona-and arteriviridae The ORF4a protein of human coronavirus 229E functions as a viroporin that regulates viral production Interferon induction of IFITM proteins promotes infection by human coronavirus OC43 The N-terminal region of severe acute respiratory syndrome coronavirus protein 6 induces membrane rearrangement and enhances virus replication Coronavirus nucleocapsid protein is an RNA chaperone