key: cord-0822874-i35azado authors: Ruigrok, Katinka; Vaney, Marie-Christine; Buchrieser, Julian; Baquero, Eduard; Hellert, Jan; Baron, Bruno; England, Patrick; Schwartz, Olivier; Rey, Felix A.; Backovic, Marija title: X-ray Structures of the Post-fusion 6-Helix Bundle of the Human Syncytins and their Functional Implications date: 2019-12-06 journal: J Mol Biol DOI: 10.1016/j.jmb.2019.10.020 sha: adf19a9b6146fd075d668ad7a41b4f9e6dc48a51 doc_id: 822874 cord_uid: i35azado The retroviral envelope-derived proteins syncytin-1 and syncytin-2 (syn1 and syn2) drive placentation in humans by forming a syncytiotophoblast, a structure allowing for an exchange interface between maternal and fetal blood during pregnancy. Despite their essential role, little is known about the molecular mechanism underlying the syncytins' function. We report here the X-ray structures of the syn1 and syn2 transmembrane subunit ectodomains, featuring a 6-helix bundle (6HB) typical of the post-fusion state of gamma-retrovirus and filovirus fusion proteins. Contrary to the filoviruses, for which the fusion glycoprotein was crystallized both in the post-fusion and in the spring-loaded pre-fusion form, the highly unstable nature of the syncytins' prefusion form has precluded structural studies. We undertook a proline-scanning approach searching for regions in the syn1 6HB central helix that tolerate the introduction of helix-breaker residues and still fold correctly in the pre-fusion form. We found that there is indeed such a region, located two α-helical turns downstream a stutter in the central coiled-coil helix - precisely where the breaks of the spring-loaded helix of the filoviruses map. These mutants were fusion-inactive as they cannot form the 6HB, similar to the “SOSIP” mutant of HIV Env that allowed the high-resolution structural characterization of its labile pre-fusion form. These results now open a new window of opportunity to engineer more stable variants of the elusive pre-fusion trimer of the syncytins and other gamma-retroviruses envelope proteins for structural characterization. Syncytins are cellecell fusion proteins that drive placenta formation by inducing the merger of peripheral blastocyst cells, called cytotrophoblasts, into a giant syncytium called syncytiotrophoblast. This structure provides an exchange interface between maternal and fetal blood, thereby establish-ing a nutrient supply to the fetus [1] . Despite the syncytins' key role in placental morphogenesis and likely in a number of pregnancy diseases [2] , as well as their involvement in osteoclast fusion during bone formation [3] , they remain poorly characterized molecularly. The syncytin genes are among the numerous endogenous retroviral elements [4] that account for 8% of the human genome [5] . Although the majority of such elements have accumulated mutations and sequence rearrangements that make them non-functional, some have preserved their open reading frames due to selective advantages for the host. The syncytins are prime examples of such appropriations, involving domestication of the retroviral genes coding for the viral envelope protein (Env) , which mediates fusion of the viral lipid envelope with the plasma membrane of target cells during entry. One or more syncytins have been found in all so far investigated placental mammals, including marsupials [2, 6] , and also in more distant vertebrates such as viviparous lizards [7] . They have been the product of independent endogenization events, with the earliest syncytin capture envisioned as the driving force behind the evolution of egg-laying vertebrates into placental mammals [1] . In humans, there are two syncytins, syncytin-1 and syncytin-2 (syn1 and syn2), that derive from the env genes of the human endogenous retroviruses W and FRD (HERV-W and HERV-FRD), respectively [8] . Both are gamma-retroviruses [9] that entered the human genome around 25 and 45 million years ago, respectively [10e12] . Human syncytins possess a gamma-type Env protein, which is present not only in gamma-retroviruses but also in delta-, alpha-, and in a subset of the beta-retroviruses [13] . The Env protein of all retroviruses infecting vertebrates is a class I viral fusion protein, a class that also includes the fusion glycoproteins (GPs) of the otherwise unrelated influenza viruses, coronaviruses, paramyxoviruses, pneumoviruses, filoviruses, and arenaviruses [14] . Class I proteins are type 1 single-pass transmembrane proteins that fold in the endoplasmic reticulum of the infected cell as a precursor that in most cases trimerizes directly upon folding. An activating proteolytic step cleaves the precursor in two -an N-terminal portion, termed "surface" subunit (SU) in retroviruses -which in general has a receptor binding function, and a Cterminal subunit, which carries the membrane fusion function and is termed "trans-membrane" (TM) subunit, as it includes the trans-membrane anchor (TM A) near its C-terminal end (Fig. 1A) . The SU subunit remains peripherally associated and in some cases is covalently linked to TM by a disulfide bond [13] . Cleavage of the precursor into SU and TM traps the complex in a metastable state (the "pre-fusion" form), with the TM subunit spring-loaded underneath a crown made by SU. Interactions with target cells induce the release of the SU crown, allowing TM to spring out and undergo a major conformational change that drives membrane fusion [15] . A specific feature of the gamma-type Env proteins is the presence of two cysteine-rich motifs in the primary sequence, CXXC in SU and CX 6 CC in TM (where X is any aa), which control formation of an inter-subunit disulfide bond in the pre-fusion form that is reduced upon interaction with a receptor to release SU and induce the fusogenic conformational change in TM required for entry [16] . An important functional element of the TM subunit is the "fusion peptide", a hydrophobic segment located at its N-terminal end and normally buried under the SU crown. The fusogenic conformational change entails first projection of the fusion peptide to a distance about 10 nm or more away from its original location, such that it can reach and insert into the target cell membrane. In this transient conformation, the TM protein has the N-and C-termini at opposite ends of a long trimeric rod, thereby bridging viral and cellular membranes at a distance of about 15 nm [17] . The trimer protomers bend in half, adopting a "hairpin" conformation that brings the Nand C-terminal ends of the TM ectodomain (TM E ) into contact, forcing apposition of the two membranes at a distance within 1 nm. The conformational change of TM thus lowers a kinetic barrier arising from short-range repulsive forces between the two membranes -resulting from the necessary dehydration of both outer leaflets to allow inter-membrane lipid contacts [18] -to catalyze lipid merger. The X-ray structure of TM E in the post-fusion hairpin conformation is known for Env from several retroviruses [19e25] . These structures showed a long a-helix (termed N-helix) immediately downstream of the fusion peptide, which forms a characteristic central, parallel trimeric a-helical coiled-coil. The amino acid sequence in this region has characteristic heptad repeats (HRs), represented as positions abcdefg (Fig. 1B) , where a and d are typically occupied by apolar amino acids that form the hydrophobic core of the coiled-coil. Downstream the N-helix, the polypeptide chain makes an overall 180 turn, such that the C-terminal segment runs backward, antiparallel to the coiled-coil and inserting an a-helix (the "C-helix") along the grooves in-between the long N-helices in the trimer to complete the hairpin. The resulting 6-helix bundle (6HB) thus has the C-terminal segment connecting to the trans-membrane anchors in proximity of the fusion peptides projecting from the N-terminal end. The structure of the pre-fusion form of the retroviral Env ectodomain, in which the N-helix is broken into several short helices to maintain a more compact form underneath the SU crown, remains a challenge. HIV-1, which has a "beta-type" Env, is the only retrovirus for which pre-fusion Env structure is known (reviewed in Refs. [26, 27] ). In this case, as with other class I fusion proteins, the introduction of mutations interfering with formation of the postfusion 6HB were necessary to stabilize the prefusion trimer and make it accessible to structural studies [28] . Structural studies on syncytins have been limited to the X-ray structure of a syn2-TM E fragment missing around half of the TM E [29] . We present here the . The boundaries of the crystallized constructs and the residues resolved in the structures are marked above and below the rectangular diagram of the full-length syncytin-1 and syncytin-2, respectively. The residues limiting different regions within SU and TM are indicated with numbers on the scheme of the full-length protein. The intersubunit disulfide bond is represented with dashed lines, and the intra-TM disulfide bond within CX 6 CC motif in T-loop as an arc symbol. (B) Sequence alignment of syncytin-1 and syncytin-2 TM subunits. The residues are colored according to their structures of the complete syn2-and syn1-TM E in post-fusion conformation, which allowed us to perform structural comparisons of syncytins with other class I fusion proteins, confirming the anticipated structural homology to the retroviral gamma-type Envs and also to GP2 fusion subunit of filoviruses and reptarenaviruses. Our results identified a vertical string of 9 salt bridges crisscrossing the N-terminal part of the N-helix and the C-terminal part of the C-helix, found in syn1 and not in syn2, in line with the higher thermal stability measured for syn1-TM E . The new structures, together with their close structural homology with the filovirus counterparts, guided further functional studies with full-length syn1 variants carrying helix-breaker proline substitutions along the N-helix region. We thus identified locations in which the proline appeared to block formation of the post-fusion form -as monitored by their loss of function in a syncytium formation assay -but remained compatible with folding in the springloaded pre-fusion form as they were trafficked to the cell surface as efficiently as wild-type protein. Although syn1 and syn2 have the same number of residues in total (538 aa) and their TM E are also about the same length (~140 aa), syn1 has a shorter SU and a longer cytosolic tail (Fig. 1A ). We used structural information available for a number of postfusion retroviral TM E conformations [22, 24, 25] together with aa sequence alignments and secondary structure/hydrophobicity plots [30, 31] to define the boundaries of the constructs we used for the structural studies. These analyses (Fig. S1 ) suggested that the fusion peptide spans the TM Nterminal 23e24 aa and that the HRs begin 6-7 residues downstream, encompassing around 48 aa in total for both syncytins. The N-terminal end in the previously reported X-ray structure of a syn2-TM E fragment [29] was about 10 residues downstream these predictions, suggesting that the N-helix was truncated. In the C-terminal segment, our predictions indicated a roughly 25-residue long a-helix downstream the characteristic gamma-type Env CX 6 CC motif, which would correspond to a C-helix that was absent in the reported syn2-TM E structure (Figs. 1A and 2; PDB accession code 1Y4M [29] ). A relatively hydrophobic element is present between the predicted C-helix and the TM A , most likely corresponding to the amphipathic membrane-proximal external region (MPER) identified in a number of other retroviral Env proteins. From these analyses, we designed expression vectors for a recombinant TM E core, termed here syn1-TM E (residues 342e435) and syn2-TM E (residues 375e468) starting immediately downstream of the fusion peptide and ending before the putative MPER (Fig. 1B) . We cloned these constructs into a pET28 vector for production in Escherichia coli, including a histidine tag fused at the N-terminus. Upon purification by nickel affinity followed by size exclusion chromatography (SEC), we obtained yields of about 30 mg of each protein per liter of cell culture. Both proteins formed trimers with a molecular mass of~36 kDa as judged from their SEC elution profiles (data not shown). Syn1-TM E crystallized in a rhombohedral space group (H32), and the crystals, which contained one protomer in the asymmetric unit, diffracted to 2.1 Å resolution. Syn2-TM E formed trigonal crystals (space group P3 2 21) that diffracted to 2.2 Å and had a trimer in the asymmetric unit. The corresponding structures were determined by molecular replacement using the truncated syn2-TM E structure as search model (PDB accession number 1Y4M). The final map displayed clear electron density for residues 345e433 in the crystals of syn1-TM E , with the first three and last two residues of the construct being disordered. For syn2-TM E , the trimer is local, and residues 380e465 are resolved in all three chains of the trimer, with individual protomers displaying additional N-or C-terminal extensions ordered because of stabilization by crystal packing contacts. The atomic models were refined to the maximum resolution of the diffraction data to R work /R free of 0.22/ 0.24 and 0.21/0.26 for syn1-and syn2-TM E , respectively. The statistics concerning the structure determination and refinement of the atomic model are listed in Table S1 . The crystallographic details are described in the Materials and Methods section. The syncytins are typical gamma-type Env proteins The structures showed that the syn1 and syn2 6HBs are formed by an N-helix of 12 turns (about 44 aa organized in 6 HRs) followed by a "turn" (T) conservation: identical residues in white font on a red background, nonconserved residues in black font, and the residues with similar physicochemical properties in red font on a white background. The boundaries of the expression constructs are marked with black vertical lines, and the regions that are not resolved in the structures with dashed lines. The helical register (abcdefg) and the turns in N-and C-helices are plotted above and below the sequence alignment. The stutter residues are underlined. The N-helix asparagines that coordinate chloride ions are labeled with "Cl À ," and the T-loop residues conferring the ISU þ fusion properties to syncytin-2 are highlighted in yellow. Orange arrowheads indicate the boundaries of the previously crystallized syn2-TM E . Postfusion structures of syn1-TME and syn2-TME. Our X-ray structures of syn1-and syn2-TM E are compared with the previously reported syn2 structure (top row) and the structures of closely related MPMV and XMRV TM E (bottom row). The structural elements forming the TM E dN-helix (pale gray), T-loop (blue), C-helix (dark gray)dare indicated, as well as the turns in the helices. The intrachain disulfide bonds are shown as yellow sticks. A pale green tube inside the Nhelices follows the axis of each helix, with the stutter location marked black. A black vertical line marks the 3-fold axis of the coiled-coil. A chloride ion coordinated by asparagine residues is shown as a purple sphere. The residues conferring ISU properties to syn2-TM E and the equivalent residues in other TM E have their side chains displayed as orange sticks, and their names displayed on yellow background. The conserved N-glycosylation site at the beginning of C-helix is labeled as N409, N443, and N487 in syn1-, syn2-, and MPMV TM E , respectively. The PDB codes are displayed below the structure names. region (21 aa), where the chain reverses orientation, and a C-helix of 6 turns (23 aa) (Fig. 2) . Aligning the 6HBs of syn1 and syn2 resulted in 0.9 Å root mean square deviation (rmsd) for 258 aligned Ca atoms (Table S2) . A DALI search [32] using the syn1-and syn2-TM E structures as queries showed that they are most similar to the 6HB of the beta-retrovirus MasonePfizer monkey virus (MPMV), which has a gamma-type Env [33] , and also to the prototypical gamma-retrovirus xenotropic murine leukemia viruserelated virus (XMRV) (Fig. 2 , lower panel). The observed structural similarity is in line with the 51% and 45% aa sequence identity of the MPMV TM E core with syn1 and syn2, respectively, which is higher than between syn1 and syn2 (44% aa sequence identity). In three dimensions (3D), the 6HB of syn1 and syn2 align with 0.9 Å and 1.2 Å rmsd, respectively, onto the MPMV counterpart, with 258 aligned Ca atoms (Table S2 ). The XMRV 6HB is slightly more distant, but still highly similar, as are the 6HBs of the delta-retroviruses human T-lymphotropic virus (HTLV) and bovine leukemia virus (BLV) Env proteins, which are also gamma-type Envs. The most obvious difference is that the BLV and HTLV proteins have two or a single short helix, respectively, replacing the C-helix in the 6HB [24, 25] (Fig. S2A) . The corresponding rmsd values obtained upon 3D alignment on the syn1 and syn2 6HBs are given in Table S2 and Fig. S2B . As also observed in the previously reported gamma-type TM E structures [22, 24, 25] , the 6HB of the syncytins features two "belts" or networks of hydrogen bonds: belt 1 ties the N-helix at turn 7 to its adjacent protomers in the trimer and belt 2 at turns 9 and 10, close to the bottom of the bundle (Fig. 3A ). Belt 1 involves the side chains of several glutamine residues in syn1 and syn2. In the 6HB of MPMV and XMRV TM E , numerous electrostatic interactions were described between charged residues mapping to the same region ( Fig. 3D ) [22] . In addition, the syn1 6HB contains a string of 8 inter-chain and 1 intra-chain salt bridges that tie the first 5 turns of the N-helix to the C-helix of the adjacent protomer in a vertical fashion, i.e. by establishing numerous successive inter-chain and intra-chain contacts (Fig. 3B) . The corresponding residues in the Chelix of the syn2 6HB are uncharged (Fig. S2 , Fig. 3D ), and therefore no salt bridge is formed. Instead, the syn2 6HB features two different salt bridges, one inter-and one intra-chain, in the upper half of the bundle (Fig. 3C , Table S3 ). Akin to what we observe here in syn2, electrostatic interactions also clip the termini of the N-and C-helices of MPMV and XMRV at the open end of the hairpin, although they are more dispersed and less numerous (1 and 2, respectively) (Fig. 3D ). The ionic character of the interactions stabilizing the syn1 6HB is reflected in the negatively charged electrostatic surface potential of the upper half of the coiled-coil in syn1, which is uncharged in syn2 (Fig. 4A) . To understand the contribution of these differences to the stability of the 6HB, we measured the melting temperature (T m ) of the syn1 and syn2 6HBs using differential scanning fluorimetry. We found that the syn1 6HB had a T m of 85 C, about 20 C higher than that of the syn2 6HB (Fig. 4B) , highlighting the higher stability of the former. At the core of belt 2, there is a conserved QNRR motif (aa 381e384 in syn1 and 415e418 in syn2), with an asparagine in place of the hydrophobic residue expected at position a or d in the HR pattern (Fig. 3D ). The asparagine side chain coordinates a central chloride ion on the 3-fold molecular axis, as seen previously in the HTLV-1, BLV, MPMV, and XMRV TM E structures. We refer to the interacting residues in the QNRR motif according to their location relative to the conserved asparagine (N 0 ), as Q À 1 , R þ1 , and R þ2 . In this scheme, the conserved salt bridge connects R þ2 with E þ20 . Both residues were shown to be essential for the fusion activity of XMRV, MPMV, and HTLV Envs [22, 34] . The TM E of the deltaretroviruses contains a second, upstream hydrophilic layer, which is in the case of HTLV-1 gp21 organized around Asn364 that traps a second chloride ion, and in BLV gp21 around Thr353 that chelates a water molecule on the 3-fold axis. These additional hydrophilic layers of inter-chain polar contacts established around a second trapped molecule/ion map to the same region as belt 1 in XMRV, MPMV, and the syncytins 6HB (Fig. S2A ). The polar interactions in belt 2, with the asparagine side chain pointing into the coiled-coil cavity, are likely to be important for establishing the correct register and orientation of the helices, as has been demonstrated for other coiled-coils [35, 36] . The interactions between the residues of the N-and Chelices of the same or the adjacent protomer in the trimer ( Fig. 3A [belt 1] , B, and C) might play a role in stabilization of the 6HB, which is in turn important to drive fusion. A direct test of the role of the residues involved in these interactions by site-directed mutagenesis is complicated by the fact that the 6HB forms only after the fusogenic conformational change, and the lack of functionality of any given point mutant could also be because of an alternative interaction in the pre-fusion form. A detailed analysis of these residues therefore awaits the knowledge of the structure of the TM E moiety in the context of a prefusion syncytin trimer. The T-loop at the chain reversal region starts with a 3/10 helical turn, termed h T helix, followed by a short a-helix (T helix,~2 turns; Fig. 2 ). The single Syn1, which does not have ISU properties, has R393 at the corresponding position. Another residue located in the syn2-TM E T-loop, A433, is important for its fusion activity, and only the simultaneous introduction of these two mutations into syn1, R393Q and F399A (Fig. 1B) , converts syn1 into a functional protein with both ISU and fusogenic activities [37] . These pairs of residues, Q427 and A433 in syn2-TM E , are conserved in MPMV (Q471, A477), while a glutamic acid replaces the glutamine in XMRV (E526) (Fig. 2) . Although syn1 is not immunosuppressive, the main chain conformation in the region spanning the ISU is the same as in syn2, MPMV, and XMRV, which all have functional ISU properties. In addition to the expected structural similarity to the gamma-retrovirus type 6HB, syn1-and syn2-TM E are also structurally very close to the fusion proteins of filoviruses such as Ebola virus (EBOV) and Marburg virus (MARV), as well as the filoviruslike fusion protein of reptarenaviruses such as the California Academy of Sciences virus (CASV), as already pointed out [38, 39] . This is reflected in the low rmsd values obtained upon superposition of their trimers ( 1.9 Å for !83% residues aligned; Table S2 ), as well as conservation of structural motifs such as CX 6 CC, ISU [40] , and belts 1 and 2 (Fig. S2A) . Such a close conservation of features is not observed when comparing syncytins to the 6HB of retroviruses bearing a beta-type Env protein, like the lentiviruses, which lack the conserved CXXC/ CX 6 CC motifs and do not form the inter-subunit TMeSU disulfide bond. In the filoviral-like proteins, the length of the structural elements of the 6HB is similar to the gamma-retroviral counterparts, with the C-helices showing some variability (Table S2, Fig. S2B ). The QNRR-E þ20 motif at the core of belt 2 is LNRK-D þ21 and INRH-D þ21 in EBOV and MARV GP2, respectively (Fig. S2A) . The chloride ion coordinated by the central asparagine is in addition coordinated by a serine side chain located three residues upstreamdat the a position of the same HR [41, 42] (Fig. S2 ). CASV GP2 is slightly different, as the central asparagine is replaced by a histidine residue that coordinates a central water molecule on the trimer axis [39] . The high structural similarity between the syncytins 6HB and the filovirus counterpart is of particular interest, as the fusion proteins of EBOV and MARV have also been structurally characterized in their pre-fusion form [43e45]. The X-ray structures of a number of class I fusion proteins from different viral families have been determined in both pre-and post-fusion forms (reviewed in Ref. [14] ), and the breaks in the Nhelix in the spring-loaded form have been mapped. In several of these structures, including filoviral GP, the break points were found downstream of a special turn in the N-helix that makes a "stutter" in the central coiled-coil. Stutters are defined as HR disruptions in the form of 4-residue insertions in between two HRs [46] and are found in most alphahelical coiled-coils. The stutter-induced deviation from the ideal geometry results in local unwinding of the coiled-coil superhelix, and while the helical axes are almost straight for the first three turns of the Nhelix, their twist becomes evident as the coil progresses towards the C-terminus (Fig. 2) . Stutters have been proposed to impart structural plasticity to regions that function as conformational switches [39] , in agreement with their potential to suggest possible sites for spring-loading the N-helix. Analysis of the coiled-coil by program TWISTER [47] revealed the presence of a stutter at the 3rd turn of the N-helix in both syncytin structures, residues 354 LNGD 357 in syn1 and 388 IANN 391 in syn2-TM E (Figs. 1B and 2) . A stutter is present at the same location in the 6HB of MPMV, XMRV, and BLV, as well as in CASV, two helical turns upstream in HTLV and two turns downstream in the filovirus GP2 6HB (Fig. S2) . Identification of helix-breaker residues along the syncytin N-helix that do not affect transport of the full-length protein to the cell surface Our attempts to produce a biochemically tractable syncytin ectodomain (SU/TM E ) 3 for structural studies have not been successful, and a likely reason is its instability in the spring-loaded form. We therefore bonds formed between charged residues forming 8 salt bridges are shown on the left, and the single intrachain salt bridge is indicated on the right. (C) Syn2-TM E inter-and intrachain salt bridges. Hydrogen bonds formed between the charged residues forming salt bridges at the N-and C-termini of the bundle are shown. (D) Comparison of interactions within 6HBs of syncytins and gamma-type Envs. The residues forming belt 1 and 2 are displayed on orange and green background, respectively. The charged amino acids forming a vertical string in syn1-TM E and the charged residues forming salt bridges mapping to the same region in syn2-, MPMV, and XMRV TM E are indicated as red, bold letters. The numbers below the sequence alignment mark the turns in the N-and C-helices of syncytins' 6HB. Residues at positions a and d within the HRs are indicated (the a position in the stutter is in red). explored the effect of introducing helix-breaking residues (prolines) along the whole length of the Nhelix, with the goal of identifying positions that may not interfere with folding of the syncytins in their prefusion conformation. Such an approach was applied successfully to stabilize the pre-fusion state of HIV-1 Env [28] and later of other class I proteins, such as the spike protein of the Middle East respiratory syndromeerelated coronavirus [48] or the fusion (F) protein of the human respiratory syncytial virus as well as the human metapneumovirus [49, 50] . We generated a set of single proline variants of syn1, containing a helix-breaking residue at a and d positions in the N-helix: Y347P, L350P, L354P, M358P, L365P, L368P, L372P, L375P, and V379P. We monitored by flow cytometry the effect of these mutations on the surface expression of the syn1 variants upon transient transfection (described in Section 'Cell-surface expression of ST syn1 variants analyzed by FACS'), as indicative of a proper prefusion conformation. With this system, we found that the proline substitutions introduced between turns 6 and 7 of the N-helix (L365P, L368P, and L372P) led to a protein that was transported to the cell surface as efficiently as wild type (Fig. 5) . In contrast, proline substitutions further upstream (turns 1, 2, and 3, Y347P, L350P, L354P, M358P) or downstream (turns 8 and 9, L375P and V379P) resulted in The electrostatic potentials of syn1-and syn2-TM E coiled-coils are plotted on the molecular surface, with the C-helix presented as cartoon in pale gray and with numbers 1e5 designating the turns (left side of each panel). The C-helix rotated around the Y-axis for 180 is shown on the right to illustrate the complementarity of the positively charged C-helical surfaces and the highly negatively charged groove formed by the Nhelices in syn1-TM E . (B) 6HB bundle of syncytin-1 is more stable than of syncytin-2. The graph shows unfolding of syn1and syn2-TM E displayed as increase in the intrinsic fluorescence as a function of temperature. The syn1-TM E unfolds at a temperature~20 C higher than syn2-TM E . impaired transport to the cell surface compared with wild-type (Figs. S3A and S3B, respectively), while the transfection efficiency and total syn1 expression were similar in all the samples. As controls, we tested in parallel other sites in the 6HB, for instance, the effect of altering the polar layer at the QNRR motif, a mutation that had been tested for BLV Env [24] . We found that several substitutions at this location -syn1 N382D, N382A, and N382Lpartially affected its trafficking to the cell surface (Fig. S3C) , in contrast to the previously tested BLV Env variants (N367D and N367L), which were expressed at the cell surface at a level similar to that of the WT protein [25] . Mutation of the syn1 disulfide isomerization motif CXXC to SXXC in SU (C186S) also resulted in partially impaired trafficking to the cell surface. Finally, mutating the furin cleavage site (R314S, R317S) resulted in a protein that trafficked to the cell surface without significant differences to the wild-type protein (Fig. 5B) . We then used 293T cells stably transfected with either one of the two complementary fragments of the green fluorescence protein (GFP) to analyze syncytia formation upon transient transfection with full-length syn1 and selected variants. Syncytia formation leads to reconstitution of a functional GFP, allowing easy monitoring of cellecell fusion by bimolecular fluorescence complementation [51] (described in detail in Ref. [52] and in the Material and Methods Section 'Cellecell fusion assay (293T cells)'). These cells constitutively express the syn1 receptor, the alanineeserineecysteine transporter 2 molecule, which is required for syn1-driven cellecell fusion [53] . As expected, the GFP signal appeared in the cells transfected with wild-type syn1, indicative of its fusogenicity (Fig. 6, Fig. 7A ). In contrast, no GFP signal was detected when transfected with the proline variants, including those that underwent efficient trafficking to the cell surface, indicating that the mutants were unable to induce cellecell fusion because they could not form the required 6HB. The variants carrying N382 mutations did induce GFP signal, although at 20% of what was observed for the wild-type syn1. N382D was more active than N382A and N382L, which showed essentially no fusion activity (Figs. 6 and 7A, Fig. S4) , similar to what was observed when asparagine to aspartic acid or leucine mutations were introduced into BLV TM E [25] . Finally, the controls with the altered disulfide motif or the furin site knockout did not result in measurable cell fusion either, consistent with the already published data on the human syncytins [54, 55] , HTLV [16] , and MLV Envs [56] . The structures of the syn1-and syn2-TM E core revealed a 6HB very similar to that of the closely related gamma-type Env proteins of the gammaretrovirus XMRV, the beta-retrovirus MPMV, and the delta-retroviruses HTLV and BLV. They also revealed close similarity to the more distant fusion GPs of the filoviruses MARV and EBOV, and of the filovirus-like CASV. The two polar layers described in the structures of the XMRV and MPMV TM E [22] are also present in the syncytins and have their counterpart in the 6HB of all the proteins mentioned above. A number of these polar interactions on the MPMV and XMRV 6HB were tested by site-directed mutagenesis and shown to interfere with the fusion activity of the corresponding full-length protein [22] . The same residues, however, can potentially be involved in a different set of interactions in the pre-fusion form, and it is therefore difficult to draw strong conclusions about their specific role in stabilization of the post-fusion 6HB and its relation to the membrane fusogenic activity of the protein. Our studies also confirmed that the core TM E fragment of syn2 reported back in 2005 [29] was truncated roughly at half the length of the protein core (Fig. 2) , similar to the first described structure of a gammaretroviral protein in the post fusion form of the murine Moloney leukemia virus reported 23 years ago [19] on which the construct had been based. In those structures, only the second polar belt was present, and as the C-helix was absent, there was no 6HB in the trimer. To fully understand the role of the polar residues observed stabilizing the 6HB, it is important to also know the structure of the spring-loaded, (SU/TM E ) 3 prefusion form. No such structure is available for any gamma-type retroviral Env protein, and the available structure of the beta-type HIV-1 Env protein is too distant to be a useful guide. In contrast, the structure of the fusion GP of the EBOV and MARV filoviruses in the pre-fusion (GP1/ GP2 E ) 3 spring-loaded trimer is available; in the filovirus GP, GP1 and GP2 are equivalent to the retroviral SU and TM subunits, respectively. The high structural similarity of the post-fusion GP2 E 6HB suggests that the region spanning the N-helix in the gamma-type retroviral Env proteins can potentially undergo a similar change in going from the pre-fusion to the post-fusion conformation. Our helix-breaking proline-scanning mutagenesis studies, targeting the a and d positions of the HR downstream of the stutter, identified that proline residues can be introduced at turns 6 and 7 of the Nhelix (mutants L365P, L368P, L372P; Fig. 5A ) without interfering with full-length syn1 being transported to the surface of the transfected cells. This observation strongly indicates that the mutants were well-folded and consequently had passed the folding quality control system of the cell, which was not the case for the proline mutants engineered further upstream (Y347P, L350P, L354P, M358P at turns 1e3 i.e. between the N-terminus and the stutter) or downstream (L375P and L379P at turns 8 and 9, closer to belt 2), which led to decreased transport to the cell surface (Fig. S3) . The similarities between the syn1-TM E and filoviral ectodomain GP2 E post-fusion 6HBs are displayed in Fig. 7B , where the region of the Nhelix that tolerates proline mutations is highlighted, as well as the region that maps in between shorter helices (i.e., to a helix break) in the pre-fusion form of the filovirus GP2. Remarkably, this region is also located two turns downstream of the stutter. The GP1 moiety of the filovirus GP is, however, most likely unrelated to the gamma-retrovirus SU, as the two fusion proteins have very different properties. For instance, filoviruses enter cells by fusion in late endosomes, and the EBOV GP requires further proteolytic cleavage of GP1 by cellular cathepsins after uptake into a target cell, partially removing the GP1 crown [57] . The remainder of GP1 then interacts with a second internal receptor upon this cleavage, the NiemannePick C1 protein [58] , and fusion ensues only at this late-stage membrane. As a result, the pre-fusion form of the filovirus (GP1/ GP2 E ) 3 spike is particularly robust, and no additional stabilization was necessary for the structural studies. The removal of the mucin-like domain, which is heavily glycosylated, and complexation with neutralizing antibody fragments served to improve crystal packing but were not required for stabilization of the pre-fusion conformation per se [43, 44] . In contrast, the syncytins act directly at the plasma membrane, The structures are colored to highlight the stutters (black) and the regions that form helix breaks in the pre-fusion forms of EBOV and MARV GP2 (blue). The blue region in syncytin-1 indicates the locations of the proline mutations that rendered the protein inactive but did not impact its trafficking to the cell surface, indicating that the same region may form a helix break in the syncytin-1 metastable state. The open brackets illustrate the conserved distance between the stutters and the predicted or already known helix breaks in syncytin-1 and filoviral GP2, respectively. The HR2 regions are omitted for clarity. and the fusion trigger is the interaction with their cellsurface receptor (reviewed in Ref. [59] ). In this sense, they are more like the beta-type HIV-1 Env protein, which similarly drives fusion of the viral envelope with the plasma membrane for entry. The syncytins are also reminiscent of the paramyxoviruses and pneumoviruses, which also induce fusion of the viral envelope with the plasma membranes. These viral fusion proteins all required mutagenesis to maintain them in the pre-fusion form to maintain conformational epitopes to use as immunogens and to carry out structural studies [49, 50] . An initial structure of the pre-fusion form, however, guided those trials. With the syncytins, we have worked backward, devising trials by explicit homology of the postfusion form without any information of the pre-fusion form. Our results nonetheless now point to new ways of engineering the pre-fusion form of gamma-type retroviral Env proteins, using mutants with helix-breaking residues in the identified region of the N-helix to destabilize the post-fusion form. Such a strategy may also require further mutations directly stabilizing the prefusion state, in addition to those in the region identified here. Possible combinations include the SU/TM CXXC/CX 6 CC motif to lock the inter-subunit disulfide bond and avoid shedding of the SU subunit, and also the addition of a trimerization motif to keep the pre-fusion trimer together for a high-resolution structural characterization. Expression construct design, protein expression, and purification Synthetic genes encoding the ectodomain portion of the TM subunit (TM E ) of syn1 (residues 342 to 435, GenBank accession number Q9UQF0) and of syn2 (residues 375 to 468, GenBank accession number P60508) were codon optimized for protein expression in bacteria and purchased from GeneArt. Cloning into the expression vector pET28-a vector (Novagen) was done using Gibson Assembly Master Mix (New England Biolabs). The expression constructs contained the N-terminal six-histidine tag followed by a tobacco etch virus cleavage site ENLYFQS (Fig. 1A) . The expression plasmids, syn1-TM E and syn2-TM E , were transformed into E. coli BL21 DE3 cells (New England Biolabs), which were used as the expression host. Cell cultures were grown in Miller lysogeny broth (LB) containing 50 mg/L kanamycin (Sigma) at 37 C until the optical density at 600 nm reached 0.5. Protein expression was induced by addition of 0.5 mM of isopropyl-b-D-thiogalactopyranoside, and 3 h later, the cells from 1 L of culture were collected by centrifugation at 4000Âg for 20 min (Beckman Coulter centrifuge, JLA 8.1 rotor). Cells were lysed by sonication (Bioblock Scientific Vibra Cell sonicator 75041) in 50 ml of buffer containing 50 mM Tris pH 8, 150 mM NaCl, 1 mM ethylenediaminetetraacetic acid, and 0.2 mM phenylmethyl-sulfonyl fluoride. The insoluble fraction was removed by two rounds of centrifugation (3000Âg for 20 min [Eppendorf 5810R centrifuge] and 50,000Âg for 30 min [Beckman Coulter centrifuge, JA25.50 rotor]). The clarified lysate was filtered through a 0.22 mm filter (Millipore) before being loaded to 1 ml HisTrap Excel column (GE Healthcare). Protein purification was performed according to the manufacturer's protocol. Eluted material was concentrated and further purified by SEC on a Superdex 75 16/60 column (GE Healthcare) using 10 mM Tris pH 8, 100 mM NaCl as running buffer. Protein concentration was calculated using the extinction coefficient (calculated on ExPASy server [60] ) of 0.6 ml/mg and 0.7 ml/mg for syn1-and syn2-TM E , respectively. Purified syn1-TM E protein was concentrated to 10 mg/ ml in Vivaspin PES cutoff 10 kDa and crystallized at 18 C by the sitting-drop method with a reservoir solution containing 0.1 M HEPES pH 7.5, 30% v/v 2-propanol, and 0.2 M MgCl 2 . Syn2-TM E was concentrated to 6 mg/ml and crystallized under the same conditions in 0.1 M Tris pH 8.5, 25% v/v tertiary-butanol. The crystals of both proteins were harvested in their mother liquor solutions containing 25% glycerol as a cryoprotectant and flash-frozen in liquid nitrogen. X-ray diffraction data collection and structure determination X-ray diffraction data were collected at the Proxima 2 beamline at the French national synchrotron facility SOLEIL (Gif-sur-Yvette, France). The data were processed using XDS package [61] , and the initial phases were obtained by molecular replacement with Phaser [62] in CCP4 suite [63] , using the coordinates of the previously crystallized fragment of the syn2-TM [29] as a search model (PDB accession number 1Y4M). Model refinement was done in Buster [64] and structure building in Coot [65] . Coiled-coils were analyzed by program TWISTER to determine the presence and position of stutters (https:// pharm.kuleuven.be/apps/biocryst/twister.php). The sequence alignments were generated in Clustal Omega [66] , and the one shown in Fig. S2A was further manually modified, in the T-loop and C-helix regions, to correspond to the structural superpositions of the proteins. ESPript [67] was used to calculate residue conservation and generate all the images showing sequence alignments. PyMol was used to create all the structure images and to perform the structural superpositions (pair_fit command) [68] . The electrostatic potentials shown on Fig. 4A were calculated with the APBS tool in PyMol [69] , on a scale À5 to þ5, with red depicting negative and blue positive surface potentials. The Intrinsic fluorescence signal expressed by the 350 nm/ 330 nm emission ratio, which increases as the proteins unfold, is plotted as a function of temperature on Fig. 4B . The plots are one of the three independent data collections that were performed for each protein. Functional studies: syn1-mediated cellecell fusion Plasmids pQCXIP-GFP1-10 (Addgene plasmid #68715) and pQCXIP-BSR-GFP11 (Addgene plasmid #68716), encoding a GFP variant residues 1e155 (strands b1e10) and residues 156 to 172 (strand b11), respectively, were gifts from Yutaka Hata and are described in Ref. [51] . The synthetic gene encoding the full-length syn1 (residues 1e538, GenBank accession number Q9UQF0) was codon optimized for protein expression in mammalian cells, cloned into pcDNA3.1 expression vector, and purchased as such from GenScript. The expression construct containing the affinity tag, residues D WSHPQFEK GGSA WSHPQFEK, at the syn1 N-terminus was created using standard molecular biology protocols and is referred to as the ST syn1 (the two strep tags [ST] are highlighted in bold, while the short linker separating them is in italics). The single point mutations were introduced into this construct by the "QuikChange II Site-directed Mutagenesis Kit" from Agilent Technologies. HEK293T platinum-E retroviral packaging cell line ("Plat-E" from Cell Biolabs), HEK293T cells (293T, ATCC), and derivatives were cultured in DMEM medium supplemented with Glutamax (Gibco), 10% fetal bovine serum (FBS), and 1% Penicillin-Streptomycin. Puromycin resistant cells were maintained in the presence of 1 mg/ml puromycin. The ExpiCHO cells (Thermo Fisher®) were grown in ExpiCHO media at 37 C with 8% CO 2 , in 50 ml tubes or flasks, on the orbital shaker InforsHT Celltron at 130 rotations per minute (rpm). The cell-surface expression of the syn1 variants was measured by FACS analysis of nonpermeabilized Expi-CHO cells transfected with the same expression constructs as used for the fusion assays. These vectors were engineered to have the affinity ST at the N-terminus of the protein, thus being accessible for binding to the antibodies used in FACS and immunofluorescence experiments. 293T cells could not be used for FACS analysis because they fused forming syncytia, complicating cell sorting and counting. ExpiCHO cells were transiently transfected with the ST syn1 plasmids, according to the manufacturer's protocol and using ExpifectamineCHO transfection reagent and 2 mg of plasmid DNA per 2 ml of cells diluted to density of 3 Â 10 6 cells/ml. Cells were grown in 50 ml conical tubes (Corning) and shaken in a rack at 37 C with 8% CO 2 at 130 rpm. Six hours posttransfection, the cells were centrifuged for 3 min at 200Âg, washed three times with ExpiCHO media, and left to grow overnight in 2 ml of fresh ExpiCHO media. Eighteen hours later, the cells were counted, pelleted by centrifugation at 200Âg for 3 min, and treated with 1 ml of 4% paraformaldehyde diluted in PBS for 15 min at 4 C. The rest of the staining procedure was performed in round bottom 96-well plates; 100,000 cells were plated per well and in duplicate for each sample. After washing the cells with PBS containing 1% BSA (the washing buffer [WB]), the primary, StrepMAB-Immo antibody (IBA Biosciences, #2-1517-001) that recognizes the SAWSHPQFEK sequence in the ST at the N-terminus of syn1, was incubated for 30 min at room temperature (RT). The antibody was diluted 1:5000 in WB and 0.05% saponin for the samples that required permeabilization (Fig. S3) . After 2 washes with WB, the samples were incubated with a goat anti-Mouse IgG (H þ L) highly cross-adsorbed secondary antibody conjugated to Alexa Fluor 647 (Invitrogen) that was diluted 1:500 in WB, for 30 min at RT, followed by two washes in WB. FACS acquisition on 10,000 cells per sample was performed with the Attune NxT Flow Cytometer (Thermo) for Y347P, L350P, L354P, and M358P variants (Fig. S3A) and on BD FACSCanto™ cytometer (BD Biosciences) for the remaining samples (Fig. 5, Figs. S3B and C) . Two different cytometers had to be used due to technical problems and the unavailability of the latter instrument for experiments involving Y347P, L350P, L354P syn1 variants. All the data were analyzed via FlowJo software. For murine retroviral vector production, the Plat-E retroviral packaging cell line was transfected with pQCXIP-BSR-GFP11 or pQCXIP-GFP1-10 using calcium chloride at 70% confluency. Media were changed 12h after transfection. Retroviral vectors were harvested at three time points (36h, 48h, and 72h after transfection) and ultracentrifuged at 22,000Âg for 1h at 4 C. For lentiviral transduction, 2 Â 10 4 HEK293T cells were resuspended in 150 ml of DMEM-HEPES, and 15 ml of retroviral vector was added. Cells were agitated 30 s every 5 min for 2 h at 37 C in a thermomixer and then transferred to 6-well plates with complete DMEM and incubated for 2 days at 37 C before addition of puromycin. The two cell lines stably expressing the GFP parts are referred to as 293T-GFP 1À 10 and 293T-GFP 11 . Transient transfection of the 293T-GFP 1 À 10 and 293T-GFP 11 cell lines with the ST syn1 constructs was performed using Lipofectamine® 2000 (Invitrogen) following the manufacturer's protocol. Briefly, after a PBS wash, the 293T-GFP 1 À 10 and 293T-GFP 11 cells were detached with PBS-EDTA (Sigma) and resuspended in DMEM medium supplemented with Glutamax, 10% FBS, and 1 mg/ml puromycin. The two cell types were mixed in 1:1 ratio at a final density of 6 Â 10 5 cells/ml, and 100 ml of such cell mixture was seeded in a 96-well cell culture flat-bottom plate (6 Â 10 4 cells/well). The same amount of cells was seeded in Lab-Tek chamber slides (#177402, Thermo-Fisher) for immunofluorescence analysis of syncytin surface expression (Section 'Surface expression of ST syncytin-1 variants analyzed by immunofluorescence'). A transfection mixture containing 50 ml of OptiMEM (Gibco), 100 ng of ST syn1 plasmid DNA, and 0.3 ml of the lipofectamine reagent was prepared for each sample, per well. 50 ml of the mixture was added, and the cells were allowed to fuse for next 24 h when the images were collected on live cells using the "EVOS FL Cell Imaging System" setup to record GFP fluorescence. An ImageJ script created by J.B. for the ImageJ software [70] was applied to (1) convert the images collected on the GFP channel from the *tif format to black and white images, green signal being replaced by black color (Fig. 6) , and (2) to integrate the surface occupied by the black fields and represent it as the percentage of total area, thereby giving what is referred to as "the fusion index." Two fields (1142.84 Â 857.13 mm) per experiment were taken for quantifications. The experiment was repeated twice, and the bar graph shown in Fig. 7A is an average of 4 measurements. The fusion was calculated as follows: the raw data obtained by recording GFP fluorescence (Fig. 6 , left column) were processed so that the green signal (fusion events) was replaced with black color, and the background signal was colored in white, as shown in the middle column. The "fusion index" was calculated as the percent of black area in the total visualized surface. The right column on Fig. 6 represents the same cell population seeded on a separate slide for immunofluorescence staining of the surface-expressed syncytin variants (red staining). The 293T-GFP 1À 10 and 293T-GFP 11 cells were seeded in Lab-Tek chamber slides and transfected with ST syn1 constructs as described in Section 'Cellecell fusion assay (293T cells)'. Twenty-four hours later the cells were fixed in 4% paraformaldehyde in PBS for 15 min. Free aldehydes were blocked by incubation with 40 mM NH 4 Cl in PBS for 5 min. Nonspecific binding was blocked with 1% FBS in PBS containing 0.02% sodium azide for 10 min, followed by 1 h, RT incubation with the StrepMAB-Immo antibody (IBA Lifesciences, Cat. No. 2-1517-001) diluted 3000 times in blocking buffer. After 3 washes in PBS, the secondary antibody, goat antibody against the mouse IgG conjugated with Alexa Fluor 555 (Invitrogen, #A28180), diluted 1:2000 was added and incubated for 1 h. Nuclei were stained with Hoechst stain (Invitrogen) at 1 mg/mL in PBS for 5 min before mounting the coverslips onto 76 Â 26 mm slides (Thermofisher) using Fluoromount-G mounting media (Southern Biotech). Images were taken with a Leica DM LB microscope and analyzed with ImageJ software. In Fig. 7A , surface expression results of ST syn1 variants are represented as the mean of three independent experiments, fusion indices as the mean of four measurements from two independent experiments, in all cases normalized to the value of the positive control (syn1-WT) ▫ SEM. One-way ANOVA and Dunnett's multiple comparisons tests were used to determine the statistical significance of each result in comparison with syn1-WT surface expression or fusion index (*p < 0.05; **p < 0.01; ****p < 0.0001). A p value of <0.05 was accepted as indicative of a statistically significant difference. The fusion index bars are missing from most of the samples because syncytia were not observed. The ****p < 0.0001 values were calculated for the fusion indexes for all the samples and are not plotted on the graph for clarity reasons. Accession numbers: Coordinates and structure factors have been deposited in the Protein Data Bank with the accession numbers 6RX1 and 6RX3, for syn1-and syn2-TM E , respectively. Structural biology Abbreviations used: 6HB, six-helix bundle GFP, green fluorescent protein; HERV, human endogenous retrovirus; HIV, human immunodeficiency virus Marburg virus; MPER, membrane-proximal external region; MPMV, MasonePfizer monkey virus; PDB, protein data bank SEC, size exclusion chromatography; SEM, standard error of the mean value Paleovirology of 'syncytins', retroviral env genes exapted for a role in placentation From ancestral infectious retroviruses to bona fide cellular genes: role of the captured syncytins in placentation Involvement of human endogenous retroviral syncytin-1 in human osteoclast fusion Endogenous retroviruses: acquisition, amplification and taming of genome invaders Initial sequencing and analysis of the human genome Functional conservation of the lncRNA NEAT1 in the ancestrally diverged marsupial lineage: evidence for NEAT1 expression and associated paraspeckle assembly during late gestation in the opossum Monodelphis domestica An endogenous retroviral envelope syncytin and its cognate receptor identified in the viviparous placental Mabuya lizard Classification and characterization of human endogenous retroviruses Use of endogenous retroviral sequences (ERVs) and structural markers for retroviral phylogenetic inference and taxonomy Molecular characterization and placental expression of HERV-W, a new human endogenous retrovirus family Genomewide screening for fusogenic human endogenous retrovirus envelopes identifies syncytin 2, a gene conserved on primate evolution Identification of an envelope protein from the FRD family of human endogenous retroviruses (HERV-FRD) conferring infectivity and functional conservation among simians Pushing the endogenous envelope Common features of enveloped viruses and implications for immunogen design for next-generation vaccines Mechanism of membrane fusion by viral envelope proteins Intersubunit disulfide isomerization controls membrane fusion of human T-cell leukemia virus Env Single-molecule analysis of a viral fusion protein illuminates a fusion-active intermediate state Hydration forces Retrovirus envelope domain at 1.7 angstrom resolution Atomic structure of the ectodomain from HIV-1 gp41 The crystal structure of the SIV gp41 ectodomain at 1.47 A resolution Crystal structures of beta-and gammaretrovirus fusion proteins reveal a role for electrostatic stapling in viral entry Structural characterization of a fusion glycoprotein from a retrovirus that undergoes a hybrid 2-step entry mechanism Crystal structure of human T cell leukemia virus type 1 gp21 ectodomain crystallized as a maltose-binding protein chimera reveals structural evolution of retroviral transmembrane proteins Charge-surrounded pockets and electrostatic interactions with small ions modulate the activity of retroviral fusion proteins The HIV-1 envelope glycoprotein structure: nailing down a moving target How HIV-1 entry mechanism and broadly neutralizing antibodies guide structure-based vaccine design Stabilization of the soluble, cleaved, trimeric form of the envelope glycoprotein complex of human immunodeficiency virus type 1 Crystal structure of a pivotal domain of human syncytin-2, a 40 million years old endogenous retrovirus fusogenic envelope gene captured by primates A simple method for displaying the hydropathic character of a protein A hidden Markov model for predicting transmembrane helices in protein sequences Dali server update Nucleotide sequence of Mason-Pfizer monkey virus: an immunosuppressive D-type retrovirus Functional implications of the human Tlymphotropic virus type 1 transmembrane glycoprotein helical hairpin structure A buried polar interaction can direct the relative orientation of helices in a coiled coil A buried polar interaction imparts structural uniqueness in a designed heterodimeric coiled coil Placental syncytins: genetic disjunction between the fusogenic and immunosuppressive activity of retroviral envelope proteins Similar structural models of the transmembrane proteins of Ebola and avian sarcoma viruses Structural characterization of the glycoprotein GP2 core domain from the CAS virus, a novel arenavirus-like species The secret life of viral entry glycoproteins: moonlighting in immune evasion Crystal structure of the Ebola virus membrane fusion subunit, GP2, from the envelope glycoprotein ectodomain Crystal structure of the Marburg virus GP2 core domain in its postfusion conformation Toremifene interacts with and destabilizes the Ebola virus glycoprotein Structure of the Ebola virus glycoprotein bound to an antibody from a human survivor Structural basis for Marburg virus neutralization by a cross-reactive human antibody Heptad breaks in alphahelical coiled coils: stutters and stammers Analysis of alpha-helical coiled coils with the program TWISTER reveals a structural mechanism for stutter compensation Immunogenicity and structures of a rationally designed prefusion MERS-CoV spike antigen A highly stable prefusion RSV F vaccine derived from structural analysis of the fusion mechanism Structure and immunogenicity of prefusion-stabilized human metapneumovirus F glycoprotein A new cell-based assay to evaluate myogenesis in mouse myoblast C2C12 cells IFITM proteins inhibit placental syncytiotrophoblast formation and promote fetal demise An envelope glycoprotein of the human endogenous retrovirus HERV-W is expressed in the human placenta and fuses cells expressing the type D mammalian retrovirus receptor Functional characterization of the human placental fusogenic membrane protein syncytin 2 Synthesis, assembly, and processing of the Env ERVWE1/syncytin human endogenous retroviral envelope Analysis of cysteine mutations on the transmembrane protein of Moloney murine leukemia virus Endosomal proteolysis of the Ebola virus glycoprotein is necessary for infection Ebola virus entry requires the cholesterol transporter Niemann-Pick C1 Transmission, evolution, and endogenization: lessons learned from recent retroviral invasions, Microbiol ExPASy: the proteomics server for in-depth protein knowledge and analysis Phaser crystallographic software Overview of the CCP4 suite and current developments BUSTER Version 2.8.0, Global Phasing Ltd Features and development of Coot Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega ESPript: analysis of multiple sequence alignments in PostScript The PyMOL Molecular Graphics System PDB2PQR: an automated pipeline for the setup of PoissoneBoltzmann electrostatics calculations NIH Image to ImageJ: 25 years of image analysis This work was funded by the European Research council (ERC "CelCelFus" grant #340371) awarded to F.A.R. and recurrent funding from the Institut Pasteur (IP) and CNRS. We thank Ahmed Haouz and staff at the IP core facility for protein crystallization and staff at the Proxima-2 beamline at the SOLEIL synchrotron for assistance with data collection and processing. We also thank Ignacio Fernandez for helpful discussions and critical reading of the manuscript, Delphine Brun for technical assistance, and Etienne Simon-Loriere for discussions and help with phylogenetic analyses. Supplementary data to this article can be found online at https://doi.org/10.1016/j.jmb.2019.10.020.