key: cord-0891382-4jg4t8po authors: Parthasarathy, Krupakar; Lu, Huang; Surya, Wahyu; Vararattanavech, Ardcharaporn; Pervushin, Konstantin; Torres, Jaume title: Expression and purification of coronavirus envelope proteins using a modified β-barrel construct date: 2012-07-20 journal: Protein Expr Purif DOI: 10.1016/j.pep.2012.07.005 sha: 00623268d0ce12f11d6b3acb0cc012821f592b93 doc_id: 891382 cord_uid: 4jg4t8po Coronavirus envelope (E) proteins are short (∼100 residues) polypeptides that contain at least one transmembrane (TM) domain and a cluster of 2–3 juxtamembrane cysteines. These proteins are involved in viral morphogenesis and tropism, and their absence leads in some cases to aberrant virions, or to viral attenuation. In common to other viroporins, coronavirus envelope proteins increase membrane permeability to ions. Although an NMR-based model for the TM domain of the E protein in the severe acute respiratory syndrome virus (SARS-CoV E) has been reported, structural data and biophysical studies of full length E proteins are not available because efficient expression and purification methods for these proteins are lacking. Herein we have used a novel fusion protein consisting of a modified β-barrel to purify both wild type and cysteine-less mutants of two representatives of coronavirus E proteins: the shortest (76 residues), from SARS-CoV E, and one of the longest (109 residues), from the infectious bronchitis virus (IBV E). The fusion construct was subsequently cleaved with cyanogen bromide and all polypeptides were obtained with high purity. This is an approach that can be used in other difficult hydrophobic peptides. Envelope (E) proteins are found in coronaviruses (family Coronaviridae, genus Coronavirus [1] ), a group of enveloped viruses that cause common cold in humans and a variety of lethal diseases in birds and mammals [2] . The species in the genus Coronavirus has been organized into three groups [3] . Group 1 includes the porcine Transmissible gastroenteritis virus (TGEV 1 ) and Human coronaviruses 229E (HCoV-229E) or NL63 (HCoV-NL63), Group 2 includes Murine hepatitis virus (MHV) and Human coronavirus OC43 (HCoV-OC43). Group 3 includes the avian Infectious bronchitis virus (IBV) and the Turkey coronavirus (TCoV). The virus responsible for the severe acute respiratory syndrome (SARS-CoV), a respiratory disease in humans [4] , is close to group 2 [5] . Coronavirus E proteins are minor components in virions, with varying lenghts of around $100 amino acids. All of them have at least one predicted a-helical transmembrane (TM) domain followed by a juxtamembrane cluster of 2-3 cysteines. The TM domain is flanked by a short hydrophilic N-terminus ($10 residues) and a C-terminal tail (39-76 residues) . E proteins are not essential for in vitro or in vivo coronavirus replication, but their absence results in attenuated viruses, as shown for SARS-CoV [6] and others [7] [8] [9] [10] [11] [12] . In some coronaviruses, absence of E protein results in deletereous morphological changes in the virion [13] [14] [15] [16] . E proteins has also been shown to have membrane permeabilizing [17] and ion channel [18] [19] [20] activity, although the purpose of this is not known. Biophysical or structural studies so far have been performed using synthetic peptides corresponding to the predicted a-helical TM domain [17, 21, 22] or with synthetic full length protein [23, 24] . In the latter case, however, the peptides were only partially purified and contained truncations or deletions. Thus, despite the evident biological interest of coronavirus E proteins, to our knowledge, none of the envelope coronavirus proteins has been efficiently expressed and purified. Our initial attempts to overexpress SARS-CoV E, using poly-Histidine, GST, ketosteroid isomerase (KSI) [25, 26] or maltose binding protein (MBP) fusion tags were not successful. While the first three resulted in very low expression levels, MBP could not be efficiently purified from SARS-CoV E (results not shown). Thus, we used a novel construct in which the E protein is Nterminally fused to an integral membrane protein, a modified eight strand b-barrel protein derived from the Escherichia coli outer membrane protein A (OmpA) [27] . This b-barrel, which has shortened extramembrane loops, has been previously characterized by NMR [28] , and is referred herein as b-barrel protein platform (BBP). A 6His-tag was added to the N-terminus of BBP to further facilitate purification of the fusion construct. BBP was linked to the N-terminus of the E protein with a flexible linker region which contained the restriction site for factor Xa. Although we obtained high expression levels (20 mg/L), proteolysis with factor Xa was not effective. Thus, the construct 6His-BBP-E was cleaved chemically with cyanogen bromide (CNBr), which cleaves polypeptides at methionine residues [29] [30] [31] . CNBr therefore cannot be used when internal methionines are present in the target protein, which fortunately is not the case for neither SARS-CoV E or IBV E. However, the b-barrel used as fusion protein contains four internal Met residues, and cleavage of the fusion construct with CNBr would produce in this case many fragments that could complicate purification. To avoid this, all four internal BBP Met residues were mutated to Ala (mutant BBP DMet ). To demonstrate that this method is efficient, we have expressed and purified two representatives of coronavirus E proteins, the shortest, SARS-CoV E (76 aa, 8.3 kDa), and the longest, IBV E (108 aa, 12.3 kDa). These two proteins, like other coronavirus envelope proteins, contain several (2-3) juxtamembrane cysteines of as yet unknown function that are conserved. It is known, however, that they can be palmitoylated in vivo [21, 32, 33] , and that removal of these cysteines in MHV E resulted in deformed viruses [34, 35] . The study of the role of these cysteines is therefore an objective in their own right. In addition, these cysteines could potentially produce inter or intra-molecular disulfide bonds, leading to sample heterogeneity. Thus, the potential negative effect of disulfide bond formation in the final samples was examined by also purifying these proteins without cysteines. The latter samples may also be more amenable for structural studies. Both envelope proteins and their cysteine-less mutants could be purified by reverse phase high performance liquid chromatography (RP-HPLC). This method should facilitate future detailed biophysical and structural studies of coronavirus envelope proteins. An additional benefit of this fusion protein is that BBP was originally engineered by inserting additional 12 amino acids in a loop region of the barrel, corresponding to loop 3 of calmodulin from X. laevis. This modified loop binds lanthanides [36, 37] , and the use of BBP can enable measurement of residual dipolar couplings (RDC) in membrane proteins [28] . Constructs pET3a/ 6H BBP DM expression vector was modified from the plasmid pET3a-BBP described in a previous work [28] . The mutated BBP (BBP DM ) was generated with its four internal methionine residues replaced with alanine using the QuickChange Ò SDM kit (Stratagene) with the relevant primers. Additionally, a hexa-His tag was introduced at the N-terminus of BBP to facilitate purification. The forward and reverse primers used to amplify the target gene SARS-CoV E and IBV-E are as follows: SARS-CoV E (NCBI Reference Sequence: NC_004718.3, protein ID NP_828854.1): forward primer 5 0 CAGGATCCATCGAAGGTCGTAT GTACTCATTCG3 0 and reverse primer 5 0 GAGGATCCTCAGACCAGA AGATCAGGAAC3 0 . IBV E (NCBI Reference Sequence: NC_004718.3, protein ID AAU14251): forward primer 5 0 CAGGATCCATCGAAGGTCGTATGT TTAATTTATTCC3 0 and reverse primer 5 0 GAGGATCCTTAGATATCA TCCACCTCTAATAG 3 0 . Both forward and reverse primers contain BamHI restriction sites. Factor Xa cleavage sequence was introduced at the N-terminus of the genes. The amplified inserts were digested with BamHI and then ligated to the similarly digested pET3a/ 6H BBP DM expression vector. The identity of the constructs was confirmed by DNA sequencing. Mutants of SARS-CoV E and IBV-E constructs at cysteine residues were generated using the QuickChange Ò SDM kit (Stratagene). In SARS-CoV E, three cysteine residues at position 40, 43 and 44 (represented by the first, second and third letters, respectively) were mutated to serine or alanine, generating triple Cys mutants (SSS and AAA). In IBV E, the two cysteine residues at positions 44 and 45 were mutated to Alanine, generating a doublecysteine mutant (AA). The plasmid containing the desired construct was transformed into the E. coli strain BL21-CodonPlus (DE3)-RIL (Stratagene). One colony was inoculated into LB media (3 ml) containing 50 lg/ml of ampicillin and 34 lg/ml of chloramphenicol and incubated overnight at 37°C with shaking at 200-225 rpm. The culture was inoculated into 500 ml LB media in a 2.5 L flask containing the same concentrations of antibiotics. Protein expression was induced with 1 mM isopropyl b-d-thiogalactopyranoside (IPTG) at an OD 600 of 0.5-0.6. Expression of IBV AA mutant was much lower than the wild type, and best expression was obtained by using 0.4 mM IPTG. Cells were harvested by centrifugation when cell growth reached its stationary phase. Expression was checked by running the whole-cell lysate on SDS-PAGE. The fusion protein was found in inclusion bodies. For a 0.5 L culture, the cell pellet obtained after centrifugation was first washed with 20 mM Tris-HCl and 5 mM EDTA (TE buffer, pH 8) and then resuspended in 20 ml of 20 mM Tris-HCl and 5 mM EDTA. The resuspended cells was lysed by sonicating for 10-15 min in ice, followed by microfluidization 5-10 times and centrifugation at 8,000 rpm at 4°C for 20 min. The white pellet was collected and resuspended in 20 ml TE buffer (pH 8) containing 2% Triton X-100, shaken at 37°C for 1 h, and centrifuged at 8000 rpm at 4°C for 30 min. The pellet was resuspended in TE buffer, incubated at 37°C for 1 h and centrifuged at 8000 rpm at 4°C for 20 min. The pellet was then resuspended completely in the binding buffer (8 M urea, 5 mM imidazole in 20 mM Tris pH 8.0) and centrifuged at 12,000 rpm at 4°C for 30 min. The protein was allowed to bind overnight onto Ni-NTA resin (Bio-rad) pre-equilibrated with the binding buffer. After incubating, the resin was washed with 5 column volumes of washing buffer (8 M urea, 20 mM imidazole in 20 mM Tris pH 8.0) followed by elution (elution buffer: 8 M urea, 250 mM imidazole in 20 mM Tris pH 8.0). All fractions were analyzed by SDS-PAGE. The desired fraction was dialysed to remove urea, and the precipitate was collected and lyophilized for CNBr cleavage. The same protocol was followed for the expression of the 15 N labeled SARS-CoV E, although with some modifications. Initially, the cells were grown in normal LB medium and after the OD 600 reached 0.6, the cells were harvested by centrifugation and resuspended in minimal media. After two cycles of resuspension and centrifugation, cells were inoculated in M9 minimal medium enriched with 15 N labeled ammonium chloride (1 g/L, >99% 15 N) (Cambridge Isotopes, USA). In addition, the medium was supplied with 1% basic mineral elements solution (Sigma chemicals) which includes trace elements and vitamins. MALDI-TOF/MS (Applied Biosystems) was used for identification of the cleavage products. The protein was solubilized in 60% isopropanol, 35% acetonitrile, 5% water and 0.1% TFA and deposited onto a matrix of 2-(4-hydroxyphenylazo) benzoic acid (HABA). The lyophilized powder of purified construct was solubilized in 80% formic acid (to a few ml). In a different flask, five-fold molar excess of CNBr versus methionine content of the construct was also dissolved in 80% formic acid. After mixing both solutions the reaction proceeded in the dark under anaerobic conditions [38] . After 12-15 h, the solution was diluted and dialysed against water. The precipitate was lyophilized and solubilized by addition of 10 lL of trifluoroacetic acid (TFA). Acetonitrile was added to reach a 99.9:0.1 (acetonitrile to TFA) volume ratio. The solubilized CNBr cleavage products were injected into a Jupiter 5 l C4 300 Å analytical column with a linear gradient of water (Solvent A: 95% H 2 O, 5% acetonitrile, 0.1% TFA), and 2-propanol (Solvent B: 80% 2-propanol, 20% acetonitrile, 0.1% TFA). The presence and purity of SARS-CoV E or IBV E in the HPLC peaks was assessed by MALDI-TOF/MS using a HABA matrix in a 1:1 (v/v) ratio with HPLC fractions. Fractions with peaks matching the molecular weight of the protein were concentrated using a Scanvac centrifuge (Chemoscience) to remove excess organic solvents, diluted with H 2 O and lyophilized. Lyophilized SARS-CoV E or IBV E powder was solubilized in 20-30 ll of 1,1,1,3,3,3-hexafluoro-2-propanol (HFIP), and the concentration of the protein was estimated using its absorbance at 280 nm. SDS solubilizing buffer was added to the lyophilized protein to a final concentration of 1 mg/mL, with or without DTT. After mixing for 5 min, the sample was heated at 65°C for 5 min and loaded. The loading volume was 10 ll (10 lg of peptide). Four percent to 12% NuPAGE Bis-tris gel (Invitrogen) was run at 150 V for 1 h with 2-(N-morpholino)-ethanesulfonic acid (MES)-SDS running buffer. After completion, the SDS-PAGE gel was stained with Coomassie blue. For PFO electrophoresis, peptides were dried from HFIP and dissolved in sample buffer containing 4% PFO to $1 mg/ml. Samples were heated at 65°C for 4 min and loaded on a 4-12% NuPAGE Ò-Bis-tris gel (Invitrogen). The gel was electrophoresed at 80 V for 2-3 h with 2-(N-morpholino)-ethanesulfonic acid (MES) PFO running buffer. After completion, the gel was washed twice with water, followed by Coomassie blue staining. Protein identity was confirmed by tryptic digestion after running SDS-PAGE. The protein band corresponding to SARS-CoV E or IBV E protein was excised and subjected to in-gel trypsin digestion. The gel fragments were diced into small fragments (1 mm 2 ) and washed three times with 200 ll 50% acetonitrile and 25 mM ammonium bicarbonate to remove the Coomassie blue dye. Gel fragments were then soaked in 100% acetonitrile for 10 min. Using the gel loading tip, acetonitrile was removed and residual acetonitrile was removed by Speedvac. Prior to enzymatic digestion, gel fragments were reduced with 10 mM DTT in 25 mM ammonium bicarbonate at 60°C for 30 min. Alkylation was then performed with 55 mM iodoacetamide in 25 mM ammonium bicarbonate for 30 min at room temperature in the dark. The reduced and alkylated gel slices were rehydrated in 20 ll of 50 mM ammonium bicarbonate, pH 8.3, containing 200 ng of trypsin. Once this solution was fully absorbed by the gel slices, enzyme-free ammonium bicarbonate buffer was added until the gel pieces were completely covered. The samples were digested for 18 h at 37°C [39] . After in gel tryptic digestion, the peptides were extracted and analyzed using MALDI-TOF/TOF-MS as described previously [40] . CD spectra were measured on a Chirascan CD Spectrometer (Applied Photophysics, UK). Data were collected at 20°C at 1 nm interval, from 190 nm to 240 nm, on a 1 mm pathlength cell. SARS-CoV E and detergents were first dissolved in HFIP and dried, followed by addition of 1 mM phosphate buffer at pH 7. Final detergent concentrations were: 13 mM SDS and 10 mM DPC. The final protein concentration was approximately 10 lM. Spectra were smoothed with a Salvitsky-Golay algorithm. Infrared spectra were acquired as described previously on a Nicolet Nexus spectrometer (Madison, USA) [20] . The protein was incorporated in multilamellar liposomes by first dissolving a dry mixture of lipid and lyophilized peptide (50:1 M ratio) in HFIP. The solution was dried by a stream of N 2 , and the resulting peptide-lipid film was dissolved in phosphate buffer (1 mM, pH 7) followed by sonication. The compositions of lipid mixtures used were as follows: DMPC alone, DMPC:DMPG (1:1 M ratio) and POPC:SM:CH (2:1:1, w/w/w). The liposome solution was deposited on a germanium internal reflection element (IRE) and dried slowly by evaporation. NMR experiments were performed using a Bruker Avance-II 600 or 700 NMR spectrometers, equipped with cryogenic probes (Bruker BioSpin, GmbH), with three channels and triple resonance probes with shielded z-gradient coils. The carrier position was set to 4.70 ppm for 1 H and 119 ppm for 15 N. Sodium 2,2-dimethyl-2silapentane-5-sulfonate (DSS) was used as internal reference for 1 H nuclei. The chemical shifts of 15 N nuclei were indirectly calculated from the 1 H chemical shifts. SARS-CoV E protein was screened for solubilization in detergents DPC and SDS in phosphate buffer at pH 5.5 at various protein to detergent molar ratios, from 1:100 to 1:150. DTT was also used to reduce possible disulphide bonds. Sedimentation equilibrium experiments were performed at 25°C using a Beckman XL-I analytical ultracentrifuge [41] in the presence or absence of 6 mM of reducing agent tris(2-carboxyethyl)phosphine (TCEP). Absorbance was measured at 230 nm, and for SARS-CoV E we used the estimated extinction coefficient e 230 = 27,000 M À1 cm À1 . The buffer composition was 50 mM Tris, 100 mM NaCl, pH 7.3 and 5 mM C14 betaine (c.m.c., 0.1 mM). To match the density of the detergent, D 2 O was added to the buffer to a final volume ratio of 29.4% [42] . The density-matched buffer was added to the dry peptide. The molar ratio peptide:detergent was 1:25, 1:50 and 1:100 for inner, medium and outer cell, respectively. The samples were centrifuged in three-compartment carbon-epoxy centerpieces with quartz windows for lengths of time sufficient to achieve equilibrium, tested with WinMatch. This typically was achieved $26 h after reaching the speeds at 38,000, 45,000 or 48,000 rpm. Data was processed and analyzed using Sedfit and Sedphat [43] . The monomeric molecular mass of the protein and its partial specific volume were calculated with SEDNTERP [44] after correction for partial hydrogen/deuterium exchange [41] : 8,229.84 Da and 0.7687 cm 3 /g. The log plot (ln A vs. r 2 À r 2 0 ) was not linear, indicating the presence of more than one species at these peptide-to-detergent ratios. The data was fitted to several monomer-Nmer models. For IBV E, the conditions were identical to that of SARS-CoV E except that the concentration of detergent (C14 betaine) was 15 mM. The protein:detergent molar ratio was 1:300, 1:450 and 1:600. The samples were centrifuged at 28,000, 34,500, 42,000 or 48,000 rpm. For the preparation of SARS-CoV E proteoliposomes, the protein was reconstituted into liposomes formed by DPhPC and cholesterol (10:1 M ratio, Avanti Polar Lipids), at a 1:100 protein/lipid (mol/ mol) ratio. Giant unilamellar vesicles were prepared by electroformation on a Nanion Vesicle Prep Pro setup (Nanion Technologies GmbH, Munich, Germany). Electrophysiological recordings were carried out with a Port-a-Patch automated patch clamp system (Nanion Technologies GmbH, Munich, Germany). First, a planar bilayer was obtained by applying 5 ll of GUV solution onto the borosilicate glass chips, $1 lm aperture, in presence of a slightly negative pressure (À20 mbar). The formation of the planar lipid bilayer on the aperture provided a seal resistance of >10 GO. Then, proteoliposomes containing the protein were pipetted onto the planar lipid bilayer. Currents were amplified with Axon Patch 200B amplifier (Axon instruments, USA). The data were acquired by using Axon Digidata 1322A and analyzed with Clampfit (Axon instruments, USA). The experiment was performed in a symmetric solution of 150 mM NaCl, 5 mM HEPES at pH 5.1, varying the voltages across the lipid bilayer. To determine the optimal temperature for expression, yield was tested at 37, 30, 20 and 18°C. For both SARS-CoV E (Fig. 1A) and IBV E (Fig. 1B) , highest expression was observed at 18°C. Elution of the fusion construct containing SARS-CoV E ( Fig. 2A and B) , from a Ni-NTA column with 250 mM imidazole produced a relatively pure sample (Fig. 2C, lanes E1 and E2) . The yield of fusion construct after this step was approximately 20 mg/L of culture. After CNBr cleavage, the HPLC elution profile showed two distinct peaks labeled 1 and 2, respectively (Fig. 3A, insert) , eluting at 38% and 72% of solvent B. MALDI-TOF/MS analysis of peak 1 produced a mass of 18,415.1 Da corresponding to BBP DM (not shown), and peak 2 produced a main peak at 8,295.1 Da (Fig. 3A) , which corresponds to the monomeric SARS-CoV E (calculated mass 8,230 Da). Additional peaks at multiples of the monomeric form were observed, e.g., 16 ,623 Da, likely to represent dimers of SARS-CoV E. The 15 N labeled SARS-CoV E protein was observed at 8,436 Da (see Supplementary File, Fig. S1B ), whereas the calculated molecular weight was 8,316 Da. Like in unlabeled protein, multiples of this peak at 17,016, 25,532 and 34,901 Da were observed, corresponding to higher oligomers. We note that in both labeled and unlabeled protein, not only the observed molecular weight was higher than the calculated one, but the value obtained fluctuated between measurements. For example, in a total of five measurements, the average was 8,339 ± 23 Da for unlabeled SARS-CoV E protein. These alliquotes originated from the same volume, and the machine was properly calibrated. In any case, we have further confirmed the identity of the purified protein by in-gel tryptic digestion and identified the fragments by MALDI-TOF (see Supplementary File, Fig. S2) . The final yield of non-labeled SARS-CoV E protein was 4 mg/L of culture, which is comparable or higher than other methods of expression for this class of proteins [45] , whereas the yield for 15 N-labeled protein was $15 mg/L. For cysteine-less mutants of SARS-CoV E, expression and purification results were similar (not shown), although in this case MALDI-TOF/MS mass spectra produced no multimers ( Fig. 4A and B) . It is thus likely that the presence of multimers in MALDI mass spectra is related to the presence of inter-molecular disulfide bonds. The fusion construct formed by 6His-BBP DM and IBV E ( Fig. 2A and B) , also accumulated in inclusion bodies. The construct was eluted from a Ni-NTA column with 250 mM imidazole (Fig. 2D , lane E1) and the yield after this step was similar to that of SARS-CoV E. The HPLC chromatogram of this sample after CNBr cleavage (Fig. 3B) shows two main peaks. Peak 1 corresponds to BBP DM (not shown), whereas the MALDI-TOF/MS spectrum corresponding to peak 2 produced a peak at 12,196.4 Da corresponding to IBV E (calculated 12,116 Da). In this case, a peak at 6,075.6 Da, resulting from double protonation, was observed. The cysteine less mutant also could be purified (Fig. 4C) . The identity of IBV E was confirmed by tryptic digestion (see Supplementary File, Fig. S3 ). The fraction corresponding to peak 2 in the HPLC chromatograms (Fig. 3, inserts) and the equivalent peaks for other similar chromatograms was analyzed by SDS electrophoresis. Both wild type SARS-CoV E and IBV E contain cysteines, therefore the gels were run with or without reductant (DTT). The monomer of wild type SARS-CoV E migrated with its expected mobility in 4-12% Nu-PAGE ($8 kDa) together with several higher oligomers, and mutants SSS and AAA produced only monomers (Fig. 5A ). In the presence of DTT (Fig. 5B) , wild type SARS-CoV E produced only two bands consistent with monomers and trimers, whereas the mutants were unaffected. The difference between triple mutants and wild type in presence of DTT suggests that cysteine side chains are involved in SARS-CoV E inter-monomeric contacts, but this involvement does not necessarily imply the formation of disulfide bonds. The side chain of cysteine is more hydrophobic [46] than alanine or serine, and the latter residues may lead to different secondary structure propensities relative to cysteine [23, 24] . Wild type IBV E produced only one main band in SDS (Fig. 5C) , consistent with monomers, and only very faint bands corresponding to higher oligomers. Similar bands were observed for the double cysteine mutant AA, with or without DTT, which indicates that IBV E as prepared did not contain significant amount of intermonomeric disulfide bonds. To confirm that the proteins purified are able to form pentameric oligomers, we used perfluorooctanoic acid (PFO) electrophoresis. PFO is a detergent known to facilitate native interactions between transmembrane a-helices [47, 48] . We have shown previously [19] that the synthetic transmembrane domain of SARS-CoV E (SARS-CoV E-TM) forms pentamers in this detergent. However, neither full length wild type SARS-CoV E or the SSS mutant showed any band consistent with pentamers (Fig. 5D) , and produced instead aggregation well above 100 kDa. The AAA mutant, in contrast, showed a clear band consistent with pentamers, $40 kDa (Fig. 6D) . The difference in behavior of this mutant may be due to a change in secondary structure, possibly increased helicity, induced by substitution of Cys to Ala [50] . The presence of DTT did not have any effect on mobility in PFO gels (not shown). In the case of IBV E (Fig. 5E) , both for wild type and AA mutant, a large smear was observed around 66 kDa, which is consistent with a predominantly pentameric form. To further confirm the presence of a pentameric arrangement in these coronavirus envelope proteins, we used a milder, zwitterionic detergent, C14 betaine, in a sedimentation equilibrium (SE) experiment. For wild type full length SARS-CoV E no simple reversible association 'monomer:N-mer' model could be fitted to the data (N was tested from 2 to 15), probably because of the presence of disulfide bonds. However, in the presence of reductant TCEP, the data could be best fitted to a simple monomer-pentamer self-asso-ciation model (Fig. 6A-D) , with log K a = 17.5 ± 0.2, similar to that obtained previously in similar conditions [19] for synthetic peptide TM in similar conditions. This suggests that the main contribution to pentamerization in SARS-CoV E is provided by the TM domain. The fact that we could fit the data in presence of TCEP to a monomer pentamer model demonstrates that disulfide bonds are not required to form pentamers. When we used the cysteine-less mutants (SSS or AAA) or IBV E and mutant AA, the data could still be fitted to a reversible monomer-pentamer equilibrium (not shown). For IBV E a good fit of the data could be obtained without the need of TCEP, consistent with the observation that no disulfide bonds were observed in SDS (Fig. 5C ). Purified SARS-CoV E has channel activity in lipid bilayers, although the film was unstable and recordings could only be obtained at two potentials ( Fig. 6E and F). After confirming that SARS-CoV E is pentameric and functinal for channel activity, we tested with independent techniques if it can be suitable for structural studies and can be properly incorporated in membranes. Thus we obtained CD and FTIR spectra to assess the secondary structure of SARS-CoV E. The CD spectrum of SARS-CoV E protein incorporated in anionic SDS or lipid-like DPC detergent micelles showed characteristic minima at 208 nm and 222 nm ( Fig. 7A) , typical of a-helical conformation. This is confirmed by the infrared amide I band envelope (Fig. 7B ) obtained after reconstitution of SARS-CoV E protein in lipidic membranes, which shows a predominant band centered at $1655 cm À1 , also characteristic of a-helices, although in some cases (e.g., DMPC) a small shoulder was observed consistent with some b-structure. Finally, 2D-HSQC NMR spectra were collected in DPC detergent (Fig. 7C ) or a mixture of DPC/SDS (Fig. 7D) . Although the spectra are not yet of sufficient quality for complete structure determination, the best spectra were obtained using 0.5 mM SARS-CoV E, 100 mM DPC, 100 mM SDS and 40 mM DTT, at a temperature of 328 K (Fig. 7D) . From visual inspection, about 70% of the 1 H-15 N peaks can be detected. Using the triple cysteine mutant did not improve the quality of the spectra (not shown). Nevertheless, the NMR spectrum, combined with CD and FTIR spectra, show that the protein is folded and after further optimization to improve signal intensity and peak dispersion, structural determination should be possible. We have also shown that these proteins form pentameric oligomers, are functional and are likely be properly folded when incorporated in detergent micelles or in lipid bilayers. We stress the fact that (1) this is the first report that shows unequivocally that coronavirus envelope proteins (full length) are pentameric, and (2) that the mass spectroscopy determination of these proteins is by no means routine. Indeed, the fluctuation between measurements observed for SARS-CoV E was not observed previously in TM fragments of SARS-CoV E [18, 19] , in proteolytic fragments after tryptic digestion (Supplementary File), or in similar viroporins, e.g., small hydrophobic (SH) protein from RSV [51] , which do not form large aggregates in SDS and PFO gels. Therefore, we attribute this variability to the high aggregation tendencies of these proteins. Overall, these results show that the two coronavirus E proteins could be expressed and purified using a novel fusion protein consisting of a modified transmembrane b-barrel. This is an approach that can be used to study the structural variations and functional characteristics of this so far unexplored family of membrane proteins. This approach can also be used in other small membrane proteins, as long as they do not contain internal methionines. A comparative sequence analysis to revise the current taxonomy of the family Coronaviridae The Coronaviridae; An Introduction Fig. 7. Structural integrity of purified SARS-CoV E in detergent micelles and lipid bilayers. (A) Circular dichroism spectra of SARS-CoV E in detergent micelles formed by SDS Sample labeled 'no lipid' refers to SARS-CoV E dry after being exposed to HFIP solvent. The deconvolved spectrum corresponding to the sample in DMPC is shown at the bottom (broken line), with the small -structure contribution indicated by an arrow; (C) 2D 1 H-15 N HSQC spectra for 15 N-labeled SARS-CoV E wild type protein in presence of 100 mM DPC and 40 mM DTT at 328 K; (D) same conditions as in (C) but after addition of 100 mM SDS (see details in Materials and Methods) Severe acute respiratory syndrome coronavirus phylogeny: toward consensus Advantages of combined transmembrane topology and signal peptide prediction -the Phobius web server Pathogenicity of severe acute respiratory coronavirus deletion mutants in hACE-2 transgenic mice The production of recombinant infectious DI-particles of a murine coronavirus in the absence of helper virus Infectious bronchitis virus E protein is targeted to the Golgi complex and directs release of virus-like particles Nucleocapsid-independent assembly of coronavirus-like particles by co-expression of viral envelope protein genes Coronavirus pseudo particles formed with recombinant M and E proteins induce alpha interferon synthesis by leukocytes The cytoplasmic tails of infectious bronchitis virus E and M proteins mediate their interaction Efficient assembly and release of SARS coronavirus-like particles by a heterologous expression system Analysis of constructed E gene mutants of mouse hepatitis virus confirms a pivotal role for E protein in coronavirus assembly Heterologous gene expression from transmissible gastroenteritis virus replicon particles Generation of a replicationcompetent, propagation-deficient virus vector based on the transmissible gastroenteritis coronavirus genome Exceptional flexibility in the sequence requirements for coronavirus small envelope protein function Expression of SARS-coronavirus envelope protein in Escherichia coli cells alters membrane permeability Structure and inhibition of the SARS coronavirus envelope protein ion channel Structural flexibility of the pentameric SARS coronavirus envelope protein ion channel Model of a putative pore: the pentameric a-helical bundle of SARS coronavirus E protein in lipid bilayers Biochemical and functional characterization of the membrane association and membrane permeabilizing activity of the severe acute respiratory syndrome coronavirus envelope protein Biochemical evidence for the presence of mixed membrane topologies of the Severe Acute Respiratory Syndrome coronavirus envelope protein expressed in mammalian cells Hexamethylene amiloride blocks E protein ion channels and inhibits coronavirus replication SARS coronavirus E protein forms cation-selective ion channels Production, purification, and cleavage of tandem repeats of recombinant peptides Cloning, expression, isotope labeling, purification, and characterization of bovine antimicrobial peptide, lactophoricin in Escherichia coli Structure of the outer membrane protein A transmembrane domain A minimal transmembrane beta-barrel platform protein studied by nuclear magnetic resonance Reliable expression and purification of highly insoluble transmembrane domains Effective high-throughput overproduction of membrane proteins in Escherichia coli Nonenzymatic cleavage of peptide bonds: the methionine residues in bovine pancreatic ribonuclease The cytoplasmic tail of infectious bronchitis virus E protein directs Golgi targeting Importance of conserved cysteine residues in the coronavirus envelope protein Envelope protein palmitoylations are crucial for murine coronavirus assembly Palmitoylations on murine coronavirus spike proteins are essential for virion assembly and infectivity Tuning the affinity for lanthanides of calcium binding proteins Calcium and lanthanide affinity of the EFloops from the C-terminal domain of calmodulin The influenza virus M2 ion channel protein: probing the structure of the transmembrane domain in intact cells by using engineered disulfide cross-linking In-gel digestion for mass spectrometric characterization of proteins and proteomes One-step procedure for peptide extraction from in-gel digestion sample for mass spectrometric analysis Total chemical synthesis of the integral membrane protein influenza A virus M2: role of its C-terminal domain in tetramer assembly 1 Oligomerization of the integrin aIIbb3: roles of the transmembrane and cytoplasmic domains On the analysis of protein self-association by sedimentation velocity analytical ultracentrifugation Modern applications of analytical ultracentrifugation Strong hydrophobic nature of cysteine residues in proteins PEP-FOLD: an online resource for de novo peptide structure prediction Novel method for evaluation of the oligomeric structure of membrane proteins Oligomerization of a peptide derived from the transmembrane region of the sodium pump gamma subunit: effect of the pathological mutation G41R NPS@: network protein sequence analysis The small hydrophobic protein of the human respiratory syncytial virus forms pentameric ion channels Supplementary data associated with this article can be found, in the online version, at http://dx.doi.org/10.1016/j.pep.2012.07.005.