key: cord-0825003-4yxda0mf authors: Tian, Xinsheng; Feng, Youjun; Zhao, Tiezhu; Peng, Hao; Yan, Jinghua; Qi, Jianxun; Jiang, Fan; Tian, Kegong; Gao, Feng title: Molecular cloning, expression, purification and crystallographic analysis of PRRSV 3CL protease date: 2007-07-28 journal: Acta Crystallographica Section F Structural Biology and Crystallization Communications DOI: 10.1107/s1744309107033234 sha: 2c95e1b07dbd856d0007121257e8435472360888 doc_id: 825003 cord_uid: 4yxda0mf 3CL protease, a viral chymotrypsin-like proteolytic enzyme, plays a pivotal role in the transcription and replication machinery of many RNA viruses, including porcine reproductive and respiratory syndrome virus (PRRSV). In this study, the full-length 3CL protease from PRRSV was cloned and overexpressed in Escherichia coli. Crystallization experiments yielded crystals that diffracted to 2.1 Å resolution and belong to space group C2, with unit-cell parameters a = 112.31, b = 48.34, c = 42.88 Å, β = 109.83°. The Matthews coefficient and the solvent content were calculated to be 2.49 Å(3) Da(−1) and 50.61%, respectively, for one molecule in the asymmetric unit. Porcine reproductive and respiratory syndrome (PRRS), also called 'blue ear' disease, is manifested clinically by severe reproductive failure in sows and respiratory distress in pigs of various ages (Paton et al., 1991) . This disease has become endemic in many countries and has caused great economic losses in the swine industry worldwide (Tian et al., 2007; Dea et al., 2000) . Viruses of the order Nidovirales regulate their genome expression by synthesizing two multidomain precursor polyproteins that are subsequently cleaved into functional viral proteins mainly by the 3CL protease (den Boon et al., 1991; Birtley et al., 2005; Snijder et al., 1996; van Aken et al., 2006) . The functional importance of the 3CL protease in the viral life cycle makes it an attractive target for the development of small-molecule drugs. Several crystal structures of 3CL protease have been solved, e.g. those of human coronavirus 229E (hCoV-229E; Anand et al., 2003) , severe acute respiratory syndrome-related coronavirus (SARS-CoV; Yang et al., 2003) , equine arteritis virus (EAV; Barrette-Ng et al., 2002) and transmissible gastroenteritis virus (TGEV; Anand et al., 2002) . Despite the common structure of the 3CL proteases determined to date, consisting of two -barrels and a unique C-terminal domain, there are many significant differences between them, especially in the linker between the C-terminal domain and the C-terminal -barrel ( Barrette-Ng et al., 2002) . We have initiated a study aimed at determining the threedimensional structure of 3CL proteases in order to understand their function from a structural perspective and to provide a platform for the screening of new drugs. Here, we report the crystallization of the 3CL protease from PRRSV and its preliminary crystallographic analysis. Viral genomic RNA was extracted from marc-145 cell cultures which had been inoculated with the PRRSV isolate (gi|119068009) using the RNeasy Plus Mini Kit (Qiagen, Germany). The reverse transcription (RT) was undertaken using the primers 3CL-F1 (5 0 -CACATGCTTGCTGGTGTTTA-3 0 ) and 3CL-R1 (5 0 -GCCAC-GGTATCGGCAAAAGC-3 0 ). After the single-tube RT reaction, 1 ml of PCR product was used as a template to amplify the coding sequence (CDS) of PRRSV 3CL protease with specific primers 3CL-F2 (5 0 -CCCGGGCGCTTTCAGAACTCAAAA-3 0 ) and 3CL-R2 (5 0 -CCGCTCGAGTTATTCCAGTTCGGGTTTGGC-3 0 ). The amplified DNA fragment encoding the PRRSV 3CL protease was inserted into the GST-fusion expression vector pGEX-6p-1 (GE Healthcare) with SmaI/XhoI sites (introduced by PCR primers). The resultant positive clone, designated pGEX::3CL, was confirmed by double-enzyme digestion and verified by direct DNA sequencing. For protein expression, the plasmid was transformed into Escherichia coli strain BL21 (DE3) competent cells and the single colony was inoculated into Luria-Bertani (LB) medium with 50 mg l À1 ampicillin (Sigma, USA) at 310 K for overnight growth. The overnight culture was then transferred into a flask containing 2 l fresh LB medium. When the culture density (OD 600 ) reached 0.8, the culture was induced with 0.2 mM isopropyl -d-thiogalactopyranoside (IPTG; Sigma, USA) and kept for $3.5 h at 289 K until collection of the bacteria. The harvested culture was centrifuged at 5000 rev min À1 for 12 min at 277 K and the pellet was resuspended in chilled PBS (140 mM NaCl, 2.7 mM KCl, 10 mM Na 2 HPO 4 , 1.8 mM KH 2 PO 4 pH 7.3) and homogenized by sonication. The lysate was centrifuged at 16 000 rev min À1 for 20 min at 277 K and subsequently filtered through a 0.22 mm membrane for clarification. The supernatant was then loaded onto a glutathione-Sepharose 4B column (GE Healthcare). The protein-loaded column was washed with ten bed volumes of PBS and ten bed volumes of cleavage buffer (50 mM Tris-HCl, 150 mM NaCl, 1 mM EDTA, 1 mM DTT pH 7.5). To obtain GST-free protein, GST-3C rhinovirus protease (kindly provided by Drs J. Heath and K. Hudson) was added to the resin at a concentration of 160 units per millilitre of bed volume. After the mixture had been incubated with gentle agitation overnight at 277 K, the target protein was eluted with 10 ml cleavage buffer (Fig. 1) . The harvested protein was then concentrated to a suitable concentration by ultrafiltration (10 kDa cutoff) and loaded onto a Superdex 75 column (GE Healthcare) with an Akta Purifier System (GE Healthcare) at a flow rate of 0.2 ml min À1 . The fractions containing the protein peak were collected and analyzed using 15% SDS-PAGE. For crystallization, the purified protein was concen- trated to approximately 20 mg ml À1 and the buffer was exchanged to 10 mM Tris-HCl, 10 mM NaCl, 1 mM EDTA, 1 mM DTT pH 7.5. The purified 3CL protease after gel filtration was dialyzed against cross-linking buffer (50 mM HEPES pH 8.3, 100 mM NaCl) and concentrated to approximately 5 mg ml À1 by ultrafiltration (10 kDa cutoff). Subsequently, various concentrations (0, 0.5, 1.0, 2.0 mM) of bis-succinimidylsuccinate (EGS; Pierce) were added to the protein sample; the GST protein was used as a positive control (Ma et al., 2005) . The reactions were incubated for 1 h at room temperature and quenched with 50 mM glycine. Finally, the cross-linked samples were analyzed by 15% SDS-PAGE (Fig. 2) . Crystallization trials were performed with Crystal Screens I and II (Hampton Research) at 291 K using the hanging-drop vapourdiffusion method. 1 ml protein solution (at a concentration of 5, 10 or 15 mg ml À1 ) was mixed with 1 ml reservoir solution and equilibrated against 200 ml reservoir solution. X-ray diffraction data were collected in-house on a Rigaku MicroMax007 rotating-anode X-ray generator operated at 40 kV and 20 mA (Cu K; = 1.5418 Å ) equipped with an R-AXIS VII** image-plate detector. The crystal was mounted in a nylon loop and flash-cooled in a cold nitrogen-gas stream at 100 K using an Oxford Cryosystem. The cryoprotectant solution contained 15% glycerol and 50% reservoir solution. Data were indexed, integrated and scaled using DENZO and SCALEPACK as implemented in HKL-2000 (Otwinowski & Minor, 1997) . PRRSV 3CL protease was partially generated in soluble form when expressed in E. coli. Induction at 289 K proved to increase the yield of soluble GST-3CL protease (data not shown). After removing the N-terminal GST tag of the fusion protein, purified 3CL protease was obtained with nine additional residues (GPLGSPEFP) at the N-terminus. With these nine extra residues, the protein product is 213 amino acids in length, corresponding to a molecular weight of about 22.0 kDa. 15% SDS-PAGE showed that the peak for the expressed protein occurs at the position expected for its size (Fig. 1) . The recombinant PRRSV 3CL protease was also confirmed by peptide mass fingerprinting (data not shown). A subsequent chemical crosslinking experiment clearly indicated that the PRRSV 3CL protease was in a monomeric form (Fig. 2) , in comparison to the dimeric form of GST. Crystallization of 3CL protease proved to be easy under several conditions, indicating its stable tightly packed structure. A crystal grown (within 8 d; Fig. 3a ) from a condition consisting of 0.1 M sodium cacodylate trihydrate pH 6.5, 1.4 M sodium acetate trihydrate diffracted X-rays to 2.1 Å (Fig. 3b) and belongs to the monoclinic space group C2, with unit-cell parameters a = 112.31, b = 48.34, c = 42.88 Å , = 109.83 . Data-collection statistics are given in Table 1 . There is only one molecule per asymmetric unit. The Matthews coefficient and the solvent content were calculated to be 2.49 Å 3 Da À1 and 50.61% (Matthews, 1968) , respectively. Initial molecular-replacement calculations were performed using the program AMoRe (Navaza, 1994) using the crystal structure of equine arteritis virus (EAV) 3CL protease as a search model (PDB code 1mbm), but no distinct peaks were obtained. This suggests that there are many differences in the three-dimensional structures of the 3CL proteases from PRRSV and EAV, despite their significant sequence homology (36% identity at the amino-acid level). ) 26.1 † R merge = P hkl P i jI i À hIij= P hkl P i I i , where I i is the intensity of an individual measurement of a reflection and hIi is the mean value for all equivalent measurements of this reflection. Proc. Natl Acad. Sci. USA