key: cord-261388-d56ci0hl authors: Tibbles, K.W; Cavanagh, D; Brown, T.D.K title: Activity of a purified His-tagged 3C-like proteinase from the coronavirus infectious bronchitis virus date: 1999-05-28 journal: Virus Res DOI: 10.1016/s0168-1702(99)00011-8 sha: doc_id: 261388 cord_uid: d56ci0hl Previous studies in vitro of the processing of cloned polyprotein fragments from the coronavirus infectious bronchitis virus (IBV) large open reading frame (ORF1), confirmed the activity of a predicted 3C-like proteinase (3CLP) domain and suggested that the proteinase is released autocatalytically from the polyprotein in the form of a 35 kDa protein, 3CLpro, capable of further cleavages in trans. In order to identify such cleavages within the ORF1 polyprotein mediated by 3CLpro, the proteinase was expressed in bacteria, purified and used in trans cleavage assays with polyprotein fragments lacking the 3CLP domain as targets. The proteinase was expressed as a polyprotein fragment which was able to process during expression in bacterial cells, releasing mature 3CLpro. A histidine (His(6)) tag was introduced close to the C-terminus of the proteinase to aid purification. Processing demonstrated by the tagged proteinase was indistinguishable from that of the wild-type enzyme indicating that the site chosen for the tag was permissive. From these studies we were able to demonstrate trans cleavages consistent with the use of most of the previously predicted or identified sites within the open reading frame of gene 1. This tentatively completes the processing map for the ORF1 region with respect to 3CLpro. The coronavirus infectious bronchitis virus (IBV) encodes the major portion of its non-struc-tural proteins within two large open reading frames, ORFs 1a and 1b, situated at the 5' end and occupying the major portion of the 27.6 kb single-stranded positive-sense RNA genome . The ORFs overlap slightly with ORF1b accessed via programmed ribosomal frameshifting (Brierley et al., 1989) resulting in ORF1a and ORF1a/1b (a fusion) trans-lation products. The large size of the ORFs suggests that the translation products are polyproteins that are processed to generate a set of mature non-structural proteins. In common with all other coronaviruses so far examined, IBV encodes a 3C-like proteinase domain which is located towards the C-terminus of the ORF1a polyprotein and approximately centrally in the ORF1a/1b fusion product. The 3CLP domain was first identified as a possible component of IBV polyproteins (Gorbalenya et al., 1989) due to sequence similarity to picornavirus 3C proteinases (reviewed in Ryan and Flint, 1997) . In the same study, predictions were made as to likely target dipeptide cleavage sites for a 3C-like proteinase within the ORF1 polyproteins, the large number suggested implicating this proteinase in a pivotal role in the maturation of the polyproteins. Similar domains were subsequently identified in mouse hepatitis virus (MHV) (Lee et al., 1991) and human coronavirus (HCV 229E) (Herold et al., 1993) . There are no cellular counterparts for the activities of the 3C-like proteinases, therefore they are essential to the successful maturation of their respective polyproteins and subsequent replication events. This, and the comparatively high degree of conservation (at functional level), makes the 3Clike proteinases attractive targets for antiviral intervention. Given the essential role played by the 3C-like proteinase of the coronaviruses in maturation of the 'polymerase' polyproteins, there has been considerable interest in characterising the enzyme, with initial emphasis on defining the processing events involving the proteinase. Such studies are already at an advanced stage for both IBV (Liu et al., 1994 and HCV 229E ORF1b regions (Grotzinger et al., 1996; Heusipp et al., 1997a,b) . 3C-like proteinase activities have been confirmed recently for three representatives of the coronavirus group, MHV (Lu et al., 1995) , HCV 229E (Ziebuhr et al., 1995) and IBV (Tibbles et al., 1996) and the protein has been identified in vivo during viral infection in the case of HCV (Ziebuhr et al., 1995) and MHV (Lu et al., 1996) . In addition, the proteinases from these two viruses have been purified and partially characterised (Seybert et al., 1997; Ziebuhr et al., 1997) . The identification of the IBV 3CLpro was first realised in vitro when a cloned fragment of the polyprotein was expressed in reticulocyte lysate (Tibbles et al., 1996) . In the presence of membranes, upon which activity was vitally dependent, processing of the region containing the protease domain was observed with the release of a 35 kDa protein (p35). This product was capable of trans processing, but was inefficient and of limited use for mapping studies. We undertook the identification of the remaining IBV 3CLpro cleavage targets within the ORF1 polyprotein using bacterial expression to obtain significant quantities of 3CLpro. The bacterial expression system described utilises the minimum processing unit identified in our previous studies which established that, in addition to the 3CLP domain, MP2 protein sequence between the Q/S 4 cleavage target and a point delimited by an NcoI restriction site (ntd position 10118) was that minimally required for processing. A BamHI fragment released from pKT205 (see below) was introduced into BamHI digested pET3xa (Novagen) to yield pET205 ( Fig. 1 ). This construct would allow expression to be monitored through the synthesis of the strongly expressed T7 gene 10 protein, which composes the major insoluble fraction component upon extraction. Expression beyond the NcoI site was restricted by digestion of pET205 with NcoI followed by end repair and religation. This manipulation introduces a frame-shift in the sequence which results in a translation stop signal being encountered. This plasmid was then used to create an internal deletion between the vector initiator ATG triplet and the IBV sequence by digestion with NheI and BamHI. The newly generated plasmid, pETDN205DC Nco , was modified to include a His-6 tag approximately 35 aminoacid residues from the C-terminus of 3CLpro to aid in purification. A mutation which replaced the amino-acid sequence VKNSQW (single letter code) with six histidine residues near the C-terminus of the 3CLP domain in pKT205 (ntds 9679- Fig. 1 . Schematic representation of the IBV ORF1a/1b polyprotein with positions of predicted cleavage dipeptide targets shown as solid vertical bars, those with arrowheads have been experimentally confirmed. Predicted catalytic domains are also indicated (MP1/2; membrane proteins, 3CLP; 3C-like proteinase, POL; polymerase, Zn; zinc finger, HEL; helicase). The derivation of the eukaryotic and bacterial expression clones and the principal cloning vectors used are depicted. The principal restriction sites used in cloning manipulations (adjacent solid bars), promoters (T7, SP6 and Ptac) and translation (Tln) start sites for the plasmids are also shown. 9696) was made, giving rise to pKT205His. The positioning of this tag sequence was determined by consideration of the possible amino-acid sequences involved in cleavage at the Q/S 4 site and potential for interaction with the affinity matrix based on predicted hydrophilicity (Kyte and Doolittle, 1982) of the proteinase sequence. This modified sequence was transferred to the 205bearing pET plasmid via replacement of a NsiI fragment. The processing properties of the modified proteinase were tested extensively, using a variety of systems, and were found to be indistinguishable from wild-type (data not shown). Therefore the appropriate pET plasmids (Studier et al., 1990) were introduced into E.coli BL21 pLysS cells (Studier, 1991) . Cultures were grown for approximately 3 h until the OD 600 was between 0.6 and 0.9 and expression induced by the introduction of IPTG (at 100 mg/ml) into the growth medium. The cells were harvested after a further 3 h. Cells were lysed, clarified and the supernatant applied to a chelating agarose column charged with cobalt chloride. The use of cobalt instead of the usual nickel resulted in lower binding of background material (data not shown). Alternatively, cultures were induced after 1.5 -2 h (OD 600 0.3 approx.) and rifampicin was added (to 200 mg/ml) after a further hour and incubation continued for 3 h. This resulted in a substantial inhibition of cellular protein expression without affecting ongoing T7-dependent expression. Although this resulted in a lower overall protein yield, there was a higher proportion of 3CLpro and background binding for the metal chelate column selection was significantly reduced. Typical results are shown in Fig. 2 where the appearance of a novel 35 kDa protein was observed after induction (compare lanes 1 and 2, panel A). Clarification of the crude lysate (lane 3) by centrifugation revealed that the major portion of the protein of interest (i.e. 35 kDa) remained in the supernatant (lane 5, compare to lane 4, pellet fraction). Comparison of the total applied protein and the unbound fraction (lanes 5 and 6 respectively) revealed that over 90% of the novel band seen after induction was retained on the column. This putative 3CLproH6 bound with relatively low affinity and was seen to elute at 60mM imidazole (lane 8), conditions normally associated with removing background binding. No observable protein remained to be eluted with 1M imidazole (lane 9). The background contaminating proteins could be significantly reduced by expression in the presence of rifampicin. While this resulted in a reduction in the total protein yield of some 80%, the proportion that was 3CLproH6 was greater and resulted in less background in the purified product (panel B, compare products under normal conditions, lane 1, with those obtained in the presence of rifampicin, lane 2, after single step purification on a cobalt column). Only fractions seen to contain the putative 3CLproH6 were found to cleave target polypeptide targets in a trans assay which supports the identity of the induced product as 3CLproH6 (data not shown). From comparison with known concentration protein standards (BSA) we estimated the yield to be approximately 2-4 mg/l of culture (0.5 -1 mg/l in the presence of rifampicin), which compares favourably with that obtained by Ziebuhr et al., 1997 . Different regions of the ORF1a/1b polyprotein were expressed in reticulocyte lysate (TnT, Promega) or bacterial cell extract (S30, Promega) in the presence of [ 35 S] methionine using the following expression plasmids (see Fig. 1 ). Construction of the pKT205 and pKTBP5 plasmids (encoding IBV sequence from nucleotides (ntd) 8693 -10927 and 10752 -12600 respectively) has been described elsewhere (Liu et al., 1994) . These encode predicted cleavage sites Q/S 3 to Q/G 6 and Q/S 7 to Q/S 9 , respectively, within the C-terminal region of ORF1a. IBV sequence between ntd 16785 and 23060 (Boursnell et al., 1987) , amplified by RT-PCR using a first-strand cDNA synthesis kit (Pharmacia) and Pfu polymerase (Stratagene), was cloned into pCRScript (Stratagene). This plasmid, pCRF4, was digested with EspI and PstI (following end repair of EspI site with DNA polymerase) and the fragment introduced into EcoRV-PstI digested pKT0/NS1 vector to produce pKT1bC (encoding nucleotides 16841 -20926). The polyprotein fragment encoded contains the three predicted 3CLpro cleavage sites Q/G 11 , Q/S 12 and Q/S 13 from the C-terminal region of ORF1b. Cleavage dipeptides were mutated (to G/S or A/S) by oligonucleotide-directed mutagenesis as described previously (Tibbles et al., 1996) . This region was transferred as a SpeI fragment to pGEX4T2 (Pharmacia) and deletions made to express portions containing each target dipeptide individually. The N-terminal half of the ORF1a region (up to nucleotide position 6500 approx.) is encoded by the plasmid pKT1a3, constructed as described previously (Liu et al., 1995) . Analysis of the Q/G 5 site was made using a KpnI deletion of the pET205 plasmid, pET205D Kpn . A HincII fragment (ntd 10438-10902) bearing the Q/G 6 target from pKT205 was introduced into pGEX4T2 to yield pGEXMP2C (resulting in the production of a fusion with glutathione-S-transferrase (GST), Smith and Johnson, 1988) . The region encoding the MP1 region and Q/G 2 was derived from PCR amplification and a fragment (ntds 6919-8693) cloned upstream of the 205 sequence in pKT205/ BP5. Purified proteinase was added in varying amounts to the radiolabelled target fragments and incubated for up to 16 h at room temperature. Cleavage patterns were analysed following polyacrylamide gel electrophoresis and an example of these analyses is shown in Fig. 3 . Complete digestion conditions for the fragment corresponding to the BP5 region of the polyprotein revealed a cleavage pattern that was consistent with the pattern expected from the cleavage predictions made by Gorbalenya et al., 1989 . This target contained the predicted cleavage sites of Q/S 7 , Q/S 8 and Q/S 9 , all of which, along with an additional unpredicted Q/N site, have been experimentally confirmed (Liu et al., 1997; Ng and Liu, 1998) . Cleavage within BP5 would be expected to yield processed products of approximate sizes of 6, 24, 10, 17 and 3 kDa (as they lie on the polyprotein) assuming cleavage at Q/S 7 , Q/N, Q/S 8 and Q/S 9 . In Fig. 3 the effect of processing by the addition of 3CLproh6 can be seen. The uncleaved target (lane 1) consisted of two species (of approximately 60 and 70 kDa) because the sequence contained the frame-shift signal, followed by approximately 300 nucleotides from ORF 1b. Only the two largest cleavage products were resolved in this experiment (lane 2) which correspond to cleavage reinforced by comparing them with those produced after incorporation of [ 35 S] cysteine. The relative intensity of the two bands is reversed reflecting the inverse ratio of the two amino acids in the 24 and 17 kDa products (data not shown). No cleavage was apparent in the absence of 3CLproH6 (data not shown). Cleavage at the predicted Q/S 7 site was tested by mutation of the cleavage dipeptide to an uncleavable form. Failure to cleave at the mutated Q/S 7 site should result in the extension of the N-terminus of the 24 kDa product by 6 kDa resulting in the generation of a new cleavage product of approximately 30 kDa. Comparison of the cleaved wild-type 24 kDa product (lane 2) and the product from the mutated polyprotein fragment (lane 3) revealed a decrease in the mobility of the more intensively labelled product (equating to approximately 30 kDa), consistent with failure to cleave at the mutated Q/S 7 site. Polyprotein fragments constituting the remaining areas of the ORF1 region were synthesised and similarly analysed for cleavage by purified 3CLproH6. The results of all of these trans assay analyses are summarised in Table 1 . We found that all so far identified target dipeptides are utilised by the 3CLproH6 enzyme in vitro. It was at Q/S 7 and Q/N for the largest product of 24 kDa and at Q/S 8 and Q/S 9 for the smaller of 17 kDa. The identities of these two products were noted that under conditions expected to result in complete cleavage, not all target sites were digested to completion. These tended to be in ORF1b. Only three of the predicted sites fail to be cleaved, namely Q/G 2 , Q/G 5 and Q/G 6 and no novel target sites were detected. The first of the uncleaved sites (Q/G 2 ) lies upstream of both the 3CLP and MP1 domains while the latter pair fall within the MP2 region. Either these sites are not utilised by 3CLpro or the conformation of the target fragments bearing them in our assays do not facilitate their recognition by the proteinase. However in the case of the Q/G 6 -bearing GST fusion, cleavage consistent with use of the thrombin site within the fusion junction region was observed (data not shown), which suggests that no gross conformational abnormality exists that might preclude proteolysis. We have used bacterial expression of viral protein, comprising the IBV ORF1a polyprotein fragment that gives rise to 3CLpro via autocatalytic processing in vitro (Tibbles et al., 1996) , to obtain significant amounts of highly purified active proteinase for trans processing studies. Our rationale for modifying the starting pET plasmid containing the 205 sequence was as follows. Expression of hydrophobic proteins from picornaviruses with deleterious effects on bacterial cells has been reported (Lama and Carrasco, 1996) . Therefore termination of expression at the NcoI site within MP2 would still allow processing, according to our in vitro data (Tibbles et al., 1996) , while at the same time minimising the possible toxicity of the highly hydrophobic MP2 region. The gene 10 sequence was deleted in order to promote the production of a soluble polyprotein fragment, equivalent to that which is able to mature in vitro, and so ensure processing would proceed. Evidence suggests that in bacteria folding is post-translational (Netzer and Hartl, 1997) and so there was potential for the proteinase to become inactive due to the characteristic sequestration of pET fusion proteins into inclusion bodies. In the event this latter measure appeared unnecessary as soluble processed 3CLpro was produced by the cells expressing fusion proteins (data not shown). This suggests that folding and processing were completed if not before completion of translation then before the sequestration into inclusion bodies. Thus by whatever means the proteinase matures in its usual eukaryotic context seems to be facilitated in the bacterial cell, though not necessarily by analogous interactions. The precise nature of the interaction between MP2 and cellular components (membranes) necessary for maturation of 3CLpro and its role in the mechanism of 3CLpro processing (maturation or release) remain to be determined. We do not know as yet whether the membrane requirement for maturation of 3CLpro also reflects a need for other associated cellular components such as chaperone functions. The fact that processing is able to proceed in an heterologous environment offers a possible means of investigating this further. In order to simplify purification of 3CLpro, we introduced a histidine (His 6 ) tag which facilitates separation on a suitable metal chelation matrix. The tag could not be placed at the extreme N or C-termini of the protein as is usual since the cleavage dipeptide and surrounding sequences are crucial for correct processing (Ryan and Flint, 1997) . We introduced the tag into the nearest region of predicted hydrophilicity to the C-terminal processing site Q/S4 so as to minimise interference with the sequences involved in processing and allow interaction of the exposed histidine residues with the purification matrix. The tagged proteinase (3CLproH6) appears to be indistinguishable from wild-type and is readily purified to high purity. However, the affinity between modified proteinase and matrix is low and similar to background proteins suggesting that the tag may not be fully accessible. Our observation seems to conflict with the interpretation of a mutational analysis of HCV 229E 3CLpro by Ziebuhr et al., 1997 which concluded that the C-terminus of 3CLpro was important to function and intolerant of mutation. The His 6 -tag mutation here (a substitution) was well tolerated and would therefore appear to indicate a so called 'permissive site' in the C-terminus of the protein. We have shown by trans cleavage analysis that processing occurs at the majority of the predicted 3CLpro cleavage dipeptides within the IBV ORF1 polyprotein and that no further cleavages are apparent. Our data reinforce recently reported in vivo studies and also partially overcome a potential shortcoming of these approaches where some products may not be consistently detectable, due to poor reactivity with antisera or short half life for example. This analysis tentatively completes the processing map of the ORF1 region with respect to 3CLpro. There appear to be no target sites for 3CLpro in the N-terminal half of the ORF1a polyprotein (covered by the 1a3 clone) so that processing of this region and possibly regions upstream is probably achieved through the PLP domain (Liu et al., 1995) , alone or in combination with other as yet unidentified proteinase activities. Those target dipeptides that were not cleaved in our assays are associated with the hydrophobic domains. Cleavage of the MP1 region downstream of 3CLpro may be carried out by the PLP domain or an alternative activity, although failure for 3CLpro to cleave Q/G 2 may reflect a shortcoming in the presentation of the targets. The data presented here also suggest that there is no processing between Q/S 3 (the C-terminus of the 3CLP domain) and Q/S 7 so that the MP2 product containing uncleaved Q/G 5 and Q/G 6 dipeptides would be expected to be approximately 42 kDa. Notably these three hydrophobic region targets are not represented in either MHV or HCV 229E, although, along with IBV itself, these viruses bear a potential target at a position equivalent to just downstream of the IBV Q/G 6 site. These possibilities and the assembly of processed products into polymerase complexes may be addressed using specific antisera raised to amino or carboxyl terminal regions of processing products indicated by the present studies. Completion of the sequence of the genome of the coronavirus avian infectious bronchitis virus Characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an RNA pseudoknot The coronavirus non-structural proteins Coronavirus genome: prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis Characterization of a 105-kDa polypeptide encoded in gene 1 of the human coronavirus HCV 229E Nucleotide sequence of the human coronavirus 229E RNA polymerase locus Identification and subcellular localization of a 41 kDa, polyprotein 1ab processing product in human coronavirus 229E-infected cells Identification of an ATPase activity associated with a 71-kilodalton polypeptide encoded in gene 1 of the human coronavirus 229E A simple method for displaying the hydropathic character of a protein Screening for membrane-permeabilizing mutants of the poliovirus protein 3AB The complete sequence (22 kilobases) of murine coronavirus gene 1 encoding the putative proteases and RNA polymerase A 100-kilodalton polypeptide encoded by open reading frame (ORF) 1b of the coronavirus infectious bronchitis virus is processed by ORF 1a products Proteolytic mapping of the coronavirus infectious bronchitis virus 1b polyprotein: Evidence for the presence of four cleavage sites of the 3C-like proteinase and identification of two novel cleavage products Identification, expression, and processing of an 87-kDa polypeptide encoded by ORF 1a of the coronavirus infectious bronchitis virus Proteolytic processing of the coronavirus infectious bronchitis virus 1a polyprotein: identification of a 10-kilodalton polypeptide and determination of its cleavage sites Intracellular and in vitro-translated 27-kDa proteins contain the 3C-like proteinase activity of the coronavirus MHV-A59 Identification and characterization of a serine-like proteinase of the murine coronavirus MHV-A59 Recombination of protein domains facilitated by co-translational folding in eukaryotes Identification of a 24-kDa polypeptide processed from the coronavirus infectious bronchitis virus 1a polyprotein by the 3C-like proteinase and determination of its cleavage sites Virus-encoded proteinases of the picornavirus super-group Expression and characterization of a recombinant murine coronavirus 3C-like proteinase Single-step purification of polypeptides expressed in Escherichia coli as fusions with glutathione S-transferase Use of bacteriophage T7 lysozyme to improve an inducible T7 expression system Use of T7 RNA polymerase to direct expression of cloned genes Characterization in vitro of an autocatalytic processing activity associated with the predicted 3C-like proteinase domain of the coronavirus avian infectious bronchitis virus A region of the infectious bronchitis virus 1a polyprotein encoding the 3C-like protease domain is subject to rapid turnover when expressed in rabbit reticulocyte lysate Characterization of a human coronavirus (strain 229E) 3C-like proteinase activity Biosynthesis, purification, and characterization of the human coronavirus 229E 3C-like proteinase We thank Ian Brierley for critical reading of the manuscript. This work was supported by the Wellcome Trust, the Biotechnology and Biological Sciences Research Council and the Ministry of Agriculture, Fisheries and Food, UK.