key: cord-0722087-7y5i2w8x authors: Das, Dipankar; Suresh, Mavanur R. title: Copious production of SARS-CoV nucleocapsid protein employing codon optimized synthetic gene date: 2006-08-09 journal: J Virol Methods DOI: 10.1016/j.jviromet.2006.06.029 sha: a2a391086724c20993fec522863d63007a362744 doc_id: 722087 cord_uid: 7y5i2w8x The severe acute respiratory syndrome coronavirus (SARS-CoV) nucleocapsid protein (NP) is one of the predominant antigenic protein and the most abundant shed antigen throughout the SARS-CoV infection. This feature makes it a suitable molecular target for diagnostic applications. In this study the full length codon optimized NP gene and its subfragment gene segment was cloned in a bacterial expression vector. The full length NP could be expressed in E. coli at very high level within inclusion bodies. The inclusion bodies were successfully solubilized, purified under denaturing conditions employing IMAC column and refolded. The non-glycosylated NP was used to immunize mice for hybridoma development. The polyclonal antiserum from animals immunized with this recombinant NP protein was found to specifically recognize the NP and its subfragments, thus demonstrating the immunogenic nature of the recombinant protein. The NP antigen or a subfragment could be useful for developing a sensitive serum diagnostic assay to monitor SARS-CoV outbreaks by detecting the early human anti-SARS antibodies. In addition, the availability of the NP fragments could facilitate epitope mapping of anti-NP monoclonals for identifying suitable sandwich pairs. Severe acute respiratory syndrome (SARS), a new infectious disease caused by SARS-CoV, was first recognized in China in [2002] [2003] . Similar outbreaks occurred in Hong Kong, Singapore and Toronto making them the initial hot zone of SARS. According to World Health Organization, the outbreak of SARS epidemic in 2002-2003 infected over 8400 people globally resulting in deaths of over 900 individuals. Coronaviruses are the RNA viruses containing positive-sense single stranded RNA genome of 27-32 kb. Analysis of the nucleotide sequences of the novel SARS-CoV showed that the viral genome is nearly 30 kb in length and contains 14 potential open reading frames flanked by 5 and 3 untranslational region. SARS-CoV contains four major structural gene open reading frames including spike (S), membrane (M), envelope (E) and nucleocapsid protein (NP) and a set of accessory proteins whose number and sequence vary among different coronaviruses. The continual lack of a rapid sensitive antigen test to assist in the diagnosis of suspected cases and in turn of probable cases of SARS makes this area a pri- * Corresponding author. ority for further research efforts to develop inexpensive point of care rapid diagnostics. The three major diagnostic methods that are currently available include viral RNA detection using RT-PCR (Jiang et al., 2004) , detection of antibody by indirect fluorescence assay and NP or culture extract of SARS-CoV-based enzyme linked immunosorbent assay . The NP is the most predominant virus derived protein throughout the infection, because the NP mRNA levels are amplified 3-10 times higher at 12 h post-infection (Hiscox et al., 1995) compared to other structural genes. This feature makes it a suitable candidate for developing monoclonal antibodies for SARS NP antigen diagnostics as well as detection of patient anti-NP antibodies. In the literature many recombinant viral proteins have been expressed with immunodominant epitopes and successfully used as antigens for diagnostics of infection (Konishi et al., 1996; Sohn et al., 1994; Zoller et al., 1993) . Rapid diagnosis of any disease can lead to early therapeutic intervention. In many viral diseases, virus shedding is greatest during the early symptomatic phase. The detection of viral RNA by RT-PCR is very sensitive and expensive. The whole process of extraction of mRNA from different specimens can be labor intensive and relies on the special technical expertise (Tan et al., 2004) . In addition, false positive results may also result from cross-contamination. ELISA based sandwich antigen detection test by dual monoclonal antibodies are known to have high specificity and reproducibility. The results of RT-PCR can be complemented and improved by using antigen detection capture ELISA method. An early capture ELISA was achieved with a polyclonal antibody and a mouse monoclonal antibody which detects at least two spatially separated epitopes on the NP antigen . Polyclonal antibody has inherent specificity limitations and hence the availability of suitable monoclonal antibodies is essential for the development of an immunological diagnostic tool. Such antigen capture assays by ELISA has demonstrated that 52% patients have detectable levels of NP from nasopharyngeal samples and 55% patients showed NP positive in their stool samples (Lau et al., 2004) . In addition NP can be detected from nasal, urinary and fecal samples. It has been shown that nearly 89% of SARS patients have anti-NP antibodies during infection using both native and bacterially produced NP antigen in an immunoassay . It has also been reported that anti-NP monoclonal antibodies has been used to establish a sensitive antigen capture ELISA for the detection of shed antigen from the SARS infected patients serum samples (Che et al., 2004) . Several other groups (Chang et al., 2004; He et al., 2004; Lin et al., 2003) have also reported the detection of anti-NP antibodies in SARS patients using NP or its fragments as a capture antigen. However, detection of pathogen specific antigens is desirable since it provides information's on current infective states unlike detection of anti-pathogen antibodies which can persist in the body long after clearance of any infection. Our initial objective was cloning and expression of codon optimized NP gene and its subfragments in E. coli to develop monoclonal and bispecific antibodies by hybridoma technology together with epitope mapping with the NP fragments. Here we report the successful cloning and abundant expression of the full length NP and its subfragment domains employing a prokaryotic codon optimized synthetic NP gene. We exploited the common E. coli bacterial expression system for the production of NP. However, since it is a prokaryotic based system, heterologously expressed eukaryotic proteins are not post-translationally modified properly. Further, it is often difficult to facilitate the secretion of large amounts of expressed protein into the culture medium. In addition, proteins expressed in large amounts tend to aggregate, forming inclusion bodies which present an advantage for the purification of expressed protein. To enhance the expression level of a foreign gene in a particular expression system it is very important to adjust the usage of codon frequency of the gene to match that of the host expression platform. Appropriate codon usage has a significant impact on heterologous gene expression since codon preference among different species could be widely different. Rarely used codons in a foreign gene can lead to poorly translated mRNA, decreased mRNA stability, premature translation and even misincorporation of amino acids. This can be mitigated by supplementing the gene expression host with the rare codon tRNAs that are otherwise infrequent in the host organism. Alternatively, foreign genes can be optimized for codon usage appropriate to the host translational machinery. Different bioinformatics tools are available for the optimization of codon usage considering many factors such as RNA secondary structure, GC content, repetitive codons and rare codons manifested in commonly used prokaryotic and eukaryotic hosts (Grote et al., 2005) . The NP nucleotide sequences of SARS-CoV were codon optimized for E. coli expression and chemically synthesized from GENEART, Germany. The codon optimized NP gene and fragments (NP1.1, aa 1-140; NP1.2, aa 141-280; NP1.3, aa 281-422) were PCR amplified and cloned in the correct reading frame in pBM802 vector with the His 6 tag at the C-terminal for high level expression of proteins within inclusion bodies of E. coli. The recombinant clones upon analysis by restriction digestion fragment mapping showed the right size clones and were selected for protein expression. Similarly, the plasmids containing NP subfragments were also isolated for expression. Results showed that all the selected NP full length clones expressed the target protein of approximately 46 kDa at different levels when analysed by SDS-PAGE and one clone with the best expression is shown in Fig. 1A along with the Western blot when probed with anti-His 6 MAb (Fig. 1B) . All the NP subfragments were also expressing the right size protein when analysed by SDS-PAGE (Fig. 1C) and confirmed by Western blot when probed with anti-His 6 MAb (Fig. 1D) . The expression of the recombinant NP and its three subfragments appeared almost exclusively within the inclusion bodies thus facilitating the purification. The best NP clone was selected for medium scale bacterial expression (2-4 L) and purification. The final yield of purified full length NP antigen was estimated by Bradford protein assay to be approximately 30-55 mg/L of culture which is a log fold higher than a previous report using the native viral NP gene sequence which provided a yield of 3 mg/L (Lau et al., 2004) . Isolation and purification of inclusion bodies in this study involved four different steps which includes isolation of inclusion bodies from E. coli cells, solubilization, affinity purification and refolding of the purified protein. The final purification exploited immobilized metal-affinity chromatography (IMAC) under denaturing conditions to adsorb the His-tagged protein and subsequently elute the pure NP. Most recombinant proteins are expressed in E. coli within inclusion bodies and different refolding methods have been reported to renature proteins from inclusion bodies (Das et al., 2004) . The protein concentration plays a crucial role in refolding conditions and it was observed that when the concentration was 100 g/mL or above, aggregation was evident. We observed that during extraction of inclusion bodies, IMAC purification and upon storage at 4 • C, NP exhibited some degradation which was also reported previously (Tang et al., 2005) . This is due to the low stability of NP of SARS-CoV since it is the only SARS-CoV protein which contains no Cys residues (Cao et al., 2005) . It is also reported that SARS virus looses its infectious capacity at 56 • C and the low stability of the NP may be at least one of the important determinants for the thermosensitivity of SARS virus (Duan et al., 2003) . The refolded NP antigen was used to immunize mice to develop several monoclonal antibodies. The polyclonal antibodies from mouse serum were strongly reacting with the NP recombinant antigen and its three subfragments in Western blot. The three different NP subfragments were also expressed in high levels and could be useful for epitope mapping of different monoclonals currently under development. These NP fragments will be useful to select a pair of monoclonal antibodies with nonoverlapping specificities for efficient sandwich formation. In addition, the NP antigen or a subfragment based ELISA could be the basis of developing a sensitive assay to monitor SARS-CoV outbreaks by detecting the early human anti-SARS antibodies. In conclusion, we have successfully generated significant amounts of the recombinant NP antigen and this method could provide a safe and valuable inexpensive resource for early SARS diagnostics. Yeast expression and characterization of SARS-CoV N protein Immunofluorescence assay for serologic diagnosis of SARS Antibody detection of SARS-CoV spike and nucleocapsid protein Sensitive and specific monoclonal antibody-based capture enzyme immunoassay for detection of nucleocapsid antigen in sera from patients with severe acute respiratory syndrome Development of a biotin mimic tagged ScFv antibody against western equine encephalitis virus: bacterial expression and refolding Stability of SARS coronavirus in human specimens and environment and its sensitivity to heating and UV irradiation JCat: a novel tool to adapt codon usage of a target gene to its potential expression host Development of a Western blot assay for detection of antibodies against coronavirus causing severe acute respiratory syndrome Quantification of individual subgenomic mRNA species during replication of the coronavirus transmissible gastroenteritis virus Sensitive and quantitative detection of severe acute respiratory syndrome coronavirus infection by real-time nested polymerase chain reaction Enzyme-linked immunosorbent assay using recombinant antigens for serodiagnosis of Japanese encephalitis Detection of severe acute respiratory syndrome (SARS) coronavirus nucleocapsid protein in sars patients by enzyme-linked immunosorbent assay Antibody response of patients with severe acute respiratory syndrome (SARS) targets the viral nucleocapsid Identification of an epitope of SARS-coronavirus nucleocapsid protein Diagnosis of severe acute respiratory syndrome (SARS) by detection of SARS coronavirus nucleocapsid antibodies in an antigen-capturing enzymelinked immunosorbent assay Overexpression and simple purification of human immunodeficiency virus-1 gag epitope derived from a recombinant antigen in E. coli and its use in ELISA Profiles of antibody responses against severe acute respiratory syndrome coronavirus recombinant proteins and their potential use as diagnostic markers Biochemical and immunological studies of nucleocapsid proteins of severe acute respiratory syndrome and 229E human coronaviruses Low stability of nucleocapsid protein in SARS virus A novel mu-capture enzyme-linked immunosorbent assay based on recombinant proteins for sensitive and specific diagnosis of hemorrhagic fever with renal syndrome This work was supported by the NIH grant # 5U01AI061233-02. Thanks to Dr. Sandra Marcus for providing the French press facility, Department of Chemistry, University of Alberta, Edmonton, Alta., Canada.