key: cord-0005458-omwjjy7z authors: Yeo, Sang-Geon; Hernandez, Mercedes; Krell, Peter J.; Nagy, Éva É title: Cloning and Sequence Analysis of the Spike Gene of Porcine Epidemic Diarrhea Virus Chinju99 date: 2003 journal: Virus Genes DOI: 10.1023/a:1024443112717 sha: 2d20eaea468ae629661015666c551953e8dc0273 doc_id: 5458 cord_uid: omwjjy7z The spike (S) gene of the porcine epidemic diarrhea virus (PEDV) Chinju99 which was previously isolated in Chinju, Korea was cloned and sequenced to aid in the development of genetically engineered vaccines and diagnostic reagents against PEDV. The nucleotide sequence encoding the entire S gene open reading frame (ORF) of Chinju99 was 4152 bases long encoding 1383 amino acids. It consisted of 1001 adenine (24.1%), 849 cytosine (20.4%), 877 guanine (21.1%) and 1425 thymine (34.3%) residues. The Chinju99 S ORF nucleotide sequence was 94.5% homologous with that of the Br1/87 and CV777 strains, respectively. The Chinju99 S protein had 92.8% amino acid identity with that of Br1/87 and CV777, respectively. The amino acid sequence contained 27 potential sites for asparagine (N)-linked glycosylation and there was a stretch of highly hydrophobic residues at position 1325–1350. Since porcine epidemic diarrhea (PED) was first reported in Belgium [1] and the UK [2] in 1978, it has occurred in many countries and causes economic losses in pig farming [3±6] . Pigs in all age groups are susceptible to the disease and show acute diarrhea and dehydration, leading to death with high mortality, especially up to 90% in 1±2 weeks old piglets [7±9] . PED is clinically and pathologically similar to transmissible gastroenteritis (TGE), which obscures the differentiation of the two diseases. Immunity is, therefore, important for the prevention and control of the disease. Moreover, maternal antibodies derived from immunized sows are the sole source for immunity to the disease in highly susceptible neonates during the first few weeks after birth [7, 9] . However, no effective vaccines have been developed for the mucosal immunity in the gut of pigs where the etiologic agent causes severe enteritis [9] . The etiologic agent, porcine epidemic diarrhea virus (PEDV), belongs to the family Coronaviridae of the order Nidovirales and has a genome of positive sense, ss-RNA [7, 10] . The subgenomic mRNAs, which are transcribed from the genome, produce viral proteins such as the spike (S, 180±220 kDa), membrane (M, 27±32 kDa) and nucleocapsid (N, 55±58 kDa) proteins [10±12] . Recently, Pensaert [9] suggested a molecular mass of 85±135 kDa for the S protein. Among the proteins, S, a glycoprotein peplomer on the viral surface, plays an important role in the attachment of viral particles to the receptors of host cells with subsequent penetration into the cells by membrane fusion. The S glycoprotein also stimulates induction of neutralizing antibodies in the host [13] . The S gene is, therefore, an important target for cloning and expression in the development of genetically engineered vaccines. Duarte et al. [14] reported a sequence of 1741 nucleotides in the region between the S and N genes of the British, Br1/87 and Belgian, CV777 strains of PEDV. Duarte and Laude [13] also reported a sequence of 4162 nucleotides including the coding region of the Br1/87 S gene. More recently, the complete S gene sequence has been reported within the complete genome sequence of PEDV CV777 [15] . PED occurs frequently in Korea since the virus was first isolated there [6] , and developmental efforts should target accurate diagnosis and control of the disease. Studies on the nucleotide sequences of the S gene of Korean PEDV isolates have not yet been reported. In the present study, DNA clones of the full-length S gene open reading frame (ORF) of PEDV isolated in Chinju, Korea (PEDV Chinju99) were constructed. The nucleotide and deduced amino acid sequences of the S gene were determined, and further analyzed and aligned with those of other PEDVs to aid in the production of genetically engineered vaccines and diagnostic reagents. Virus PEDV Chinju99, first isolated by the Virology Laboratory of Gyeongsang National University College of Veterinary Medicine, Chinju, Korea from the intestinal tissues of piglets suffering from severe diarrhea (data not shown), was used for cloning of the S gene. The virus was propagated in Vero cells grown in minimal essential medium (MEM) containing streptomycin (100 mg/ml), penicillin (100 U/ml) and trypsin (10 mg/ml) in a 5% CO 2 incubator at 37 C following the methods of Hofmann and Wyler [16] . The medium was removed at 15 h post-infection at the early stage of cytopathic effects such as rounding, degeneration and syncytia formation. Cells were lysed by Trizol 1 reagent (Invitrogen, USA) at 2 ml per tissue culture flask (25 cm 2 ) and homogenized by passing the cell lysate several times through a pipette. Viral RNA was extracted from the homogenate following the manufacturer's suggestions and dissolved in diethyl pyrocarbonate-treated distilled water. Pairs of sense and antisense primers were designed and aligned based on the nucleotide sequence of the Br1/87 S gene [13] from the GenBank data base (National Center for Biotechnology Information, USA). These primers were used to generate cDNA for the S gene. The nucleotide sequences and relative position of the primers are shown in Table 1 and Fig. 1 , respectively. Synthesis of the first-strand cDNA for the S gene was carried out by reverse transcription (RT) using The ds-cDNA for the S gene was synthesized by polymerase chain reaction (PCR) using a reagent kit (Perkin-Elmer, USA). A 10 ml portion of the first-strand cDNA template was added to 5 ml of 10X PCR buffer, 4 ml of 25 mM MgCl 2 , 1 ml of 10 mM dNTP mixture, 1 ml of each 100 pM sense and antisense primers, 1 ml of Taq DNA polymerase (5 U/ml) and brought to 50 ml with distilled water. The PCR was carried out in a thermocycler (Perkin-Elmer) following the program of 2 min at 94 C and 30 cycles of 1 min at 94 C, 1 min at 45±50 C depending on the primers and 1 min at 72 C, and a final extension at 72 C for 5 min. The PCR products were resolved by electrophoresis in 1% agarose gels. The PCR-generated S gene ds-cDNAs were bluntended with Klenow enzyme (2 U) and 1 ml of 0.5 mM dNTPs (Invitrogen) in 20-ml reaction volume and cloned into the SmaI site of pUC19 plasmid DNA by ligation using T4 DNA ligase (1 U) (Invitrogen) [17] . The recombinant plasmid DNAs were transformed into competent Escherichia coli DH5a cells by heat shock for 45 s at 42 C. After adding SOC medium (0.5% yeast extract, 2% tryptone, 10 mM NaCl, 2.5 mM KCl, 10 mM MgCl 2 , 20 mM MgSO 4 , 20 mM glucose), the tube was shaken for 1 h at 220 rpm, 37 C. The transformed cells were plated onto Luria Bertani (LB) agar (Invitrogen) containing ampicillin (50 mg/ml), X-gal (40 mg/ml) and isopropylthio-bgalactoside (20 mg/ml) (Invitrogen) and incubated at 37 C, overnight. Transformed colonies were cultured in LB broth with ampicillin (50 mg/ml) by shaking at 220 rpm, at 37 C, overnight, and were subjected to DNA extraction by alkaline-lysis, restriction enzyme digestion and electrophoresis through 1% agarose gels for identification of recombinant DNA clones. Nucleotide sequencing was done for the S generecombinant DNA clones using Dye Terminator Cycle Sequencing kit (Perkin-Elmer) by the automatic sequencer (ABI prism 377, Advanced Biotechnologies, USA). The sequences of nucleotides and deduced amino acids were analyzed by ClustalW, version 1.82 using In the production of ds-cDNA of the Chinju99 S gene, three overlapping DNA fragments were amplified by RT-PCR using appropriate pairs of sense (SF) and antisense (SR) primers. The DNAs, designated as Sfrag1 (1.5 kb), Sfrag2 (1.8 kb) and Sfrag3 (1.4 kb), were each cloned into pUC19 vector DNAs (Fig. 1) , and sequenced. The nucleotide sequence encoding the entire Chinju99 S gene was 4162 bases in length and contained a single 4152 bases long ORF starting with an initiator ATG at nucleotide 11 and ending with a terminator TGA at 4160. The sequence GUAAAC, found at 8 nucleotides upstream of the ATG, was considered part of the leader sequence preceding the ORF. The coding region of the gene had 229 nucleotide mismatches compared to Br1/87 and CV777, respectively (Fig. 2) . It consisted of 1001 adenine (24.1%), 849 cytosine (20.4%), 877 guanine (21.1%) and 1425 thymine (34.3%) nucleotides, and a GC content of 41.5%. The gene showed 94.5% nucleotide sequence homology to that of Br1/87 and CV777, respectively. The putative translation product of the PEDV Chinju99 S gene consisted of 1383 amino acids starting at the ATG at nucleotide 11 and ending at the terminator TGA at nucleotide 4160. There were 27 potential asparagine (N)-linked glycosylation sites recognized in the protein. The Chinju99 S protein, had 100 and 99 amino acid mismatches compared to those of Br1/87 and CV777, respectively (Fig. 3) . There was also a stretch of highly hydrophobic residues at position 1325±1348 (>1.6 on the Kyte±Doolittle scale) (Fig. 4) . The Chinju99 S protein showed 92.8% amino acid sequence identity with that of Br1/87 and CV777, respectively. The S gene of the Korean Chinju99 isolate of PEDV was cloned as a series of three overlapping cDNA clones, and the resulting sequence data revealed a single large ORF of 4152 nucleotides which encodes 1383 amino acids to the terminator TGA. Previous studies showed the same lengths for the nucleotide and deduced amino acid sequences of the S gene in Br1/87 [13] and CV777. A single ORF of 4149 nucleotides was identified with the potential to produce a coronavirus S protein [18] . The Chinju99 S gene also had a sequence GUAAAC at 8 nucleotides upstream of the initiator ATG as previously recognized in Br1/87 [13] . This is a common hexameric motif in coronaviruses and similar to the motifs XUA(A/G)AC found adjacent to other PEDV ORFs [12] . The hexameric motifs were suggested as the site for transcription of the subgenomic mRNAs [18] . The Chinju99 S protein contained 27 potential N-linked glycosylation sites and a region of hydrophobic residues at position 1325±1348 was assumed to function as a membrane anchor. Similarly, the Br1/87 S protein [13] had 29 potential N-linked glycosylation sites and a hydrophobic stretch at 1322±1337. Consequently, the conformational features of the Chinju99 S protein are probably highly conserved with those of Br1/87. Nevertheless, the Chinju99 S gene showed 94.5% nucleotide sequence homology and 92.8% amino acid sequence identity with those of both Br1/87 and CV777, respectively, although the ratios of adenine (24.0±24.1%), cytosine (20.4%), guanine (21.1%) and thymine (34.3±34.4%) were identical or very similar among the three strains. Accordingly, the Chinju99 S protein had only minimal differences in overall structure compared to that of Br1/87 and CV777. Nevertheless the minor differences in the structural features of the S protein could help in elucidating aspects related to both molecular pathogenesis and antigenic structures of PEDV isolates in relation to prevention and control of the disease. This is the first published report on the nucleotide sequences of the S gene of any Korean PEDV isolate. In the present study, the complete nucleotide sequence of the Chinju99 S gene was determined, and its amino acid composition was analyzed in comparison to other PEDVs. The nucleotide sequence of the Chinju99 S gene can form the basis for further studies on the development of genetically engineered vaccines and diagnostic reagents for PEDV isolates in Korea. Hagan and Bruner's Microbiology and Infectious Diseases of Domestic Animals Diseases of Swine Short Protocols in Molecular Biology This study was supported by a grant (No. 981-0613-065-2) from the Korea Science and Engineering Foundation (KOSEF), Ministry of Science, Korea.