key: cord-0885048-8zwisqc0
authors: Chen, Jian-Fei; Sun, Dong-Bo; Wang, Cheng-Bao; Shi, Hong-Yan; Cui, Xiao-Chen; Liu, Sheng-Wang; Qiu, Hua-Ji; Feng, Li
title: Molecular characterization and phylogenetic analysis of membrane protein genes of porcine epidemic diarrhea virus isolates in China
date: 2008-01-24
journal: Virus Genes
DOI: 10.1007/s11262-007-0196-7
sha: 65aa3582228494a073cd4cd7f8879ee14ca79a1c
doc_id: 885048
cord_uid: 8zwisqc0

Six porcine epidemic diarrhea viruses (PEDVs) were isolated from the fecal samples of piglets infected with PEDV in 2006 in China. The membrane (M) protein genes of six PEDV isolates were amplified by reverse transcriptase-polymerase chain reaction (RT-PCR), then cloned, sequenced, and compared with each other as well as those ten PEDV reference strains. The M protein genes of six Chinese PEDV isolates consisted of 692 nucleotides containing a single open reading frame (ORF) of 681 nucleotides, which encoded a 226aa-long peptide. The conserved intergenic motif (ATAAAC), as previously recognized in Br1/87, was found in the 5 nucleotides upstream of the initiator ATG of M protein genes of six Chinese PEDV isolates. The hexamer motif was also found in CV777, JMe2, LZC, and QH. The M protein of six isolates had three main transmembrane domains (aa20–38, aa43–65, aa75–97). The M protein of one isolate, CH/IMT/06, had one potential glycosylation site, but those of the other five isolates had two. The glycosylation sequence Asn-Phe-Thr was highly conserved in the M proteins of six PEDV isolates. The six PEDV isolates showed nucleotide sequence homology between 98.8 and 100% and deduced amino acid sequence homology between 98.2 and 100% with each other. The nucleotide and amino acid identity of M protein genes between the six PEDV isolates and ten reference PEDV strains varied from 97.2 to 99.4% and 96.9 to 100%, respectively. On the basis of the phylogenetic relationship of M protein genes, six Chinese PEDV isolates composed of a separate cluster including one Chinese strain JS-2004-02, however, not including the Chinese strain LJB/03. These results demonstrated that there was a new genotype of PEDV prevailing in China.

Porcine epidemic diarrhea (PED), caused by porcine epidemic diarrhea virus (PEDV), is an acute and highly contagious enteric disease in swine. The disease is characterized by severe enteritis and watery diarrhea, leading to death with a mortality rate of up to 80%. PED was first reported in England in 1971 [1] , since then it has been reported in many other swine-producing countries [2] . In China, PED was first reported in 1976 [3] . So far, it has been prevailing in many provinces and become one of the most important viral diarrhea diseases in China, resulting in heavy economic losses in swine industry.

PEDV was first reported in Belgium and the United Kingdom in 1978 [4, 5] . It is a member of Group 1a, genus Coronavirus, family Coronaviridae within the Order Nidovirales. PEDV is an enveloped, single-stranded, positivesense RNA virus. The genome of it is approximately 28 kb in length and encodes four mainly structural proteins, which are the spike (S) protein, the membrane (M) protein, the envelope (E) protein, and the nucleocapsid (N) protein [6] [7] [8] .

PEDV occurs frequently in swine-raising farms in spite of using the vaccine strategy at present, damage caused by PEDV infection is continuous and serious in many provinces in China. According to an accurate analysis on genome of prevalent PEDVs, we can prevent outbreaks of the PEDV-induced diarrhea more effectively and choose the correct PEDV vaccine strain. By the phylogenetic analysis of the partial spike glycoprotein genes, the Korean PEDV isolates were divided into three groups (G1, G2, G3), which had three subgroups (G1-1, G1-2, G1-3) [9] . Like S glycoprotein gene, M protein gene is also an important component in the phylogenetic analysis of the epidemiological situation of PEDV in the field. On the basis of sequence analysis of M protein gene, Korean PEDV isolates were shown to be different from foreign PEDV strains [10] . Therefore, the purpose of the present study was to investigate the diversity among the Chinese PEDV isolates and was to find out more prevalent PEDVs in China by the sequence analysis and the phylogenetic relationship of the M protein genes with Chinese PEDV isolates and reference PEDV strains.

Porcine fecal samples had been taken from piglets showing watery diarrhea and dehydration on six different swineraising farms of five provinces in China. These fecal samples had been confirmed positive for PEDV by Anigen Rapid PED Ag Test Kit (Animal Genetics Inc., Korea). PEDV positive fecal samples were diluted with phosphatebuffered saline (PBS; 0.1 M, pH 7.2) to be 10% (V/V) suspensions. The suspensions were vortexed and clarified by centrifugation for 10 min at 5,000 rpm. The supernatants were collected for the amplification of M protein gene by RT-PCR. The isolates, used for nucleotide sequence analysis, amino acid sequence analysis and phylogenetic analysis in this study, were described in Table 1 .

The supernatant (200 ll) containing PEDV was mixed with 400 ll TRIzol Reagent (Invitrogen Corp., Carlsbad, CA) and RNA was isolated according to the manufacturer's instructions. The RNA was air-dried for 10 min and redissolved in 15 ll diethyl-pyrocarbonate (DEPC)-treated deionized water.

Published primers [11] designed based on the published sequence of M gene were used for generating the full M genes of PEDV isolates. Briefly, P1 (5 0 -CCCCAGTACTG TTATTGACGTATAAAC-3 0 ) and P2 (5 0 -GTTTAGACT AAATGAAGCACTTTC-3 0 ) were used for PCR to amplify the M gene. The size of amplified products was 715 bp. P3 (5 0 -GCCATAAAGTTTCTGTTTAGACTAA-3 0 ) was used as the primer for the synthesis of complementary DNA.

For reverse transcription, 15 ll of extracted viral RNA was mixed with 1 ll P3 and incubated at 70°C for 10 min followed by 2 min on ice. After adding a reaction mixture consisting of 5 ll 59 First Strand Buffer (Fermentas International Inc., Ltd., USA), 2 ll 2.5 mM dNTPs (Ta-KaRa Biotechnology Co., Ltd., Dalian, China), 100 units of M-MLV reverse transcriptase (Fermentas International Inc., Ltd., USA), 20 units RNAsin (TaKaRa Biotechnology Co., Ltd., Dalian, China), the mixture was incubated at 37°C for 2 h. The reaction was terminated by heating at 98°C for 6 min and chilling on ice. The cDNA was either stored at -70°C or amplified immediately.

In PCR, P1 and P2 were used to amplify the M genes of six isolates. Exactly, 1 ll cDNA was mixed with a reaction The amplification was performed with 30 cycles of denaturation at 95°C for 1 min, annealing at 50°C for 1 min, and extension at 72°C for 1 min, followed by a final extension step of 72°C for 10 min. PCR products were analyzed on 1% agarose gels and were purified using AxyPrep TM Gel Extraction Kit (Axygen Scientific, Inc., USA) according to the manufacturer's instructions.

Purified PCR products were cloned into a T-tailed vector, pMD18-T (TaKaRa Biotechnology Co., Ltd., Dalian, China) and transformed using JM109 competent cells (TaKaRa Biotechnology Co., Ltd., Dalian, China) according to the manufacturer's instructions. Exactly, for cloning of cDNA, 9 ll of purified PCR products, 1 ll of pMD18-T (50 ng/ll), and 10 ll of 29 Solution I were mixed gently and incubated for 6 h at 16°C. The ligation-reaction mixture was then subjected to the transformation protocol, which renders cells competent through heat-shock. For transformation, a tube of 100 ll JM109 competent cells was thawed on ice following which 20 ll of ligationreaction mixture was added to the tube, mixed gently and incubated on ice for 30 min. The tube was heated in a 42°C water bath for 60 s and incubated on ice for 2 min immediately, then 800 ll room temperature SOC medium was added to the tube and mixed gently, incubated at 37°C for 1 h with vigorous shaking. 200 ll of each transformation mixture was immediately plated onto LB agar plate containing ampicillin. The plate was incubated at room temperature until the transformation mixture had been absorbed into the agar, following which it was inverted and then incubated at 37°C overnight. Colonies grown in LB agar plate were cultured in LB medium with vigorous shaking at 37°C overnight, and DNA was extracted using the AxyPrep TM Plasmid Miniprep Kit (Axygen Scientific, Inc., USA). PCR and restriction enzyme digestion, with enzymes such as EcoRI, followed by electrophoresis through a 1.0% agarose gels was employed for identification of recombinant DNA clones. Three independent DNA clones of each isolate were sequenced by using M13 sequencing primers by Shanghai Sangon Biological Engineering Technology & Services CO., Ltd. (China). All sequencing reactions were performed in duplicate and all sequences were confirmed by sequencing both strands.

Nucleotide and deduced amino acid sequences of six PEDV isolates were aligned, edited and analyzed with DNAMAN software (Version 5.2.2, Lynnon BioSoft. Inc., USA), and MegAlign software of Lasergene (version 7.1.0, DNASTAR Inc., USA). A phylogenetic tree was then generated using an alignment of M glycoprotein gene nucleotide sequences with reference PEDV strains by Jotun Hein Method in MegAlign software. The reference strains used for sequence alignment, sequence analysis, and phylogenetic analysis with Chinese PEDV isolates were described in Table 2 .

The M protein genes of six Chinese PEDV isolates consisted of 692 nucleotides containing a single open reading frame (ORF) of 681 nucleotides, which encoded a 226aalong peptide. There were no nucleotides deletion or insertion in the ORFs of six isolates. Six isolates had a conserved intergenic motif (ATAAAC) of 5 nucleotides upstream of the initiator ATG, as previously recognized in Br1/87 [11] . The hexamer motif was also found in CV777, JMe2, LZC, and QH. There was a five-nucleotide conserved sequence (GAAAT) between the conserved intergenic motif and the initiator ATG of M protein genes of six isolates. The five-nucleotide conserved sequence was found in the corresponding regions in CV777, JMe2, LZC, and QH, too. Compared to PEDV CV777, CH/JSX/06 had 4 nucleotide mutations, CH/IMT/06 had 6 nucleotide mutations, CH/HLJH/06 had 7 nucleotide mutations, CH/ IMB/06, CH/HNCH/06, and CH/SHH/06 had 8 nucleotide mutations (Fig. 1) .

The nucleotide changes would lead to changes in the predict amino acid sequences of PEDV isolates. Compared to PEDV CV777, CH/IMB/06, CH/HNCH/06, and CH/ SHH/06 had 1 amino acid change (from E to Q at 13), CH/ IMT/06 had 2 amino acid changes (from S to F at 5, from N to D at 167), CH/HLJH/06 had 2 amino acid changes (from V to A at 42, from A to S at 214). Although CH/JSX/06 had nucleotide changes, it had no amino acid change (Fig. 2) . The M proteins of six Chinese PEDV isolates were 226 amino acid residues in length. The M proteins of five isolates had two potential N-linked glycosylation sites in theirs exposed amino-terminal domain, excepting for that of CH/ IMT/06, which had only one N-linked glycosylation site because 1 nucleotide change (from C to T at 14) (Fig. 1 ) Fig. 1 Comparison of the nucleotide sequences of the membrane glycoprotein genes of Chinese PEDV isolates and PEDV reference strains by DNAMAN software. The dashes (-) indicate regions where the sequences are identical to those of CV777. The dots (Á) indicate unavailable sequence. The asterisk (*) indicates the numbers are calculated from the start codon. The conserved intergenic motif is in shade, the start codon and the stop codon is underlined, respectively destroying N-linked glycosylation site through amino acid change (from S to F at 5) (Fig. 2) (predicted by http://www.cbs.tdu.dk/CBS Prediction Servers/NetNGlyc 1.0 Server). The glycosylation sequence Asn-Phe-Thr (NFT) was highly conserved in the M proteins of six isolates. The M proteins of six isolates were highly hydrophobic, and had four hydrophobic domains alternating with short hydrophilic regions in the amino-terminal half of the protein. The locations of three main transmembrane regions were aa20-38, aa43-65 and aa75-97, respectively, but the location of the fourth hydrophobic domain was not shown (predicted by http://www.cbs.tdu.dk/CBS Prediction Servers/TMHMM Server v. 2.0). In the center of the M protein, directly adjacent to the third hydrophobic domain, was a stretch of eight amino acids (SWWSFNPE) (Fig. 2) , which was well conserved in the coronavirus M proteins [12] .

Nucleotide and deduced amino acid sequence homology results are described in Table 3 

The Chinese PEDV isolates and ten PEDV reference strains were divided into three clusters based on the phylogenetic relationship of the nucleotide sequences of M protein genes (Fig. 3) . One cluster (I) comprised three Korean strains (KPEDV-9, KPEDV-9F, Chinju99), two Chinese strains (LJB/03, QH), and one Japanese strain JMe2. The second cluster (II) consisted of CV777, Br1/87 and one Chinese strain LZC. The third cluster (III) was composed of six Chinese isolates and one Chinese strain, JS-2004-02. 

The M protein is a multispanning membrane protein with a small, amino-terminal domain located on the exterior of the virion, or, intracellularly, in the lumen of the endoplasmic reticulum (ER). The ectodomain is followed by three transmembrane segments and then a large carboxy terminus comprising the major part of the molecule. This latter domain is situated in the interior of the virion or on the cytoplasmic face of intracellular membranes. The M protein is the most abundant viral constituent and is almost invariably glycosylated. Intriguingly, whereas the group 1 and group 3 coronaviruses-with transmissible gastroenteritis virus (TGEV) and infectious bronchitis virus (IBV) as important representatives, respectively-all contain M proteins with only N-linked sugars, the M proteins of group 2 coronaviruses such as mouse hepatitis virus (MHV) are O-glycosylated [13] , An exception is MHV-2, the M protein of which carries both O-and N-linked sugars [14] . The M protein plays a central role in coronavirus assembly. Together with the minor E protein it is responsible for the assembly of the coronavirus envelope [15] [16] [17] . In addition, the M protein directs the incorporation of the S protein [18, 19] and the nucleocapsid [20] into the budding particle.

In the present study, the complete nucleotide and deduced amino acid sequences of M protein genes of six Chinese PEDV isolates were determined and compared with ten PEDV reference strains in order to find genetic diversity among prevailing PEDVs in China. Although PEDV is known to have only one serotype at present, the genome of PEDV is widely to show genetic diversity, according to the analysis results using molecular biological methods [21] [22] [23] . The M protein genes of four Chinese strains (LJB/03, JS-2004-2, LZC, and QH) have been sequenced and reported. Although the phylogenetic analysis based on M protein gene of LJB/03 was reported [24] , it could not reflect the phylogenetic relationship among the PEDV field strains in China. Although the bi-combined attenuated vaccine against TGEV and PEDV infection is authorized to use in swine farms, PED still occurs in the immunized swine herds in China. So it is necessary to further investigate the prevalence status of PEDV based on the phylogenetic analysis of M protein genes of PEDV field strains in China.

There are no insertion and deletion in the M protein genes of six Chinese PEDV isolates except point mutations. These point mutations may lead to genetic diversity among these isolates. Our findings show that Chinese PEDV isolates are genetically diverse in their M protein genes, both among themselves and as compared with reference strains. The hexamer motif (ATAAAC), which locates at the 5 0 end of M protein gene, are conserved among the six isolates. They are identical to those found in the corresponding sites in strains CV777, Br1/87, JMe2, LZC, and QH. The M proteins of six isolates have four hydrophobic domains alternating with short hydrophilic regions in the amino-terminal half of the protein, and the forth hydrophobic domain may has functions as a cleavable signal peptide. In the center of M proteins of six isolates, directly adjacent to the third hydrophobic domain, are also the stretches of eight amino acids (SWWSFNPE) that are well conserved as mentioned by de Haan [12] . The coronavirus M protein is almost invariably glycosylated in its exposed amino-terminal domain. N-linked glycosylation is initiated in the endoplasmic reticulum by the cotranslational linkage of a large oligosaccharide structure to the polypeptide at asparagine residues within the consensus sequence NXS/T (where X is any amino acid). By prediction, five isolates have two N-linked glycosylation sites except CH/IMT/06 that has one N-link glycosylation site. The M proteins of group1 and 3 coronaviruses and SARS-CoV all contain with only N-linked sugars. The glycosylation status of the MHV M protein was found to influence the ability of the virus to replicate in the liver but not in the brain [25] , yet that of PEDV M protein has not been reported.

Sequence homology of M protein genes indicated that six Chinese PEDV isolates were highly homologous to CV777, Br1/87, Chinju99, JMe2, and JS-2004-2 rather than to QH, LZC, LJB/03, KPEDV-9, and KPEDV-9F. Although CH/SHH/06, CH/HNCH/06, CH/IMB/06 were isolated in different provinces, they had identical nucleotide sequences. It showed that the three isolates maybe originated from the same ancestor. CH/HLJH/06 was highly homologous to JS-2004-2, but they were isolated in different provinces at different time. Although CH/IMB/06 and CH/IMT/06 were isolated in the same province, the sequence homology between them was lower. These results indicated that the Chinese PEDV isolates had genetic diversity with respect to the M protein genes. By phylogenetic analysis, the six Chinese PEDV isolates with JS-2004-2 formed a separate group, which excluded European strains, Korean strains, Japanese strain and three Chinese strains (QH, LZC, LJB/03). The phylogenetic relationship indicated that Chinese PEDV isolates were different from foreign PEDV strains and a new genotype PEDV was prevailing in China. We expect to prevent outbreaks of the PEDV-induced diarrhea more effectively, according to choosing the more efficient PEDV vaccine strain by accurate analysis on genome of prevalent PEDVs in China. Moreover, genetic variation existence in M protein genes of Chinese PEDV isolates and more prevalent PEDVs discovery in China will form the basis for further functional exploration of PEDV.

Pig Farming

Disease of Swine

Veterinary Virology

The Coronaviridae