key: cord-0952041-jdiqvgbg authors: Lara-Romero, Rocío; Gómez-Núñez, Luis; Cerriteño-Sánchez, José Luis; Márquez-Valdelamar, Laura; Mendoza-Elvira, Susana; Ramírez-Mendoza, Humberto; Rivera-Benítez, José Francisco title: Molecular characterization of the spike gene of the porcine epidemic diarrhea virus in Mexico, 2013–2016 date: 2017-12-14 journal: Virus Genes DOI: 10.1007/s11262-017-1528-x sha: 9e31c7e67f3b134791cbe0adf9f4c636540a2656 doc_id: 952041 cord_uid: jdiqvgbg In Mexico, the first outbreaks suggestive of the circulation of the porcine epidemic diarrhea virus (PEDV) were identified at the beginning of July 2013. To identify the molecular characteristics of the PEDV Spike (S) gene in Mexico, 116 samples of the intestine and diarrhea of piglets with clinical signs of porcine epidemic diarrhea (PED) were obtained. Samples were collected from 14 farms located in six states of Mexico (Jalisco, Puebla, Sonora, Veracruz, Guanajuato, and Michoacán) from 2013 to 2016. To identify PEDV, we used real-time RT-PCR to discriminate between non-INDEL and INDEL strains. We chose samples according to state and year to characterize the S gene. After amplification of the S gene, the obtained products were sequenced and assembled. The complete amino acid sequences of the spike protein were used to perform an epitope analysis, which was used to determine null mutations in regions SS2, SS6, and 2C10 compared to the sequences of G2. A phylogenetic analysis determined the circulation of G2b and INDEL strains in Mexico. However, several mutations were recorded in the collagenase equivalent (COE) region that were related to the change in polarity and charge of the amino acid residues. The PEDV strain circulating in Jalisco in 2016 has an insertion of three amino acids ((232)LGL(234)) and one change in the antigenic site of the COE region, and strains from the years 2015 and 2016 changed the index of the surface probability, which could be related to the re-emergence of disease outbreaks. Porcine epidemic diarrhea (PED) is characterized by severe watery diarrhea, vomiting, and dehydration in all swine productive stages. In fattening pigs, an increase in the mortality rate of 2.5% has been registered [1] ; however, mortality can reach 100% in piglets [2] . PED was first described on a pig fattening farm in England in 1971 [3] . From there, it disseminated to several countries in Europe with sporadic outbreaks, such as Italy in 2006 [4] . However, from 2014 to 2016, several emerging cases have been reported in other countries, including Germany, Belgium, Ukraine, France, Italy, and Austria [5] [6] [7] [8] [9] [10] . In Asia, the first outbreak was reported in Japan in 1982; since then, PED has been considered a re-emerging disease [11] . The first outbreak in the Americas appeared in April 2013 in the USA, which disseminated rapidly throughout the country and caused severe economic losses [2] . In Canada, the first outbreak of PED was identified in January 2014, however in 2017, re-emerging outbreaks occurred in the province of Manitoba [ . ORF1a and ORF1b are located in region 5′; they occupy two-thirds of the genome and encode two polyproteins (pp1a and pp1b). The last third of the genome encodes four structural proteins: the spike protein (S), the envelope protein (E), the membrane protein (M), the nucleocapsid protein (N), and a non-structural protein encoded by ORF3 [17] . Similar to other coronaviruses, the S protein of PEDV plays a fundamental role in the interaction with the cellular receptor. Additionally, the S protein induces neutralizing antibodies in the natural host. This protein contains two domains, S1, which mediates the binding of the virus to the cellular receptor, and S2, which is involved in the fusion process of the virus to the cell. Within these two domains, there are four neutralizing epitopes: three of them are located in domain S1 (COE residues 499-638, SS2 residues 748-755, and SS6 residues 764-771), and the fourth is in domain S2 (2C10 residues 1368-1374) [18] [19] [20] . Furthermore, the S1 region contains the N-terminal domain (NTD) and the C-terminal domain (CTD) [21] . Phylogenetic analyses have grouped PEDV into two different genogroups (G1 and G2). Molecular analyses have identified the circulation of new viral strains; some of them are insertions or deletions of the S gene, called INDEL strains, and they are related to a diminished virulence of PEDV [22] . The objective of the present study was to perform the molecular characterization of the S gene of PEDV to identify the circulating strains from 2013 to 2016 in Mexico. We obtained 116 intestinal and diarrheal samples from piglets with clinical signs suggestive of PEDV infection (vomiting, watery diarrhea, dehydration, and anorexia) from 14 farms of multi-site intensive production systems located in six states of Mexico (Jalisco, Puebla, Sonora, Veracruz, Guanajuato, and Michoacán) from 2013 to 2016. The samples were collected in liquid nitrogen and then stored at −70 °C. RNA extraction of the samples was performed with commercial RNeasy Mini Kit (Qiagen) kits following the manufacturer's protocol, and the real-time RT-PCR test described by Wang et al. [23] was performed to amplify a fragment of domain S1 of the S gene. This process, aside from determining the presence of genetic material of PEDV, also indicates whether the positive samples correspond to INDEL or non-INDEL strains. The samples corresponding to the years of 2013 and 2014 were analyzed with a differential test for transmissible gastroenteritis virus, porcine rotavirus A, and porcine deltacoronavirus (PDCoV), detecting positivity only for PDCoV in 35% of the samples analyzed (data not shown). Of the total samples positive in the real-time RT-PCR, we selected those presenting the highest amounts of genomic equivalents according to state and year of sampling. The cDNA was synthesized from extracted RNA using Rever-tAid First Strand cDNA Synthesis Kit (Thermo Fisher Scientific) following the manufacturer's protocol using random hexamer primers. Amplification of the S gene was performed through seven PCR reactions using the primers depicted in Table 1 . For the amplification, we used GoTaq Green Master Mix (Promega); the amplification conditions were as follows: 95 °C for 10 min, 35 cycles of 95 °C for 30 s, annealing temperature dependent on primers, 72 °C for 60 s and a final extension of 72 °C for 10 min. The amplified products were purified by means of commercial columns, QIAquick Gel extraction kit (Qiagen), following the manufacturer's protocol. The purified products were sequenced in both directions using an ABI Prism 3100 xL Genetic Analyzer with the BigDye ® Terminator v3.1 Kit (Applied Biosystems, Foster City, CA) in the Instituto de Biología of the Universidad Nacional Autónoma de México. The obtained sequences were compared to the GenBank database using the NCBI's BLAST tool (https://blast.ncbi. nlm.nih.gov/Blast.cgi). The amino acid sequences were aligned and analyzed with ClustalW using MEGA 7.0.21 software. The antigenic index was calculated using the Jameson-Wolf algorithm [24] , the surface probability profiles were obtained by the Emini method [25] , and the hydrophilicity analysis was performed using the Kyte-Doolittle method [26] . With the amino acid sequences obtained from the present study and sequences of PEDV reported in the Americas, Asia, and Europe, a phylogenetic tree was constructed using the maximum likelihood based on the JTT matrix-based model and Gamma distribution with 1000 bootstrap replicates using the MEGA 7.0.21 software. To accomplish the phylogenetic analysis, we used the S gene sequences from the different PEDV genogroups with the following access numbers to GenBank for G2: . All the sequences obtained in the present study were also deposited in GenBank (Table 2) . Of the 116 obtained samples, 103 (88.8%) were positive for PEDV based on the real-time RT-PCR test. Of the positive samples, 87.4% corresponded to samples of the high virulence PEDV (G2 non-INDEL) and 12.6% to INDEL PEDV strains. We selected 10 samples for amplification and sequencing, which represents 74% of the farms sampled (10/14). The obtained sequences correspond to the six analyzed states of Mexico (Table 2) . Alignment was performed using the previously reported sequences pertaining to G1a, G1b, G2a, G2b, and INDEL to compare the sequences obtained in this study. A neutralizing epitope analysis was performed through the alignment of amino acids. In the COE region (Fig. 1) , seven of the ten analyzed USA/Colorado/2013 strains possessed mutations (Table 3) ; however, epitopes SS2, SS6, and 2C10 (Fig. 2) remain conserved. The insertion of three amino acids ( 232 LGL 234 ) was detected in strains PEDV/MEX/JAL/01/2016 and PEDV/ MEX/JAL/03/2016 (Fig. 3) . The alignment identified changes in specific residues according to the genogroups. The Spike protein was analyzed by means of the Jameson-Wolf method that predicts potential antigenicity sites; these sites are the result of changes in polarity and charge in the amino acid sequence of the different assessed strains. The prototype strains we used were the parental strain (Fig. 4) . The surface probability analysis identified changes in the NTD region of the S1 domain in the Mexican strains from 2015 and 2016 compared to the strains of previous years that were analyzed in the present study. The hydrophilicity analysis reported a hydrophobic region we identified in the insertion region of the PEDV/ MEX/JAL/01/2016 strain (Fig. 5) . The Mexican strain previously reported and eight strains of the present study were grouped in clade G2b, which is composed of the ancestral strain AH2012 and the strains of the Americas (Fig. 6 ). The first outbreaks of PED in Mexico occurred in July 2013. Since then, the disease has caused severe economic losses to the Mexican swine industry because of its high mortality, which reached 100% in piglets in the first outbreaks; in the re-emergence of outbreaks, the reported mortality has been 75% in newborn piglets. Until 2017, in Mexico, there was no vaccine available, only the feedback immunization strategy was implemented. Currently in Mexico, is authorized the killed virus vaccine based with the USA/Colorado/2013 (KF272920) strain. The PEDV/MEX/MICH/01/2013 strain originated in the state of Michoacán, where the first cases of the disease occurred; the S gene of that strain differs by only four nucleotides compared to the USA/Colorado/2013 strain, which was the first complete sequence of PEDV reported in the USA [27] , indicating that the first PED outbreaks in México were caused by a related-highly virulent strain. Since 2013, the circulation of the first INDEL strain (OH851) was identified, that was closely related to the strain CH/HBQX/10 [22, 28] . The first Mexican INDEL strain was identified in Veracruz, Mexico and was named PEDV/MEX/ VER/01/2014; it had a 99.8% identity with strain OH851, which is related to the reduced severity of the clinical disease in the piglets [22] . In PEDV, the receptor-binding domain (RBD) is located in the C-terminal end of region S1 of the Spike protein [29] ; this region, named COE, is located in residues 499-638 of the strains of gene group 1 and INDEL and in G2 strains of high virulence, and in no-INDEL in residues 502-641. The COE region has been identified as a neutralizing epitope and as the region that recognizes the porcine amino peptidase N (pAPN), allowing entrance of the virus to the susceptible potential antigenicity sites, demonstrated a very evident change that corresponds to an additional antigenic peak compared to diverse PEDV strains. The 2016 strains from Jalisco were obtained from a farm where a PEDV reemerging outbreak was reported with severe clinical signs of diarrhea in growing and fattening pigs. Additionally, in these strains, a three amino acid insertion ( 232 LGL 234 ) was found, which has not been reported previously at a worldwide level. This insertion contains two leucines; therefore, this region of the protein increases its hydrophobicity, as observed in the hydrophilicity analysis. The hydrophobicity, together with the changes in surface index in the strains of years 2015-2016 and mutations in the COE region, could be related to re-emerging outbreaks, tropism, and the presentation of the clinical PED signs [30] . Changes in the S gene have been related to recurring cases, as in Japan, where a co-infection of PEDV strains with the complete S gene and strains with deletions of amino acids 194-216 in region S1 were identified [31] . However, the neutralizing regions SS2 and 2C10 remain highly conserved, as reported previously in studies performed with strains from Japan in 2014 and China in 2015 [11, 32] . Region SS6 does not show mutations in relation to strains of G2b; however, there are changes compared to G1. These analyses of neutralizing epitopes together with the analysis of antigenicity sites provide basic information on the antigenic structure of the Spike protein. The phylogenetic analysis showed that the North American strains are located in clade G2b, with a common ancestor, strain AH2012, as reported previously, whereas the study of the complete genome of the North American strains reported until 2014 places them in three clades (North American clade I and II and USA INDEL), with two strains from Mexico in clades I and II [28] . Regarding the circulating strains of G2b in Mexico, the most distant with respect to time and Fig. 6 Phylogenetic analysis of the amino acid sequences of the Spike protein of PEDV. The evolutionary history was inferred using the Maximum Likelihood method based on the JTT matrix-based model. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories). The tree is drawn to scale, with branch lengths corresponding to the number of substitutions per site. The analysis involved 34 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 1354 positions in the final dataset. Circles indicate the sequences of the present study (Color figure online) substitutions of amino acids are PEDV/MEX/MICH/2013 and PEDV/MEX/JAL/01/2016, with a 99% identity. However, in 2015 in China, the S gene of diverse strains was assessed, yielding a homology between 96.6 and 100% [32] . This result indicates that the strains from this study have not undergone a high rate of mutations, although the antigenic sites have changed. Trujillo-Ortega et al. reported an analysis of a fragment of the S gene of four strains from a PED outbreak in Mexico in 2014 [14] . These strains seem to have a common ancestor, strain MEX/104/2013 (KJ645708), reported by Vlasova [28] , and a 99% similarity with strains of the USA from 2013 and 2014, and even with a strain from South Korea from 2013. The PEDV/MEX/PUE/01/2016 strain was grouped outside the clade of G2b, due to probable recombination events, as described by Song et al. with strains CH/S and DR13 [33] , so this recombination will be confirmed in the future analysis of complete genome sequences. However, in the present study, the phylogenetic analysis of the S protein demonstrated that the strains circulating in Mexico have different ancestors. In conclusion, in this work, ten PEDV strains were identified from outbreaks in six regions of Mexico from 2013 to 2016. Of the four analyzed neutralizing epitopes, the Mexican strains presented mutations only in the COE region. In the 2016 strains of Jalisco, we identified a new insertion of three amino acids in PEDV, which has not been reported before at a worldwide level. In addition, the present mutations result in changes in the surface index and the antigenic sites of the COE region; therefore, the clinical presentation of the disease and the virulence can change with respect to the other circulating strains. Lastly, the phylogenetic analysis determined the circulation of PEDV INDEL strains of high virulence in the porcine population of Mexico. The results of this study help identify the molecular characteristics of PEDV, which will lead to improving disease control measures. Letter to the editor Porcine Epidemic Diarrhea Virus The authors thank DVM. Flor Solís Galicia and MC. Jazmín De la Luz for his support in sample processing. Conflict of interest All authors in this paper declare they have no conflict of interest.Ethical approval All applicable international, national, and/or institutional guidelines for the care and use of animals were followed.