key: cord-0900493-3lp54mmu authors: Kuo, Shu-Ming; Wang, Ching-Ho; Hou, Ming-Hon; Huang, Yuan-Pin; Kao, Hsiao-Wei; Su, Hong-Lin title: Evolution of infectious bronchitis virus in Taiwan: Characterisation of RNA recombination in the nucleocapsid gene date: 2010-08-26 journal: Vet Microbiol DOI: 10.1016/j.vetmic.2010.02.027 sha: d9e96285563f38d052d6c129e247023bc51901cb doc_id: 900493 cord_uid: 3lp54mmu Avian infectious bronchitis virus (IBV) belongs to the Coronaviridae family and causes significant economic loss in Taiwan (TW), even in flocks that have been extensively immunised with Massachusetts (Mass)-serotype vaccines. Phylogenetic analysis of all non-structural and most structural genes shows that TW IBV is genetically distinct from the US strain and more similar to Chinese (CH) IBV. In contrast, the nucleocapsid (N) gene of TW IBV presents phylogenetic incongruence. RNA recombination at the 5′ end of the N gene between TW and US IBV is shown to be responsible for this discordance. Surprisingly, the recombinant N gene is found in all of tested TW IBV isolates, suggesting that a recombination event gave origin to a founder lineage. Our data indicate that RNA recombination in the recombinant 5′ end of the N gene may have caused the emergence of the current IBV population in Taiwan. Infectious bronchitis virus (IBV) is highly contagious and pathogenic in chickens. Although the current Massachusetts (Mass)-and Connecticut-serotype vaccines usually offer sufficient protection, they are less effective in certain areas, such as China (CH) and Taiwan (TW) Wang and Khan, 2000; Yu et al., 2001) . Frequent point mutations and evolutionary fitness in the hypervariable regions (HVRs) of the spike 1 (S1) gene contribute to most of the antigenic determinants of IBV. Therefore, the S1 gene is often referred to as the index of genetic diversity and of viral evolution, due to its high variability and close genotype/serotype correlation (Dolz et al., 2008; Liu et al., 2006; Wang and Huang, 2000; Wang and Tsai, 1996) . In addition to point mutation, RNA recombination is also involved in the generation of new antigenic variants. RNA recombination in a coronavirus was first demonstrated experimentally in mouse hepatitis virus (MHV) (Lai et al., 1985; Makino et al., 1986) , and a high frequency of recombination was found both in vitro and in vivo after MHV infection (Keck et al., 1988; Makino et al., 1986) . Recombination sites were detected throughout nearly the entire genome, and many recombinants with multiple crossover sites were found (Liao and Lai, 1992) . The high frequency of RNA recombination in coronaviruses is likely caused by the unique mechanism of RNA synthesis, which involves discontinuous transcription and polymerase jumping (Cavanagh, 2007; Lai et al., 1985; Liao and Lai, 1992; Makino et al., 1986) . Putative IBV genetic recombination in the S gene has been documented in different field isolates, including a Japanese strain (KB8523), European Avian infectious bronchitis virus (IBV) belongs to the Coronaviridae family and causes significant economic loss in Taiwan (TW), even in flocks that have been extensively immunised with Massachusetts (Mass)-serotype vaccines. Phylogenetic analysis of all non-structural and most structural genes shows that TW IBV is genetically distinct from the US strain and more similar to Chinese (CH) IBV. In contrast, the nucleocapsid (N) gene of TW IBV presents phylogenetic incongruence. RNA recombination at the 5 0 end of the N gene between TW and US IBV is shown to be responsible for this discordance. Surprisingly, the recombinant N gene is found in all of tested TW IBV isolates, suggesting that a recombination event gave origin to a founder lineage. Our data indicate that RNA recombination in the recombinant 5 0 end of the N gene may have caused the emergence of the current IBV population in Taiwan. ß 2010 Elsevier B.V. All rights reserved. isolates (Dutch D207 and British 6/82), Texas local strains (PP14 and SE17) and a New York isolate (CU-T2) (Jia et al., 1995; Kusters et al., 1990; Wang et al., 1993) , based on sequence comparison and phylogenetic incongruence. Additionally, half of the nucleocapsid (N) gene of CU-T2 was further shown to be replaced by that of the Holland 52 (H52) vaccine strain, indicating that genomic RNA recombination in IBV may occur in multiple genes (Jia et al., 1995) . In Taiwan, phylogenetic analysis revealed that few TW IBV strains, such as 2992/02 and 3374/05, may undergo genetic recombination with CH IBV (Chen et al., 2009; Huang et al., 2004) . This experimental evidence and the sporadic detection of recombinant variants in nature support the notion that recombination occurs in coronaviruses (Kottier et al., 1995) . However, available evidence remains insufficient to allow the conclusion that coronavirus evolution can be driven by interspecies recombination. TW IBV, first isolated in 1964 (Lu et al., 1993) , can be classified into two individual serological groups, TW I and TW II, based on the sequences of the HVRs of the S1 proteins (Wang and Huang, 2000) . The Mass-serotype vaccine has shown a modest protective effect on the IBV outbreak in Taiwan (Huang and Wang, 2006; Lin et al., 2005) , possibly due to the considerable variation in S1 sequences between TW and US strains. In this study, we analyse TW IBVs and show that they are genetically closer to CH isolates than to US strains. The N gene of TW IBV is the only exception and shows phylogenetic incongruence. We further demonstrate that RNA recombination has contributed to this genetic discordance. The recombinant N gene is detected in all the analysed TW IBVs, suggesting that RNA recombination drives the evolution of this coronavirus at the population level. TW IBV strains were isolated and propagated through the allantoic sac route by inoculating 9-11-day-old specific pathogens free (SPF) embryonated eggs from the Animal Health Research Institute, Council of Agriculture in Taiwan (Gelb and Jackwood, 1998) . TW IBVs had been purified by three consecutive limiting dilutions (Wang and Huang, 2000; Wang and Tsai, 1996) . Sequences of both US and CH IBVs were obtained from Genbank maintained by NCBI. We listed the tested IBVs and their Genbank numbers in supplementary Table 1 , including 29 TW strains, 15 US strains and 30 CH strains. Inquired from the Genbank up to 2009, we found 12 TW and 15 US IBVs are decoded in both S1 and N genes and we included all these up-to-date IBVs in this study. Among the 12 TW strains, 8 IBVs in the same clade of ME tree (supplementary Fig. S1 ) were chosen for the analysis of N gene in Figs. 3 and 4. Both TW2992/02 and TW3374/05 were excluded because of their phylogenetic discordance in S1 (Fig. S1 ). Both 2994/02 and 3382/06 was excluded due to the discordance in N (Fig. S2) . The length of the tested genes is all intact and no 3 0 -UTR deletion was found in these IBV strains. Total RNA was extracted from TW2575/98-infected allantoic fluid using TRIzol TM C&T (Protech, Taiwan), according to the manufacturer's protocol. RNA (0.5 mg) was reverse transcribed into cDNA using SuperScript TM III reverse transcriptase (RT) (Invitrogen, USA). PCR was performed with a proofreading DNA polymerase (KOD-Plus, Toyobo, Japan). Generally, the PCR conditions included 28 cycles of 94 8C for 30 s, 62 8C for 30 s, and 68 8C for 1 min, on a PCR machine (ASTEC 818, Japan). Consecutive viral genomic fragments were cloned into the pCRII-TOPO vector (Invitrogen, USA) by TA cloning and were subjected to sequencing. The 5 0 -untranslated regions (UTR) of IBV 2575/98 was analysed using a SMART TM RACE cDNA amplification kit (Clontech, USA). First, cDNA was prepared from 1 mg of total RNA from allantoic fluid, using the PowerScript TM RT and a unique 3 0 reverse primer (5 0 -TAC TTG CAA GAC AAG TTC C-3 0 , nt 1247-1265 of the viral genome). After the RT reached the 5 0 end of the mRNA template, it added 3-5 dC residues to the newly synthesised cDNA. The SMART II A 5 0 oligonucleotide (5 0 -AAGCAGTGGTAT-CAACGCAGAGTACGCGGG-3 0 ) annealed to the dC tag and served as an additional extending primer for RT. Consequently, the 5 0 single-stranded cDNA was used as the PCR template with the nested primer (5 0 -AAG-CAGTGGTATCAACGCAGAGT-3 0 ) and a new reverse primer, 5 0 -CCAACAAGCCTGCGCATTGCCATAG-3 0 (nt 996-1020 of the viral genome). Amplified DNA products were cloned into the pGEM-T Easy Vector (Promega, USA) and sequenced. Phylogenetic trees were constructed by Bayesian inference (BI), maximum likelihood (ML) and minimum evolution (ME) methods. The best-fit models and parameters for initial settings of the phylogenetic programs were selected by Modeltest 3.7 for BI and ML algorithms on the basis of the Akaike information criterion (Posada and Crandall, 1998) . Estimates of evolutionary divergence over all sequence pairs were computed by averaging the number of base substitutions per site in MEGA 4.0. All positions containing gaps and missing data were eliminated from the dataset. Standard error estimates were obtained by a bootstrap procedure from 500 replicates. For the S1 data set in Fig. 1A , the best-fit model of substitution was a GTR model (Rodriguez et al., 1990) with a gamma substitution parameter ( MrBayes 3.1.2 was used for BI analysis. Random starting trees were used. A total of two million generations of Markov chains were run. Trees were saved every 100 generations, resulting in 20,000 trees in the initial samples. The stationary phase of log-likelihood was reached within 500,000 generations. Thus the burn-in (number of initial trees that were discarded) was set to 5000. A majority-rule consensus tree was generated from the remaining samples (15,000 trees), and the percentage of samples recovering any particular clade represented the clade's posterior probability. We conducted 500 nonparametric bootstrap replicates for ML analysis using GARLI v0.951. We used the heuristic search option with substitution models set to the best-fit model chosen by Modeltest 3.7. ME analysis was performed using MEGA 4.0 with 500 replicates. Twelve IBV strains, including TW2575/98, TW1171/92, TW2296/95, TP/64, Mass 41, Cal99, Beaudette, CU-T2, CH-BJ, CH-S14, CH-LX4 and CH-LDT3, were subjected to the Recombination Detection Program (RDP) 3.0 (Martin and Rybicki, 2000) . Genes S and N were aligned by ClustalW and analysed by seven algorithms in RDP 3.0, including RDP, Bootscan, GENECONV, MaxChi, Chimaera, SiScan, and 3Seq. Based on the results automatically calculated by RDP, TW2575/98, TW2296/ 95, CU-T2 and CH-LX4 were chosen as representative strains for which data is shown below. Data were further subjected to the Genetic Algorithms for Recombination Detection (GARD) program to confirm the putative single breakpoint position (Kosakovsky Pond et al., 2006) . To elucidate the phylogenetic relationships of TW IBV, 74 IBVs were analysed in this study, including 29 TW strains, 30 CH strains and 15 US strains. Minimum evolution (ME) test showed that 25 TW IBVs form a unique branch independent of the US and CH groups (Fig. S1 ). Some exceptional strains showing the phylogenetic incongruence were noticed, such as the TW2992/02 (Chen et al., 2009; Huang et al., 2004) , TW2994/02 (Huang et al., 2004) , TW3374/05 (Chen et al., 2009) , China CK/CH/ LDL97I and China CK/CH/LDL98I (Chen et al., 2009) . In their respective clades, 11 TW strains, 9 CH strains, 9 US strains were chosen for the analyses of nucleotide-sequence identity, maximum likelihood (ML) and Bayesian inference (BI). The nucleotide-sequence identity between TW S1 genes and those of US or CH isolates was similar (77-82%). The phylogenetic tree shown in Fig. 1A , which were constructed by BI, shows that TW IBV is a branch independent of the US and CH groups. Analysis of ML reproduces similar tree topology and the bootstrap values are illustrated together with the posterior probability (Fig. 1A) . Interestingly, the evolutionary divergence and ME methods indicated that TW IBVs are more closely related to the CH cluster than to the US group (Table 1 , Fig. 1B and Fig. S1 ), supported by less difference between TW-CH strains (Table 1 ) and the high bootstrap value (98%) for the branch separating the TW-CH groups from US group (Fig. 1B) , respectively. A virulent Taiwanese field isolate, TW2575/98, was fully sequenced (27,710 bps; DQ646405). The G/C content of the viral genome was 38.5%. ORFs 1a, 1b, E and M of TW2575/98 were compared to those of other IBV genomes (Fig. 2) . Phylogenetic trees of the 1a and 1b genes of TW IBVs consistently show a closer relationship to the CH strains than to the US strains ( Fig. 2A and B) . Analysis of the 5 0 -UTR and the ORF 1-encoded proteins (including the papain-like proteinase, helicase, RNA-dependent RNA polymerase, p58 and HD2 protein) using the neighbourjoining algorithm reproduces the same topology (data not shown). Similar results are also seen in the trees based on the full-length E and M genes of IBV, as shown in Fig. 2C and D, further illustrating that TW IBV is genetically closer to the CH virus. In contrast to the above results, the N gene of TW IBV showed surprising phylogenetic incongruence and was more closely related to that of US isolates according to the ME algorithm (Fig. S2, Fig. 3A and B) and the evolutionary divergence analysis (Table 1) . A fragment containing nt 1-600 and its encoded proteins ( Fig. 3C and D) also reflected the same phylogenetic relationship as the full-length N region. Interestingly, results based on the 3 0 -terminus of the TW IBV N gene and its encoded protein ( Fig. 3E and F) were inconsistent with those from the full-length N gene, and were similar to those for the genes described above and shown in Figs. 1 and 2 . To further examine this phenomenon, unrooted phylogenetic trees produced by BI clearly illustrate that the TW N gene is phylogenetically closer to US IBV than to CH IBV (Fig. 4A, C and E) . Crucially, Fig. 4C illustrates that the 5 0termini of the N genes of TW and US IBV belong to the same monophyletic group, rather than being separate isolated branches. The K a /K s ratio of TW2575/98 N versus all IBVs analysed in this study is 0.06-0.17, indicating the purifying selection of N protein during evolution. To further clarify whether the phylogenetic incongruence is caused by different evolutionary rates in different parts of the N gene, a ML test that are not biased by evolutionary rate variation was applied to recheck the phylogenetic relationships (Holmes and Rambaut, 2004) . No major difference was observed between the results from the BI and those from the ML test (Fig. 4B , D and F), confirming that RNA recombination probably occurred between TW and US IBVs in the 5 0terminal region of the N gene and that the phylogenetic incongruence was not caused by point mutation or variation in local evolutionary rates. To further test our hypothesis, genes 5 and N of 12 IBVs were subjected to the Recombination Detection Program (RDP) (Martin and Rybicki, 2000) and the Genetic Algorithms for Recombination Detection (GARD) program (Kosakovsky Pond et al., 2006) to examine the interspecies recombination. RDP analysis and three GARD algorithms all consistently showed high significance (p value = 1.9 Â 10 À3 to 3.5 Â 10 À9 ) to support the hypothesis of interspecies recombination with the breakpoint at nt 278 of gene 5. RDP-bundled algorithms also revealed one candidate crossover region between the TW and US genes, located between nt 278 and 757 of the queried sequences, covering the gene 5 and the 5 0 -terminus of the N gene (Fig. 5A ). For instance, the p values of the RDP and Bootscan algorithms were 5.1 Â 10 À4 and 1.0 Â 10 À5 , respectively. Alignment of the first 150 base pairs (bp) of N clearly indicated high identity between the TW and US strains (Fig. 5B) . Moreover, the deduced protein sequences also showed that the CH IBVs were significantly different from the TW and US strains ( Fig. 5C and Fig. 6 ). In addition, all the analysed TW IBVs shared the recombinant N gene, including one of the earliest isolates found in Taipei in 1964, TP/64 (Figs. 3 and 4) . This finding suggests that the recombination in the N gene produced a novel IBV founder distinct from IBV lineages present before 1964. The recombination event may have provided replication benefits, enabling the offspring of this founder to become dominant. The variation in the S1 region often represents viral genetic diversity and strain evolution (Dolz et al., 2008; Liu et al., 2006; Wang and Huang, 2000; Wang and Tsai, 1996) . The ratio of K a /K s can be an indicator of selective pressure acting on an open reading frame. The K a /K s of nucleotides in the S1 region was 0.35 AE 0.12% among evaluated TW IBVs. Although previous studies of IBV S1 genes have argued that TW IBV is an independent branch of the US group (Huang et al., 2004) , the relationship between TW IBV and CH isolates remains unclear. In this study, we have shown that Taiwanese IBV features a recombinant NTD in the N locus and has evolved into a unique population. The interspecies recombination is evidenced by the incongruent phylogenetic relationship inferred for this gene region, high consensus in the N-terminus between TW and US strains, rare synonymous mutations in the conserved N protein, and wellsupported results from RDP analyses. A fast rate of mutation caused by an error-prone RNAdependent RNA polymerase is a major factor driving the evolution of RNA viruses. Nevertheless, as the restraint of error threshold of lethal mutations, the coronavirus is speculated to have a relatively low mutation rate due to its large genome size (Eigen, 1987; Maynard Smith and Szathmá ry, 1995) . To increase viral adaptability and fitness, coronavirus has evolved a unique discontinuous transcription system and 'copy-choice' genomic synthesis to achieve high efficiency of RNA recombination (Cavanagh, 2007; Lai et al., 1985; Liao and Lai, 1992; Makino et al., 1986) . However, only sporadic recombinant variants have been detected in nature. We here show that the recombinant N gene was detected in all the isolated TW IBVs, supporting the participation of recombination in viral evolution and showing that a recombinant RNA virion can emerge as a local dominant strain. Since the emergence of severe acute respiratory syndrome coronavirus (SARS-CoV) in humans, coronavirus has become a new focus of pathogenic research. In the present, little is known about the original strain and the natural host of the SARS-CoV. Plausibly, the SARS-CoV may have originated, like TW IBV, through RNA recombination in an unidentified host and subsequently emerged as a new virus in human beings by crossing the host boundary (Holmes and Rambaut, 2004; Hon et al., 2008; Magiorkinis et al., 2004; Stavrinides and Guttman, 2004; Zhang et al., 2005) . However, present evidence is far from supporting this conclusion, and extensive sampling of the coronavirus genome from candidate hosts is required to solve this important issue. Although studies of MHV have demonstrated that RNA recombination can occur throughout the genome, genetic crossovers in coronaviruses are usually detected after TAS sequences, referred to as putative hotspot sites (Keck et al., 1987; Kusters et al., 1990; Liao and Lai, 1992) . In this study, both the RDP/Bootscan and GARD analyses suggested the possibility of a putative 5 0 recombination crossover nick at nt 278 of gene 5 in the TW IBV genome. Alternatively, given the remarkable discrepancy between the first 150 nt of CH N and those of TW and US N (Fig. 5) , the start codon of N after the TAS sequence may also serve as a breakpoint candidate for genetic recombination within TW IBV genome. It is noteworthy that one of the earliest TW IBVs, TP/64, shares high sequence identity in the first 150 nt of N with other TW IBVs and US IBVs (Fig. 6 ). During the 1970s in Taiwan, flocks were extensively immunised with the Mass-serotype H120 vaccine (Lu et al., 1993) . In this study, our finding suggests that the recombination in the N gene produced a novel IBV founder distinct from IBV lineages present before 1964. However, we did not exclude the possibility that the vaccination may also contribute to the antigenic shift of TW IBVs. The recombination event may have provided replication benefits, enabling the offspring of this founder to become dominant. To increase fitness under the pressure of H120 vaccination, the TW IBV with N recombinant might have undergone positive selection to escape from immune surveillance or to gain a replication advantage within quasi species. Coronavirus avian infectious bronchitis virus Identification of Taiwan and China-like recombinant avian infectious bronchitis viruses in Taiwan Molecular epidemiology and evolution of avian infectious bronchitis virus in Spain over a fourteen-year period New concepts for dealing with the evolution of nucleic acids A Laboratory Manual for the Isolation and Identification of Avian Pathogens Viral evolution and the emergence of SARS coronavirus Evidence of the recombinant origin of a bat severe acute respiratory syndrome (SARS)-like coronavirus and its implications on the direct ancestor of SARS coronavirus S1 and N gene analysis of avian infectious bronchitis viruses in Taiwan Development of attenuated vaccines from Taiwanese infectious bronchitis virus strains A novel variant of avian infectious bronchitis virus resulting from recombination among three different strains RNA recombination of murine coronaviruses: recombination between fusion-positive mouse hepatitis virus A59 and fusion-negative mouse hepatitis virus 2 Multiple recombination sites at the 5 0 -end of murine coronavirus RNA GARD: a genetic algorithm for recombination detection Experimental evidence of recombination in coronavirus infectious bronchitis virus Sequence evidence for RNA recombination in field isolates of avian coronavirus infectious bronchitis virus Recombination between nonsegmented RNA genomes of murine coronaviruses RNA recombination in a coronavirus: recombination between viral genomic RNA and transfected RNA fragments Protective effect of vaccination in chicks with local infectious bronchitis viruses against field virus challenge Genetic diversity of avian infectious bronchitis coronavirus strains isolated in China between The incidence and virus isolation of infectious bronchitis in chickens in Taiwan Phylogenetic analysis of the full-length SARS-CoV sequences: evidence for phylogenetic discordance in three genomic regions High-frequency RNA recombination of murine coronaviruses RDP: detection of recombination amongst aligned sequences The Major Transitions of Evolution MODELTEST: testing the model of DNA substitution The general stochastic model of nucleotide substitution Mosaic evolution of the severe acute respiratory syndrome coronavirus Isolation, pathogenicity, and H120 protection efficacy of infectious bronchitis viruses isolated in Taiwan Relationship between serotypes and genotypes based on the hypervariable region of the S1 gene of infectious bronchitis virus Genetic grouping for the isolates of avian infectious bronchitis virus in Taiwan Evidence of natural recombination within the S1 gene of infectious bronchitis virus Molecular characterization of an infectious bronchitis virus strain isolated from an outbreak in vaccinated layers Characterization of three infectious bronchitis virus isolates from China associated with proventriculus in vaccinated chickens Testing the hypothesis of a recombinant origin of the SARS-associated coronavirus This work was supported by the Bureau of Animal and Plant Health Inspection and Quarantine, Council of Agriculture in Taiwan (grant number 95AS-13.2.1-BQ-BE). It is highly appreciated. We also thank Dr. Yi-Ling Lin at the Institute of Biomedical Sciences in Academia Sinica for critical reading of this manuscript and valuable comments. Supplementary data associated with this article can be found, in the online version, at doi:10.1016/j.vetmic. 2010.02.027.