key: cord-0823163-j2840g75 authors: Callison, Scott; Hilt, Deborah; Jackwood, Mark title: Using DNA Shuffling to Create Novel Infectious Bronchitis Virus S1 Genes: Implications for S1 Gene Recombination date: 2005 journal: Virus Genes DOI: 10.1007/s11262-004-2194-3 sha: 7de78e86d6284601e15735bd2276edb8e46f606b doc_id: 823163 cord_uid: j2840g75 We employed the staggered extension process (StEP) to shuffle the S1 genes from four infectious bronchitis virus (IBV) strains representing four unique serotypes. Upon creating a shuffled S1 gene library, we randomly selected 25 clones and analyzed them by DNA sequencing. In total, eleven clones contained novel S1 gene recombinants. Based on sequence data, each recombinant was unique and contained a full-length open reading frame. The average number of crossovers per recombinant was 5 and the average number of point mutations was 1.3, leading mostly to non-synonymous amino acid changes. No recombinant contained sequences from all four parental genes and no recombinant contained any sequence from the distantly related Delaware 072 strain. Our data suggests that recombination between distantly related IBV strains within the S1 gene probably does not readily occur. This finding is extremely important in light of the common industry vaccination practice of mixing different live-attenuated IBV strains. Infectious bronchitis virus (IBV) continues to cause disease in poultry flocks around the world. Although commercial poultry flocks are routinely vaccinated for IBV, outbreaks of infectious bronchitis still happen due to naturally occuring variant viruses that continue to arise. Currently, modified live and killed viruses that do not cross protect are used to vaccinate chickens. Better vaccines that are low in virulence, highly immunogenic and most importantly, cross protective between different IBV strains/serotypes are needed. To create better vaccines, a strategy that keeps pace with the rate of viral change in nature is needed. Infectious bronchitis virus is a Coronavirus that contains a large, single stranded, positive sense RNA genome that is 27.5 kb long. The genome encodes four major structural proteins: the membrane glycoprotein, the envelope protein, the nucleocapsid protein, and the spike glycoprotein, which contains conformationaly dependent virusneutralizing and serotype specific epitopes [1] . Unfortunately, the three-dimensional structure of the spike is not known and development of an efficacious recombinant vaccine has not been successful. The ability of IBV to rapidly change in nature is attributable to its method of replication. Infectious bronchitis virus undergoes genetic drift due to point mutations that occur during replication because the viral polymerase lacks proofreading ability. In addition, genetic shift occurs due to the recombination of two or more strains during replication. Both genetic drift and shift have been documented for IBV in the laboratory and in birds [2] [3] [4] [5] [6] [7] . DNA shuffling is the process of purposeful, but random recombination of parental genes into novel recombinant genes. Basically, genes (parental) with some level of similarity are recombined in a test tube by any number of techniques, such that the newly created genes contain genetic information from some or all of the parental genes. The new recombinant genes can be expressed as protein and then selected or screened for a desired property. Within the last 10 years, researchers have used DNA shuffling to create new genes that encode novel or improved proteins [8] [9] [10] [11] . As an example, Zhao et al. developed the Staggered Extension Process or StEP to shuffle different sequences encoding the subtilisin E protein and improved its half-life at 65°C by 50-fold [12] . Briefly, they promoted recombination by using a modified polymerase chain reaction in which the annealing and extension steps were combined and allowed only brief periods of time for polymerization. In this study, we use DNA shuffling to create unique spike glycoprotein genes from four different strains of IBV that represent four different serotypes using the StEP method. Examining S1 sequence recombinations following DNA shuffling ought to provide insights into coronaviral antigenic shift and thereby lead to predictions of future variant IBVs. In addition, it might be possible to create spike glycoproteins with multiple virus-neutralizing epitopes that could be used to develop a broadly-protective IBV vaccine. The virus strains used in this study were Massachusetts 41 [13] , Arkansas DPI [14] , Connecticut 46 [13] , and Delaware 072 [15] . All viral strains were propagated by allantoic sac inoculation of 9-11 day-old embryonating chicken eggs. After 48 h of incubation, the allantoic fluid was harvested and stored at )70°C [16] . RNA extraction, RT-PCR, and cloning of parental genes Viral RNA was extracted from allantoic fluid using the High Pure RNA extraction kit (Roche Diagnostics Corporation, Indianapolis, IN). The S1 gene was amplified by RT-PCR as previously described [17, 18] using the Titan One Step RT-PCR kit (Roche Diagnostics Corporation, Indianapolis, IN). The 1700 bp amplicon from each virus strain was gel purified on a 1% agarose gel using GenElute spin columns (Sigma-Aldrich Co, St. Louis, MO) and Microcon 30 columns (Millipore, Bedford, MA) per the manufacturers directions. The resultant DNA was cloned into TOPO XL per the manufacturers directions (Invitrogen Corp., Carlsbad, CA). Plasmid DNA was extracted from colonies using a plasmid miniprep kit (Qiagen, Inc., Valencia, CA). Plasmids containing inserts of the appropriate size were screened by DNA sequencing of the 5¢ and 3¢ ends. Clones containing S1 genes with appropriate 5¢ and 3¢ end sequences were fully sequenced (Seq-Wright, Houston, TX). Plasmid DNA from each parental S1 gene was purified, linearized by digestion with MIu I (New England Biolabs, Inc., Beverly, MA) and gel purified on a 1% agarose gel as stated earlier, for use as StEP template. The staggered extension process was performed as previously descried [12] . Briefly, a 100 ll PCR was assembled containing 0.05 picomoles of each linear parental template, 40 picomoles of each S1 primer, 50 ll of a commercial PCR buffer mixture, Premix 7 (Epicentre, Madison, WI), and water up to 100 ll. The reaction mixture was subjected to a thermocycle program consisting of 95°C, 5 min; 100 cycles of 95°C, 30 s, 55°C, 2 s; and 4°C hold until the products could be analyzed. StEP products were electrophoresed on a 1% agarose gel. The diffuse band around 1700 bp was excised from the gel and purified as previously stated. The purified DNA was reamplified by PCR using 1 ll as template in the same reaction mixture as for the StEP. The thermocycle program was 95°C, 2 min; 25 cycles of 95°C, 30 s, 45°C, 30 s, 72°C, 90 s; and 4°C, hold. After the reaction, the 1700 bp amplicon was gel purified and cloned into TOPO XL according to the manufacturerÕs directions. Twenty-five colonies were randomly selected from the transformation reaction. Each clone was grown overnight in LB broth in the presence of kanamycin. Plasmid DNA was prepared from each clone as stated earlier and the inserts were sequenced (Seq-Wright, Houston, TX). Sequence data was analyzed using MacDNASIS Pro V3.5 (Hitachi Software Engineering Corp., San Bruno, CA) and Lasergene V3.12 (DNASTAR, Inc., Madison, WI). Creating a Novel IBV S1 Gene Library Using the StEP and Initial Screening of 25 Clones We used the S1 gene of the Massachusetts 41, Arkansas DPI, Connedticut 46, and Delaware 072 IBV strains, which represent four different serotypes, in the StEP and examined the reaction products after 20, 40, 60, 80, and 100 cycles on an agarose gel (Fig. 1 ). In total, 11 of the 25 clones examined contained recombinant S1 genes. Each of the recombinant genes had a unique sequence (Fig. 2 ) that maintained a full-length ORF from the ATG start site to the cleavage site. On average, each recombinant gene contained 5 crossovers. Seven of the recombinants (DS-6, DS-8, DS-9a, DS-9, DS-12, DS-21, DS-22) contained sequence from two of the parental genes, while four of the recombinants (DS-1, DS-2, DS-13, DS-16) contained sequences from three of the parental genes. None of the recombinants contained sequences from all four of the parental genes. There was very little Arkansas DPI parental sequence and no Delaware 072 parental sequence in any of the recombinants. The nucleotide similarity for each of the 11 recombinants, when compared with the Massachusetts 41 strain, ranged from 91.3% to 99.6%. The nucleotide similarity for each recombinant when compared with the Connecticut 46 strain, ranged from 90.4% to 98.8%. The nucleotide similarity for each recombinant when compared with the Arkansas DPI strain, ranged from 75.7% to 82.8% (Table 1) . The amino acid similarity for each of the 11 recombinants, when compared with the Massachusetts 41 strain, ranged from 89.9% to 99.3%. The amino acid similarity for each recombinant when com-pared with the Connecticut 46 strain, ranged from 87.6% to 97.4%. The amino acid similarity for each recombinant when compared with the Arkansas DPI strain, ranged from 75.1% to 81.4% (Table 1) . The cleavage site sequence and the number of potential glycosylation sites were analyzed for each recombinant ( Table 2) . Each of the recombinants contained a cleavage sequence of the type X 1 RX 2 RR, where X 1 represents Arg, His, or Tyr and X 2 represents Ser, Phe, Ile, Arg, or His [22, 23] . The number of potential N-linked glycosylation sites ranged from 17 to 18. When secondary structure predictions of the amino acid sequence for each recombinant were performed using the Chou and Fasman algorithm [24] , and compared to the parental amino acid sequences, each recombinant was found to have a different secondary structure prediction (data not shown). The DS-1, DS-2, DS-13, and DS-16 recombinants contained parental sequence from Massachusetts 41, Connecticut 46, and Arkansas DPI (Fig. 3) . The recombinant DS-1 had 6 crossovers and no point mutations. The recombinant DS-2 had four crossovers and four point mutations, of which three were non-synonymous changes. The recombinant DS-13 had 7 crossovers and 1-point mutation that was synonymous. The recombinant DS-16 had four crossovers and two point mutations, of which one was a non-synonymous change ( Table 2 ). The StEP was used to produce recombinant S1 genes from four different strains of IBV, representing four different serotypes. We cloned the putative recombinant S1 genes and randomly selected 25 clones from the library for sequence analysis. Eleven of the clones contained recombinant S1 genes. All eleven recombinant clones (44% of total clones) contained a unique S1 sequence. This percentage might be improved by performing the StEP at a lower temperature Fig. 2 . Comparison of recombinant S1 gene sequences. Dark grey = Massachusetts 41 parental sequence; Light grey = Connecticut 46 parental sequence; Black = Arkansas DPI parental sequence. Shown at the bottom is a generic S1 gene with regions and their amino acid positions known to be involved in the formation of neutralizing and serotype specific epitopes (light grain) [19] [20] [21] . and/or by decreasing the amount of time for annealing/extension. Lowering the temperature should increase the chance for recombination by slowing down the DNA polymerase and allowing for less similar sequences to hybridize, while decreasing the annealing/extension time would produce shorter fragments during each PCR cycle. Those changes should increase the number of unique recombinants, and might produce recombinants that contain sequences from all of the parental genes. It was interesting to note that there was very little Arkansas DPI sequence and no Delaware 072 sequence in the recombinant genes. This may be due to the lower level of similarity between these two parental genes as compared to the Massachusetts 41 and Connecticut 46 S1 genes ( Table 1) . In fact, it may be impossible to recombine the Delaware 072 gene by the StEP because its S1 gene nucleotide similarity to the Massachusetts 41, Conneticut 46, and Arkansas DPI strains is extremely low (52.3%, 50.7%, and 48.7%, respectively). It is difficult to shuffle genes by the StEP with similarities below 70% (F.H. Arnold, California Institute of Technology, personal communication). Creating novel IBV S1 genes Previous research indicates that most natural recombination of infectious bronchitis viruses occurs in hot spots termed intergenic consensus sequences [5] . These areas are regions of similarity outside of the coding sequence for a gene. This normally leads to recombination of whole genes, not sections of genes, although some exceptions have been reported [2, 3] . Our research has shown that even under near ideal conditions for recombination, recombinants from very different sequences within the S1 gene could not be produced. The Delaware 072 virus is unique in that it is less than 60% similar in the S1 gene to all other IBV isolates. Thus, it would seem logical to assume that natural Delaware 072 recombinants containing crossovers within the S1 gene coding sequence with other IBV types would be rare. Indeed, a recent report details a new serotype of IBV, designated GA98, which appears to have evolved from the Delaware 072 strain by antigenic drift and not by antigenic shift [6] . It appears that recombination at intergenic consensus regions, which replaces whole genes in IBV, is more likely to occur when the S1 genes of two strains are less than 70-75%, whereas antigenic shift could occur to produce variants within the S1 gene from the recombination of IBV strains with a S1 gene similarity above 70-75%. This finding is extremely important in light of the common practice of simultaneously vaccinating with different live attenuated strains of IBV to achieve broad protection in commercial chicken flocks. However, these percentages have been determined from our data and should be considered an estimation only, because in vivo RNA recombination for IBV may differ from the in vitro DNA recombination reported here. Although we did not produce any recombinants with sequences from all four parental S1 genes, we did produce several that contained sequences from three of the parental S1 genes. In particular, recombinants DS-2 and DS-16 contained large S1 gene segments from parental strains, Massachusetts 41, Connecticut 46, and Arkansas DPI. The relatively low S1 amino acid sequence similarity for the DS-2 and DS-16 clones to any of the parental clones suggests that they may induce unique antibodies and makes them good candidates for further serologic characterization. Interestingly, the crossover sites for DS-2 ensure that different parental sequences reside within regions known to play key roles in the formation of virus neutralizing epitopes [19] [20] [21] 25] . It is impossible to predict the antigenic outcome of a mixture of parental sequences from serologically distinct viruses within the S1 gene. Virus neutralizing epitopes for IBV are conformationally dependent [19] [20] [21] and it is not known whether hypervariable regions form neutralizing epitopes within themselves or with other hypervariable or constant regions. Furthermore, due to the known association of non-contiguous amino acids in the functional spike glycoprotein and the non-conservative point mutations within our recombinant S1 genes, the biological functionality and serological character of these genes remains to be determined. Diseases of Poultry Proc 60th Ann A Laboratory Manual for the Isolation and Identification of Avian Pathogens