key: cord-0705648-490n4rxk authors: Kusters, J. G.; Niesters, H.G.M.; Lenstra, J. A.; Horzinek, M. C.; van der Zeijst, B.A.M. title: Phylogeny of antigenic variants of avian coronavirus IBV date: 1989-03-31 journal: Virology DOI: 10.1016/0042-6822(89)90058-5 sha: e1f210b5c1a4d6f1dfb426c27cb8ec6d5d588635 doc_id: 705648 cord_uid: 490n4rxk Abstract The sequences of the peplomeric S1 protein of four serologically distinct strains of the infectious bronchitis virus (IBV), an avian coronavirus, have been determined. The S1 protein is thought to contain the serotype-specific neutralization epitopes and to be the main target of antigenic variation. An alignment with sequences of three strains published previously showed that from the 545 amino acid residues only 243 have been conserved. Clustering of substitutions suggests that most serotype determinants are located within the first 300 amino acid residues of S1. A phylogenetic tree of the S1 sequences showed very variable rates of divergence. Differences in topology with a tree based on RNAse-T1 fingerprint data indicate that some of the IBV strains have arisen by genetic recombination. Avian infectious bronchitis (16) is a worldwide disease, which is caused by a coronavirus and results in a highly contagious respiratory affliction of young chickens or in a decrease in egg production (35). Initially, IB was effectively controlled by vaccination with live attenuated IB virus (IBV). However, this did not prevent outbreaks caused by variant viruses (10, 13). RNase Tl fingerprinting analysis showed that field strains isolated from such outbreaks are related to vaccine strains (24) suggesting that the new strains have originated from vaccine virus by antigenic variation. As in other RNA viruses, antigenic variation is probably facilitated by the relatively high error rate of transcription (1 Om3) during the transcription of the RNA template and the absence of a proofreading mechanism (18, 33) . It is generally assumed that the serotype of IBV is determined by the glycoprotein E2, which is the structural component of the peplomers, the typical clubshaped structures projecting from the surface of the virus. E2 is processed proteolytically to two noncovalently bound peptide chains, Sl and S2 (6, 34) . S2 contains the C-terminal half of the sequence, including the transmembrane anchor and two long a-helices that form the stalk of the peplomer (14). Sl forms the top part of the peplomer and presumably carries the serotypical determinants. This assumption is based on the findings that all strongly neutralizing monoclonal antibodies recognize Sl , and that immunization with purified Sl, but not with virus lacking Sl , induces neutralizing antibodies (5, 7, 29) . To investigate the serotypic variation of IBV at the molecular level, we determined the Sl sequences of four IBV strains (73, 24): H 120, an attenuated vaccine strain of serotype A; D207, the reference strain of serotype B; D1466, a vaccine strain of serotype C; and V1397, a recent Dutch isolate from serotype A/C. These sequences were compared with the sequence of three strains published previously (3, 4, 3) : M41, a pathogenic strain from serotype A used in a killed-virus vaccine; M42, a nonpathogenic laboratory strain of serotype A; and 6/82, a recent British field isolate of serotype B (10). Virus strains were obtained from the Poultry Health Institute, Doorn, The Netherlands. Details on the isolation and passage history have been described (24). Strains were passaged once in the allantoic cavity of 1 O-day-old chicken embryos. Virus stocks were stored at -70". Virus growth, isolation of genomic RNA, cDNA synthesis, cloning, and sequencing were carried out essentially as described previously (31). With V1397, cDNA synthesis was primed using random calf thymus pentanucleotides (Pharmacia). By screening of colonies with probes containing Sl sequences of the M41 strain, Sl clones of D207 and H120 were detected. Partial sequencing of a D1466 clone with a large insert yielded Sl sequences that could be used as a probe to obtain D1466 as well as V1397 clones. Most sequences were based on two or more independent cDNA clones. Only with strain D207, three nu- : L: (4) . D207* represents a cDNA variant of strain D207 (Glu-48 as main variant was found by sequencing independent clones, Lys-1 17 by direct RNA sequencing). Dashes were introduced to align the sequences. Potential glycosylation sites (NXS or NXT. X # P) are underlined. Conserved cysteine residues are in boldface. cleotide differences between cDNA clones were observed, two of them leading to amino acid substitutions. During the sequence determination of strain M41, one difference between two cDNA clones was observed. From the nucleotide sequences (submitted to the EMBL, GenBank, and DDBJ nucleotide se-quence databases), amino acid sequences have been deduced. In Fig. 1 , the amino acid sequences are listed together with the Sl sequences published previously. As observed earlier (37) five differences with M41 not reported by Cavanagh eta/. (9). The length of the Sl protein varies between 535 and 538 amino acids, including the signal peptide and the arginine-rich cleavage site between Sl and S2 (6) . The sequences could be aligned by assuming deletions/insertions at 14 positions. Two Sl proteins can have different amino acids in up to 49% of the positions of the sequence (Table 1 ). However, from the 17 to 19 cysteine residues, 16 have been completely conserved, as are most of the glycosylation sites. Presumably, the sequence variability is the combined result of the accumulation of neutral substitutions and the positive selection of antigenic variants; the relative high frequency of nonsilent mutations in several parts of the S 1 sequence (not shown) suggests a positive selection. To localize the most variable regions, the number of different amino acid residues found on all 545 positions is plotted in Fig. 2 . On 243 positions, the amino acid residues are conserved in all strains. Although there are no clearly defined hypervariable regions-as forexample, in the HIV envelope protein (30) the VP1 protein of foot-and mouth-disease virus (2, 1 I), and the rotavirus VP7 protein (IS, 17)-there are relatively many replacements in the regions 50-l 70 and 250-3 10. Insertions/deletions were mainly found in the region 120-170. These observations suggest a localization of most of the serotypic and antigenic determinants in the Nterminal half of the Sl subunit. A more accurate definition of the antigenic determinants may be derived from a comparison of similar sequences of strains with different serological properties. It has been suggested (9, 37) that in serotype-A strain the clustered substitutions in two regions, HVR 1 (56- 69 in the numbering of Fig. 1 ) and HVR 2 (1 17-133) coincide with neutralization epitopes. Indeed, a mutation in HVR 1 prevented neutralization by two different monoclonal antibodies (9). From the six differences between the serotype-B strains D207 and H6/82, three are in a region corresponding to HVR 2. The epitopes of IBV recognized by neutralizing monoclonal antibodies against Sl are conformation dependent (unpublished data). As described elsewhere (25; Kusters et al., unpublished results) , the 30 N-terminal residues of the S2 subunit contain several overlapping conformation-independent epitopes that evoke a weak neutralizing response. Since these epitopes cross-react with antisera against different serotypes, they are not relevant for the serotype of the virus. An alignment of the Sl nucleotide sequences was used to calculate a distance matrix, from which the most likely phylogenetic tree was inferred by a program distributed as part of the PHYLIP package 2.6 (15). The topology and branch lengths of this tree, shown in Table 1 . The figures indicate percentages of replaced nucleotides. The tree is unrooted. i.e., the position of the hypothetical ancestral IBV sequence is unknown. (B) Tree representing genomic relatedness. Percentages are estimated from RNase-Tl fingerprint analysis (I). The serotype is indicated between parentheses. The data are from Kusters et al. (24) . except for M42 (72; J.G.K., unpublished). placed apart in the Tl tree (<95% identity) but have closely related Sl sequences (297.5% nucleotide identity). Second, the considerable divergence of the Sl sequences of strains D1466 and D207 is not reflected in the Tl tree, nor in a tree (not shown) based on sequences of the El genes (8). Third, V1397 is related to H120 in the Tl tree (299.5% overall sequence similarity), but to D1466 in the Sl tree. Theoretically, the first two of these discrepancies might be resolved by assuming extreme variations of the evolutionary rate within the viral genome. However, this would not explain the similar Sl sequences of V1397 and D1466 vs the common RNase-Tl spots of V1397 and H120. Instead, we propose that genetic recombination has played a role in the generation of antigenie variants. For instance, V1397 may have acquired a D1466-like peplomer gene. In the murine coronavirus MHV, recombination occurs at a rather high frequency (19,X, 26) and may alter the serotype of the virus (27). For IBV, conditions that favor recombination are created in the field by vaccination of chickens with more than one attenuated IBV strain. Thus, infections of cells with two different strains, leading to the formation of recombinants, may very well have occurred. RNA recombination has been well documented for picornaviruses (21-23, 28, 32) . It would be interesting to test our hypothesis that recombination also plays a role in the generation of new IBV variants. Such a test might be based on the localization of the recombination site or on in vitro recombination experiments. We thank Dr. G. Koch for communicating unpublished data. The help of Dr. J. A. M. Leunissen (University of Nijmegen) with the computer programs and the use of the facilities of the Dutch CAOS/CAMM Center, under Grants SON-l l-20-700 and STWNCH44.0703, are gratefully acknowledged. This work was supported in part by a research grant from Duphar B.V., Weesp, The Netherlands (H.G.M.N.). Modern Approaches to Vaccines Ce/l47 Proc. Nat/. Acad. Sci. USA