key: cord-1010289-5i3g2zb7 authors: Brierley, Ian; Digard, Paul; Inglis, Stephen C. title: Characterization of an efficient coronavirus ribosomal frameshifting signal: Requirement for an RNA pseudoknot date: 1989-05-19 journal: Cell DOI: 10.1016/0092-8674(89)90124-4 sha: ceb567e504b717d421b4f7a191945236d885acd8 doc_id: 1010289 cord_uid: 5i3g2zb7 Abstract The genomic RNA of the coronavirus IBV contains an efficient ribosomal frameshifting signal at the junction of two overlapping open reading frames. We have defined by deletion analysis an 86 nucleotide sequence encompassing the overlap region which is sufficient to allow frameshifting in a heterologous context. The upstream boundary of the signal consists of the sequence UUUAAAC, which is the likely site of ribosomal slippage. We show by creation of complementary nucleotide changes that the RNA downstream of this “slippery” sequence folds into a tertiary structure termed a pseudoknot, the formation of which is essential for efficient frameshifting. Ribosomal frameshifting is a recently described mechanism of translational regulation in which a directed change of translational reading frame allows the synthesis of a single protein from two (or more) overlapping genes (see Roth, 1981, Craigen and Caskey, 1987 , for reviews). So far, almost all the examples of this kind of control in higher eukaryotes have come from retroviruses, where frameshifting appears to be a mechanism for the regulation of expression of the viral RNA-dependent DNA polymerase. One termination codon in Rous sarcoma virus (RSV) (Jacks and Varmus, 1985) and human immunodeficiency virus type 1 (HIV-l) (Jacks et al., 1988a) , and two in mouse mammary tumor virus (MMTV) (Moore et al., 1987; Jacks et al., 1987) are suppressed by (-1) ribosomal frameshifts into alternative overlapping open reading frames (ORFs), generating the gag/p01 polyproteins from which the viral polymerases are subsequently derived. In addition, frameshifting appears to be necessary for the expression of the reverse transcriptase enzymes of a number of retrotransposons, such as yeast Tyl (Mellor et al., 1985; Wilson et al., 1986; Clare et al., 1988) and Ty912 (Clare and Farabaugh, 1965) and the Drosophila 17.6 (Saigo et al., 1984) and gypsy (Marlor et al., 1986) elements. Recently, however, we described the first non-retroviral, higher eukaryotic example of the phenomenon, in the avian coronavirus infectious bronchitis virus (IBV) (Brierley et al., 1987) . IBV, the type species of the Coronaviridae (Siddell et al., 1983) , has a large, single-stranded, continuous RNA genome of positive polarity (Schochetman et al., 1977; Stern and Kennedy, 1980) . Three-quarters of the genomic coding capacity is contained in two large ORFs, Fl and F2, which potentially encode 440 kd and 300 kd polypeptides respectively (Boursnell et al., 1987) . The ORFs of Fl and F2 overlap by 42 nucleotides, with F2 in the -1 frame with respect to Fl (see Figure 1 ). Using in vitro transcription and translation, we demonstrated that F2 is produced as a fusion protein with Fl as a result of a highly efficient (25%-300/o) (-1) ribosomal frameshift that suppresses the Fl termination codon (Brierley et al., 1987) . The role of this translational mechanism in IBV replication is so far unknown, but since the Fl and F2 ORF are assumed to encode, at least in part, components of the virus-specific RNA-dependent polymerase, and since the predicted amino acid sequence of F2 contains regions in common with other positive strand virus replicases (Boursnell et al., 1987; Hodgman, 1988) , it may be that the overall strategy is similar to that understood for retroviruses. There is considerable interest in the precise mechanism by which ribosomal frameshifting operates, and work from several groups has shown that the specificity for the event resides in the nucleotide sequence of the RNA around the site at which frameshifting occurs, since it has proved possible to induce frameshifting in a heterologous context by inserting cloned DNA corresponding to the frameshift site into unrelated genes (Brierley et al., 1987; Jacks et al., 1988b) . For RSV and HIV, the site at which frameshifting occurs has been identified by a combination of site-directed mutagenesis and amino acid sequence analysis (Jacks et al., 1988a (Jacks et al., , 1988b . These authors, from a comparative analysis of a large number of retroviruses thought to utilize frameshifting as a means of controlling gene expression, have suggested that certain heptanucleotide RNA sequences, initiating with two homopolymeric triplets, can allow tRNA slippage during translation, leading to a (-1) frameshift. In addition to these "slippery" sequences, potential RNA stem-loop structures located downstream of most retroviral shift sites have been proposed to make contributions to the frameshifting process (Rice et al., 1985; Moore et al., 1987; Jacks et al., 1987 Jacks et al., ,1988a , and indeed, the presence of a stem-loop downstream of the RSV frameshift site has been shown to be essential (Jacks et al., 1988b) . How the stem-loop influences frameshifting is not known, but it has been suggested (Jacks et al., 1988b ) that ribosomes may slow or stall at the stem-loop, increasing the likelihood of a tRNA slippage event. In this paper we describe analysis of the RNA sequences responsible for frameshifting in IBV, and define the minimal amount of information from the Fl/F2 overlap that is sufficient to induce frameshifting at high efficiency. The upstream limit of this essential region has the characteristics of a "slippery" sequence as defined previously (Jacks et al., 1988b) , and mutational analysis of this sequence is entirely consistent with frameshifting occurring at this site. The remainder of the essential region consists of about 80 nucleotides immediately downstream of the retk. lysate "slippery" sequence, and we present evidence that efficient frameshifting depends on the formation, by these sequences, of a tertiary RNA structure in the form of a "pseudoknot" (Studnicka et al., 1978; Pleij et al., 1985) . In Vitro and In Vivo A ribosomal frameshifting mechanism for the expression of the IBV F2 ORF was first suggested by Boursnell and colleagues (1987) . As no products encoded by Fl and F2 have been detected in IBV-infected cells, we tested this hypothesis by in vitro transcription and translation of a cDNA containing the Fl/F2 overlap region cloned within a marker gene (PB2 of influenza virus A/PR8/34) under the control of the bacteriophage SP6 promoter (Brierley et al., 1987) . Translation of synthetic mRNA derived from this plasmid (pFS1) in vitro yielded products whose size and pattern of immunoprecipitation (with specific antisera raised against defined portions of the marker protein) were entirely consistent with an efficient (-1) ribosomal frameshifting event at the end of the Fl ORF, resulting in the production of an Fl-F2 frans-frame protein (Brierley et al., 1987) . These earlier studies, however, did not test if the In Vrtro and In Vivo (A) Dragram of plasmtd pFS7 showing the predicted sizes of protein products that would be expected following ribosomal frameshrftrng within the FllF2 overlap durmg translation of a mRNA derived from Smal-digested pFS7 template. (8) Left-hand panel: reticulocyte lysate and Xenopus oocyte translatron products synthesized in response to RNA transcribed from Smal-digested pFS7 RNA was translated and products were labeled with [%]methionine as described in Experimental Procedures. Polypepttdes were separated on a 10% SDS-polyacrylamide gel and detected by autoradrography. Right-hand panel. reticulocyte lysate translation products directed by pFS7-derived mRNA were analyzed directly (total) or immunoprecipitated wtth antiserum raised agatnst the N-or C-terminus of the PBl protein, or wrth a control antiserum (a polyclonal antiserum to the influenza PA protein) as indicated. Polypeptides were labeled with [%]methionine, separated on 10% SDS-polyacrylamide gels, and detected by autoradiography. frameshift signal was functional in vivo. To address this question, we wished to test the capacity of our synthetic mRNA to signal frameshifting in Xenopus oocytes after microinjection. However, since the influenza Pf32 protein is relatively unstable in Xenopus oocytes, we constructed an alternative recombinant plasmid, pFS7, in which the IBV Fl/F2 overlap was cloned within the PBl gene of influenza virus, which in turn was flanked by the 5' and 3' noncoding regions of the Xenopus @globin gene, downstream of an SP6 promoter (Krieg and Melton, 1984) (Figure 1A) . Linearization of this plasmid with Smal, followed by transcription, resulted in the production of a capped and polyadenylated 2.8 kb mRNA, designed such that on translation, ribosomes terminating at the Fl ORF stop codon would produce a 45 kd product (the "stopped" product) and those that frameshifted, a 95 kd product (see Figure 1A) . Since the region of IBV cDNA included in pFS7 was considerably shorter than that in pFS1 (230 bp, as opposed to 497 bp,), we first tested that the pFS7-derived mRNA contained a functional frameshift signal by translating it in the reticulocyte lysate system. As can be seen in Figure lB , frameshifting occurred at wild-type (25%) efficiency. The predicted 45 kd "stopped" product migrates as a somewhat larger species (about 50 kd) on SDS-poly-nGc ,, pFS7.14 c pFS7.17 The diagram shows part of the sequence of the RNA around the frameshift site, including the suspected 'slippery" sequence UUU-AAAC (boxed) and the predicted stem-loop structure downstream. Shaded areas represent regions of the influenza PBl gene, and unshaded areas, IBV-derived sequences, except for a linker sequence (boxed) that was derived from phage ml3 during the cloning procedure (see Experimental Procedures). (B) Reticulocyte lysate translation products synthesized in response to mRNAs derived from Smaldigested pFS7 or mutant templates. Polypeptides were labeled with [%]methionine, separated on a 10% SDS-polyacrylamide gel, and detected by autoradiography. acrylamide gels, probably a consequence of the highly basic nature of the Pl31 protein. As expected, antiserum asl, raised against the N-terminus of PBl, immunoprecipitated both the 45 kd and 95 kd products, indicating common N-termini, and the C-terminal antiserum, as4, only the 95 kd product. When the same mRNA was microinjetted into Xenopus oocytes, frameshifting was also observed, with the stopped and frameshifted products clearly visible against the background of labeled oocyte proteins (see Figure 1B ). The efficiency of frameshifting in vivo was similar to that seen in the reticulocyte lysate. Signal Retroviral frameshifting appears to involve at least two components, a "slippery" sequence, which is the site of tRNA slippage, positioned upstream of a potential stemloop structure in the mRNA (Jacks et al., 1988b) . Analysis of the IBV FllF2 overlap reveals a similar sequence motif, UUUAAAC, some 6 bp upstream of a potential stem-loop (see Figure 2 and Brierley et al., 1987) . Based on the original hypothesis of Jacks and Varmus (1985) on the mechanism of frameshifting in RSV, we suspected that the site of frameshifting in IBV would be the UUUA sequence, with a tRNA slipping from the UUA codon back to the UUU codon in the -1 frame during the frameshifting process. This possibility was tested by site-directed mutagenesis using pFS7, which contains the bacteriophage fl intergenie region in a nonessential region of the vector (see Experimental Procedures), and which can thus be converted to single-stranded DNA (Dotto et al., 1981) . We first created a deletion from within the upstream PBl sequences up to, but not including, the UUUAAAC motif of the IBV frameshift region (pFS7.3) and another from the same point to just beyond the UUUAAAC motif (pFS7.4) ( Figure 2 ). In each case the deletions were designed to preserve the upstream and downstream open reading frames present in the original plasmid. Synthetic mRNA was transcribed from these plasmids and tested for frameshifting by in vitro translation ( Figure 2 ). The results confirmed the suspicion that the UUUAAAC motif was important, since frameshifting occurred with wild-type efficiency in pFS7.3 but was totally abolished with pFS7.4. To investigate in more detail the role of this sequence, we made single base substitutions at certain positions within this region. In the first mutagenic construct, pFS7.1, UUUA was changed to CUUA, on the basis that a UUA to CUU tRNALeU slippage event ought to be unfavorable. As can be seen in Figure 2 , with this mutant, frameshifting in the reticulocyte lysate system was reduced approximately lo-fold, supporting the idea that a tRNA slips at this site. However, as frameshifting was not completely abolished, we tested whether an additional site of slippage was involved, at the adjacent AAC codon. To avoid perturbing the UUUA slip site, we altered the second nucleotide of the AAC codon to either U (pFS7.14) or C (pFS7.17) and tested for frameshifting ( Figure 2 ). Frameshifting was not observed with pFS7.14 (UUU-AUAC), and was greatly reduced (to about 0.5%) with pFS7.17 (UUUACAC). The great reduction in frameshifting efficiency arising from these changes suggests that the tRNAs specified by the mutant sequences, i.e., tRNATyr (for pFS7.14) and tRNAHiS (for pFs7.17) are unable to shift into the -1 frame. This is not unreasonable, since such a shift would require the tRNATyr anticodon, AUG, to base pair with AUA in the mutant mRNA, and the tRNAHIS anticodon, GUG, to pair with ACA (the anticodon sequences shown are based on standard Watson-Crick base pairing). In the pFS7.1 mutant, where low-level frameshifting was seen, the tRNALeU anticodon, AAU, could form one base pair with the mutant codon CUU; this may be sufficient to allow a low efficiency slip. Recently, Jacks and colleagues (1988b) have proposed that in RSV, a double tRNA-slippage event occurs at the frameshift site, in which two adjacent ribosome-bound tRNAs slip into the -1 frame at the sequence AAAUUUA. Our results with IBV are consistent with this simultaneous slippage model, with the adjacent tRNALeU and tRNAAS" slipping back by one nucleotide in the 5' direction at the UUUAAAC site in I BV. Requirement for Downstream Sequences We next created a series of deletions downstream of the "slippery" sequence to define the 3'limit of the frameshifting signal (Figure 3 ). Once again deletions were made by oligonucleotide-directed mutagenesis and were designed to preserve the original open reading frame. We expected on the basis of previous work that this analysis would reveal a requirement for the nucleotides predicted to form a stem-loop structure just downstream of the slippery sequence, and indeed, pFS7.6, in which one side of the predicted stem-loop was removed, displayed virtually no frameshifting. This reduction was not simply due to the removal of the Fl stop codon in pFS7.6, since alteration of the stop codon from UGA to UGU by mutagenesis in a separate construct, pFS7.2, resulted in a mRNA with slightly increased frameshifting efficiency (Figure 3) . In this case a new termination codon is used which is located some 36 nucleotides downstream. Thus it seems that precise positioning of a terminator is not a prerequisite for frameshifting, in agreement with the situation for RSV (Jacks et al., 1988b) . Surprisingly, pFS7.5, which pre- (pFS7M) and in six mutant constructs prepared by site-directed mutagenesis. The mutations in constructs pFS7.18, pFS7.19, and pFS7.20 were in the first stem (PKlIPK3) and in constructs pFs7.12. pFS7.15. and pFS7.16, the putative second stem (PK2/PK4) was altered. (6) Reticulocyte lysate translation products synthesized in response to mRNAs derived from Smal-digested pFS7 or mutant templates. Polypeptides were labeled and analyzed as described in the legend to Figure 2. serves the potential for stem-loop formation, also showed crete base pairing interactions within the mRNA (as greatly reduced frameshifting, although in this case a reshown in Figure 4 ) one which would form the original sidual amount, about 20/o-3%, was still observed. In light stem-loop (between PKl and PK3), and a second beof this observation, we considered the possibility that an tween the loop of this structure (PK2) and the AGCCUadditional interaction may occur between the stem-loop containing region downstream (PK4). As a first step, we structure and nucleotides downstream to create a tertiary sought direct evidence for the PKl-PK3 interaction, to structure in the mRNA. Structures of this kind have been confirm the existence of the first stem-loop, by creating proposed to occur in a number of positive-stranded viral complementary mutagenic changes within the PKl and RNAs (see Pleij et al., 1985; Clarke et al., 1987) and have PK3 regions (see Figure 4 ). In pFS7.18, we sought to been termed "pseudoknots." The formation of a pseu-destabilize the stem by replacing six nucleotides of the doknot necessitates the base pairing of nucleotides in a PKl sequence at the base of the stem (GGGGUA) with hairpin loop with nucleotides downstream, generating an their complementary nucleotides. In pFS7.19, the six extended quasi-continuous double helix. Close examina-nucleotides of the PK3 sequence at the base of the stem tion of the IBV Fl/F2 mRNA sequence revealed a stretch (CCCCAU) were replaced by complementary nucleotides. of 7 nucleotides (underlined) about 30 nucleotides down-Finally, in pFS7.20, both mutagenic changes were made, stream of the predicted stem-loop which could potentially to create a double mutant in which the changes whould base pair with a stretch of nucleotides in the single-strand recreate base pairing and restabilize the stem. In both the region of the hairpin loop. As can be seen in Figure 3 , single mutants (pFs7.18 and 7.19) frameshifting was abolframeshifting on pFS7.10-derived mRNA, which retains ished, yet the frameshifting efficiency returned to near this sequence, occurred with wild-type efficiency. How-wild-type levels in the double mutant, pseudo-wild-type ever, when most of this sequence was deleted, in con-construct, pFS7.20 (Figure 4) . This result provides strong struct pFS7.11, the frameshifting efficiency was dramati-support for the PKl-PK3 interaction, and demonstrates cally reduced, which indicates that the nucleotides from that the stem-loop plays an important role in the framethis region are crucial for efficient frameshifting. shifting process. The Frameshifting Signal Contains a Tertiary RNA Structure The formation of a pseudoknotted RNA structure downstream of the UUUAAAC sequence would require two dis-We then went on to investigate the proposed PK2-PK4 interaction in a similar manner. In this case, we mutated the relevant regions by reversing a block of five nucleotides within each region (see Figure 4) . Thus in pFS7.15, the PK2 sequence was changed from AGGCU to UCGGA, In this structure, S2 is stacked upon Sl such that a right-handed, quasicontinuous double-helix of 16 bp is formed. An artist's impression of the three-dimensional organization of this structure is shown (C), assuming that 9 bp form in Sl, 7 bp form in S2, and one turn of the helix contains 11 bp (Arnott et al., 1972) . In the resulting pseudoknot. Li (2 nucleotides in length) crosses the deep groove and L2 (32 nucleotides rn length) the shallow groove. The "fold" program of Jacobson et al. (1984) did not predict any significant RNA secondary structures within L2. These diagrams are based on those presented by Rietveld et al. (1983) and Pleij et al. (1985) . and in pFS7.12, the PK4 sequence was changed from AGCCU to UCCGA. In both mutated constructs, where the potential for base pairing is dramatically reduced, the frameshifting efficiency was lowered some lo-fold to about 2%. However, as was the case with the PKl-PK3 interaction, the creation of the double mutant, pFS7.16, in which the changes are complementary, restored the efficiency of frameshifting to approximately wild-type level. In these experiments, it was consistently observed that the "stopped" product from pFS7.15 and pFS7.16 had a slightly lower gel mobility than that produced by the wild-type and pFS7.12 constructs. This altered gel mobility is most likely due to the two amino acid substitutions (serine and aspartic acid replacing arginine and leucine) arising from mutation of the PK2 sequence in these plasmids. These experiments provide strong evidence that a tertiary RNA structure, a pseudoknot, forms at the frameshift site. A model for such a structure is shown in Figure 5 . The Slippery Sequence and Pseudoknot Are Precisely Positioned Jacks and colleagues (1988b) have proposed that RNA secondary structure downstream of the "slippery" site may slow or stall ribosomes during translation, such that the ribosomal peptidyl (P) and acceptor (A) sites remain in the vicinity of the slippery sequence slightly longer, promoting the slippage event. A prediction of such a model is that the correct spacing distance must be maintained between the slippery sequence and the RNA structure concerned. We tested this by constructing two mutants in which the spacing was increased (pFS7.13) or decreased (pFS7.22) by three nucleotides (Figure 6 ). In pFS7.13, in which a tyrosine codon (UAC) was inserted between the slippery sequence and pseudoknot, frameshifting was abolished. In pFS7.22, deletion of the glycine codon (GGG) greatly reduced frameshifting (to about 2%). Thus alteration of the relative position of the UUUAAAC sequence by as little as three nucleotides with respect to the pseudoknot causes a dramatic reduction in frameshifting efficiency. The genomic RNA of the coronavirus IBV contains a sequence at the junction of the Fl and F2 ORF that can direct ribosomal frameshifting both in vitro and in vivo in Xenopus oocytes. We have defined, by deletion analysis, a stretch of approximately 86 nucleotides within this region that contains all the necessary information for high efficiency frameshifting in a heterologous context. This minimal frameshift signal includes, at its 5'end, the heptamerit sequence UUUAAAC, which appears to be analagous to the "slippery sequences" described by Jacks et al. (1988a Jacks et al. ( , 1988b required for frameshifting in retroviral systems. In addition, frameshifting is dependent upon the formation of a tertiary mRNA structure downstream of the slippery sequence. This represents direct evidence that tertiary folding in RNA can influence its translation and could have implications for a much wider range of biological processes involving RNA. The frameshifting system thus provides an excellent opportunity to study in more detail this novel kind of RNA structure. The Site of Frameshifting in Coronavirus IBV Our results suggest strongly that frameshifting occurs at the UUUAAAC sequence. A deletion that removed this region abolished frameshifting, and point mutations within the region were severely inhibitory. This sequence corresponds in organization to the "slippery" sequences present in the gene overlaps of demonstrated and suspected retroviral frameshift sites (Jacks and Varmus, 1988b) . In the three retroviral overlaps where the site of slippage has been established by protein sequencing, the shift occurs r$onavirus Ribosomal Frameshifting Signal at the downstream codon in the slippery sequence, i.e., at the leucine codon in RSV (AAAUUUA) and HIV-1 (UUUUUUA), and at the asparagine codon in MMTV gag/pro (AAAAAAC] (Hizi et al., 1987) . It seems likely therefore that the IBV shift occurs at the asparagine codon (UUUAAAC), although confirmation will require protein sequencing at the junction of the frameshifted product. Jacks et al. (1988b) have proposed a model for ribosomal frameshifting in RSV which involves simultaneous slippage on the mRNA of tRNAs occupying both the P and A sites of the ribosome. They postulate that such slippage would be possible only if base pairing could subsequently be maintained between the first two positions of each anticodon loop and the new mRNA codon. Our data for IBV frameshifting are consistent with such a model. Following slippage, base pairing could be maintained between the first two nucleotides of each anticodon loop and the mRNA (with the post-slippage anticodons AAU and UUG pairing with the mRNA codons UUU and AAA, respectively, after slippage). If this model is correct, it might be expected that reducing the capacity to form base pairs in the "slipped" position would affect frameshifting dramatically, and this indeed was observed. Two single site mutations (UUACAC, pFS7.14 and UUUAUAC, pFS7.17), which would prevent tRNA-mRNA pairing in the A site following slippage, greatly reduced or abolished frameshifting (05% or less). In mutant pFS7.1 (CUUAAAC), in which complementarity in the "slipped" position would be reduced from 2 to 1 bp in the P site, frameshifting was again markedly reduced, although in this case, some activity was retained (efficiency of 2%). This difference in frameshifting efficiency between the A and P site mutants may reflect the number of base pairs that can form following slippage at each site. Alternatively, the tRNA reading the A site may play a more important role in frameshifting than that occupying the P site, as suggested by Jacks and colleagues (1988b) for RSV and other retroviral frameshift sites. Evidence for Tertiary RNA Folding in the IBV Frameshift Signal Our deletion analysis of the frameshift signal of IBV indicates the requirement not only for the UUUAAAC "slippery sequence" at the 5' end, but also of a stretch of nucleotides downstream. We previously suggested that a stem-loop structure predicted from the primary sequence of this downstream region might be an important element of the frameshift signal (Brierley et al., 1987) and the results reported here are consistent with this. However, our experiments indicate that this stem-loop forms part of a more complex tertiary structure, since base pairing was apparent between nucleotides of the predicted loop and a region downstream. Until direct structural information is obtained, the precise three-dimensional organization of the RNA remains uncertain. However, we believe that the RNA folds to form a structure similar in principle to the pseudoknots predicted to occur in a number of plant viral RNAs (see Pleij et al., 1985) foot and mouth disease virus (FMDV) RNA (Clarke et al., 1987) E. coli 5s RNA (Goringer and Wagner, 1986 ) E. coli 16s rRNA (Maly and Brimacombe, 1983; Pleij et al., 1985) and in the E. coli a mRNA leader sequence (Deckman and Draper, 1987) . A similar kind of structure has also been described for a short synthetic RNA (Puglisi et al., 1988) . The construction of the proposed IBV pseudoknot is outlined in Figure 5 . A stretch of nucleotides in the loop of the hairpin and bordering the stem region form normal Watson-Crick base pairs with nucleotides of the complementary sequence located downstream in the RNA. Two separate double-helical regions form two stem structures (marked Sl and S2), which stack coaxially to generate an elongated double-stranded helix. It has been assumed (Pleij et al., 1985) that in interactions of this kind, the stems are stacked on top of each other such that a quasi-continuous, right-handed double helix is formed, comparable to A-RNA. Such an assumption is valid if the single stranded connecting loops Ll and L2 pose no sterical constraints upon this structure (for a detailed discussion, see Pleij et al., 1985) . The end product of the folding is a pseudoknot (Studnicka et al., 1978) in which the connecting loops Ll and L2 span the deep and shallow grooves of the doublehelix, respectively. The IBV stem-loop predicted by the computer-based RNA folding program of Jacobson et al. (1984) (Figure 3 ) has 11 bp in the stem and 5 unpaired nucleotides in the loop. As it stands, it seems unlikely that this precise structure could form a pseudoknot, since all the loop nucleotides are predicted to base pair with elements downstream, and no unpaired nucleotides remain to cross the deep groove of the double-stranded helix. To account for the results of our mutagenic analysis of the region, we suggest that an alternative stem-loop forms (shown in Figure 4 ) which, because a number of previously paired stem nucleotides now form part of the single-stranded loop, is capable of the tertiary interaction. In this model, the weak UG pair and the GC pair at the top of the stem are included in the loop (as is the previously unpaired G). This would potentially allow seven of the nine loop nucleotides to base pair with the downstream region to form S2, leaving two unpaired loop nucleotides to span the deep groove in the pseudoknot. The bridging of 7 bp in the deep groove by two loop nucleotides is sterically feasible (see Pleij, 1985) . An artist's impression of the three-dimensional folding of the proposed IBV pseudoknot is shown in Figure 5 . A quasi-continuous double helix of 16 bp In length is formed. As is the case in a number of the plant viral pseudoknots, loop 2 is considerably longer than loop 1. As only 7 bp can form between the loop and the downstream sequence, it is unlikely that a real knot could form in the RNA, since a full turn of an A-RNA duplex contains 11 bp (Arnott et al., 1972) . Our results therefore suggest strongly that an RNA pseudoknot is an essential element of the IBV frameshifting signal, although its precise organization remains to be elucidated. However through direct structural analysis of artificially synthesised wild-type and mutated mRNAs in combination with functional analysis by in vitro translation, it should be possible to investigate in considerable detail this novel RNA building principle. The Role of Tertiary RNA Structure in Ribosomal Frameshifting How does the presence of a pseudoknot in IBV mRNA lead to efficient frameshifting? The answer to this question is not yet clear, but an obvious possibility is that the elongating ribosome may be required to unwind the pseudoknot, and this could slow or stall its passage along the mRNA sufficiently to allow tRNA slippage at the "slippery" site. In this case it might be expected that mutations which destabilize, but do not abolish entirely, secondary and tertiary structure in the downstream region would be partially inhibitory to frameshifting. Indeed, in the case of mutants where the first predicted stem (Sl) was allowed to form but not the second (S2), that is pFS7.5, pFS7.11, pFS7.12, and pFS7.15, frameshifting was detected, although at a much lower level. This raises the possibility, currently being tested, that intermediate levels of frameshifting would be observed if one or other of the two stems were only partially destabilized. Whatever model is advanced to explain how a pseudoknot promotes frameshifting must account for the precise spacing required between the knot and the "slippery" sequence, since deletion or insertion of as little as three bases was sufficient to reduce frameshifting dramatically. A possible explanation for this is that in these mutants, the ribosomal P and A sites are positioned on sequences that are refractory to slippage during a ribosomal pause. A second important question arising out of this work is whether tertiary structures of this kind are a more general feature of (-1) frameshift sites. A large number of retroviruses, related retrotransposons (Jacks et al., 1988b) and a luteovirus (Miller et al., 1988) are known or suspected to utilize (-1) frameshifting. We have examined the known or suspected sites of such frameshifting (those listed and referenced in Jacks et al., 19886) by predicting, with the aid of a computer program, the secondary structure of the RNA at the appropriate regions and then looking for potential tertiary base pairing interactions in the downstream sequences. We found that the "fold" program of Jacobson and colleagues (1984) predicted hairpin loops immediately downstream of the slippery sequences at all the sites except in the retrotransposon 17.6. We scanned the region downstream of the predicted stemloops for sequences that were complementary to the loop nucleotides, setting 50 nucleotides as the maximum distance scanned and 5 nucleotides as the minimum size of the complementary region. The result obtained was rather striking: 14 out of the 22 sequences examined appear to contain the potential for pseudoknot formation. The number of complementary nucleotides was between 5 and 8, and these were located between 3 and 32 nucleotides from the base of the upstream stem-loop. Shown in Figure 7 are the potential interactions at the RSV gaglpol, MMTV gag/pro and prolpol overlaps and some examples from cases where frameshifting is suspected to occur, with the complementary loop and downstream nucleotides highlighted. In those systems where frameshifting has been confirmed (RSV, MMTV, IBV, and HIV-l), only HIV-1 did not contain a suitable downstream sequence. Madhani et al. (1988) and Wilson et al. (1988) , however, have observed recently that frameshifting in HIV-1 appears to occur in the absence of any recognizable RNA structures at the frameshift site, so this may not be unexpected. In addition to HIV-l, seven of the suspected sites did not show potential for the formation of tertiary RNA structure within the limits of this simple examination. Nevertheless, the possibility remains that tertiary RNA interactions may be a common feature of (-1) frameshift sites. In the case of RSV, it has been shown (Jacks et al., 1988b ) that a deletion which left the stem-loop intact, but did not include downstream RSV sequences, had reduced frameshifting efficiency. However, when an additional 22 downstream nucleotides were included, frameshifting was restored to approximately wild-type efficiency. As can be seen in Figure 7 , the additional sequences include an eight nucleotide stretch (UGUAGCGC) with full complementarity to a region in the loop of the RSV hairpin, The figure shows the RNA secondary structures predicted by the "fold" program (Jacobson et al.. 1984 ) at a number of retroviral gene overlaps where frameshifting is known (RSV, MMTV) or suspected to occur (Visna. HTLVl, BLV). Nucleotides (ntds) that could participate in base pairing interactions between the loop and downstream regions are highlighted. The known or suspected "slippery" sites are boxed. The nucleotide sequences were taken from the references cited in Jacks et al. (1988b) which might therefore interact through base pairing. Introduction of single and compensatory base changes in this region, through site-directed mutagenesis, should resolve this question. Construction of Plasmid pFS7 Plasmid pFS1 (Brierley et al., 1987) was digested with Ddel and a 230 bp fragment containing the FllF2 overlap region isolated (IBV sequence information from position 12,286 to 12,509 bp; Boursnell et al., 1987) . Following end-filling with E. coli DNA polymerase I Klenow enzyme, the fragment was inserted into Sspl partially digested plasmid pSTl+ (Digard et al., submitted) to produce pFS6. pSTl+ comprises the influenza virus A/PR8/34 PBl gene (Young et al., 1983) inserted into the Bglll site of plasmid pSP64-T ( Krieg and Melton, 1984) . Thus pFS6 consists of pSP64-T/PBl with the IBV frameshift region inserted at position 1162 in the PBl gene, such that both the Fl and F2 ORF were in frame with the flanking PBl sequences, with Fl in frame 1 and F2 in frame 3. Plasmid pFS7 (see Figure 1 ) was constructed by inserting a 519 bp Rsal fragment from pEMBL8 (Dente et al., 1983) into the unique Pvull site of pFS6. This fragment included the intergenic region of filamentous bacteriophage 11 (Dotto et al., 1981) enabling singlestranded pFS7 DNA to be generated following infection of plasmid-carrying bacteria with helper bacteriophage R408 (Russel et al., 1986) . The orientation of this fragment was such that the noncoding strand of pFS7 was packaged into R408 phage particles. The influenza-IBVinfluenza junctions were confirmed by dideoxy nucleotide sequencing (Sanger et al., 1977) of single-stranded pFS7. Both pFS6 and pFS7 were designed such that ribosomes could translate the length of the PBl mRNA following a frameshift event within the inserted IBV region (see Figure 1 ). Plasmids were maintained in E. coli JMlOl (Yanisch-Perron et al., 1985) . Mutagenesis of pFS7 A procedure based on the method of Kunkel (1985) was employed to prepare site-specific mutations within the IBV frameshift region. Plasmid pFS7 (or mutant derivatives) was transformed into E. coli strain RZ1032 (dut-ung-) (Kunkel, 1985) and single-stranded, uracilcontaining, pFS7 DNA isolated following superinfection with bacteriophage R408. Phosphorylated mutagenic oligonucleotides (5 pmol) were annealed to this template (0.1-0.5 ng) in a 10 WI reaction mixture containing 10 mM Tris-HCI (pH 8) 10 mM MgCIz, 50 mM NaCI, and 1 mM dithiothreitol (DTT) at 55% for 5 min. After cooling at room temperature (10 min), the primer was extended by addition of dATR dCTP dGTR and dTTP (0.5 mM each); ATP (0.5 mM), E. coli DNA polymerase I Klenow enzyme (6 U, Boehringer Mannheim) and T4 DNA ligase (2.5 U, Bethesda Research Laboratories) to a final reaction volume of 20 nl after readjusting the concentration of Tris-HCI, MgClz, and DTT to 40 mM, 7.5 mM, and 3 mM, respectively. The extension-ligation reac-bon was for 2 hr at room temperature, and then for an addrtronal 2 hr after supplementing the reactron with addihonal Klenow (6 U) and Iigase (2.5 U). Portrons of the reaction were transformed Into E. coli JMlOl. and srngle-stranded templates prepared from transformants by R408 superinfectron Mutants were Identified by drdeoxy sequencing. The efficiency of mutagenesrs was between 20% and 70%. Preparation of DNA Template, Transcription and Translation of mRNA Plasmids for transcription were prepared by the alkalrne lysrs mrnipreparation method (Brrnboim and Doly, 1979) lrnearized by drgestron with Smal, and transcribed as described previously (Brierley et al., 1987) . Messenger RNAs were translated in rabbit reticulocyte lysates as described before (Bnerley et al., 1987) . Transcripts were translated in Xenopus oocytes as described by Colman (1984) . lndrvidual oocytes were micrornjected with approximately 50 ng mRNA, and labeled with [35S]methionine (10 uCi) from 2-6 hr following injection. Translabon products were analyzed on 10% SDS-polyacrylamide gels accordrng to standard procedures (Hames. 1981) . The relative abundance of nonframeshifted or frameshifted products on the gels was estimated by either scanmng densitometry or by measuring the radioactive content of excised gel slices. In each case, a correction was made to account for the drfferential methionine content of the products. Antisera against Defined Regions of the Influenza Virus PBl Protein Rabbit anhsera agatnst defined regions of the PBl protem were prepared by rmmunization with PBl fragments expressed as fusion proteins with f3-galactosidase (Stanley and Luzio, 1984) . Antrsera 1 (asl) and 4 (as4) covered the regions encoded by nucleotides 150-1110 and 1464-2256, respectively. An antiserum raised against a regron of the influenza A/PR8/34 PA protein by a simrlar procedure was used as a control. Radioimmune precipitation of reticulocyte translation products was performed as described (Brierley et al., 1987) . Optimised parameters for RNA double-helices A rapid alkaline extraction procedure for screening recombinant plasmid DNA Nucl Completion of the sequence of the genome of the coronavirus avian Infectious bronchitis virus An efficient ribosomal frameshifting signal in the polymerase-encoding region of the coronavirus IBV Nucleobde sequence of a yeast Ty element: evidence for an unusual mechanism of gene expression Efficient translational frameshifting occurs within a conserved sequence of the overlap between the two genes of a yeast Tyl transposon Potential secondary and tertiary structure rn the genomrc RNA of foot and mouth disease virus Translatron of eukaryotrc messenger RNA rn Xenopus oocytes Translational frameshiftrng where will it stop? S4-a mRNA translation regulatron complex. II. Secondary structures of the RNA regulatory site in the presence and absence of 54 pEMBL. a new famrly of single-stranded plasmids Functronal analysis of bacteriophage fl intergenrc region Does 5s RNA from E. coli have a pseudoknotted structure? An introduction to polyacrylamrde gel electrophoresis Characterization of mouse mammary tumor virus gag-pal gene products and the rrbosomal frameshift by protein sequencrng A new superfamily of repkcative proteins Expressron of the Rous sarcoma vrrus pal gene by ribosomal frameshifting Two effrcient rrbosomal frameshifting events are required for synthesis of mouse mammary tumor vrrus gag-related polyproteins Characterization of ribosomal frameshifting in HIV-l gag-pal expression Signals for ribosomal frameshiftmg in the Rous sarcoma virus gag-pal region Some sample computational methods to improve the foldmg of large RNAs Nucl Functional messenger RNAs are produced by SP6 in vitro transcrrption of cloned cDNAs Rapid and efficient site-specrfrc mutagenesis wrthout phenotyprc selection Signals for the expression of the HIV pal gene by ribosomal frameshifting Refined secondary structure models for the 16s and 23s rrbosomal RNA of Escherichia coli The Drosophila melanogaster gypsy transposable element encodes putative gene products homologous to retroviral proteins A retrovrrus-lrke strategy for expression of a fusion protein encoded by yeast transposon Tyl Sequence and orgamzatton of barley yellow dwarf vrrus genomic RNA Complete nucleobde sequence of a milk-transmitted mouse mammary tumor virus: two frameshift suppression events required for translation of gag and pal A new principle of RNA folding based on pseudoknotting A pseudoknotted RNA oligonucleotide The gag and pal genes of bovine leukemia virus: nucleotide sequence and analysis Three-dimensional models of the tRNA-like 3'termini of some plant viral RNAs Frameshift suppression An improved filamentous helper phage for generating single-stranded plasmid DNA Identification of the coding sequence for a reverse transcriptase-like enzyme in a transposable genetic element in Drosophila melanogaster DNA sequencing with chain-terminating inhibitors Presence of infectious polyadenylated RNA in the coronavirus avian infectious bronchitis virus The biology of coronaviruses Construction of a new family of high efficiency bacterial expression vectors: identification of cDNA clones coding for human liver proteins Coronavirus multiplication strategy. I. Identification and characterization of virus-specified RNA Computer method for predicting the secondary structure of single-stranded RNA Expression strategies of the yeast retrotransposon Ty: a short sequence directs ribosomal frameshifting HIV expression strategies: ribosomal frameshifting is directed by a short sequence in both mammalian and yeast systems Improved Ml3 phage cloning vectors and host strains: nucleotide sequences of the Ml3 mpl8 and pUC19 vectors Cloning and expression of influenza virus genes We are grateful to Bhavna Bilimoria and Nicola Rolley for expert assistance, to Mike Boursnell for useful discussion, and to Tom Atkinson and Professor Michael Smith for the gift of oligonucleotides. This work was supported by a grant from the Agricultural Research Council (LRG 45).The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" rn accordance with 18 U.S.C. Section 1734 solely to indicate this fact.