key: cord-0001381-v6lycc7k authors: Yu, Chien-Hung; Teulade-Fichou, Marie-Paule; Olsthoorn, René C. L. title: Stimulation of ribosomal frameshifting by RNA G-quadruplex structures date: 2013-10-30 journal: Nucleic Acids Res DOI: 10.1093/nar/gkt1022 sha: 66daba9fe4360f00714bd778402b6ac3489b2ec6 doc_id: 1381 cord_uid: v6lycc7k Guanine-rich sequences can fold into four-stranded structures of stacked guanine-tetrads, so-called G-quadruplexes (G4). These unique motifs have been extensively studied on the DNA level; however, exploration of the biological roles of G4s at the RNA level is just emerging. Here we show that G4 RNA when introduced within coding regions are capable of stimulating −1 ribosomal frameshifting (−1 FS) in vitro and in cultured cells. Systematic manipulation of the loop length between each G-tract revealed that the −1 FS efficiency positively correlates with G4 stability. Addition of a G4-stabilizing ligand, PhenDC3, resulted in higher −1 FS. Further, we demonstrated that the G4s can stimulate +1 FS and stop codon readthrough as well. These results suggest a potentially novel translational gene regulation mechanism mediated by G4 RNA. G-quadruplex (G4) structures formed by guanine (G)-rich nucleic acid sequences are characterized by their fourstranded G-tracts in combination with multiple stacked G-quartets. As opposed to typical Watson-Crick base pair forming duplexes, G-quartets are constituted by noncanonical Hoogsteen hydrogen bonds between these G bases. Although both RNA and DNA can adopt G4s, structural analysis demonstrates that RNA G4s fold into parallel-stranded conformations independent of nucleotide sequences, the species of cations and the concentration of RNA molecules (1) . DNA G4s show structural polymorphism according to various factors (2, 3) . These polymorphic structures have been shown to correlate with biological functions (4, 5) . As opposed to extensive studies on DNA G4s, our knowledge of RNA G4s, especially their biological consequences, remains limited. Recent advances have demonstrated that RNA G4s are key players in various cellular functions, including telomere homeostasis, pre-mRNA processing (splicing and polyadenylation), mRNA targeting, RNA turnover and translation (6) . Among these characterized functional roles, RNA G4s located within the 5 0 -untranslated regions (5 0 -UTRs) in relation to translational control are best studied. Several mechanisms related to translation initiation have been proposed to explain the roles of G4s in 5 0 -UTR: (i) interference with cap binding by the eIF4F complex (7); (ii) steric hindrance of start codon recognition (8) ; (iii) impeding the scanning process of ribosomal 40S subunit (9) (10) (11) ; (iv) assisting in formation of internal ribosomal entry site for cap-independent translation initiation (12) . Interestingly, a direct correlation between thermodynamic stability of RNA G4s in 5 0 -UTRs and their ability to repress translation has been shown (13) , suggesting that RNA G4s can act as tunable roadblocks to control gene expression by affecting ribosome scanning (14) . À1 ribosomal frameshifting (À1 FS) is a translational recoding mechanism whereby translating ribosomes are forced to move one nucleotide (nt) backward, leading to the decoding of a second open reading frame (ORF) located in the À1 register with respect to the first ORF (15) (16) (17) . Two elements within mRNA are required to induce efficient À1 FS: a 7-nt slippery sequence where FS occurs (18) , and a stimulatory structure that can be a pseudoknot, a hairpin or antisense oligonucleotideforming duplex (19, 20) located 5-8 nt downstream of the slip site. Several models have been proposed to explain the mechanism of À1 FS (21-23). One generally accepted feature is that the mechanical stability of the downstream structure is critical to À1 FS, but a simple correlation between stability and frameshifting efficiency is not evident (24) (25) (26) (27) . Because stable RNA G4s in the 5 0 -UTRs can impede 40S ribosomal subunit scanning, and stable structures are required to stall translating ribosomes to induce À1 FS, we hypothesized that G4 RNAs in the coding region can stall ribosomes, owing to their unusual stability, and thus promote À1 FS. While ribosomal stalling by G4s has recently been demonstrated in a bacterial system (28), we demonstrate here that natural and synthetic G4 RNA motifs are indeed efficient frameshifting signals in a mammalian system. Frameshift construct and oligonucleotides À1 FS was monitored by the pSF208 construct described earlier (29) . Sets of complementary oligonucleotides (Sigma-Aldrich) were annealed, followed by ligation into SpeI and NcoI digested pSF208. To monitor +1 FS and stop codon readthrough (RT), pSF208 was digested by BglI/NcoI, followed by insertion of annealed synthetic dsDNA fragments. A list of oligonucleotides is available on request. All constructs were verified by automated dideoxy sequencing using chain terminator dyes (LGTC, Leiden). Plasmid DNA was linearized by BamHI, followed by successive phenol/chloroform extraction and ethanol precipitation. SP6 RNA polymerase-directed transcriptions were carried out according to manufacturer's protocol (Promega). After transcription, RNA samples were loaded on a 1% agarose gel to determine the quality and quantity. Appropriate dilutions of the transcription mixtures in RNase-free water were directly used for in vitro translation. The translation mixtures (10 ml) contained 5 nM of mRNA, 4 ml of nuclease-treated rabbit reticulocyte lysate (RRL, Promega), 0.5 ml of 1 mM amino acids mix (Promega) without methionine, 0.25 ml of 35 S-methionine (>1000 Ci (37.0TBq)/mmol, EasyTag, Perkin Elmer), and indicated amounts of PhenDC3 (30) or TMPpyP4 (31) , and were incubated at 28 C for 1 h. Translation reactions were terminated by adding equal volume of 2Â Laemmli buffer followed by heating up to 80 C for 5 min. Samples were separated by 13% of sodium dodecyl sulphate (SDS)-polyacrylamide gels. Gels were dried and then exposed to phosphoimager screens (Molecular Dynamics). Band intensity of in-frame and recoding products (including À1 FS, +1 FS and RT) were measured by Molecular Imager FX (BioRad) or Typhoon 9400 scanner (GE Healthcare), and were quantified by Quantity One software (BioRad). Frameshifting efficiency was calculated as the amount of recoded products divided by the sum of in-frame and recoded product, corrected for the number of methionines (10 in the 0-frame product and 28 in the recoded products), and multiplied by 100. Selected G4 constructs were tested in HEK293T cells using the dual luciferases reporter, pDUAL-HIV(0), as described earlier (29) . In short, pDUAL-HIV(0) was digested by KpnI/BamHI, followed by insertion of complementary oligonucleotides. HEK293T cells were cultured in Dulbecco's modified Eagle's medium/high glucose/stable glutamine (PAA Laboratories) supplemented with 10% fetal calf serum and 100 U/ml penicillin and 100 mg/ml streptomycin. Cells were kept in a humidified atmosphere containing 5% CO 2 at 37 C on a regular subculturing regime. Cells were transfected with 300 ng of plasmid by 1 ml of Lipofectamine 2000 (Invitrogen) in a 24-well culture plate. Cells were lysed 20-24 h after transfection and luciferase activities were measured by GLOMAX multidetector (Promega) using Dual-Luciferase Reporter Assay Kit (Promega). Frameshifting efficiency was obtained by dividing the ratio Renilla luciferase (RL) over Firefly luciferase (FL) activity of the mutant by the RL/FL ratio of the in-frame control and multiplied by 100. Endogenous RNA G4s derived from natural 5 0 -UTRs can induce À1 FS To investigate the possibility of RNA G4 in inducing À1 FS, we first examined two well-defined suppressive RNA G4s located in the 5 0 -UTR of NRAS (9) and Trf2 (32) to prove our principle. As shown in Figure 1 , both RNA G4s when located 5 nt downstream of an efficient U 3 A 3 C slippery sequence showed significant À1 FS compared with corresponding negative controls in an in vitro translation assay (2.6-fold and 6-fold for NRAS and Trf2, respectively). These data indicate that the stable RNA G4s, which can interrupt 40S ribosomal subunit Figure 1 . Natural RNA G4s can cause À1 frameshifting (À1 FS). The wild-type (wt) G4 forming sequences located in the 5 0 -UTR of NRAS (wt NRAS) and Trf2 (wt Trf2) were cloned 5-nt downstream of UUUAAAC slippery sequence (underlined) in a frameshifting reporter construct (33) . Mutants, mut NRAS and mut Trf2, that are unable to form a G4 were constructed by replacing the 5 0 proximal G-tract by 3 As. SDS-PAGE analysis was used to resolve the 35 scanning are able to act as roadblocks to stimulate frameshifting as well. Spacer length effect of G4-induced À1 FS An optimal spacer length between slippery sequence and downstream stimulatory structures is crucial for efficient À1 FS (34, 35) . Although generally between 5 and 8 nt, the optimal distance depends on individual frameshifting signals. To investigate the optimal distance of RNA G4s, a novel frameshifting signal in inducing À1 FS, we increased the spacer length stepwise from 3 to 10 nt in the (G 3 U) 4 background (Figure 2A ). (G 3 U) 4 has been reported as the most stable G4 structure both at RNA and DNA level (36, 37) and thus may result in higher FS efficiency. Almost 2-fold higher FS than the Trf2 G4 was measured at the same spacer length of 5 nt ( Figure 2B, SP5) , while a control construct, in which the G4 structure is disrupted by 4 G-to-A mutations showed only 0.4% of FS ( Figure 2B , NC). FS reached an optimum of $7% at a spacer length of 6-8 nt ( Figure 2B , SP6-SP8). The À1 FS efficiency was also tested in cultured mammalian cells using a dual luciferase reporter plasmid (33) The FS efficiency in vivo was related to an in-frame control whose activity was set at 100% (see 'Materials and Methods' section). Similar to in vitro experiments, optimal FS was observed for the 6-nt spacer with decreasing FS efficiencies for shorter and longer spacers (Supplementary Figure S1 ). In general, FS efficiencies measured in vivo are much lower than those obtained in vitro [see e.g. (29) ]. À1 FS efficiency is positively correlated with the thermodynamic stability of G4 RNA A typical feature of G4 RNAs is that the four G3 tracts are separated by three loops. The length of these loops is central to G4 stability and topology (36) . To better characterize RNA G4s as frameshifting signals, we systematically investigated the effect of total loop length as well as the orientation of the loop in À1 FS using the representative U-rich sequence in these loop regions. To clarify, the number of the first, second and third loop regions separated by four G3 tracts were denoted as (x, y, z) ( Figure 3) . For example, a total loop-length of 4 nt can result in three different constructs with loop orientation as (1,1,2), (1,2,1) and (2,1,1), respectively. These three loop variants induced 3.5, 4.4 and 3.9% of À1 FS, respectively (Figure 3 and Supplementary Figure S2 ). For the six constructs with total loop length of 5 nt, the À1 FS efficiency ranges from 2.3 to 3.2% (Figure 3 and Supplementary Figure S2 ). Constructs with a total loop length of 6 (2,2,2), 7 (2,2,3 and 1,5,1) and 10 (3,4,3) nt displayed a decreasing ability in promoting À1 FS. In combination with previous data showing that the thermodynamic stability of RNA G4s is inversely correlated with total loop length (37), our results suggest that RNA G4s can induce À1 FS in a thermodynamic stability-dependent manner. We and others previously observed the same trend for frameshifting signals formed by perfect stemloop structures (29) and antisense oligonucleotides (20, 21, 38) . We further increased the number of G3 repeats with the aim to induce higher À1 FS due to their better ability to impede ribosomal scanning (13) . However, with the exception of (G 3 U) 5 , we observed less efficient À1 FS by incrementing the number of G 3 U repeats (Supplementary Figure S3 , left). These data are in contrast to a previous study applying G4 RNAs in the 5 0 -UTR as translational suppressors (13) . This discrepancy may be due to the intrinsic differences of the readouts. To induce À1 FS, the frameshifting signal should be present at a precisely defined distance, whereas for inhibition of scanning, the precise location of the roadblock is not important. With increasing number of G3 tracts, the possibility of forming a (G 3 U) 4 at the right distance from the slippery sequence will decrease, thus actually resulting in a decrease in FS. For obstructing ribosomal scanning, distance is not an issue, and as melting of the first G-stretch still allows formation of a new G4 by downstream G-tracts, scanning inhibition is enhanced by increasing numbers of G-tracts. Moreover, decreasing or increasing the number of G-quartets results in less FS (Supplementary Figure S3 , right) in agreement with their lower thermodynamic stability (3). Ligands that bind G4 RNA can either enhance or decrease FS Next we investigated the effect of G4 binding ligands on FS efficiency. We chose the G4 stabilizing ligand PhenDC3, which has been reported to increase the stability of a variety of DNA (30) and RNA (39) G4s and the porphyrin TMPyP4, which is a known G4 destabilizing agent (31) . Addition of PhenDC3 resulted in a dosedependent enhancement of FS of the (G 3 U) 4 construct reaching a 1.4-fold increase at a concentration of 2 mM (Figure 4 and Supplementary Figure S4 ). Addition of TMPyP4 at 2 mM, although affecting global translation, decreased FS $3-fold (Supplementary Figure S5) , while both TMPyP4 and PhenDC3 had no significant effects on frameshifting induced by a 12 base-pair hairpin (Supplementary Figure S5) . These data verify that RNA quadruplex formation is responsible for the observed FS. Although the mechanism of inducing À1 FS is distinct from +1 FS or stop codon RT, a common feature of these recoding events is the involvement of a 3 0 stimulatory RNA structure (40, 41) . To investigate if RNA G4 can induce +1 FS or stop codon RT, we replaced the wellcharacterized +1 FS stimulatory pseudoknot of mammalian antizyme (42) and the RT stimulatory stem-loop structure of Colorado tick fever virus segment 9 (43) with the most stable (G 3 U) 4 G4 sequence ( Figure 5) . Interestingly, the RNA G4 could induce significant levels of +1 FS (3.0%) and stop codon RT (1.5%) against a background of 0.6 and 0.3%, respectively ( Figure 5 ). In our present study, we demonstrated that RNA G4s can act as translational recoding signals to induce both À1 and +1 FS as well as stop codon RT. This suggests a potentially novel translational gene regulation mechanism mediated by RNA G4s. Given the high number of potential G4 forming sequences in the human genome (45) , it is likely that some are present within coding regions and are involved in translational recoding. Recently, some G4s present in bacterial mRNAs were reported to induce ribosomal stalling (28) . Although the function of this stalling remains unknown, ribosomal frameshifting was suggested to be one of the possibilities. The highest level of À1 FS that we achieved with a G4 RNA is 7%, using (G 3 U) 4 as stimulator. Although this level is rather modest compared with levels obtained with pseudoknot-stimulated FS, which can reach >40% as in the case of the 'Infectious Bronchitis virus' FS pseudoknot (46) , it is significantly higher than some natural FS signals like those present in Influenza A virus PA gene whose FS efficiency is 2% (47) . A FS efficiency of 7% is comparable with that obtained with a hairpin of 8 bp with a calculated ÁG of À17.1 kcal/mol (29) . Interestingly, the stability of (G 3 U) 4 has been measured to be only À8.16 kcal/mol (37) . So, how is this small structure of 15 nt capable of redirecting 7% of ribosomes into another reading frame? The answer lies probably in the peculiar topology of the G4 structure comprising eight hydrogen bonds and four purine stacks per helical step, making it difficult for the ribosome to melt the first Gquartets that reside at the opening of the mRNA entrance tunnel. The stability of the first 3 or 4 bp has also been shown to be critical for shifting the ribosome by hairpins (29, 48) and antisense LNA oligonucleotides (38) . The (G 3 U) 4 sequence was also capable of stimulating+1 FS as well as stop codon RT with an efficiency of 3.0 and 1.5%, respectively. Although these values may seem low, in vitro RT frequencies reported for CTFV are between 3.6 and 6.7% (43) and for Alphaviruses are 6.4-7.6% against a background of 0.8-2.0% in the absence of a stimulatory structure (49) . The +1 FS efficiency of the antizyme pseudoknot in the absence of spermidine is 2-3% (42), which is comparable with our G4-stimulated +1 FS We have also investigated the effect on À1 FS of several known G4 ligands. The bisquinolinium derivative PhenDC3, which is a general stabilizer of RNA G4s (39) , was found to enhance À1 FS efficiency $1.5-fold, whereas TMPyP4, known to destabilize certain RNA G4s (31), reduced FS $3-fold. Other DNA G4 stabilizing ligands like 2,4-Bis-[(E)-4-(dimethylaminostyryl)]-1-[4-(triethylammonio)butyl]pyridinium dibromide [Distyryl 1b (50) ] and 4a,10a,16a-triazoniatrinaphthylene [TrisQ (51) ] had no effect on À1 FS (data not shown). These ligands though have strong DNA sequence and/or structure preferences and may not stabilize (G 3 U) 4 RNA. One of the interesting aspects of G4s is their stabilization by potassium ions (52, 53) . This would allow a potential natural G4-dependent FS signal to be able to respond to changing cellular or environmental conditions. We have not investigated this possibility here since the lysate used in our in vitro translation assays already contains a high level of potassium (57 mM, Promega Technical Manual 232) but it is conceivable that at even higher concentrations of K + (>100 mM), FS may be enhanced. In addition to potassium, several proteins like FMRP (54) and RHAU or DHX36 (55) are known to bind RNA G4s and could play a regulatory role in this type of FS. Previously, a G-rich sequence was reported to be involved in a +1 frameshift of herpes simplex virus (HSV) (56) . However, in the case of HSV, the ribosomal slippage was thought to occur within the G-rich sequence itself at a frequency of $1%, and is not stimulated by a downstream structure, and is therefore different from our G4-stimulated FS. In conclusion, RNA G4s are capable of stimulating À1 and +1 FS as well as stop codon RT, thereby expanding the repertoire of RNA structures involved in translational recoding. Whether RNA G4s are present at natural recoding sites remains to be investigated. Supplementary Data are available at NAR Online. Funding for open access charge: Leiden Institute of Chemistry, Leiden University, Leiden, The Netherlands Monomorphic RNA G-quadruplex and polymorphic DNA G-quadruplex structures responding to cellular environmental factors Loop-length-dependent folding of G-quadruplexes Intramolecular DNA quadruplexes with different arrangements of short and long loops The dynamic character of the G-quadruplex element in the c-MYC promoter and modification by TMPyP4 Structure of an unprecedented G-quadruplex scaffold in the human c-kit promoter G-quadruplexes in RNA biology G-quadruplexes: the beginning and end of UTRs Molecular mechanisms of translational control An RNA G-quadruplex in the 5' UTR of the NRAS proto-oncogene modulates translation Inhibition of translation in living eukaryotic cells by an RNA G-quadruplex motif 5'-UTR G-quadruplex structures acting as translational repressors An RNA G-quadruplex is essential for cap-independent translation initiation in human VEGF IRES Predictable suppression of gene expression by 5'-UTR-based RNA quadruplexes RNA quadruplex-based modulation of gene expression Frameshifting RNA pseudoknots: structure and mechanism The +1 FS sequence based on the antizyme gene is derived from P2lucAZ1wt (44) except for the frameshifting pseudoknot, which was replaced by (G3U)4 ('+1FS G4'). The sequence of corresponding negative control ('+1FSmut') is indicated. The slip site is underlined. The stop codon RT construct is based on the Colorado tick fever virus segment 9 RT construct (43) except for the RT signal, which was replaced by (G3U)4 ('+1FS RT'). The RT stop codon is underlined and the sequence of corresponding negative control ('RTmut') is indicated. An SDS-PAGE analysis of 35 S-methionine-labeled translation products obtained by the Recode-2: new design, new search tools, and many more genes Non-canonical translation in RNA viruses Mutational analysis of the 'slippery-sequence' component of a coronavirus ribosomal frameshifting signal Novel application of sRNA: stimulation of ribosomal frameshifting Efficient stimulation of site-specific ribosome frameshifting by antisense oligonucleotides Signals for ribosomal frameshifting in the Rous sarcoma virus gag-pol region The 9-A solution: how mRNA pseudoknots promote efficient programmed -1 ribosomal frameshifting A mechanical explanation of RNA pseudoknot function in programmed ribosomal frameshifting Correlation between mechanical strength of messenger RNA pseudoknots and ribosomal frameshifting Characterization of the mechanical unfolding of RNA pseudoknots Triplex structures in an RNA pseudoknot enhance mechanical stability and increase efficiency of -1 ribosomal frameshifting Mechanical unfolding of the beet western yellow virus -1 frameshift signal Suppression of Gene Expression by G-Quadruplexes in Open Reading Frames Depends on G-Quadruplex Stability Stem-loop structures can effectively substitute for an RNA pseudoknot in -1 ribosomal frameshifting Highly efficient G-quadruplex recognition by bisquinolinium compounds The porphyrin TmPyP4 unfolds the extremely stable Gquadruplex in MT3-MMP mRNA and alleviates its repressive effect to enhance translation in eukaryotic cells A G-quadruplex structure within the 5'-UTR of TRF2 mRNA represses translation in human cells A dual-luciferase reporter system for studying recoding signals Characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an RNA pseudoknot The sequences of and distance between two cis-acting signals determine the efficiency of ribosomal frameshifting in human immunodeficiency virus type 1 and human T-cell leukemia virus type II in vivo A sequenceindependent study of the influence of short loop lengths on the stability and topology of intramolecular DNA G-quadruplexes A sequence-independent analysis of the loop length dependence of intramolecular RNA G-quadruplex stability and topology Stimulation of ribosomal frameshifting by antisense LNA Efficient suppression of gene expression by targeting 5'-UTR-based RNA quadruplexes with bisquinolinium compounds Ribosomal frameshifting in decoding antizyme mRNAs from yeast and protists to humans: close to 300 cases reveal remarkable diversity despite underlying conservation Stimulation of stop codon readthrough: frequent presence of an extended 3' RNA structural element Autoregulatory frameshifting in decoding mammalian ornithine decarboxylase antizyme Characterization of the stop codon readthrough signal of Colorado tick fever virus segment 9 RNA Antisense-induced ribosomal frameshifting Prevalence of quadruplexes in the human genome Mutational analysis of the ''Slippery-sequence'' component of a coronavirus ribosomal frameshifting signal An overlapping protein-coding region in influenza A virus segment 3 modulates the host response HIV-1 frameshift efficiency is primarily determined by the stability of base pairs positioned at the mRNA entrance channel of the ribosome Stimulation of stop codon readthrough: frequent presence of an extended 3 0 RNA structural element Recognition of G-Quadruplex DNA by Triangular Star-Shaped Compounds: With or Without Side Chains? Chemistry Asymmetric distyrylpyridinium dyes as redemitting fluorescent probes for quadruplex DNA A sodium-potassium switch in the formation of four-stranded G4-DNA Toward a digital gene response: RNA G-quadruplexes with fewer quartets fold with higher cooperativity Fragile X mental retardation protein targets G quartet mRNAs important for neuronal function The RNA helicase RHAU (DHX36) unwinds a G4-quadruplex in human telomerase RNA and promotes the formation of the P1 helix template boundary Translational recoding induced by G-rich mRNA sequences that form unusual structures Conflict of interest statement. None declared.