key: cord-0974800-8xob5edz authors: Belew, Ashton T; Dinman, Jonathan D title: Cell cycle control (and more) by programmed −1 ribosomal frameshifting: implications for disease and therapeutics date: 2015-01-13 journal: Cell Cycle DOI: 10.4161/15384101.2014.989123 sha: d00f4dabc58eca254f3b2ca4efafda00d671c3da doc_id: 974800 cord_uid: 8xob5edz Like most basic molecular mechanisms, programmed –1 ribosomal frameshifting (−1 PRF) was first identified in viruses. Early observations that global dysregulation of −1 PRF had deleterious effects on yeast cell growth suggested that −1 PRF may be used to control cellular gene expression, and the cell cycle in particular. Collection of sufficient numbers of viral −1 PRF signals coupled with advances in computer sciences enabled 2 complementary computational approaches to identify −1 PRF signals in free living organisms. The unexpected observation that almost all −1 PRF events on eukaryotic mRNAs direct ribosomes to premature termination codons engendered the hypothesis that −1 PRF signals post-transcriptionally regulate gene expression by functioning as mRNA destabilizing elements. Emerging research suggests that some human diseases are associated with global defects in −1 PRF. The recent discovery of −1 PRF signal-specific trans-acting regulators may provide insight into novel therapeutic strategies aimed at treating diseases caused by changes in gene expression patterns. Cells regulate gene expression via diverse mechanisms. From mRNA transcription to protein degradation, many regulatory systems affect the timing, localization, and rate of each reaction. Gene expression is primarily concerned with the abundance and translational activity of mRNA; therefore expression is increased when a message is transcribed more rapidly, stabilized by the cell, or more available to actively translating ribosomes. Increased mRNA degradation, decreased transcription, translational silencing, and the storage of mRNA are the hallmarks of decreased expression. The center of this spectrum between mRNA synthesis and protein degradation is translation, the timely and high-fidelity process of bringing together mRNA, aminoacylated-tRNA, and the ribosome. Cellular functions are generally not impacted by sense errors, but nonsense, missense, and errors which change the reading frame tend to be deleterious; the translational apparatus has evolved to ensure that these are extremely rare. Simultaneously, the complex interplay of factors which effect translation suggest that the ribosome is a likely post-transcriptional regulation nexus of gene expression. Most evidence gathered thus far concentrates primarily on cis-acting elements in the 5 0 and 3 0 untranslated regions (UTRs) of mRNAs, and the transacting factors with which they interact. Coding sequences have more recently been examined for effects on post-transcriptional control, and multiple cis-acting mRNA elements have been found which cause elongating ribosomes to recode the mRNA sequence. [1] [2] [3] [4] In all kingdoms of life, these elements include, but are not limited to, sequences responsible for C1 and ¡1 programmed ribosomal frameshifting, termination codon suppression, stop-start elements, selenocysteine incorporation, and in archaea, pyrolysine incorporation. die. This is a central point of attack by many antibiotics, which capitalize on small but distinct differences between bacterial and eukaryotic ribosomes. 5, 6 Reading frame is determined by recognition of a start codon by an initiator tRNA in the ribosome's decoding center and is maintained through the consistent translocation by 3 nucleotides during elongation. The three stop codons do not correspond to any tRNAs, this enables the release factor complex to initiate translation termination. Translational recoding comprises the spectrum of events which occur when canonical translation is disrupted. This ranges from mechanisms which subvert initiation, 7 alter the equilibria of events during elongation, 8 and affect the termination process 9 generally resulting in suppression or reinitiation. This document mainly focuses on a particular recoding event called programmed ¡1 ribosomal frameshifting that occurs during elongation. Programmed ¡1 ribosomal frameshifting (PRF) is a recoding mechanism historically associated with viruses 10 and retrotransposons. 11, 12 This is due at least in part to the much smaller and compact nature of viral genomes and the relative simplicity in searching their kilobase scale genomes as opposed to megabase bacterial genomes or gigabase eukaryotic genomes. When the gag amber termination of the Rous sarcoma virus was found to be bypassed in favor of translation of a downstream ¡1 reading frame, 2 relatively simple and testable possibilities were considered: either a splicing event or the programmed shift of ribosomes into the new reading frame. 10 Sequencing of the relevant region of the genome and invitro transcription/translation experiments supported the latter hypothesis and suggested that 'shifty tRNAs' caused the actual frameshift events. Shortly thereafter, reports described new RNA structures called pseudoknots, which when present in viral mRNAs could either repress translation 13 or stimulate efficient ribosomal frameshifting. 14 The Recode databases 15, 16 (http://recode.ucc.ie) continue to categorize and describe these viral signals. A PRF signal is defined as a cis-acting mRNA element that stochastically redirects translating ribosomes into an alternate reading frame. The most well characterized ¡1 PRF signals follow a relatively consistent pattern of stimulatory element proximally downstream of a group of weakly pairing bases (the "slippery heptamer") comprised of N x N x N x W y W y W y H z ; the spaces delineate the incoming reading frame, NWH refer to the IUPAC definitions, and xyz denote identical nucleotides. There is some debate regarding the relative contributions of the slippery heptamer, downstream stimulatory element, and the space between them; but all models of ¡1 PRF agree that the downstream mRNA structure causes elongating ribosomes to pause while tRNAs are positioned over the slippery site. The nature of the codon:anticodon interactions at the slippery site facilitates slippage of a fraction of paused ribosomes backward (5 0 ) by one base through pairing at the non-wobble positions. The weak homopolymer nucleotide sequences (poly-U or poly-A) are of particular interest in ¡1 PRF, as they are often coupled with pauses in translation in order to effect the frameshifting event. 17 H-type mRNA pseudoknots are the most common stimulatory structure, but other structures, including proteins bound to stem-loops, 18 variously sized stem-loops, 19 and RNA triplexes 20 can also promote efficient frameshifting. The general mechanism and structure of a typical ¡1 PRF signal is diagrammed in Figure 1 . Genomic ¡1 PRF Examination of the history of modern molecular genetics shows that most basic molecular mechanisms were first observed in viruses. Therefore, it was reasonable to hypothesize that ¡1 PRF is also used to control expression of cellular genes. Indeed, the serendipitous discoveries of ¡1 PRF signals of viral origin in the mammalian PEG10 21, 22 and Edr1 23 mRNAs suggested that more of these elements were hidden in eukaryotic genomes. Simultaneously, genetic studies in yeast suggested that ¡1 PRF may play a role in cell cycle control. 24 Thus began the search for ¡1 PRF signals in eukaryotic genomes. There are 2 complementary approaches when searching genome databases for potential frameshift signals. The first is to search for genes harboring conserved overlapping open reading frames. 25 This approach has been highly successful in identifying new viral ¡1 PRF signals, and conserved C1 PRF signals in ornithine decarboxylase genes. 26 However, while this enables identification of novel PRF signals, it enforces the assumption that frameshifting results in a new functional protein. The second method is to search for sequence motifs that conform to known PRF signals. Though limited to one class of signal, this method does not assume that frameshifting results in a C-terminal extension product. The efforts of our laboratory have focused on this latter approach. In particular, ¡1 PRF was chosen because there are sufficient viral examples to enable the generation of heuristics. This is in contrast to C1 frameshifting, which appears to be idiosyncratic. The basic strategy is to perform a pattern-match based search for allowable heptameric slippery sequences followed by strong downstream structure; this search is NP-complete 27 and therefore computationally difficult. The initial study using this approach pressed the available CPU limits at the time, taking significant time for relatively small numbers of sequences. 28 Later searches against the yeast genome refined this strategy and used significantly greater computational resources, allowing the yeast genome to be exhaustively searched in months. 29 Further refinements, increases in memory/ CPU, and larger computational resources made it possible to complete approximately one genome per week 30 along with comparisons against randomized sequences. Analysis of multiple genomes reveals that approximately 10% of annotated genes contain at least one high-confidence potential frameshift signal. A searchable database of predicted eukaryotic ¡1 PRF signals is available in the Predicted Ribosomal Frameshift Database (PRFdb, www.prfdb.umd.edu). 30 An unexpected result: cells use ¡1 PRF to control mRNA abundance As noted above, a motif based search removes the assumption that frameshift events produce functional extended peptides. This led to the surprising observation that only 0.07% candidate ¡1 PRF containing sequences extend more than 30 codons beyond the frameshift event, regardless of the cellular genome examined ( Fig. 2A) . The observation that the vast majority of frameshift events direct translating ribosomes to premature termination codons engendered the hypothesis that ¡1 frameshift signals function as mRNA destabilizing elements through the nonsense mediated mRNA decay pathway (NMD) 31 (Fig. 2B) . This was initially validated in yeast cells using a well-defined viral frameshift signal and further analysis suggested a role for No-Go decay (NGD) as well. 32 More recent studies in our laboratory demonstrate that this general rule is also true in human cells. 33 ¡1 PRF and gene expression If »10% of cellular mRNAs are controlled by ¡1 PRF, what is the biological significance of this phenomenon? As noted above, the first hint that ¡1 PRF may have a physiological role came from the observation that mutants that promote global increased rates of ¡1 PRF appeared to disrupt the cell cycle in yeast. 34 Indeed, a general observation in our laboratory over the past 2 decades has been that mutations that globally alter rates of ¡1 PRF compromise cell growth and viability. 35 The serendipitous discovery of a C1 PRF signal in the yeast EST3 mRNA, which encodes a component of telomerase, 36 prompted a search the PRFdb for ¡1 PRF signals in additional mRNAs encoding proteins involved in telomere maintenance. Operational ¡1 PRF signals (defined as promoting 1% frameshifting) were identified in 4 mRNAs encoding proteins critical for yeast telomere maintenance. These are: EST2, encoding the reverse transcriptase component of telomerase; EST1, encoding the protein that "docks" telomerase to chromosome ends; and STN1 and CDC13 mRNAs, which encode proteins involved in recruiting telomerase to shortened telomeres. These were all shown to function as NMD-dependent mRNA destabilizing elements. The wide range of ¡1 PRF efficiencies (from 2% -70%) promoted by these elements enabled characterization of a simple exponential decay function between ¡1 PRF efficiency and mRNA destabilizing activity: f .x/ D e ¡ 0:05x where x denotes PRF efficiency and mRNA abundance is a function of x. Abrogation of ¡1 PRF in the EST2 mRNA (leading to increased expression of this protein) yielded cells with telomeres of intermediate length, consistent with prior studies showing similar effects upon overexpression of this gene, or STN1 or CDC13. [37] [38] [39] [40] Importantly, abrogation of NMD resulted in cells with very short telomeres, consistent with NMD being epistatic to ¡1 PRF. Microscopic examination of these cells revealed that a large fraction were arrested at the G2/M boundary characterized by large mother cells attached to equally large daughter cells. Indeed, many of these cells had additional buds , characteristic of cell cycle "escape" mutants. In unpublished studies, operational frameshift signals have been identified in at least 2 human messages encoding proteins required for telomere maintenance; suggesting that ¡1 PRF may also play a role in human telomere maintenance and aging. Model: how ¡1 PRF may control telomere length In yeast, telomerase abundance is strictly limited: it is estimated that diploid cells, which contain 64 chromosome ends, only contain »29 telomerase molecules, 41 consistent with observations that limitation of telomerase levels is required for telomere length homeostasis. 42 Current models 43, 44 posit that telomeres exist in a range of states, from long, and fully capped (by Rap1p, Rif2p, Rif1p trimers) to short and uncapped, and that the small fraction of short uncapped telomeres present in any given cell cycle are preferentially repaired. As shown in Figure 3 The net effect is to "immortalize" telomeres, resulting in longer lifespans, but at the cost of genome integrity. As shown in Figure 3 , we propose that ¡1 PRF is used to maintain the correct stoichiometric ratios of telomerase components critical for telomerase recruitment. Changes in the expression of any one of these components, e.g. by abrogating ¡1 PRF in EST2 or overexpressing any single component, has dominant-negative effects on telomerase recruitment resulting in the observed increased rates of telomere shortening, consistent with the intermediate telomere lengths observed in these mutants. We further propose that changing the expression of all of the ¡1 PRF containing mRNAs, e.g., by globally changing rates of ¡1 PRF or by inactivation of NMD, has an even more dramatic effect, accounting for the very short telomeres observed in these classes of mutants. If ¡1 PRF is normally employed to control gene expression, it stands to reason that ¡1 PRF itself should be subject to regulation. As suggested above, since global changes in ¡1 PRF tend to be detrimental to cells, regulation of ¡1 PRF should be sequence-specific. Given that ¡1 PRF is directed by cis-acting elements in mRNAs, sequence-specific interactions could be mediated by either base-pairing interactions with trans-acting RNAs, or by highly specific interactions with trans-acting proteins. Indeed, examples of both cases have been recently documented. The human CCR5 mRNA harbors a ¡1 PRF signal that functions as an mRNA destabilizing element by directing elongating ribosomes to premature termination codons, and we recently demonstrated that sequence specific interactions between this element and at least 2 micro-RNAs (miRNAs) promote increased rates of -1 PRF. 33 Mapping of the miRNA/¡1 PRF signal interaction suggested that formation of an RNA-triplex structure stabilizes the frameshift-stimulating mRNA pseudoknot, leading to increased ribosome pause times at the slippery sequence, further enhancing frameshifting. With this in mind, it is also possible that trans-acting RNAs that destabilize ¡1 PRF stimulating pseudoknots may also exist, i.e., these would have ¡1 PRF inhibitory activity. Trans-acting proteins can also stimulate ¡1 PRF: in the porcine reproductive and respiratory syndrome virus (PRRSV), both ¡1 and ¡2 PRF are stimulated by the virus-encoded nsp1b replicase subunit that specifically interacts with sequence containing the slippery site. 45 Emerging evidence for a role of ¡1 PRF in human disease To date, no direct connection has been established between changes in ¡1 PRF and human disease. However, a growing body of evidence suggests that such linkages may be forthcoming. The DKC1 gene encodes dyskerin, the protein that catalyzes conversion of uridines into pseudouridine in ribosomal rRNAs. Patients harboring mutations in this gene present with X-linked dyskeratosis (X-DC), a congenital disease characterized by bone marrow failure, dystrophic nails, mucosal leukoplakia, mottled rashes, congenital anomoalies and additional clinical presentations. 46 Hypo-pseudouridulated yeast and human ribosomes have lower affinities for tRNAs, resulting in greater rates of tRNA slippage at ¡1 PRF signals. 47 In unpublished work, we have shown that these mutant yeast cells have shortened telomeres, consistent with the progeria like symptoms of this disease. Indeed, X-DC is a member of a general class of diseases called ribosomopathies, which are caused by mutations in ribosomal protein genes and genes involved in ribosome biogenesis. 48 Unpublished yeast based studies in our laboratory suggest that translational fidelity defects including altered rates of ¡1 PRF, may play important roles in this general class of diseases. Spinocerebellar ataxia 26 (SCA26) is caused by a mutation in eukaryotic translation elongation factor 2 (eEF2), the GTPase that translocates ribosomes along mRNAs. 49 ¡1 PRF can occur during translocation, and inhibition of this process can stimulate this. 50 ¡1 PRF is elevated In yeast cells expressing mutant forms of eEF2 harboring the SCA26-equivalent mutation, 49 and in unpublished work, we have observed that this also occurs in cells derived from SCA26 patients. The finding that the same mutation in ribosomal protein L10 (eL16) found in a significant fraction of patients with T-cell lymphoblastic leukemia (T-ALL) also promotes increased rates of ¡1 PRF (by promoting decreased ribosomal affinity for aminoacyl-tRNA) provides evidence that somatically acquired mutations that affect ¡1 PRF may contribute to at least some cancers. 51 Like many basic molecular regulatory mechanisms, while ¡1 PRF was discovered in viruses it has now been found to be involved in the expression of a significant number of eukaryotic genes. Surprisingly, unlike viruses, where ¡1 PRF is used to expand the genomic coding content, it appears that ¡1 PRF signals are used to post-transcriptionally regulate gene expression by functioning as mRNA destabilizing elements. Emerging evidence links global defects in ¡1 PRF to a growing number of human diseases. Recent studies in yeast revealed the importance of this mechanism in telomere maintenance and cell-cycle control, and current research suggests that may also be applicable to human cells. The recent finding of ¡1 PRF signal-specific regulation by miR-NAs has solved one of the central questions in the field, and in combination with the telomerase studies, provokes the hypothesis that aging may be programmed in part by ¡1 PRF. The ability of transacting factors to manipulate ¡1 PRF also suggests therapeutic approaches including recombinant proteins, and synthetic noncoding RNAs/RNA analogs. Other more global approaches may include use of small molecule modulators of ¡1 PRF, or targeting of downstream pathways, e.g. nonsense mediated mRNA decay. The new paradigms described here will Figure 3 . Model: telomerase recruitment to uncapped telomeres is controlled by the relative stoichiometries of telomerase components in yeast. Top left depicts a fully capped telomere. As telomeres shorten, they become uncapped (middle left), recruiting the MRX-Tel1p complex, which in turn recruits the CST complex to the telomere end. Phosphorylation of Cdc13p by Tel1p recruits telomerase via Est1p. Est2p is the reverse transcriptase component of telomerase. If telomeres continue to shorten, they resemble chromosomes with double stranded breaks (DSB-like, lower left), recruiting DNA repair machinery. This results in cell growth arrest at checkpoint (G2/M). If cells cannot repair the defect, they undergo "crisis" and a subpopulation will bypass arrest, maintaining their chromosome ends by DSB repair. Operational ¡1 PRF signals have been identified in the STN1, CDC13, EST1 and EST2 mRNAs. We propose that their relative abundances are controlled by ¡1 PRF. In optimal conditions, precisely controlled rates of ¡1 PRF ensures that these proteins are present in the correct stoichiometries, maximizing telomere repair (bold up arrow), and minimizing progression to the DSB-like state. When expression of any one of these genes is altered, e.g. telomerase recruitment is less efficient and more telomeres progress more rapidly to the DSB-like state. When expression of all 4 are altered, e.g., by global changes in ¡1 PRF or by abrogation of NMD, telomeres progress rapidly to the DSB-like state. continue to guide a diverse set of research efforts into the future. Mechanisms and implications of programmed translational frameshifting Identification of the nature of reading frame transitions observed in prokaryotic genomes Reprogrammed genetic decoding in cellular gene expression Transfer RNA modifications that alter C1 frameshifting in general fail to affect ¡1 frameshifting On the structural basis of peptide-bond formation and antibiotic resistance from atomic structures of the large ribosomal subunit Functional insights from the structure of the 30S ribosomal subunit and its interactions with antibiotics Molecular mechanisms of translation initiation in eukaryotes Eukaryotic rpL10 drives ribosomal rotation Stansfield I. Endless possibilities: translation termination and stop codon recognition Expression of the Rous Sarcoma Virus pol gene by ribosomal frameshifting Bacterial peptide chain release factors: conserved primary structure and possible frameshift regulation of release factor 2 Efficient translational frameshifting occurs within a conserved sequence of the overlap between the two genes of a yeast Ty1 transposon Unusual mRNA pseudoknot structure is recognized by a protein translational repressor An efficient ribosomal frame-shifting signal in the polymerase-encoding region of the coronavirus IBV RECODE: a database of frameshifting, bypassing and codon redefinition utilized for gene expression Recode-2: new design, new search tools, and many more genes Structure-function analysis of the ribosomal frameshifting signal of two human immunodeficiency virus type 1 isolates with increased resistance to viral protease inhibitors Regulated ribosomal frameshifting by an RNA-protein interaction Stem-loop structures can effectively substitute for an RNA pseudoknot in -1 ribosomal frameshifting Minor groove RNA triplex in the crystal structure of a ribosomal frameshifting viral pseudoknot Mammalian gene PEG10 expresses two reading frames by high efficiency -1 frameshifting in embryonic-associated tissues A retrotransposonderived gene, PEG10, is a novel imprinted gene located on human chromosome 7q21 Characterization of the frameshift signal of Edr, a mammalian example of programmed -1 ribosomal frameshifting Translational maintenance of frame: mutants of Saccharomyces cerevisiae with altered -1 ribosomal frameshifting efficiencies Recoding: translational bifurcations in gene expression Observation of dually decoded regions of the human genome using ribosome profiling data Pseudoknots in RNA secondary structures Identification of putative programmed -1 ribosomal frameshift signals in large DNA databases Identification of functional, endogenous programmed -1 ribosomal frameshift signals in the genome of Saccharomyces cerevisiae PRFdb: a database of computationally predicted eukaryotic programmed -1 ribosomal frameshift signals A programmed -1 ribosomal frameshift signal can function as a cis-acting mRNA destabilizing element Endogenous ribosomal frameshift signals operate as mRNA destabilizing elements through at least two molecular pathways in yeast Ribosomal frameshifting in the CCR5 mRNA is regulated by miRNAs and the NMD pathway Translational maintenance of frame: mutants of Saccharomyces cerevisiae with altered -1 ribosomal frameshifting efficiencies Recoding: Expansion of Decoding Rules Enriches Gene Expression Programmed translational frameshifting in a gene required for yeast telomere replication mRNAs encoding telomerase components and regulators are controlled by UPF genes in Saccharomyces cerevisiae Telomere cap components influence the rate of senescence in telomerase-deficient yeast cells Telomere length regulation and telomeric chromatin require the nonsensemediated mRNA decay pathway Telomerase subunit overexpression suppresses telomere-specific checkpoint activation in the yeast yku80 mutant Low abundance of telomerase in yeast: implications for telomerase haploinsufficiency Telomere length homeostasis requires that telomerase levels are limiting Telomere length regulation: coupling DNA end processing to feedback regulation of telomerase Replication of telomeres and the regulation of telomerase Transactivation of programmed ribosomal frameshifting by a viral protein Updates on the biology and management of dyskeratosis congenita and related telomere biology disorders rRNA pseudouridylation defects affect ribosomal ligand binding and translational fidelity from yeast to human cells Genetics. mysterious ribosomopathies A conserved eEF2 coding variant in SCA26 leads to loss of translational fidelity and increased susceptibility to proteostatic insult Programmed -1 frameshifting by kinetic partitioning during impeded translocation Bypass of the pre-60S ribosomal quality control as a pathway to oncogenesis We would like to thank members of the Dinman laboratory, both past and present, who contributed to this research. This work was supported in part by a grant to JDD by the National Institutes of Health, R01 HL119439.