key: cord-341154-wwq0sd2r
authors: Liao, Pei-Yu; Choi, Yong Seok; Dinman, Jonathan D.; Lee, Kelvin H.
title: The many paths to frameshifting: kinetic modelling and analysis of the effects of different elongation steps on programmed –1 ribosomal frameshifting
date: 2010-09-07
journal: Nucleic Acids Res
DOI: 10.1093/nar/gkq761
sha: 
doc_id: 341154
cord_uid: wwq0sd2r

Several important viruses including the human immunodeficiency virus type 1 (HIV-1) and the SARS-associated Coronavirus (SARS-CoV) employ programmed −1 ribosomal frameshifting (PRF) for their protein expression. Here, a kinetic framework is developed to describe −1 PRF. The model reveals three kinetic pathways to −1 PRF that yield two possible frameshift products: those incorporating zero frame encoded A-site tRNAs in the recoding site, and products incorporating −1 frame encoded A-site tRNAs. Using known kinetic rate constants, the individual contributions of different steps of the translation elongation cycle to −1 PRF and the ratio between two types of frameshift products were evaluated. A dual fluorescence reporter was employed in Escherichia coli to empirically test the model. Additionally, the study applied a novel mass spectrometry approach to quantify the ratios of the two frameshift products. A more detailed understanding of the mechanisms underlying −1 PRF may provide insight into developing antiviral therapeutics.

Programmed ribosomal frameshifting (PRF) is a process where specific signals in the mRNA direct the ribosome to switch reading frame at a certain efficiency. In À1 PRF, the ribosome slips 1 nt towards the 5 0 -end of the mRNA during translation. Several viruses, including human immunodeficiency virus type 1 (HIV-1) and the coronavirus responsible for severe acute respiratory syndrome (SARS-CoV), employ À1 PRF to synthesize precursors of enzymes for their replication (1, 2) , and the ratio of the zero frame to À1 frame encoded products is important to the vitality of viruses (3) (4) (5) . As such, altering À1 PRF efficiency may damage viral replication [reviewed in (6) ]. This suggests À1 PRF as a target for the development of antiviral therapeutics.

Programmed À1 ribosomal frameshifting signals usually contain three essential mRNA elements: (i) a 'slippery' heptanucleotide sequence X XXY YYZ (X can be any three identical nucleotides, Y is A or U and Z is not G in eukaryotes; spaces denote the initial reading frame), where the ribosome changes the reading frame (3, 7) ; (ii) a downstream stimulatory mRNA secondary structure, typically a pseudoknot (8) (9) (10) ; and (iii) a spacer between the slippery sequence and the stimulatory signal. It has been suggested that the stimulatory structural element promotes À1 PRF by positioning the ribosome to pause over the slippery sequence (11) (12) (13) . The length of the spacer has also been shown to affect frameshift efficiency (7, 9, 14) .

As PRF occurs during translation elongation, models of À1 PRF should be described within this context. The elongation cycle can be divided into four stages. First, the ribosome selects the cognate aminoacyl-tRNA (aa-tRNA) according to the codon at the decoding center (decoding, DC in Figure 1 ). Second, the aa-tRNA moves from A/T entry state into the A/A state to be accommodated into the ribosome (aa-tRNA accommodation, AA in Figure 1 ). Third, the ribosome catalyses peptidyltransfer, resulting in a peptidyl tRNA in the A-site and a deacylated tRNA in the P site (peptidyltransfer, PT in Figure 1 ). Fourth, the peptidyl-tRNA moves from the A-site to the P-site, carrying the mRNA along, and the deacylated tRNA moves out of the P-site into the E-site from where it dissociates (translocation, TL in Figure 1 ). Translocation opens up the ribosomal A-site and the ribosome moves on to another round of aa-tRNA selection.

*To whom correspondence should be addressed. Tel: +1 302 831 0344; Fax: +1 302 831 4841; Email: KHL@udel.edu Three major models have been proposed for the mechanism of À1 PRF ( Figure 1 ). One hypothesis proposes that À1 PRF takes place during accommodation of the aa-tRNA (8, 15, 16) . We have denoted this Pathway II. The simultaneous-slippage model (8) originally suggested that peptidyl-and aa-tRNAs simultaneously slip by one base in the 5 0 -direction to base pair with the À1 frame codons in the slippery site. In a refinement of this model (15) , À1 PRF was posited to occur when aa-tRNA and peptidyl-tRNA are located in the A/T entry and P/P site. The 9-Å model of À1 PRF (16) built upon both this and newly available structural data to propose that the $9-Å movement of the anticodon loop in the 5 0 -direction during aa-tRNA accommodation is constrained by the presence of the downstream stimulatory RNA structural element. This creates tension on the mRNA between the decoding center and the stimulatory element that can be relieved by decoupling of the A-and P-site tRNAs from the mRNA followed by subsequent slippage of the mRNA by one base in the 3 0 -direction relative to the tRNAs, resulting in a net slip reading frame by À1 base. Consistent with this model, mutations altering aa-tRNA accommodation were found to affect À1 PRF (18) (19) (20) . However, the simultaneous slippage-based models do not explain the role of sequences upstream of the slippery site, which have also been shown to affect the À1 PRF efficiency (21, 22) . A second general hypothesis proposes that À1 PRF occurs during translocation. This can be modeled through two discrete kinetic pathways. The first suggested that after peptidyltransfer, the two tRNAs move to P/E and A/P states, followed by an incomplete, two-base translocation event promoted by the downstream mRNA stimulatory structure (23) . During this incomplete translocation event, the tRNAs dissociate from the mRNA and re-pair with the À1 frame codons in the slippery site. We call this Pathway III. In support of this model, cryoelectron microscopy imaging revealed that a À1 PRF stimulating pseudoknot can interact with the ribosome to block the mRNA entrance channel, compromising the translocation process during À1 PRF (24) . The second co-translocational model proposed that incomplete translocation occurs one elongation cycle prior to the model by Weiss et al. (23) , and that tRNAs in the ribosomal E-, P-and A-sites are all involved in the process (22) . This model suggests that incomplete translocation promotes formation of a transition intermediate, and that entry of the new aa-tRNA into the ribosome and the tendency of tRNAs to revert to stable states drives the shift in reading frame. This is Pathway I. This model is supported by the demonstration that mutations altering E-site tRNA binding affect À1 PRF (22) . However, Figure 1 . A mechanistic model of -1 programmed ribosomal frameshifting. Two translation elongation cycles are depicted at the top: the ribosome undergoes decoding (DC), aa-tRNA accommodation (AA), peptidyltransfer (PT) and translocation (TL) twice to add two amino acids into the polypeptide sequences. A shift in reading frame may occur at the first TL step and the ribosome decodes a À1 frame A-site codon at the recoding site. Additionally, À1 PRF may occur during the second AA step, in which the ribosome has decoded the zero frame A-site codon. Incorporation of the À1 reading frame aa-tRNA starts at the following cycle. Moreover, the shift in reading frame may occur at the second TL step and incorporation of the À1 reading frame aa-tRNA starts at the following cycle.

neither of the co-translocation models explain the presence of two species of frameshift proteins produced by HIV-1 frameshifting (see next paragraph).

Protein sequencing was originally employed to generate the simultaneous slippage model, and to confirm that the À1 PRF site for HIV-1 is U UUU UUA located within the gag/ pol overlap (where the P-site of the ribosome during frameshifting is underlined) (1). Interestingly, $70% of the frameshift products contained Phe-Leu (derived from decoding the 0-frame UUU UUA sequence), while $30% of the products contained Phe-Phe (derived from decoding the À1 frame UUU UUU sequence) at the frameshift site (1, 25) . Previous studies suggested that the product with Phe-Phe at the frameshift site could result from slippage of the P-site tRNA alone (1, 25, 26) , i.e. the product predicted by Pathway I, and that the À1 frame aa-tRNA is subsequently recruited to the ribosome. However, the precise mechanism driving this process remained unclear, and no model has been proposed to date explaining the simultaneous formation of different frameshift proteins. Here, we have developed a kinetic model of À1 PRF to explain all of the experimental observations. This model reveals the major steps in the translation elongation cycle that affect À1 PRF, and reconciles all three models of À1 PRF. In addition, À1 PRF efficiency was monitored in vivo using a dual fluorescence reporter (27) and the compositions of different frameshift proteins were analysed by mass spectrometry. The experimental approach was also applied to study Human T-cell leukemia virus type 1 (HTLV) pro-pol frameshift sequence. This is the first study to demonstrate and quantify the ratio of frameshift products incorporating À1 frame A-site tRNA at this À1 PRF sequence. In agreement with the model predictions, experimental perturbation of different translation steps resulted in different levels of À1 PRF efficiency as well as in the relative ratios of two types of frameshift proteins. Our findings demonstrate that all three kinetic pathways are operative during À1 PRF.

In our earlier study, a kinetic model successfully described the effects of ribosome E-, P-and A-site interactions on +1 PRF (27) . A similar approach can be applied to understand the mechanism of À1 PRF. The mechanistic model in the present study proposes that À1 PRF can occur during translocation and/or aa-tRNA accommodation. Figure 1 describes the overall framework using abc deX XXY YYZ fgh sequence as an example, where spaces separate zero frame codons and the slippery sequence is underlined. When À1 PRF occurs during translocation, the presence of the downstream stimulatory structure forces the ribosome to translocate by two, rather than three, bases toward the 3 0 -end of the mRNA, thus shifting the reading frame. If this 'incomplete' translocation occurs to the pre-translocational ribosome aligning with deX XXY, translation of the À1 frame begins at YYY. Alternatively, if incomplete translocation occurs to the pre-translocational ribosome aligning with XXY YYZ, translation of the À1 frame starts at Zfg. When À1 PRF occurs during aa-tRNA accommodation, the two tRNAs interacting with XXY YYZ slip to base pair with XXX YYY. Consequently, translation of À1 frame starts at Zfg (Figure 1 ).

An elegant series of biochemical analyses have established detailed kinetic models of translocation (28) and aa-tRNA selection (29) . Translocation involves EF-G binding to the pre-translocational ribosome, GTP hydrolysis, unlocking conformation change, Pi release, tRNA movement, relocking conformation change and dissociation of EF-G from the post-translocational ribosome. This concept is illustrated along the top of Figure 2 from component PA (pre-translocational ribosome) to E 0 P 0 (post-translocational ribosome). Detailed descriptions for each rate constant are shown in Supplementary Table S1 . Selection and accommodation of aa-tRNA involves initial binding of the ternary complex EF-Tu:aa-tRNA:GTP, codon recognition, EF-Tu GTPase activation, GTP hydrolysis, dissociation of EF-Tu from the ribosome and accommodation of the acceptor end of the aa-tRNA into the A-site or the rejection of the aa-tRNA by proofreading. Detailed descriptions for each rate constant are shown in Supplementary Table S2. In the absence of frameshifting, progression through these steps of the elongation cycle results in synthesis of the non-frameshift protein, called NFS ( Figure 2 ). Our kinetic model suggests three possible reaction pathways that could generate À1 frameshift proteins ( Figure 2 ). In Pathway I, blockage of the mRNA channel entrance by the downstream stimulatory structure induces incomplete translocation with the pre-translocational ribosome positioned at deX XXY. Specifically, the reading frame shift occurs between the tRNA movement and Pi release (rate constant r TL0 ), and the relocking step (rate constant r 45 ). Weiss et al. (23) suggested that when the two tRNAs move from P/E and A/P to the E/E and P/P states, they can un-pair from the mRNA and re-pair with the À1 reading frame. In our model, r t represents the rate constant for a ribosome:EF-G:GDP complex with two tRNAs in the E-and P-sites (E 0 P 0 EFGgdp ) to re-pair with the À1 reading frame (E 02 P 02 EFGgdp ). This motion is reversible, as denoted by the r -t rate constant. This step is followed by a relocking conformational change and EF-G release from the ribosome complex. The resulting E 0 P 0 or E 02 P 02 (A-site unocupied) then moves on to the aa-tRNA selection step. Here, E 0 P 0 is the posttranslocational ribosome aligning with deX XXY (zero frame) and E 02 P 02 is the post-translocational ribosome aligning with cde XXX (-1 frame), where subscript 0 means a zero frame tRNA pairing with the zero frame; subscript 02 means a zero frame tRNA pairing with the À1 frame. E 0 P 0 may generate non-frameshift product NFS, or enter Pathway II or III described below. E 02 P 02 can generate frameshift product FS m , which incorporates the À1 frame aa-tRNA in the recoding site (YYY). In addition, it is also possible for E 02 P 02 to recruit a zero frame aa-tRNA for YYZ (A 0 ) and accommodate this aa-tRNA into the À1 frame. In this case, frameshift product FS z , which incorporates the zero frame aa-tRNA in the recoding site (YYZ), is produced (Pathway Ia).

In the second pathway, the downstream stimulatory structure induces ribosome pausing and promotes À1 PRF during aa-tRNA accommodation. Pathway II suggests that simultaneous slippage of P-and A-site tRNAs occurs during accommodation and/or before peptidyltransfer. In Figure 2 , the process from P 0 A 0 to P 02 A 02 with the rate constant k pas2 describes the slippage in Pathway II. P 02 A 02 then proceeds through peptidyltransfer to generate FS z .

In Pathway III, incomplete translocation occurs when the pre-translocational ribosome is positioned at XXY YYZ. Consequently, translation of the À1 frame is one codon downstream of YYZ and the ribosome produces FS z . It is important to note that while both Pathway I and III involve incomplete translocation, the Pathway I slip occurs one elongation cycle before the Pathway III slippage event.

All pathways were mathematically described as systems of ordinary differential equations (Supplementary Data). Assuming steady state, the expressions of intermediate Figure 2 . The kinetic framework for programmed -1 ribosomal frameshifting. Top: the procedure from PA to E 0 P 0 represents translocation, which involves r 1 , r -1 (reversible EF-G binding), r TL0 (GTP hydrolysis, unlocking conformation change, tRNA movement and Pi release), and r 45 (re-locking conformation change and EF-G dissociation). The E 0 P 0 complex then undergoes aa-tRNA selection: from E 0 P 0 to P 0 A 0 . The selection of aa-tRNA involves: k 1 , k -1 (reversible EF-Tu binding), k 2 , k -2 (reversible codon recognition), k 345 (GTPase activation, GTP hydrolysis, EF-Tu conformation change and dissociation), and k 6 (aa-tRNA rejection by proofreading), or k 7 (aa-tRNA accommodation). The elongation cycle without a -1 PRF event results in synthesis of non-frameshift proteins (NFS). Pathway I in green suggests that -1 PRF occurs during the relocking step in the first translocation, leading to the formation of FS m . Pathway Ia indicates that the E 02 P 02 complex may interact with a zero frame aa-tRNA and eventually produce FS z . Pathways II suggests that -1 PRF occurs during aa-tRNA selection and accommodation, resulting in FS z . Pathway III suggests that -1 PRF occurs during the second translocation step, resulting in FS z production. concentrations in terms of initial reactant (PA) were solved by Matlab v.R2008a (Mathworks Inc., Natick, MA, USA). By applying the empirically-determined rate constants and assumed ranges of rate constants of incomplete translocation, P-and A-site tRNA slippage (Supplementary Tables S1-S4) , the amount of non-frameshift proteins NFS (p 0 a 0 pt in the kinetic model) and two types of frameshift proteins, FS m (P 02 A 2 pt in the kinetic model) and FS z (P 02 A 02 pt and p 02 a 02 pt in the kinetic model), were identified. The frameshift efficiency (FS%) in the model is defined as the amount of frameshift proteins divided by the amount of total proteins and multiplied by 100% [Equation (1)]. The fraction of FS m is calculated as the amount of FS m divided by the amount of total frameshift proteins and multiplied by 100% [Equation (2)].

A program was developed in Matlab v.R2008a to perform an n-way analysis of variance (ANOVA). Each parameter in the model was varied over five levels: a base line value, ±25% of the base line, a ±50% of the base line. Randomly selected 10 000 parameter sets were used to calculate FS%. A higher F statistic indicates a larger impact of the parameter on FS%.

Escherichia coli XL1 blue MRF 0 (Stratagene, La Jolla, CA, USA) was used in all experimental studies. All constructs were verified by DNA sequencing at the Cornell Bioresource Center. Construction of the dual fluorescence reporter was described earlier (27, 30) , except that different linker sequences were incorporated into the reporter plasmid ( 

Cells with the appropriate plasmids were cultured in 1 ml Luria-Bertani (LB) medium containing 100 mg/ml ampicillin with or without 0.75 mg/ml chloramphenicol in 24-well plates for 24 h at 250 rpm and 37 C. Fluorescence was measured using a plate reader (SpectraMax M5, Molecular Devices, Sunnyvale, CA, USA). Fluorescence measurements were performed as described earlier (27) . Experimental frameshift efficiency (FS% exp ) was obtained as the ratio of green fluorescence to red fluorescence for the test strains and normalized against the fluorescence ratio of the control strain. Statistical analyses were applied to all data sets as described earlier (32) . A total of 23-46 replicates for test strains and control strains were performed to satisfy the minimum sample requirement for statistical significance.

Test strains were grown in 100 ml LB medium containing 100 mg/ml ampicillin in 500 ml flasks at 250 rpm and 37 C. After 24 h, 200 OD 600 units of cells were collected by centrifugation at 4000g and 4 C for 20 min. Cells were lysed and purified by Ni-NTA under native conditions according to the manufacturer's protocol (Qiagen). Purified protein samples were resolved by SDS-PAGE (10% w/v polyacrylamide). Gel band excision and in-gel trypsin digestion were performed using a previously described standard method (33) .

A representative flow chart of the mass spectrometry analysis is shown in Supplementary Data (Supplementary Figure S1 ). Trypsin-digested frameshift protein samples resulted in target peptides spanning the recoding sites with a single amino acid difference between FS z and FS m . These peptides were analysed by nano-flow liquid chromatography tandem mass spectrometry using multiple reaction monitoring (nLC-MRM/MS). The digested sample was vacuum dried, reconstituted with 30 ml of 0.1% formic acid (FA), and a portion of each reconstituted sample was injected into Dionex 3000 nLC system (Sunnyvale, CA, USA). First, the sample was loaded onto an Acclaim PepMap 100 C18 trap column (300 mm Â 5 mm, 5 mm) and on-line desalting was carried out with water (0.1% FA) at a flow rate of 30 ml/min for 5 min. Then, peptides trapped in the trap column were GCT AAT TTT TTA GGG AAG ATC TGG CCT TCC TAC AAG GGA AGG CCA GGG AAT TTT CTT GGA TAA AG  MB2  GCU CCT TTT TTA GGG AAG ATC TGG CCT TCC TAC AAG GGA AGG CCA GGG AAT TTT CTT GGA TAA AG  MB2UCC  GCC CCT TTT TTA GGG AAG ATC TGG CCT TCC TAC AAG GGA AGG CCA GGG AAT TTT CTT GGA TAA AG  MB2CCC  TTC CCT TTA AAC CAG AAC GCC TCC AGG CCT TGC AAC ACT TGG TCC GGA AGG CCC TGG AGG CAG GCC TAA (3)]. Samples were analysed in triplicates (except duplicates of CCC and TLV).

Fraction of FS m ð%Þ observed by MS

where A FSm is the sum of peak areas at different charge states for an FS m target peptide and A FSz is the same for an FS z target peptide in MS.

The kinetic model allows for the evaluation of the effects of different translation elongation cycle steps on FS% and the fraction of FS m . Sensitivity analysis revealed several parameters that have a greater influence on FS% ( Figure 3) . Therefore, the model results will focus on these higher impact parameters in different pathways.

In Pathway I, À1 PRF occurs during translocation while the pre-translocational complex is aligned with deX XXY. Two parameters play important roles in Pathway I in the kinetic model. Here, r t represents the rate constant for incomplete translocation. An increase in r t while other parameters in the model remain constant leads to an increase in FS% (blue line in Figure 4a ). Both the levels of FS m and FS z increase when r t increases (green and red lines in Figure 4a ). Because the rise in the FS m level is larger, increasing r t results in a larger FS m fraction (Figure 4b ). It is also interesting to note that the majority of FS z comes from Pathway III when r t is <2 s À1 , while the majority of FS z is from pathway II when r t is >2 s À1 (Figure 4b) . The rate constant r 45 accounts for the relocking step during translocation. A decrease in r 45 while other parameters in the model remain constant results in an increase in FS% (blue line in Figure 5a ). Both the levels of FS m and FS z increase when r 45 decreases (green and red lines in Figure 5a ). However, in this case the increase in the FS m level is larger, leading to a larger FS m fraction with a decrease in r 45 (Figure 5b) . Here, the majority of FS z is from Pathway III when r 45 is <22 s À1 , but the majority of FS z results from pathway II when r 45 becomes >22 s À1 ( Figure  5b) . These results suggest that translocation perturbations by either a downstream mRNA secondary structure, by mutations, or by chemical inhibitors may result in production of a higher FS%, primarily due to production of a larger amount of FS m . Notably, manipulating r t values causes larger changes in FS% and in the FS m fraction compared to the effect of r 45 , suggesting a dominant role by r t on À1 PRF in Pathway I. Consistent with our model, experimental studies demonstrated that mutating the E-site codon in the recoding site, or the use of a translocation inhibitor altered FS% (22) .

In Pathway II, À1 PRF occurs during aa-tRNA accommodation and the slippage occurs before peptidyltransfer. Figure 6a shows that a higher k pas2 results in a higher FS%. Interestingly, the larger FS% results from an increase in FS z while the level of FS m remains at a similar level (Figure 6a) . Therefore, the fraction of FS m is predicted to decrease as k pas2 increases ( Figure 6b) . Here, the majority of FS z is generated from Pathway III when k pas2 is <3 s À1 , while the majority of FS z is produced from pathway II when k pas2 is >3 s À1 (Figure 6b) .

In Pathway III, À1 PRF occurs during translocation while the pre-translocational complex is aligned with XXY YYZ. The rate constant for the incomplete translocation step is demonstrated by r t3 . Figure 7a shows that a higher r t3 promotes increased FS%. Interestingly, the larger FS% results from an increase in FS z while the level of FS m remains relatively constant (Figure 7a) . Therefore, the fraction of FS m is predicted to decrease as r t3 increases (Figure 7b ). In this case, the majority of FS z is generated from Pathway II when r t3 is <1 s À1 , but the majority of FS z comes from pathway III when r t3 is >1 s À1 (Figure 7b ).

In the model, k pt represents the rate constant for peptidyltransfer, the last step in all three pathways. The model predicts that a decrease in k pt would result in a higher FS% due to increased production of FS z , while FS m synthesis remains relatively constant (Figure 8a ). Consequently, a smaller fraction of FS m is observed as k pt decreases (Figure 8b ). In this scheme, the majority of FS z is synthesized from Pathway II when k pt is <15 s À1 , while the majority of FS z comes from pathway III when k pt is >15 s À1 (Figure 8b) . The model results are consistent with previous experimental observations that peptidyltransferase inhibitors affect FS% (35) .

To examine the model predictions, À1 PRF efficiency was monitored in vivo using a dual fluorescence reporter system. In addition, compositions of the frameshift protein products were analysed by mass spectrometry. Analysis of the frameshift products revealed that the ratio of FS z to FS m was $4:1 in MB2 cells (Figure 9 ), thus indicating that the vast majority of À1 PRF events naturally occur through Pathways II and/or III. The model predicts that a smaller k pt should cause higher FS% and a lower fraction of FS m (Figure 8) . A prior study using yeast demonstrated that inhibition of peptidyltransfer promoted increased rates of À1 PRF, but did not differentiate between FS m and FS z products (36) . The model predicts that addition of chloramphenicol, a potent peptidyltransferase inhibitor in bacteria (37), should promote increased FS%. Consistent with the model, a 2.1-fold increase in FS% exp was observed in the E. coli culture containing 0.75 mg/ml chloramphenicol compared to the culture without the drug. The fractions of FS m for the culture with and without chloramphenicol were 17.3 and 20.4%, respectively (Figure 9a) . Although a slight decrease in the fraction of FS m was observed in the presence of the drug, the difference was not statistically significant (P > 0.05).

The frameshift sequence for HIV-1 is U AAU UUU UUA, where a space separates each zero frame codon and the P-site of the recoding site is underlined. The E-site tRNA GUU Asn may form one canonical base pairing with the À1 frame UAA. In the MB2UCC strain, the sequence was mutated to U CCU UUU UUA (mutations shown in bold) where the E-site tRNA GGG Pro can potentially form one G:U and two C:G interactions. In the MB2CCC strain, the sequence was mutated to C CCU UUU UUA (mutations shown in bold) where the E-site tRNA GGG Pro can form three canonical base pairings with the À1 frame CCC. Because Pathway I requires that the E-and P-site tRNAs interact with the À1 frame, UCC and CCC as the À1 frame E-site codons may enhance this reaction, i.e. these codons would promote an increase in r t . The model predicts that a larger r t should result in a higher FS% due to increased production of FS m . Consistently, 1.2-and 1.4-fold increase in FS% exp are observed for the MB2UCC and MB2CCC strains compared to the MB2 strain, respectively (Figure 9b ). In the MB2UCC strain, 76.4% of the frameshift products were FS m , and in the MB2CCC strain, 90.3% of the frameshift products were FS m . These results suggest that changing the sequence to favor incomplete translocation, i.e. to favor Pathway I, can dramatically alter the composition of the frameshift product.

To further our understanding of two types of frameshift proteins, the HTLV-1 pro-pol frameshift sequence was cloned into the reporter system. The extended frameshift sequence for HTLV-1 pro-pol is C CCU UUA AAC (where spaces separate zero frame codons and the slippery sequence is underlined). Similar to MB2CCC, the E-site tRNA GGG Pro can potentially form three canonical base pairings with the À1 frame CCC, which may create a favorable condition for Pathway I. Consequently, a significant amount of FS m among total frameshift proteins can be produced. Consistent with the model, the frameshift efficiency for HTLV-1 was 4.81% and the fraction of FS m was 39.4%.

In this study, a mathematical framework was developed for À1 PRF. To our knowledge, this is the first kinetic model to explain the production of two types of À1 frameshift proteins through three distinct kinetic pathways. Using deX XXY YYZ fgh as an example, Pathway I predicts that a pre-translocational ribosome aligning with deX XXY may shift reading frame during incomplete translocation, producing the frameshift product (FS m ) incorporating a À1 frame aa-tRNA in the frameshift site (codon YYY). Additionally, Pathway II predicts that a ribosome can change reading frame due to simultaneous slippage of P-and A-site tRNAs for a ribosome aligning with XXY YYZ, generating a frameshift product (FS z ) incorporating the zero frame aa-tRNA in the frameshift site (codon YYZ). Lastly, in Pathway III, a pretranslocational ribosome aligning with XXY YYZ may also undergo incomplete translocation to generate FS z .

The kinetic model suggests that incomplete translocation of the pre-translocational ribosome aligning with deX XXY produces FS m . Previous studies suggested that FS m may result from slippage of a single P-site tRNA (1, 25, 26) . However, it is not clear when and/or how this could occur. In addition, the single slippage model does not explain experimental evidence regarding the influence of translocation on À1 PRF (22, 23) . Our model suggests that in both mechanisms, incomplete translocation and slippage of P-and A-site tRNAs participate in synthesizing frameshift proteins to varying extents for different À1 PRF signals. Frameshifting at the HIV-1 sequence was reported to generate $70% FS z and 30% FS m (1, 25) , indicating that Pathways II and/or III exert stronger influence on FS% than Pathway I. Notably, our protein analysis showed $80% FS z and 20% FS m for the frameshifting signal in HIV-1 group M subtype B. This small discrepancy may due to the use of different reporter systems, or differences in the quantitative methods employed for the assay. For the HTLV-1 pro-pol frameshift sequence, this study observed 39.4% FS m in total frameshift protein. This is the first study to demonstrate another frameshift sequence that generates a significant amount of FS m in addition to HIV-1. Interestingly, a small but observable lysine peak appeared at the corresponding position (-1 frame A-site codon in the recoding site) when HTLV-1 pro-pol frameshift proteins were sequenced in the previous study (31) , supporting the production of FS m . The stimulatory RNA of HTLV-1 pro-pol has been suggested to be a pseudoknot (31) , although no direct evidence for the RNA structure was shown in the same study. Interestingly, the mouse Edr frameshift sequence, which shares high similarity with the HTLV-1 pro-pol sequence, was suggested to involve a pseudoknot (38) . Because the length of HTLV-1 pro-pol stimulatory signals is not well-defined, the frameshift sequence incorporated in the TLV strain may not represent the whole stimulatory signal. However, the possible absence of a portion of the stimulatory signal did not prevent us from observing a significant amount of frameshift efficiency and the ratio of FS m to the total frameshift protein. Frameshift products from other À1 PRF signals were analysed previously (39, 40) . For SARS-CoV frameshifting, FS m was not found (39) . However, for the Alphavirus coding sequence 6k, both FS z and FS m were identified in the frameshift products, although the exact ratio was not determined (40) .

The current study shows that all three kinetic pathways are operative during À1 PRF. For HIV-1 and HTLV-1, experimental results indicate that 20.4 and 39.4%, respectively, of frameshift proteins are FS m , i.e. the contribution of Pathway I. Although the experimental approach cannot differentiate between the relative contributions of Pathways II and III to FS z production, the kinetic model can be used to discriminate between the effects of these two pathways. In the kinetic model, r t , k pas2 and r t3 represent non-regular events, which are likely to be ratelimiting steps. These rate constants are thus expected to be small. The values for r 45 and k pt are 5 and 50 s À1 , respectively (Supplementary Tables S1 and S2). As shown in Figures 4-8 , r t , r 45 , k pas2 , and r t3 are all in the small value ranges, and k pt at 50 s À1 , suggest that Pathway III contributes approximately equal or more to the FS z production than does Pathway II. Interestingly, inhibition of peptidyltransfer (decreasing k pt ) can switch to a condition in which Pathway II contributes more to FS z production than Pathway III (Figure 8b ). These findings are consistent with the hypothesis that the À1 PRF signals of different viruses have evolved within these kinetic parameters so as to produce the optimal ratios of shifted to unshifted products according to their specific biological requirements.

The effect of incomplete translocation can be understood in two ways. Incomplete translocation at the pre-translocational ribosome aligning with deX XXY produced more FS m (Pathway I), consistent with the model by Leger et al. (22) . Incomplete translocation at the pre-translocational ribosome aligning with XXY YYZ resulted in more FS z (Pathway III), consistent with the models proposed by Weiss et al. (23) and Namy et al. (24) . Enhancing incomplete translocation in Pathway I promoted an increase in the fraction of FS m among total frameshift proteins. On the other hand, enhancing incomplete translocation in Pathway III decrease the fraction of FS m . While these two pathways were supported by previous studies, the direct observation of FS m for HIV-1 and HTLV-1 pro-pol frameshift sequences provided proof for the validity of Pathway I. Interestingly, altering the extended frameshift sequence resulted in a significant change in frameshift protein compositions, supporting the role of the sequence upstream of the traditional slippery site XXY YYZ.

In the presence of chloramphenicol, a 2.1-fold increase in FS% exp was observed while the fraction of FS m was not significantly different compared to the culture condition without the chemical (Figure 9a) . The model predicts that k pt has a relatively smaller effect on FS% and the fraction of FS m than r t and k pas2 (Figure 8) . A dual fluorescence reporter can sensitively detect small changes in FS% in E. coli and mammalian cells (27, 30, 41) . On the other hand, analysing the composition of the frameshift products relies on multiple manipulations including protein purification, gel electrophoresis, in-gel digestion, Table 1 ). liquid chromatography and mass spectrometry. The multistage preparation may thus affect sample yields, complicating the detection of small changes in protein composition.

A significant increase in the fraction of FS m was observed in both MB2UCC and MB2CCC strains (Figure 9b) . Mutation of the À1 frame E-site sequence to UCC and CCC in the HIV-1 frameshift site may enhance incomplete translocation in Pathway I by allowing more interactions between E-site tRNA and the À1 frame. This result is consistent with the model that different mechanisms exist and participate in making frameshift proteins to different extents. The creation of a favorable condition for one pathway can affect the composition of frameshift proteins significantly. To date, no mutations affecting the composition of frameshift proteins have been reported in the literature. Notably, our experimental results show that in one case, FS% increases significantly without a change in the composition of frameshift products (Figure 9a) , while in another condition FS% increases a smaller amount but the composition of frameshift products change dramatically (Figure 9b) .

Several parameters in the kinetic model, such as r t , r -t , k pas2 , k -pas2 , r t3 and r -t3 , have not been measured experimentally in the literature. The test ranges for these parameters are based on our current kinetic understanding of translation elongation. The rate constants for translocation and aa-tRNA selection range from 0.47 to 1980 s À1 (Supplementary Tables S1-S4). The average protein synthesis rate is 10-20 s À1 in prokaryotic cells but some codons are translated at a rate <10 s À1 (42). Therefore, a broad range (1-100 s À1 ), within the scope of the known elongation rate constants, was tested to understand the impact each of these unknown parameters on À1 frameshifting (Figures 4, 6 and 7) . Notably, these figures do not show the impact of these parameters >100 s À1 because the curves level off for the parameters at this range.

Although the investigation of whether the slippage of the two tRNAs is a simultaneous or a sequential process is beyond the scope of this study, a similar kinetic approach Figure 10 . The effect of the rate constants related to sequential tRNA movement (a) r t1 , (b) r -t1 , (c) r t2 , and (d) r -t2 on the rate constants related to single step sequential tRNA movement (r t and r -t ). The base point is assumed as r t1 = 1 s À1 , r -t1 = 10 s À1 , r t2 = 1 s À1 , and r -t2 = 10 s À1 . The inset in (d) shows a zoom in of the plot.

can be used to understand the effect of sequential tRNA movement on the overall process. Using the movement of E-and P-site tRNAs as an example, the overall movement and the sequential movement can be described as the following:

One step tRNA movement (Rx. Assuming steady state, r t and r -t can be represented by r t1 , r -t1 , r t2 and r -t2 (Supplementary Data). Figure 10 shows how a change in the rate constants in Rx.2 can affect the overall rate constants in Rx.1. The result suggests that repositioning of the E-site tRNA to the À1 frame (represented by r t1 ) may have a larger impact than repositioning of the P-site tRNA (represented by r t2 ) on the slippage toward the À1 frame (represented by r t ). On the other hand, repositioning the P-site tRNA back to zero frame (represented by r -t2 ) may have larger impact than repositioning the E-site tRNA (represented by r -t1 ) on the slippage toward the zero frame (represented by r -t ). Similarly, the same observation also applies to the movement for P-and A-site tRNAs (k pas2 , and k -pas2 ) in the model.

A mathematical framework developed upon the translation elongation cycle revealed three distinct kinetic pathways for À1 PRF. The model describes how alterations of these kinetic parameters can affect not only changes in frameshift efficiency, but also changes in the composition of frameshift products under different conditions. In addition, the model identifies the dominant parameters, representing steps in the translation elongation cycle, on À1 PRF. Experimentally targeting these steps resulted in different levels of frameshifting efficiency, consistent with model predictions. A mutation in the À1 frame E-site sequence was shown to dramatically change the composition of frameshift products, suggesting an important role for the sequence upstream of the slippery site. Our results suggest that not only the frameshift efficiency, but also the compositions of the frameshift products, are worth investigating to advance our knowledge of À1 PRF.

Characterization of ribosomal frameshifting in HIV-1 gag-pol expression

Mechanisms and enzymes involved in SARS coronavirus genome expression

The human immunodeficiency virus type 1 ribosomal frameshifting site is an invariant sequence determinant and an important target for antiviral therapy

The role of programmed-1 ribosomal frameshifting in coronavirus propagation

Achieving a golden mean: Mechanisms by which coronaviruses ensure synthesis of the correct stoichiometric ratios of viral proteins

Translating old drugs into new treatments: Ribosomal frameshifting as a target for antiviral agents

Mutational analysis of the ''slippery-sequence'' component of a coronavirus ribosomal frameshifting signal

Signals for ribosomal frameshifting in the rous sarcoma virus gag-pol region

Characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an RNA pseudoknot

RNA pseudoknots: translational frameshifting and readthrough on viral RNAs

Ribosomal movement impeded at a pseudoknot required for frameshifting

Ribosomal pausing during translation of an RNA pseudoknot

Kinetics of ribosomal pausing during programmed -1 translational frameshifting

The sequences of and distance between two cis-acting signals determine the efficiency of ribosomal frameshifting in human immunodeficiency virus type 1 and human T-cell leukemia virus type II in vivo

Programmed alternative reading of the genetic code

The 9-A solution: How mRNA pseudoknots promote efficient programmed -1 ribosomal frameshifting

Translocation of tRNA during protein synthesis

Translational misreading: mutations in translation elongation factor 1alpha differentially affect programmed ribosomal frameshifting and drug sensitivity

An ''integrated model'' of programmed ribosomal frameshifting

A reassessment of the response of the bacterial ribosome to the frameshift stimulatory signal of the human immunodeficiency virus type 1

Comparative mutational analysis of cis-acting RNA signals for translational frameshifting in HIV-1 and HTLV-2

The three transfer RNAs occupying the A, P and E sites on the ribosome are involved in viral programmed -1 ribosomal frameshift

E. coli ribosomes re-phase on retroviral frameshift signals at rates ranging from 2 to 50 percent

A mechanical explanation of RNA pseudoknot function in programmed ribosomal frameshifting

The function of a ribosomal frameshifting signal from human immunodeficiency virus-1 in Escherichia coli

P-site tRNA is a crucial initiator of ribosomal frameshifting

A new kinetic model reveals the synergistic effect of E-, P-and A-sites on +1 ribosomal frameshifting

An elongation factor G-induced ribosome rearrangement precedes tRNA-mRNA translocation

Recognition and selection of tRNA in translation

FSscan: a mechanism-based program to identify +1 ribosomal frameshift hotspots

Characterization of ribosomal frameshifting for expression of pol gene products of human T-cell leukemain virus type 1

Systematic analysis of bicistronic reporter assay data

Comparison of automated in-gel digest methods for femtomole level samples

Probability-based protein identification by searching sequence databases using mass spectrometry data

Peptidyl-transferase inhibitors have antiviral properties by altering programmed -1 ribosomal frameshifting efficiencies: development of model systems

Decreased peptidyltransferase activity correlates with increased programmed -1 ribosomal frameshifting and viral maintenance defects in the yeast Saccharomyces cerevisiae

Structural basis for the interaction of antibiotics with the peptidyltransferase centre in eubacteria

Characterization of the frameshift signal of Edr, a mammalian example of programmed -1 ribosomal frameshifting

Programmed ribosomal frameshifting in decoding the SARS-CoV genome

Discovery of frameshifting in alphavirus 6K resolves a 20-year enigma

A homogeneous cell-based bicistronic fluorescence assay for high-throughput identification of drugs that perturb viral gene recoding and read-through of nonsense stop codons

Absolute in vivo translation rates of individual codons in Escherichia coli. The two glutamic acid codons GAA and GAG are translated with a threefold difference in rate

The authors acknowledge Abhinav Rabindra Jain for assistance in the laboratory. 

Conflict of interest statement. None declared.