key: cord-263645-wupre5uj authors: Morgan, Brittany S; Forte, Jordan E; Hargrove, Amanda E title: Insights into the development of chemical probes for RNA date: 2018-09-19 journal: Nucleic Acids Res DOI: 10.1093/nar/gky718 sha: doc_id: 263645 cord_uid: wupre5uj Over the past decade, the RNA revolution has revealed thousands of non-coding RNAs that are essential for cellular regulation and are misregulated in disease. While the development of methods and tools to study these RNAs has been challenging, the power and promise of small molecule chemical probes is increasingly recognized. To harness existing knowledge, we compiled a list of 116 ligands with reported activity against RNA targets in biological systems (R-BIND). In this survey, we examine the RNA targets, design and discovery strategies, and chemical probe characterization techniques of these ligands. We discuss the applicability of current tools to identify and evaluate RNA-targeted chemical probes, suggest criteria to assess the quality of RNA chemical probes and targets, and propose areas where new tools are particularly needed. We anticipate that this knowledge will expedite the discovery of RNA-targeted ligands and the next phase of the RNA revolution. The field of RNA biology has exploded in recent years with the discovery of non-coding RNAs (ncRNAs) that regulate essential processes in all living organisms (1) . These processes include transcription, translation, and evasion in bacteria and archaea (1, 2) as well as replication, persistence, and cellular transformation in viruses (3) . Within the human genome, protein-coding genes are vastly outnumbered by regulatory ncRNAs that can influence a wide range of cellular functions (2, 4) . Many of these ncRNAs are dysregulated in and implicated as drivers of various human diseases, including metastatic cancers and neurologi-cal and neuromuscular disorders (2, 5, 6) . This 'RNA revolution' is radically changing our understanding of the role RNA plays in fundamental biology and is rapidly driving scientific innovation. Methods and tools to structurally and functionally characterize RNAs at the molecular level, however, are more difficult and/or lacking as compared to those for proteins (7) (8) (9) (10) . One important example is the development of chemical probes, which has greatly progressed the study of proteins and related diseases (11, 12) but has been challenging for non-ribosomal RNAs. This powerful chemical tool requires small molecules with well-defined biological activity, cell permeability, and selectivity to accurately and reliably probe specific mechanistic and phenotypic questions (11, 12) . Given the potential advantages of small molecule chemical probes over biological approaches (e.g. siRNAs, ASOs and CRISPR-Cas) (13, 14) and the power of using both approaches in tandem (12) , the development of RNAtargeted chemical probes has the potential to greatly benefit both chemists and biologists interested in RNA. While ligands that bind non-ribosomal RNA in vitro have been reported for decades, the development of chemical probes with evidence of specific small molecule:RNA engagement in cell or animal models has dramatically increased in the last four years. Recent studies report several drug-like small molecules that target a range of RNAs in animal models, including riboswitches (15) , miRNAs, (16, 17) splice sites (18) , and mature mRNAs (19) , at least one of which is currently in clinical trials (NCT02268552). Multivalent ligands have been reported that target r(CUG) exp repeats of Myotonic Dystrophy Type 1 (DM1) in D. melanogaster (20) and mouse models (21) . These recent successes confirm that selective RNA targeting is achievable in biological systems; however, the limited examples over years of effort highlight the challenges associated with selectively probing RNA. Recently, we compiled the RNA-targeted BIoactive ligaNd Database (R-BIND), which comprised organic small molecules that target non-ribosomal RNAs and show activity in cell culture or animal models (22) . For this Survey, the compilation was updated to include chemical probe discoveries through May 2017 for a total of 116 chemical probes (see Supplementary Material, R-BIND 1-1.xls). Aminoglycosides are excluded due to established non-specific binding behavior (23) (24) (25) as well as peptides and oligonucleotides due to distinctive medicinal chemistry properties (26, 27) . The chemical probes were divided into two classes: monovalent, traditional drug-like small molecules (SM) and multivalent ligands (MV) with alkyl, aryl, or peptidyl linkers between multiple binding moieties. Previous work compared the physicochemical, structural, and spatial properties of the small molecules to FDAapproved drugs as well as RNA-binding ligands without reported biological activity (22) . This analysis revealed several key differences between these libraries that can in turn be used to bias small molecules toward biological RNA targeting. In addition to that work, the curation of this collection allowed us to gain insights into: (i) the RNA elements targeted; (ii) the design and discovery strategies utilized and (iii) the in cellulo characterization of these chemical probes. Herein, we discuss these insights, highlight unique examples, and consider the need to establish standards for cellbased selectivity. We conclude by proposing future directions that utilize our current and prospective chemical biology toolbox to expedite the discovery of chemical probes for RNA. The 116 chemical probes targeted 33 distinct RNA elements, including those from bacterial, fungal, human, or viral systems [ Figure 1 ]. Although some overlap between targets was observed, the small molecules probed a wider range of RNAs in cell culture than the multivalent ligands [ Figure 1A ]. The most common small molecule target was the HIV-1 Trans-activation response element (TAR) RNA, a well-studied and frequently screened RNA that binds to the viral protein Tat (23) . Disruption of this interaction reduces viral production and represents an alternative strategy against HIV. Some of the first RNA-targeted chemical probes were developed for TAR RNA, including a tetraaminoquinozaline (28) and a 6-aminoquinolone (29, 30) with an EC 50 value of 16 M and an IC 50 value of 0.85 M, respectively, in chronically infected HIV cell models. On the other hand, only one bioactive small molecule was identified for a fungal target, specifically the Candida albicans LSU Group 1 Ribozyme (31). This essential ribozyme is a desirable antifungal target as it leads to failed ribosomal assembly when mutated and is absent in the human genome. Further, 10/75 small molecule chemical probes demonstrated efficacy in animal models, targeting seven unique RNA elements in bacterial and human systems. One recent example (19) targeted the G-quadruplex structure located in the 5 -untranslated region (UTR) of Human Vascular Endothelial Growth Factor (hVEGF) mRNA, an angiogenic growth factor involved in tumor progression. In a breast cancer mouse model, the small molecule showed antitumor efficacy similar to that of doxorubicin but with fewer indications of side effects. Multivalent chemical probes targeted fewer distinct RNA elements, with 27/41 unique ligands targeting nucleotide repeat expansions [ Figure 1B ]. There are several advantages to targeting these RNA repeats: (i) long repeat stretches are typically not present elsewhere in the human genome; (ii) nuclear localization minimizes competition with ribosomal RNA and (iii) targetable motifs are separated by a specific distance (32) . Another target of interest was the heat shock response element of the 32 factor mRNA in E. coli. This RNA element contains a rare, perfectly paired three-way junction that can be stabilized by symmetrical triptycenebased molecules, forming a distinct shape-selective fit (33) . This stabilization resulted in an ∼60% reduction in translation of an 32 -GFP fusion protein and could potentially lead to antimicrobial activity (34) . In addition, 7/41 chemical probes showed efficacy in animal models, targeting two RNA elements: r(CUG) exp repeats and pri-miRNA-96. Pri-miRNA-96 is an oncogenic RNA that suppresses the translation of a pro-apoptotic protein, FOXO1. In a mouse model of triple negative breast cancer, a modular ligand designed to target the Drosha processing site on the RNA led to a statistically significant reduction in tumor size and to changes in RNA and protein levels consistent with the proposed mode of action (17) . Notably, examination of this list further exposes the RNA-driven processes and diseases that still lack functional chemical probes. Ideal RNA targets have defined functional sites and/or clear phenotypes while also being of high abundance, and several untargeted RNAs, ranging from archaeal ncRNA to oncogenic lncR-NAs, meet these criteria. Many of the monovalent ligands were discovered through traditional screening methods. Approximately one-third of the RNA:ligand interactions were identified by each of the following approaches: focused-screening (FcS), highthroughput screening (HTS), and HTS followed by lead optimization (HTS-LO) [ Figure 2A ]. In this Survey, FcS is defined by the use of biased libraries, which are typically based on prior knowledge of a particular chemotype binding to an RNA element. In contrast to FcS libraries, molecules specifically designed to explore structureactivity relationships were classified as lead optimization (LO). The starting points for several of the FcS libraries and/or other small molecule identification strategies included RNA-binding natural products, chemical similarity searching, and scaffold-based synthesis [ Figure 2B -D]. We caution that the relative success of these various approaches cannot be evaluated since failed attempts are not typically documented in the literature. Hit rates are one of the benchmarks used to assess the efficiency of a screen. We note prior to the discussion that comparisons across studies should be interpreted with caution as the definition of a small molecule lead, the specific assays used in primary screens, the controls utilized in the assay, and the number of false-positives and -negatives can be highly variable and are not always reported. Of the 41 RNA:small molecule interactions discovered through HTS and FcS, 20 had reported hit rates, which were compared by screening approach, primary screen, and primary library [ Figure 3 ]. The higher hit rates found in some FcS approaches provide compelling evidence that FcS is efficient for RNA targets, as it is known to be for protein targets (35, 36) . Moving forward, characterization of additional RNA tertiary structures (8, 9, 37, 38) and the identification of novel RNA-binding chemotypes (8, 39, 40) will expedite the FcS approach for discovering biologically active RNA-targeted ligands. While hit rates varied widely within each type of primary screen and primary library, these com-parisons support the potential of many distinct paths toward RNA ligand discovery. Specific aspects of library and screen design are discussed below. Small molecules discovered by HTS were typically from large libraries (n > 50 000 small molecules), including three corporate libraries and the NIH Small Molecule Repository [ Table 1 ]. There were select examples of smaller libraries as well: FDA-approved drugs (n = 1120) and UCLA academic library (n = 1692). Importantly, some of these reports explicitly stated that libraries were filtered to yield small molecules with favorable medicinal chemistry proper- S-amino acid conjugates A Screening Approaches ties prior to screening. While successful in protein-targeted drug discovery (41), only one report identified a bioactive ligand from a fragment-based library (commercial library) (42) . Once optimized, the scaffold yielded four additional molecules that targeted the Influenza A RNA promoter with IC 50 values ranging from 34 to 44 M in a cell-based luciferase assay (43) . Similarly, only two reports yielded bioactive small molecules from natural productbased libraries (synthetic library and academic library) (19, 44) . Both of these screens contained fewer than 150 small molecules, yet identified ligands that bind and modulate G-quadruplex structures located in the 5 -UTR of two distinct mRNAs. It is promising that small molecules were discovered from a variety of HTS libraries, suggesting that biologically active, RNA-binding ligands can be found in a subset of current small molecule chemical space (22) . Further validation and exploration of this space could lead to greater efficiency and success in identifying bioactive leads as well as RNA-privileged chemotypes. As expected, FcS used smaller libraries, typically containing fewer than 150 small molecules [ Table 1 ]. The largest FcS library (n = 320, academic library) was designed through a chemical similarity search of the bis-benzimidazole and similar cores, which have shown preferences for 1 × 1 nucleotide internal loops (45) . In addition, this library was filtered for favorable medicinal chemistry properties, and the screen resulted in three leads that: (i) bound r(CUG) exp in vitro; (ii) led to a statistically significant decrease in cTNT mini-gene exon inclusion in cells and (iii) were selective for a mini-gene with 960 r(CUG) repeats compared to a mini-gene without the repeats. FcS also encompassed RNA structure-guided design, which included two studies utilizing molecular modeling to identify ligands structurally similar to guanine for the xpt-pbuX riboswitch (46, 47) . In another structure-guided approach, a small library of p-terphenylene-based ligands was designed to mimic an ␣-helix of Rev, a protein-binding partner of the HIV-1 Rev Response Element (RRE) (48) . Leads were selected by docking, with one ligand disrupting the Rev-RRE interaction in vitro (IC 50 = 6.8 M), inhibiting HIV-1 replication (IC 50 values of 3.4-5.9 M), and exhibiting on-target effects via a RRE-luciferase reporter assay (IC 50 values of 10-21 M). Further, we did not identify successful reports of biologically active ligands from small molecule libraries biased to general RNAbinding. Recently, chemical companies have designed such focused libraries (ChemDiv, Nucleic Acid Ligands: http://www.chemdiv.com/nucleic-acid-ligands/; Otava Chemicals, RNA Targeted Library: http: //www.otavachemicals.com/products/targeted-librariesand-focused-libraries/rna-binding); however, the success of these libraries is yet to be reported. To date, successful FcS strategies have utilized knowledge of the RNA structure and/or a small molecule binder(s), neither of which is known for many therapeutically-relevant RNAs. The primary screening assay for each small molecule was categorized as computational, in vitro, or cell-based [ Figure 4A ]. This list contained a wide range of primary screening assays, with limited examples of the same assay being used for multiple targets. The majority of the chemical probes were discovered by in vitro primary screening assays (n = 28) with fewer in cellulo or silico examples. Of those in vitro primary screens, 15 were RNA:protein displacement assays [ Figure 4B ]. These included fluorescence-based assays (Förster resonance energy transfer (FRET) and fluorescence anisotropy) and radiolabel-based methods (mobility shift, scintillation proximity, and filtration assays). One rather unique assay utilized a molecular beacon approach to probe for stabilization of Stem Loop 3 (SL3), a presumptive structural switch located in the -Packing Domain in HIV-1 that is destabilized by binding of the Gag protein prior to packaging of the virus (49) . In this assay, the 5 -and 3 -terminal ends of the SL3 RNA were labeled with a TET fluorophore and a blackhole quencher (BHQ1), respectively. In the presence of Gag protein, the RNA construct became single stranded and the fluorescence was 'turned on'. When a small molecule stabilized the folded hairpin form of SL3 RNA, the Gag-promoted RNA destabilization was reduced and the fluorescence was quenched. The researchers screened a modest sized library (>2500 small molecules) and discovered a ligand that reduced viral production similar to models with a mutated -Packing Domain (p24 50 = 11.3 M). The remaining in vitro assays consisted of two activitybased screens and various RNA binding assays [Figure (v) competition dialysis and (vi) indicator displacement. We note that a lack of correlation in small molecule activity between in vitro RNA binding and RNA:protein displacement assays has sometimes been reported, (16, 51, 52) highlighting the importance of multiple assays and/or choosing the most relevant assay for a particular system. We also note that, as in all screens, a lack of correlation can be observed between in vitro activity and cell culture activity. These differences can be attributed to many factors, including small molecule uptake, localization, and metabolism, specificity or off-target effects, and target availability due to binding of other macromolecules or metabolites. Nonetheless, we emphasize the success of the in vitro assays mentioned here in developing RNA bioactives and the valuable insights gained from other ligands discovered by in vitro RNA assays without reported biological activity. Cell-based screens were the second most common primary screening assay for HTS or FcS approaches [ Figure 4A ] and often the preferred screen for bacterial and viral RNA targets. The exception was a splicing assay of human serotonin receptor 2C (HTR2c) mRNA where a green fluorescent protein (GFP) reporter was used to evaluate the inclusion or exclusion of a particular exon (53) . Bacterial and viral RNA cell-based studies also utilized reporter systems such as GFP or LacZ gene as well as more traditional phenotypic screens, such as growth inhibition or cell death. In one particular example, ∼57 000 ligands were screened in a growth inhibition assay against Escherichia coli with and without supplementation of riboflavin (15). This differential supplementation allowed researchers to specifically probe the riboflavin pathway and confirm the FMN riboswitch was targeted by Ribocil-B. In addition, one report measured enzyme activity and antigen production in addition to cell death measurements to assess antiviral activity against HIV-1 (30). Given these successes, cell-based assays likely offer unforeseen promises as primary screening assays to discover RNA-targeted probes. High-throughput and focused computational screens were used to identify four small molecules [ Figure 4A ]. Two small molecules were identified by docking against experimentally determined structures of HIV-1 TAR (54) and RRE RNA (48) . In another example, small molecules were modeled into an X-ray diffraction structure of the xpt-pbuE guanine riboswitch aptamer after the native ligand was removed (46) . Criteria such as geometrical constraints, hydrogen bonding patterns, and molecule planarity were used to assess the 'fit' of the ligand, leading to the selection of two small molecules, one of which had antimicrobial activity against 9 of the 15 Gram-positive bacteria species tested and was selective for species with the guaA gene under riboswitch control. The fourth example utilized a computationally-predicted 3D structure of the severe acute respiratory syndrome coronavirus (SARS-CoV) pseudoknot (55) . A library of 80 000 small molecules was docked against the predicted structure, and the 58 highest scoring molecules were tested in an in vitro activity-based assay. The screen resulted in a biologically active ligand with an IC 50 value of 0.45 M in cell-based models. Additional advances in computational structural prediction and RNA:ligand docking will undoubtedly lead to improved computational primary screens and thus more efficient experimental screens (56, 57) . In addition to the primary screens, a database of known RNA motif:small molecule interactions, Inforna (39) , was utilized to identify seven small molecules. The database was generated using a library versus library approach named 2-Dimensional Combinatorial Screening (2DCS). In this method, small molecules are immobilized onto a microarray slide and then incubated with libraries of labeled, randomized RNA secondary structures. The bound RNAs are excised, sequenced, and assigned a fitness score using Structure-Activity Relationships Through Sequencing (StARTS). Fitness scores reflect the affinity and selectivity of a given RNA motif:small molecule interaction and are represented on a numerical scale, where a higher score represents greater selectivity. To utilize the Inforna database, a computational or experimental secondary structure of RNA is input, the 2DCS data is searched and lead molecules are proposed. This strategy identified bioactive ligands for five targets: (i) MAPT pre-mRNA, 1-nucleotide bulge; (58) (ii) Pre-miRNA-96, 1 × 1 internal loop (59); (iii) Pre-miRNA-18a, 1-nucleotide bulge (60); (iv) Pre-miRNA-210, 1 × 1 internal loop (16) and (v) Pre-miRNA-544, 1 × 1 internal loop (61) . Other examples of ligands not identified from a primary screening assay included the selection and characterization of four metabolite analogs for riboswitch inhibition (62) (63) (64) (65) . In contrast to small molecules, most multivalent ligands were developed through rational design based on the secondary structure of the RNA target [ Figure 5A ]. Generally, development began by the identification of monovalent ligands that bound to a particular secondary structure motif(s) by screening, literature search, or using Inforna [See section 'Other Methods of Small Molecule Discovery']. Monomers were covalently linked by selecting and optimizing appropriate sized spacers. For several multivalent ligands, the design was inspired by a crystal structure of r(CUG) sequences, leading to ligands targeting r(CUG) exp in Drosophila melanogaster models (66) . This approach linked acridine and a triaminotriazine unit, the latter of which was proposed to recognize the non-optimal base pairing of U-U mismatches by Janus-wedge hydrogen bonding. Stacking of the two units was expected to decrease nonspecific intercalative binding. This early design was optimized to yield bisamidinium conjugates that mitigated the glossy and rough eye phenotype observed in a DM1 transgenic Drosophila melanogaster model (67) . A different approach utilized Hoechst 33258, which had been previously reported to bind a 5 CUG/3 GUC internal loop (68) . Hoechst 33258 was modified to contain an azide handle and then covalently linked to a peptoid backbone via click chemistry (21) . After extensive linker optimization, multivalent ligands were identified that improved r(CUG) exprelated splicing defects in a mouse model of DM1. One notable exception to the aforementioned design strategies is the use of dynamic combinatorial chemistry (DCC) [ Figure 5B ] (69) . Several multivalent ligands were derived from a library of resin-bound, cysteine-containing monomers, which were allowed to incubate with the RNA of interest, probing thousands of multivalent ligand combinations by forming covalent yet reversible disulfide linkages. The binders with highest affinity were thus enriched and then isolated, characterized, and validated for RNAbinding. After replacing the disulfide linkage with more sta-ble bioisosteres, the method yielded bioactive ligands for two RNAs of known structure: DM1 r(CUG) exp , where statistically significant improvements in splicing were observed in mouse models (70) , and HIV-1 frameshift-stimulating RNA, (71, 72) where in one example the decrease in viral infectivity (EC 50 values of 3.9 and 26 M) correlated to frame-shifting activity (>50% at 50 M) in cell culture (73) . A powerful advantage of DCC is that multivalent ligands can be constructed without knowledge of the RNA structure, including larger and complex tertiary folds. In general, both rational design and DCC yield multivalent probes with significantly increased affinity and specificity for RNA targets relative to small molecules. While achieving high potency in biological systems with larger molecules may require more development than with traditional small molecules, the examples identified support the possibility of success. Evaluating target engagement, off-target effects, potency/appropriate concentration, and other criteria is critical to understanding the quality of a chemical probe and thus any experimental conclusions (11, 12, 74, 75) . When curating the collection of chemical probes, strict benchmarks related to these criteria could not be included due to the lack of consistency within the field. In the next paragraphs, the characterization techniques utilized for RNA-targeted chemical probes will be described for each biological system and notable examples highlighted. One of the most common validation experiments in bacterial and fungal systems was serial passage (15, 31, 46, 62, 65, 76, 77) . In this technique, ligand resistant mutants were grown in the presence of compound and mutations were mapped by whole-genome sequencing. In addition to confirming target engagement, the results revealed off-target effects and unexpected modes of action. Further, select examples performed the serial passage experiments in multiple bacterial strains and measured binding affinity to the mutants in vitro, which provided added confidence in target engagement (15, 31, 77) . In select cases where serial passage experiments did not yield mutated isolates, the ligands were tested against mutants with established variations in structure or activity (76, 65) . Another powerful strategy for assessing target engagement in riboswitches was phenotype rescue by addition of the native ligand (15, 46) . Lastly, several of the targeted RNA elements regulated the expression or translation of specific genes, which was assessed by measuring the quantity of the RNA or protein, respectively. One study went beyond measuring the expected transcripts and performed a transcriptomic microarray analysis of genes involved in many different cellular processes (46) . The observed repression was consistent with riboswitch inhibition, although addition of the native ligand failed to rescue the expression of several genes, indicating a potential cellular stress response. This strategy and other genome-wide analyses can provide compelling evidence of target engagement, though it must be noted that target specificity does not always lead to biological specificity (12) . Compared to the other systems, chemical probes targeting viral RNAs were less characterized in terms of target engagement and specific activity. A select few studies validated probe activity by testing mutant versions of the virus (49, 78) or closely related native viruses (30, 73) . Notably, one report validated probe activity with a second research group to ensure reproducibility of the biological effect (71) . RNAspecific reporter systems were occasionally used to confirm on-target effects, including fusion-induced gene stimulation (79) , heterologous tethering (54) , and a viral protein reporter (48) . A noteworthy example of assessing off-target effects was the use of RNA-Seq at increasing ligand concentrations in the absence of the target RNA (72) . The experiment in HEK293T cells revealed that 53 of the 17 822 transcripts assayed had statistically significant alterations at two concentrations, potentially reflecting a cellular stress response. It is also intriguing that the most biologically potent ligand in this study was the least selective analog in an in vitro tRNA competition assay. This observation underlies potential differences in cellular activity versus in vitro binding selectivity and thus the importance of progressing multiple ligands to biological assays. There are several noteworthy examples of probing ligand promiscuity and/or selectivity in human systems. Multiple studies utilized genome-wide analyses to globally assess changes in miRNA, (16, 17, 59, 60, 80) mRNA, (18, 50) or splicing (18, 21, 47, (81) (82) (83) (84) (85) ) levels compared to the target RNA or process. Toward a more direct assessment, chemical cross-linking and isolation by pull down (Chem-CLIP) was utilized to investigate proximity-based engagement of RNA (86) . In this method, the ligand is appended with a nucleic-acid reactive module (e.g. chlorambucil) and a biotin purification tag. After incubating cells with the modified ligand, the cells are lysed, the ligand is captured by streptavidin beads, and the bound targets are characterized by qRT-PCR or RNA-seq. This strategy as well as competitive Chem-CLIP (86) were used to characterize onand off-target engagement of ligands that modulate repeat expansions (45, 84, 85, 87) and miRNAs (16, 17, 60) . In another study, dual luciferase reporter assays were utilized to compare ligand binding to four G-quadruplex structures, including the RNA of interest and three other regulatory RNAs (44) . This study was one of few examples in which selectivity within a target family was assessed in cell culture. Various controls were also used to evaluate on-target effects. For example, the impacts of chemical probes have been analyzed following siRNA knockdown of FOXO1 mRNA (59) and following inhibition of the mTOR signaling pathway modulated by miRNA-544 (61) . Another example overexpressed miRNA-210 and assessed the effect of the chemical probe on the phenotype (16) . Likewise, a ligand targeting r(CUG) exp was tested with the RNA under conditional expression (88) . Another notable example used RNA immunoprecipitation (RIP) to detect RNA-binding at increasing concentrations of ligand and identified a dosedependent response (19) . For precipitation, a G-quadruplex specific antibody, BG4, was utilized and the complex was characterized by two complementary methods: dot blotting and qRT-PCR. Another important control, particularly for human systems, was to replicate on-target effects in at least two cell lines, though this control was performed for only a limited number of chemical probes (44, 53, 47, 81, 84) . For many years, RNA has been labeled as 'undruggable' or 'impossible to probe selectively'; however, the reports described herein demonstrate the substantial progress that has been achieved in the past four years. This includes the development of chemical probes for 33 unique RNA elements, though the number of 'targetable' RNA elements is certain to vastly exceed this list (5, 6, 23, 89) . Those in Archaea, for example, are unrepresented in reports of RNAtargeted chemical probes, despite the established importance of small regulatory RNAs in archaea metabolism, morphology, and adaption to extreme conditions (90, 91) . Furthermore, there are several RNAs implicated in diseases for which novel treatment strategies are needed, including insect-borne viruses (23) , genetic disorders (92) , and metastatic cancers (5,6) as well as bacterial targets amid the antibiotic-resistance crisis (89, 93) . There are also many opportunities for RNA-targeting in fungal systems (94) , especially as fungal infections are experiencing a rise in cases and a therapeutic plateau (95) . Fundamentally, the development of chemical probes will allow for the rapid and reversible interrogation of novel and complex RNA biology in ways not attainable by knockdown and genetic approaches (74) . While the potential benefits of selective RNA targeting are staggering, approaches toward chemical probe development must thoughtfully consider a number of variables, including transcript abundance and tissue-specific expression of the RNA target. By total mass and number of molecules, rRNA and tRNA account for greater than 90% of total cellular RNA in humans (96) , and thus mRNAs and other ncRNAs exist in a much lower abundance (1, 96, 97) . Even within these low abundance transcripts, copy numbers can vary widely within and across cell types with reports of mRNA levels spanning four orders of magnitude and some ncRNAs averaging less than one copy per cell (1, 97, 98) . A direct impact of RNA expression levels was recently reported, in which the authors proposed that ligand occupancy of the miRNA target was driven by the relative abundance of structurally similar RNA elements (16) . RNA with well-defined function and that are highly expressed thus represent low-hanging fruit within the field (6, 99) . The selection of a library and primary screening strategy is also critical for the success of chemical probe discovery. For RNA targets with known structure and/or ligands, the use of focused screening libraries (35, 36) has proven to be an efficient strategy, though this approach generally limits the chemical diversity of the library. Advances such as the identification of more RNA-privileged scaffolds (39) and biologically relevant RNA chemical space (22, 100) will facilitate the discovery of chemical probes for additional RNA targets. These efforts could be expedited by additional fragment-or natural product-based screening to access vast chemical space with fewer ligands (41) and to probe known biologically relevant chemical space, respectively (101) . Additional high-throughput screens could likewise discover novel chemotypes, though the expense may exceed the resources available in academia, where focused approaches are more attainable. Continued progress in the development and refinement of computational tools will also aid in expanding the boundaries of focused screening and structurebased design; (56, 57) although the latter will also depend upon advancements for accurately determining atomic resolution structures (8, 9, 37, 38) . In addition, in vitro or cellular activity-based assays that probe well-studied RNA functions (e.g. splicing, translation, or processing) may be more practical starting points to identify chemical probes. This includes screening strategies described herein as well as others recently developed (102, 103) . Finally, numerous opportunities exist to build upon the established multivalent targeting strategies discussed above, particularly the application of dynamic combinatorial chemistry to RNAs of unknown structure. When using a chemical probe in a biological system, the quality and specificity of the probe must be well characterized to draw accurate and meaningful conclusions, as recently highlighted by several preeminent chemical biologists (11, 12, 74, 75) . Evaluation of the chemical probes revealed a diverse spectrum of characterization techniques with few ligands meeting the traditional criteria for robust chemical probes, and not all of these gaps can be attributed to a lack of relevant tools. For example, characterization inconsistencies include incomplete reports of cytotoxicity and a lack of attention to cell permeability and localization. In addition, many in vitro assays are accessible to help establish the potency and selectivity of a probe in multiple experiments, and these assays should include evaluation against both specifically mutated targets and a number of other structured RNAs. Any cell-based observations should be reproducible in multiple cell lines and validated in the absence of the target by utilizing siRNA or CRISPR-Cas technologies. Further, on-target effects should be established by using a number of spatial-based experiments (12) . These include the biochemical methods described herein (e.g. Chem-CLIP or RIP) and novel applications of other technologies such as in-cell chemical probing (104, 105) to observe changes in RNA secondary structure upon ligand binding or photoaffinity labeling (106) to assess target engagement under temporal control. If plausible, serial passage experiments followed by deep sequencing should also be performed, even in human systems (107) , to identify ligandescaping mutations to confirm target engagement and/or off-target effects. Lastly, it is critical to use an inactive analog and an active analog from a different chemical class, if feasible, to draw conclusions regarding the targeted biology. Moving toward these standards will be crucial for the RNA-targeting field to avoid the scientific pollution that has plagued many others (11, 12, 74) . For comparison, we note that over a decade ago chemical probe discovery for another 'undruggable' target, protein:protein interactions, was in its infancy with only 19 known small molecule inhibitors (108) . However, novel advances in screening approaches and design strategies led to the rapid discovery of thousands of protein:protein interaction inhibitors with several entering the clinic and most also breaking the rules of 'drug-like chemical space' (109) . In the next five years, we anticipate that the innovations described herein and the ones yet to be discovered will lead to a similar surge in reports of RNA-based chemical probes and therapeutics. We also expect that novel and well-characterized chemical probes will allow RNA biologists to uncover many more exciting and unanticipated roles for RNA, propelling us into the next phase of the RNA revolution. Supplementary Data are available at NAR Online. The noncoding RNA revolution-trashing old rules to forge new ones The rise of regulatory RNA Viral noncoding RNAs: More surprises The Encode Project Consortium (2007) Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project Non-coding RNAs as drug targets The Therapeutic Targeting of Long Noncoding RNA Linking RNA biology to lncRNAs Synthetic Receptors for Oligonucleotides and Nucleic Acids Biochemical methods to investigate lncRNA and the influence of lncRNA:protein complexes on chromatin Opportunities and challenges in RNA structural modeling and design The impact of chemical probes in drug discovery: A pharmaceutical industry perspective Choose and use your chemical probe wisely to explore cancer biology Comparison of small molecules and oligonucleotides that target a toxic, non-coding RNA Overcoming cellular barriers for RNA therapeutics Selective small-molecule inhibition of an RNA structural element Small molecule inhibition of microRNA-210 reprograms an oncogenic hypoxic circuit Design of a small molecule against an oncogenic noncoding RNA SMN2 splice modulators enhance U1-pre-mRNA association and rescue SMA mice Discovery of small molecules for repressing cap-independent translation of Human Vascular Endothelial Growth Factor (hVEGF) as novel antitumor agents Rationally designed small molecules that target both the DNA and RNA causing Myotonic Dystrophy Type 1 Features of modularly assembled compounds that impart bioactivity against an RNA target Discovery of key physicochemical, structural, and spatial properties of RNA-targeted bioactive ligands Small molecules targeting viral RNA Targeting RNA with small molecules Chemical and functional diversity of small molecule ligands for RNA The medicinal chemistry of peptides The medicinal chemistry of therapeutic oligonucleotides Inhibitors of protein-RNA complexation that target the RNA: Specific recognition of Human Immunodeficiency Virus Type 1 TAR RNA by small organic molecules New anti-Human Immunodeficiency Virus Type 1 6-aminoquinolones: Mechanism of action ) 6-Aminoquinolones as new potential anti-HIV agents Pentamidine inhibition of group I intron splicing in Candida albicans correlates with growth inhibition Rational design of chemical genetic probes of RNA function and lead therapeutics targeting repeating transcripts Recognition of nucleic acid junctions using triptycene-based molecules Modulation of the E. coli rpoH temperature sensor with triptycene-based small molecules Recent developments in focused library design: Targeting gene-families The design and application of target-focused compound libraries Recent advances in developing small molecules targeting RNA Insights into RNA structure and function from genome-wide studies Inforna 2.0: A platform for the sequence-based design of small molecules targeting structured RNAs Identifying the preferred RNA motifs and chemotypes that interact by probing millions of combinations Twenty years on: The impact of fragments on drug discovery A novel small-molecule binds to the Influenza A Virus RNA Promoter and inhibits viral replication Targeting influenza A virus RNA promoter for up-regulating the translation of Antiamyloidogenic Secretase, a Disintegrin and Metalloproteinase 10 (ADAM10), by binding to the G-quadruplex-forming sequence in the 5 untranslated region (UTR) of its mRNA Studying a drug-like, RNA-focused small molecule library identifies compounds that inhibit RNA toxicity in Myotonic Dystrophy Novel riboswitch ligand analogs as selective inhibitors of guanine-related metabolic pathways Chemical correction of pre-mRNA splicing defects associated with sequestration of muscleblind-like 1 protein by expanded r(CAG)-containing transcripts Structure-based design of an RNA-binding p-terphenylene scaffold that inhibits HIV-1 Rev protein function Targeting RNA-protein interactions within the Human Immunodeficiency Virus Type 1 lifecycle A small molecule that represses translation of G-quadruplex-containing mRNA Thermodynamic studies of a series of homologous HIV-1 TAR RNA ligands reveal that loose binders are stronger Tat competitors than tight ones Development of small molecules with a noncanonical binding mode to HIV-1 Trans Activation Response (TAR) RNA Pyrvinium pamoate changes alternative splicing of the Serotonin Receptor 2C by influencing its RNA structure Structure-based computational database screening, in vitro assay, and NMR assessment of compounds that target TAR RNA Identification of RNA pseudoknot-binding ligand that inhibits the -1 ribosomal frameshifting of SARS-coronavirus by structure-based virtual screening Structure based approaches for targeting non-coding RNAs with small molecules Structure-based discovery of small molecules binding to RNA Bottom-up design of small molecules that stimulate exon 10 skipping in mutant MAPT pre-mRNA Sequence-based design of bioactive small molecules that target precursor microRNAs Defining RNA-small molecule affinity landscapes enables design of a small molecule inhibitor of an oncogenic noncoding RNA Biogenesis Disrupts Adaptive Responses to Hypoxia by Modulating ATM-mTOR Signaling Novel riboswitch-binding flavin analog that protects mice against Clostridium difficile infection without inhibiting cecal flora Roseoflavin is a natural antibacterial compound that binds to FMN riboswitches and regulates gene expression Structure-guided mutational analysis of gene regulation by the Bacillus subtilis pbuE Adenine-Responsive Riboswitch in a cellular context Thiamine pyrophosphate riboswitches are targets for the antimicrobial compound Pyrithiamine A simple ligand that selectively targets CUG trinucleotide repeats and inhibits MBNL protein binding Targeting toxic RNAs that cause Myotonic Dystrophy Type 1 (DM1) with a bisamidinium inhibitor Specific binding of Hoechst 33258 to site 1 thymidylate synthase mRNA Dynamic combinatorial selection of molecules capable of inhibiting the (CUG) repeat RNA-MBNL1 interaction in vitro: Discovery of lead compounds targeting Myotonic Dystrophy (DM1) From dynamic combinatorial 'hit' to lead: In vitro and in vivo activity of compounds targeting the pathogenic RNAs that cause Myotonic Dystrophy N-Methylation as a strategy for enhancing the Affinity and selectivity of RNA-binding peptides: application to the HIV-1 Frameshift-Stimulating RNA HIV-1 Frameshift RNA-targeted triazoles inhibit propagation of replication-competent and multi-drug-resistant HIV in human cells High-affinity recognition of HIV-1 frameshift-stimulating RNA alters frameshifting in vitro and interferes with HIV-1 infectivity The promise and peril of chemical probes Target identification and mechanism of action in chemical biology and drug discovery Design and antimicrobial action of purine analogues that bind Guanine riboswitches Dual-targeting small-molecule inhibitors of the Staphylococcus aureus FMN riboswitch disrupt riboflavin homeostasis in an infectious setting Conformational inhibition of the hepatitis C virus internal ribosome entry site RNA Inhibition of HIV-1 Tat-TAR interaction by diphenylfuran derivatives: effects of the terminal basic side chains Small molecule inhibition of miR-544 biogenesis disrupts adaptive responses to hypoxia by modulating ATM-mTOR signaling Design of a bioactive small molecule that targets the Myotonic Dystrophy Type 1 RNA via an RNA motif-ligand database and chemical similarity searching Induction and reversal of Myotonic Dystrophy Type 1 pre-mRNA splicing defects by small molecules Targeting the r(CGG) repeats that cause FXTAS with modularly assembled small molecules and oligonucleotides Small molecule recognition and tools to study modulation of r(CGG)(exp) in Fragile X-Associated Tremor Ataxia Syndrome Precise small-molecule recognition of a toxic CUG RNA repeat expansion Approaches to validate and manipulate RNA targets with small molecules in cells Discovery of a biomarker and lead small molecules to target r(GGGGCC)-associated defects in c9FTD/ALS Lomofungin and dilomofungin: Inhibitors of MBNL1-CUG RNA binding with distinct cellular effects Non-coding RNAs as antibiotic targets Small regulatory RNAs in Archaea Small RNAs in bacteria and archaea: who they are, what they do, and how they do it RNA structures as mediators of neurological diseases and as drug targets Antibacterial drug discovery in the resistance era Fungal RNA Biology The antifungal pipeline: a reality check Non-coding RNA: What is functional and what is junk? Front Specificity and nonspecificity in RNA-protein interactions An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data MicroRNAs and other non-coding RNAs as targets for anticancer drug development Enhancements of screening collections to address areas of unmet medical need: an industry perspective The re-emergence of natural products for drug discovery in the genomics era Development and implementation of an HTS-compatible assay for the discovery of selective small-molecule ligands for pre-microRNAs RNA fluorescence in situ hybridization for high-content screening Progress and challenges for chemical probing of RNA structure inside living cells Detection of RNA-protein interactions in living cells with SHAPE Photoaffinity labeling in target-and binding-site identification Using transcriptome sequencing to identify mechanisms of drug action and resistance Emerging classes of protein-protein interaction inhibitors and new tools for their development Small molecules, big targets: drug discovery faces the protein-protein interaction challenge A small molecule that targets r(CGG)(exp) and improves defects in fragile X-associated tremor ataxia syndrome Development of pharmacophore models for small molecules targeting RNA: application to the RNA repeat expansion in myotonic dystrophy type 1 Identification of biologically active, HIV TAR RNA-binding small molecules using small molecule microarrays Importance of ribosomal frameshifting for Human Immunodeficiency Virus Type 1 particle assembly and replication Bioavailable inhibitors of HIV-1 RNA biogenesis identified through a Rev-based screen A modular approach to synthetic RNA binders of the hepatitis C virus internal ribosome entry site Artificial nucleobase-amino acid conjugates: a new class of TAR RNA binding agents We thank the members of the Hargrove lab for stimulating discussion and input.