key: cord-0873152-xboq8fe7 authors: Sun, Yi; Liu, Guang; Huang, Yue title: Applications of piggyBac Transposons for Genome Manipulation in Stem Cells date: 2021-09-14 journal: Stem Cells Int DOI: 10.1155/2021/3829286 sha: 39c30817e3f0f485bd186f41fff4c390ccf241fe doc_id: 873152 cord_uid: xboq8fe7 Transposons are mobile genetic elements in the genome. The piggyBac (PB) transposon system is increasingly being used for stem cell research due to its high transposition efficiency and seamless excision capacity. Over the past few decades, forward genetic screens based on PB transposons have been successfully established to identify genes associated with drug resistance and stem cell-related characteristics. Moreover, PB transposon is regarded as a promising gene therapy vector and has been used in some clinically relevant stem cells. Here, we review the recent progress on the basic biology of PB, highlight its applications in current stem cell research, and discuss its advantages and challenges. In 1981, Evans and Martin isolated and established separately undifferentiated embryonic stem cell (ESC) lines from the inner cell mass (ICM) of mouse blastocysts [1, 2] . Subsequently, human ESCs (hESCs) were successfully isolated by Thomson et al. [3] in 1998, and hESCs provide unparalleled tools for studying human embryonic development and regenerative medicine [4] . Additionally, mouse-induced pluripotent stem cells (miPSCs) [5] and human iPSCs (hiPSCs) were generated in 2006 and 2007, respectively [6, 7] . Two key features of ESCs and iPSCs are self-renewal, the ability to proliferate indefinitely and pluripotency and the ability to differentiate into various tissue cell types under appropriate culture conditions. As major types of pluripotent stem cells (PSCs), ESCs and iPSCs provide powerful tools to study the gene function. In particular, hiPSCs hold great promise for generating patient-specific human PSCs (hPSCs) for disease modeling and drug discoveries [8] . In addition to PSCs, other types of stem cells derived from postnatal animal tissues are widely used, such as mesenchymal stem cells (MSCs) [9] , hematopoietic stem cells (HSCs) [10] , and spermatogonial stem cells (SSCs) [11] . Over the past few decades, stem cell biology and mammalian functional genetics studies have developed closely together, yielding remarkable potentials for the application of regenerative medicine to drug discovery, disease modeling, and the development of novel therapeutic strategies [12] . DNA transposons are mobile genetic elements that can move throughout the genome via a "cut-and-paste" mechanism called transposition, and they are usually inactive in rodents and human cells in nature [13] . Over the past two decades, a series of active recombinant transposons have been generated and used as novel tools for functional genomics research in mice and other vertebrates [14] . Among them, Sleeping Beauty (SB) and piggyBac (PB) are the most commonly used eukaryotic DNA transposons [15, 16] . SB, a Tc1-like transposable element isolated from the salmonid fish genome, was the first transposon used in mouse and human cells [17, 18] . Although SB can function effectively in mouse somatic cells, it is not highly active in ESCs [19] . PB, however, which is derived from the cabbage looper moth Trichoplusia ni, shows high transposition efficiency in different mammalian cell lines, including ESCs, in addition to other organisms [20] [21] [22] . Subsequent studies have demonstrated that the translocation activity of PB is significantly higher than SB in mammalian cell lines [23] . Moreover, unlike SB, which always leaves a 2-5 bp footprint mutation after mobilization and has a strong tendency for "local hopping (reinsertion close to the original donor site)," PB exhibits the unique and valuable feature of seamless excision after transposition [24] [25] [26] . With the help of the PB system, scores of transgenic animals have been successfully generated, including mice [20, 27] , rats [28, 29] , pigs [30, 31] , and goats [32] . Besides, PB has been used as a nonviral vector for insertional mutagenesis [33] , genetic screens [34] [35] [36] [37] [38] , iPSCs engineering [39] [40] [41] , gene therapies [15, [42] [43] [44] [45] , and novel CAR-T cell therapeutic strategies [46] [47] [48] [49] . In this review, we will look back to the advancements of PB transposon in stem cells and regenerative medicine, and discuss its wide applications, so as to provide a reference for future research. 2.1. Integration Site Preference. The PB element was originally discovered in insect cells as a repetitive element while propagating baculovirus in the TN-386 cell line as shown by Fraser et al. [50] and isolated by Cary et al. in 1989 [51] . The inserted mobile DNA was carried by the virus in the form of a "piggyBack," hence the name piggyBac; "Bac" stands for it being a baculovirus-related discovery. In 2005, Ding et al. found that PB elements can actively transpose in a variety of human and mouse cell lines, as well as in mouse germline cells [20] . The original PB element is a 2,475 bp fragment within an open reading frame (ORF) that encodes a functional transposase of 594 amino acids, flanked by 311 bp 5 ′ end and 235 bp 3 ′ end sequences, each containing asymmetric inverted terminal repeats (ITRs) carrying transposase binding sites (Figure 1(a) ). The 35 bp 5 ′ end ITR (5 ′ PBITR) and 63 bp 3 ′ end ITR (3 ′ PBITR) were shown to be sufficient for activity both in vivo and in vitro [52] . Importantly, the PB element can be divided into two functional components, ITRs and the PB transposase (PBase), to form a binary transposition system, and have been split into a helper plasmid and a donor plasmid (Figure 1(b) ). The PBase, which can be transiently expressed by the helper plasmid, excises any DNA sequence of interest flanked by the ITRs in circular donor plasmid via binding to the ITRs (i.e., cut) and reintegrates the sequence into the TTAA site in the genome (i.e., paste) (Figure 1 (c)) [53, 54] . The insertion site can be detected using Splinkerette PCR combined with DNA sequencing [34, 55] . The further advantage is seamless excision that the reexpression of PBase can remove the transposon completely to obtain transposon-free cells [56] . Several studies have shown that the distribution of PB transposons has no correlation with gene density or expression level, but rather depends on the distribution pattern of TTAA sites [57] and was negatively influenced by genomic methylation [22, 58] . Theoretically, there is an average of one TTAA site every 256 bp (four to the power of four) in the genome, but the protein-coding regions have a higher GC content compared to other positions, leading to a lower frequency of TTAA sites [59] . In addition, only about 1% of PB insertion sites are located in the 5 ′ region within 1000 bp upstream of the transcription start site (TSS), which is much lower than the proportion for retrovirus systems [22, 58, 60] . Compared to lentivirus systems [21, 60] , PB preferably integrates into genomic safe harbors (GSHs), which are defined based on five criteria for its relative location to ultraconserved regions, noncoding RNAs, and coding genes, especially cancer-related genes [61, 62] . 2.2. Mutagenic Cassettes. As discussed above, transposons acted as DNA delivery vehicles for genetic modifications. Several PB-based vectors that have been used for insertional mutagenesis contain two main features. These are (1) mutagenic gene trap cassettes to mediate target gene expression (loss or gain-of-function, LOF, or GOF) and (2) reporter cassettes, whose expression is dependent or independent of the correct splicing between exons of the trapped gene and mutagenic gene trap cassettes [63, 64] . Based on the strategy used for mutating genes, gene trapping can be mainly divided into promoter trapping and polyadenylation (polyA) trapping [65] . In promoter trapping, mutagenic cassettes usually include a splice acceptor (SA) followed by reporter genes and polyA signals in one or both orientations. After integrated into an intron of the expressed gene, the SAreport-polyA element can disrupt the expression of the trapped gene by splicing into upstream exons, which results in a gene trap fusion transcript, and the expression of reporter gene is driven by the endogenous promoter of the trapped gene. As the expression of such a reporter cassette depends on an endogenous promoter, they can only drive transcriptional activation in a tissue of interest [63] . Thus, a reporter driven by an exogenous promoter can be separately used and is independent of the splicing fused transcript, which has allowed more than 90% of mutational coverage of all mouse genes with unbiased distribution throughout the genome [66] . The reporter cassettes used are usually fluorescent proteins (e.g., green fluorescent protein, GFP; red fluorescent protein, and RFP), antibiotic resistance (puromycin, neomycin, hygromycin, etc.), or β-galactosidase. In polyA trapping, transposon insertions utilize a unidirectional exogenous strong promoter followed with a splice donor (SD), but lacks a polyA signal (Figure 1(d) ). If the orientation of the exogenous promoter-SD element is consistent with the direction of the transcription of the trapped gene, the element will be spliced into endogenous, downstream exons, hence initiating gene transcription regardless of transcriptional activity [63] . Some trap cassettes with strong viral enhancers/promoters may result in overexpression of truncated or full-length protein products of the trapped gene. Moreover, the promoter of the trapped gene may be transactivated by strong enhancer elements inside the transposon, leading to the overexpression of a fulllength transcript [63] . It is worth noting that vector integrations always tend to occur in the last introns (3 ′ -end most) of the trapped gene in poly-A trapping. By inserting an internal ribosome entry site (IRES) sequence between the 2 Stem Cells International reporter cassette and the SD site to prevent nonsensemediated mRNA decay (NMD) of chimeric transcripts, the bias in the vector integration site can be effectively removed [67] . These features enable the PB system to be a rapid, highthroughput, and traceable mutagenesis tool for constructing mutant libraries for LOF or GOF screening and identification of insertional genes for further validation. Genomic sequences contain proteincoding regions and important cis-acting regulatory elements (promoters, enhancers, repressors, etc.) that are essential for appropriate spatial-temporal gene expression. Therefore, the capacity to deliver large cargo is critical for achieving successful gene expression regulation. Retroviral and lentiviral vectors' cargo capacity is restricted to about 10 kb and also has immunogenic and tumorigenic potential [68] . Nonviral systems, such as SB transposon, are also limited to 5-6 kb in cargo size and have shown a reduced transposition efficiency when cargo size reaches 10 kb [69] . These characteristics limit the use of selectable markers, inducible cassettes, and large regulatory sequences. However, Li et al. showed that in mESCs, giant PB transposons could mobilize 100 kb DNA fragments to endogenous genomic sites with good cargo integrity, and transposons could be seamlessly excised after transposition [70] . Since the transposition efficiency decreases with increasing cargo size, 100 kb is unlikely to be the upper limit of PB cargo capacity. In general, PB can carry multiple genes during transposition, providing great advantages for multiplexed genetic manipulations, including insertional mutagenesis. Engineering the PBase is the key to enhancing PB transposition efficiency in mammalian cells. A mouse codon-optimized version of the PBase (mPBase) mediates a 20-fold increase in vector-to-chromosome transposition relative to the original native version [54] and also elevated the rates of chromosomal transposition from PB donor loci in mESCs [71] . The enhanced PiggyBac (ePiggy-Bac) system, which contains a human codon-optimized transposase and the T53C/C136T mutant 5′PBITR, could increase genome integration efficiency by 10-fold in hESCs [72] . Subsequently, a hyperactive PBase (hyPBase), with a total of 7 amino acids (aa) substitutions as shown by Yusa et al., can mediate more efficient transposition and outperformed the mPBase by 10-fold without compromising genomic integrity [73] . An in vivo study reported that the hyPBase had a 20-fold increase in the liver-directed expression compared to mPBase [74] . Moreover, it is well known that the PB transposon can be excised by the reexpression of transposase, but there is still the possibility of transposon jumping into new locations. To solve this problem, Li et al. generated an excision competent/integration defective (Exc + /Int − ) PBase by amino acids mutation at a catalytic domain [75] . As the integration of the PB transposase vector into the host genome may lead to multiple transposition cycles, scientists have discovered that transfection of PB 3 Stem Cells International transposase mRNA (a short half-life) instead of a plasmid can effectively reduce the potential genetic toxicity [76, 77] . An optimized PB transposon system will significantly expand its application in various fields. PBase is a very efficient enzyme that actively integrates DNA fragments into the genome in a random manner [78] . Recently, engineered nucleases, including transcription activator-like effector nucleases (TALENs), zinc finger nucleases (ZFNs), and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9), have been widely used for gene transfer and modification through generating double-strand DNA breaks (DSBs), which can be repaired by homologous directed recombination (HDR) [79] . However, all of these systems exhibit offtarget effects and nonenzymatic DNA insertion [80] . Thus, some works have sought to design PBase fused with these nucleases for integrating DNA into a unique user-defined chromosome site. Although the chimeric TALE-PBase [81] or ZFP-PBase [82] targeting of a unique genomic locus increased transposition efficiency, no targeted transposition was demonstrated [83] . The CRISPR/Cas9 system uses a short guide RNA (sgRNA) to guide the DNA endonuclease Cas9 to a specific target site and facilitates mutation insertion [84] . A specific Cas9 mutant lacking endonuclease activity (dCas9) fused with transcriptional repressor or activation domains has also been generated to promote transcriptional inhibition or activation when coexpressed with targeted sgRNAs [85, 86] . Lena et al. fused dCas9 to PBase and targeted it to specific genomic sites using dual sgRNAs [87] . Thus, the ease of design and application of dCas9-PBase, which can edit genes at precise genomic loci, improves future medical applications. Identifying genes that are important for specific biological phenotypes and diseases is a crucial goal of genetic analysis, and genetic screens have proven to be one of the most effective approaches [88] . The reverse genetic analyses are hypothesis-driven investigations of a phenotype driven by the disruption of predefined genes [89] , while forward genetic screens are phenotype-derived approaches that generally involve high-throughput mutant libraries generation, specific phenotype selection, and mutations validation [90] . 3.1. Loss-of-Function Screening. LOF genetic screens using mammalian cell lines are powerful tools for identifying genes required for many cellular processes. Since ESCs can differentiate into a variety of cell types, including germ cells, and have relatively stable genomes amenable to various genetic manipulations, these cells have become attractive models for analyzing developmental events or disease phenotypes in vitro [80] . With the excellent mutagenic ability of PB transposon and less bias towards certain genomic hot spots, a genome-wide mutant library could be rapidly and efficiently established in ESCs, allowing researchers to perform phenotype-based genetic screening in mammalian cells, similar to studies that have been done in yeast for the past 30 years [91] . In combination with high-throughput nextgeneration sequencing (NGS) technologies [91, 92] , hundreds to thousands of genes trapped by PB transposons could be easily identified, enabling the study of the molecular mechanisms of practically any biological process studied (Table 1) . 3.1.1. The Problem of Homozygosity Mutations. Since most of phenotypic changes in mammalian cells require both copies of an autosomal gene to be inactivated (except in some cases of haploinsufficiency) [93] , the genome-wide LOF screen of recessive mutations is quite time-consuming and rather difficult using diploid cells [94] . This issue was partially solved by generating a Bloom's syndrome gene-(Blm-) deficient ESCs, which lead to a higher rate of mitotic recombination between sister chromatids. Blm-null ESCs harboring heterozygous mutation conversed to homozygous mutations through a loss-of-heterozygosity (LOH), which occurs at a rate of about 10 -4 events/locus/cell/division. Thus, a mutant library needs to be expanded for at least 14 population doublings to promote homozygous mutant generation for further LOF screens, such as the resistance to 6-thioguanine (mismatch repair mutants) and retroviral infection [34, [95] [96] [97] . Huang et al. used a PB transposon vector, which carried two drug resistance genes but could express only one at a time and Blm-null ESCs to isolate homozygous mutant cell clones successfully. The two expressed drug-resistant genes could be switched by Cre recombinase, and this allowed selection for the increase in homozygous mutants that occur after LOH [94] . However, due to the low frequency of LOH in each generation per cell and homozygous cells accounting for only a very small fraction of Blm-null cells, it is not easy to achieve a sufficient number of homozygous mutants for genetic screening [91] . Recently, as an encouraging breakthrough in cell biology, haploid ESC (haESC) lines have been generated in several species, including medaka fish [98] , mice [99] [100] [101] [102] , rats [103] , monkeys [104] , and humans [105] [106] [107] . As there is only one set of chromosomes in haploid cells, it becomes quite easy to generate loss-of-function mutations using haESCs, which hold great promise for both forward and reverse genetic screens [38, 91, 99, [108] [109] [110] [111] [112] [113] [114] [115] [116] . Screening. Due to the infinite self-renewal ability and haploid properties, haESCs have become powerful tools for generating a tremendous number of homozygous mutation pools [117] . The PB transposon system has also been successfully applied to haESCs to identify different mechanisms of stemness and differentiation. Although the mechanisms of self-renewal of PSCs have become clearer, less is known about how these robust pluripotency programs are modulated to enable fate transitions. PB-mediated large-scale libraries in haESCs for the genetic exploration of the exit-from-pluripotency have been reported, and researchers have identified the RNA binding protein Pum1 and the conserved small zinc finger protein Zfp706 as being required for exit from self-renewal state timely and efficiently [108] . In addition, the combination , which can also serve as a powerful tool for forward genetic screens in placental biology and disorders [117] . In another study, Peng et al. obtained haploidinduced trophoblast stem cells (haiTSCs) from p53-deficient haESCs by overexpressing the Cdx2 gene in vitro. PBmediated high-throughput mutation in haiTSCs was performed and used to screen factors related to the trophoblast lineage, and then Htra1 was validated as a blocker of spongiotrophoblast specification [118] . Mouse epiblast stem cells (EpiSCs) are derived from the postimplantation egg cylinder epiblast. Unlike ESCs in a naïve pluripotent state, EpiSCs are in a primed pluripotent state and have been widely used to explore the intricate mechanisms of reprogramming [119] [120] [121] . Recently, Gao et al. established haploid EpiSCs (haEpiSCs) from mouse postimplantation epiblast at embryonic day 6.5 (E6.5) by microinjecting p53-knockout haESCs into normal blastocysts. Through a massive PB-mediated mutagenesis protocol, researchers determined Hs3st3b1 as a key modulator that may impede the reprogramming process, providing a valuable resource for reprogramming research [122] . Although haploid stem cells have many advantages in genetic screening, the haploid state is generally unstable in culture. As haESCs tend to become diploids spontaneously, it hampers their application in functional genomic researches [99, 100] . In a recent study, we used a genome-wide haESC homozygous mutant library based on PB transposon mutagenesis to screen the potential haploidy-maintenance factors and found that Etl4-deficiency reduced the rate of selfdiploidization in haESCs. This gene was found to be linked to an energy metabolism transition, thus providing a novel strategy for maintaining haploid status during cell culture by regulating cell metabolism [123] . The resistance to olaparib, a clinical poly (ADP-ribose) polymerase (PARP) inhibitor, was screened, and it was determined that the toxicity of olaparib in normal cells was mainly mediated by PARP1 [124] . At present, almost all reported genetic screens based on mixed mutant pools must rely on strong positive selections of resistant clones, and "negative selection"-based screens are not easy to conduct using these mixed pools due to the possible interference and interplay among different mutant cells, which can interfere with the readout of the quantitative deep sequencing of such a screen [125] . Therefore, we generated arrayed haploid mutant libraries with up to 85% homozygous mutant clones and then conducted a negative screen to discover mutations conferring sensitivity to the DNA-damaging drug doxorubicin, an anticancer drug frequently used in clinic [91] . Recently, Mao et al. developed an inducible selfinactivating PB system (named "One-Shot") that allows rapid construction of a mutant library in mouse haESCs and haploid neural stem cell-like cells (haNSCLCs) with single-copy mutation site per cell and puromycin-related resistance was chosen to evaluate this system [126] . Through PB transposons, high-throughput trap mutations can be effectively integrated into haploid neural progenitor cells (haNPCs), which can remain haploid and maintain the potential to differentiate into neurons and glia for long periods in vitro. The target genes of a tetrodotoxin-like toxicant A803467 (B4GALT6) were uncovered subsequently using such a strategy [38] . These studies have expanded the scope of genetic screens in mammalian cells. Screening. In addition to LOH screens, forward genetic analysis using PB-based GOF mutagenesis enables researchers to more fully explore various biological processes functionally. Since the genetic changes acquired during the culture of hPSCs may influence their availability for research and future treatments, Weissbein et al. used a PB transposon vector that contained the cytomegalovirus (CMV) enhancer and promoter sequences followed by the SD from the rabbit beta-globin intron, to construct genome-wide libraries of hPSCs. After screening, they uncovered that the overexpression of the RAS pathway led to resistance to the hPSC-specific drug PluriSIn-1, and inac-tivation of the RHO-ROCK pathway resulted in a growth advantage in culture adaptation [127] . As discussed earlier, EpiSCs may be the barrier in somatic cell reprogramming. Therefore, Guo et al. performed a genome-wide PB insertional activation screen in EpiSCs to identify the factors that can overcome the impediment between EpiSCs and iPSCs [35] . The gene-trap activation vector contained a murine stem cell virus (MSCV) long terminal repeat (LTR) with an SD site from exon 1 of mouse Foxf2, which could promote full or truncated protein expression when integrated upstream or within a gene [128] . To date, GOF screening using transposons has been relatively rare, and this has usually been in combination with a LOF to form a bifunctional activating and inactivating transposon system. For example, transgenic mice with these bifunctional activating and inactivating transposons, which carry different promoter/enhancer elements and bidirectional SA with SV40 polyA signals, have been used for the discovery of oncogenes and tumor suppressor genes [27] . Other forward genetic screening methods in functional genomics research include cDNA libraries, RNA interference (RNAi) libraries, and libraries using the CRISPR/Cas9 system for GOF or LOF screens [129] . Compared with transposoninduced mutagenesis, these methods each have distinct advantages and disadvantages, and the combination of different methods can provide complementary techniques for uncovering functional genes ( Table 2 ). Stem cells, as ideal targets for gene therapy, require effective tools for the transient or permanent transfer of genetic information into eukaryotic genomes. Through transposon-based genetic manipulation, therapeutic genes can be introduced with stable phenotypic correction, and stem cells edited can be expanded in vitro, followed by differentiation into particular cell lineages for specific therapeutic needs. Currently, there is widespread evidence that robust transposon-mediated gene transfer can be achieved in several clinically relevant stem cell types, such as hESCs, iPSCs, HSCs, MSCs, or myoblasts. Over the past two decades, culture conditions have been a major focus for hPSC research [3, 130] . Recently, extended or expanded pluripotent stem cells (EPSCs) have been reported to have the additional ability to contribute to both embryonic and extraembryonic tissues [131] [132] [133] . It was pointed out that Gao et al. generated doxycycline-(Dox-) dependent porcine iPSCs via stable genomic integration of complementary DNA (Yamanaka factors OCT4, MYC, SOX2, and KLF4 together with LIN28, NANOG, LRH1, and RARG) in porcine fetal fibroblasts (PFFs) using PB transposition. Under similar conditions, hESCs and hiPSCs can be transformed into EPSCs [132, 134] . The successful generation of EPSCs provides tools for embryogenesis and transformation research in regenerative medicine. In addition, hPSCs can also be used for the construction of disease models and therapeutic applications. COVID-19, caused 6 Stem Cells International by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has been declared a global pandemic by the World Health Organization. In addition to respiratory failure, COVID-19 can cause clinical complications in other systems, including metabolism, the heart, the nervous system, and the gastrointestinal tract [135] . An hESC line WAe001-A-58 was generated by PB transposon vector, which carried the Tet-On gene expression system of the SARS-CoV-2 nucleocapsid (N) protein-coding sequence, from the hESC line WA01 (H1), providing an ideal platform for further elucidating the pathological role of the N protein [136] . Initially, Dr. Shinya Yamanaka and his colleagues expressed four genes (encoding transcription factors Oct4, Sox2, Klf4, and c-Myc) in somatic cells using retroviruses, and these somatic cells were reprogrammed into an embryonic-like state with similar developmental capabilities [5] . However, due to safety concerns, permanent insertion of the virus in the genome may limit the clinical applications of iPSCs. Using the PB transposon system, mouse and human iPSCs have been successfully generated, and reprogramming factors can be removed from these pluripotent cells without any traces via the reexpression of PBase [137, 138] , thus minimizing potential concerns associated with insertional oncogenesis. To better control copy numbers in the genome, all four reprogramming factors can be introduced into one vector using approximately 20 amino acid long self-cleaving 2A peptides to separate these different genes [137, 138] . In recent years, PB transposons, combined with TALENs or the CRISPR/Cas9 system, have been used for the genome editing of iPSCs to correct gene defects [25, 26, 139, 140] . Genome editing relies on the introduction of double-strand breaks at target sites using "nucleases" to allow the occurrences of error-prone nonhomologous end-joining (NHEJ) or HDR near the nuclease cutting site, followed by the traceless removal of selectable gene fragments via PBase [24] . This strategy has recently been used to achieve the correction of mutations in the hemoglobin beta chain gene. By combining PB with TALENs or CRISPR/Cas9, the mutated β-globin gene in sickle cell disease-(SCD-) specific iPSCs or β-thalassemia patient-derived iPSCs was successfully seamlessly corrected without any detectible offtarget or adverse chromosomal alterations [25, 26] . Similarly, it has recently been shown in iPSCs derived from patients with Huntington's disease that the combination of PB transposon with the CRISPR/Cas9 system may support gene therapy in these genetic disorders induced by trinucleotide repeat expansion [141] . Corrected stem cells successfully differentiated into excitable, synaptically active forebrain neurons. Genetic manipulation of iPSCs before transplantation may further threaten genomic stability, which can affect their differentiation, characterization, tumorigenicity, and uncontrolled cellular behavior [142] . Therefore, wholegenome sequencing is needed to detect such changes [143] , and more preclinical trials in mice and other animal models will be necessary to further confirm the therapeutic potential of reprogrammed cells in vivo [144] . HSCs are ideal tools for gene therapy in hematologic diseases due to their ability for self-renew and differentiation into different lymphohematopoietic lineages. The PB [145, 146] . PB transposonmodified HSCs continue to express functional globin chain proteins, exhibiting a reduced sickle phenotype and an improvement in disease progression. Later, the hyPBase, which is more active than SB100X in other cell types, has been developed [147] , but the comparison with SB100X in HSCs has not been performed yet. Human MSCs originate from human embryonic mesoderm and/or can be isolated from fetal and adult tissues, such as bone marrow (BM), umbilical cord (UC), adipose, etc. [9] , and are a heterogeneous subset of nonhematopoietic multipotent stromal stem cells. MSCs can be differentiated into ectodermal (e.g., neuronal cells), mesodermal (e.g., osteocytes, chondrocytes and adipocytes), and endodermal lineages (e.g., hepatocytes). It has been reported that the PB system was applicable to gene integration in MSCs [148] . Yang et al. generated immortalized human UC-derived MSCs (iUC-MSCs) using the PB-based monkey virus 40 T antigen (SV40T) system. These cells positively expressed MSC markers and did not induce tumorigenesis in vivo with the retained potential for trilineage differentiation after BMP9 stimulation, which has laid a foundation for further study and applications in UC-MSCs [149] . Moreover, MSCs are considered excellent cancer therapeutic tools in view of their unique ability to target tumor cells. Interferon-gamma-(IFN-γ-) expressing adipose-derived MSCs (AD-MSCs) generated by PB-mediated gene transfer were engrafted into tumor stroma in a mouse model of melanoma and could inhibit tumor growth and angiogenesis, prolong the survival of mice and exhibit an important implication for future cancer treatment [150] . Myoblasts are self-renewing adult muscle progenitor cells that can eventually differentiate into skeletal muscle fibers for the potential treatment of muscle disorders. Reports have shown that PB-mediated gene transfer can be used to deliver therapeutic genes into myoblasts effectively. Based on the use of the PB transposon system, the genes encoding either full-length human dystrophin or truncated microdystrophins could be successfully introduced into myoblasts and expressed in differentiated multinucleated myotubules [151] , paving the way toward a PB-mediated gene therapy approach for Duchenne muscular dystrophy (DMD). 4.6. Safety Issues of piggyBac. DNA transposons allow nonviral stable gene transfer and potentially replace the need for viral vectors, but there are still a few safety issues to consider carefully. (i) Insertional mutagenesis is one of the major concerns of any integration-based gene therapy. Since PB transposons exhibit a higher integration preference for transcriptional units, they may potentially lead to the activation of oncogenes or the disruption of tumor suppressor genes, thus promoting malignant transformation. It has been reported that no growth advantage was observed in PB-modified primary human cells during a 140-day experiment [152] , and no observable tumor formation was found in livers of wild-type mice modified with PB for one year [153] . In addition, transposon integrations can be redirected to a demonstrated safeharbor site, which can be achieved by transposase modified to carry site-specific DNA binding domains at its N-or C-termini [82] (ii) Investigators have found that the probability of plasmid backbone DNA integration is relatively high in PB-modified human embryonic kidney (HEK-293) cells [152] . Although this problem can be nearly eliminated by flow cytometry to sort cells based on characteristic elements in the integrated backbone, it still requires close attention. Besides, it remains to be further explored if plasmid backbone integration exists in clinically relevant cells and animal models (iii) Previous studies [51] have reported that the 5 ′ PBITR has potential promoter activity. To avoid possible influence, gene-trap cassettes could be placed opposite the 5 ′ PBITR for chromatin integration [52] (iv) Despite the widespread assumption that nonviral vectors should not elicit any immune response, foreign DNA itself has the potential to activate the innate immune system [154] . Thus, some immune regulation may still be needed after stable transposition into the host genome Transposon-based technologies hold great promise for the development of powerful genomic tools. There is no doubt that there will be more reports in the future using PB for gene delivery in stem cells and other fields of research. By combining transposon technology with accurate gene editing techniques, the continued development, refinement, and clinical transformation using PB may herald an exciting and promising new era of gene therapy. No data is available. The authors declare that they have no conflict of interest. Y. H. conceived and designed the study. Y. S. and G. L. were engaged in data collection, organization, and analyses. The manuscript was written and revised by Y. S., G. L., and Y. H. Establishment in culture of pluripotential cells from mouse embryos Isolation of a pluripotent cell line from early mouse embryos cultured in medium conditioned by teratocarcinoma stem cells Embryonic stem cell lines derived from human blastocysts Embryonic stem cell lines from human blastocysts: somatic differentiation in vitro Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors Induced pluripotent stem cell lines derived from human somatic cells Induction of pluripotent stem cells from adult human fibroblasts by defined factors Induced pluripotent stem cells in disease modelling and drug discovery Mesenchymal" stem cells Blood development: hematopoietic stem cell dependence and independence Hallmarks of pluripotency How human embryonic stem cells sparked a revolution Transposases are the most abundant, most ubiquitous genes in nature Ten things you should know about transposable elements Preclinical and clinical advances in transposon-based gene therapy Transposons: moving forward from preclinical studies to clinical trials Molecular Reconstruction of Sleeping Beauty, a Tc1-like Transposon from Fish, and Its Transposition in Human Cells Cancer gene discovery in solid tumours using transposon-based somatic mutagenesis in the mouse Chromosomal transposition of a Tc1/mariner-like element in mouse embryonic stem cells Efficient Transposition of the piggyBac (PB) Transposon in Mammalian Cells and Mice PiggyBac Transposon-mediated Gene Transfer in Human Cells Chromosomal transposition of PiggyBac in mouse embryonic stem cells piggyBac is a flexible and highly active transposon as compared to sleeping beauty, Tol2, and Mos1 in mammalian cells Structural basis of seamless excision and specific targeting by _piggyBac_ transposase Seamless correction of the sickle cell disease mutation of the HBB gene in human induced pluripotent stem cells using TALENs Seamless gene correction of β-thalassemia mutations in patient-specific iPSCs using CRISPR/Cas9 andpiggyBac PiggyBac transposon mutagenesis: a tool for cancer gene discovery in mice Efficient Production of Fluorescent Transgenic Rats using the piggyBac Transposon Generation of Transgenic Rats through Induced Pluripotent Stem Cells Pig transgenesis by piggyBac transposition in combination with somatic cell nuclear transfer Generation of transgenic pigs by cytoplasmic injection of piggyBac transposase-based pmGENIE-3 plasmids Generation of a transgenic cashmere goat using the piggyBac transposition system PiggyBac Transposon-Based Insertional Mutagenesis in Mice A piggyBac transposon-based genome-wide library of insertionally mutated Blm-deficient murine ES cells A genome-wide screen in EpiSCs identifies Nr5a nuclear receptors as potent inducers of ground state pluripotency A high-throughput screen for genes essential for PRRSV infection using a piggyBac based system _PiggyBac_ transposon tools for recessive screening identify B-cell lymphoma drivers in mice Genetic screening and multipotency in rhesus monkey haploid neural progenitor cells Direct reprogramming to human nephron progenitor-like cells using inducible _piggyBac_ transposon expression of _SNAI2-EYA1-SIX1 Genetic engineering of human pluripotent stem cells using PiggyBac transposon system Conversion of human induced pluripotent stem cells (iPSCs) into functional spinal and cranial motor neurons using PiggyBac vectors piggyBac-ing models and new therapeutic strategies Combination of the clustered regularly interspaced short palindromic repeats (CRISPR)-associated 9 technique with the piggybac transposon system for mouse in utero electroporation to study cortical development RNA-guided piggyBac transposition in human cells HIV-1 inhibition in cells with CXCR4 mutant genome created by CRISPR-Cas9 and _pig-gyBac_ recombinant technologies Enhanced Expression of Anti-CD19 Chimeric Antigen Receptor in _piggyBac_ Transposon-Engineered T Cells A new approach to CAR T-cell gene engineering and cultivation using piggyBac transposon in the presence of IL-4, IL-7 and IL-21 Modified CAR T cells targeting membrane-proximal epitope of mesothelin enhances the antitumor function against large solid tumor Development of CAR T-cell lymphoma in two of ten patients effectively treated with piggyBac modified CD19 CAR T-cells Acquisition of Host Cell DNA sequences by baculoviruses: relationship between host DNA insertions and FP mutants of Autographa californica and galleria mellonella nuclear polyhedrosis viruses Transposon mutagenesis of baculoviruses: Analysis of Trichoplusia ni transposon IFP2 insertions within the FP-locus of nuclear polyhedrosis viruses Minimal piggyBac vectors for chromatin integration Transposon-mediated genome manipulation in vertebrates Generation of an inducible and optimized piggyBac transposon system High-throughput retroviral tagging to identify components of specific signaling pathways in cancer Assay for movement of Lepidopteran transposon IFP2 in insect cells using a baculovirus genome as a target DNA The functionality of minimal PiggyBac transposons in mammalian cells Analysis of PBase binding profile indicates an insertion target selection mechanism dependent on TTAA, but not transcriptional activity Seamless genome editing in human pluripotent stem cells using custom endonuclease-based gene targeting and the piggyBac transposon Integration Mapping of piggyBac-Mediated CD19 Chimeric Antigen Receptor T Cells Analyzed by Novel Tagmentation-Assisted PCR Genomic safe harbors permit high β-globin transgene expression in thalassemia induced pluripotent stem cells Gene insertion into genomic safe harbors for human gene therapy Transposons as tools for functional genomics in vertebrate models Gene-trap mutagenesis: past, present and beyond Gene trap mutagenesis in the mouse Large-scale, saturating insertional mutagenesis of the mouse genome Suppression of nonsense-mediated mRNA decay permits unbiased gene trapping in mouse embryonic stem cells Recent developments in transposonmediated gene therapy Development of Hyperactive Sleeping Beauty Transposon Vectors by Mutational Analysis Mobilization of giant piggyBac transposons in the mouse genome Chromosomal mobilization and reintegration of sleeping beauty and Pig-gyBac transposons An efficient and reversible transposable system for gene delivery and lineage-specific differentiation in human embryonic stem cells A hyperactive piggyBac transposase for mammalian applications Hyperactive PiggyBac Transposons for Sustained and Robust Livertargeted Gene Therapy piggyBac transposase tools for genome engineering Optimization of the piggyBac transposon using mRNA and insulators: toward a more reliable gene delivery system In vitro synthesis, delivery, and bioavailability of exogenous mRNA in gene transfer mediated by PiggyBac transposition piggyBac can bypass DNA synthesis during cut and paste transposition ZFN, TALEN, and CRISPR/Cas-based methods for genome engineering Genome editing in human pluripotent stem cells: approaches, pitfalls, and solutions Transcription activator like effector (TALE)-directed piggyBac transposition in human cells Manipulating piggyBac Transposon Chromosomal Integration Site Selection in Human Cells TAL effectors mediate highefficiency transposition of the piggyBac transposon in silkworm Bombyx mori L Multiplex genome engineering using CRISPR/Cas systems CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex CRISPR/d-Cas9-mediated transposition with specificity and efficiency of site-directed genomic insertions Genome-wide forward genetic screens in mouse ES cells Reverse genetics in eukaryotes Going forward with genetics: recent technological advances and forward genetics in mice Arrayed mutant haploid embryonic stem cell libraries facilitate phenotype-driven genetic screens Trap Seq : An RNA Sequencing-Based Pipeline for the Identification of Gene-Trap Insertions in Mammalian Cells A conditional knockout resource for the genome-wide study of mouse gene function Isolation of homozygous mutant mouse embryonic stem cells using a dual selection system Cancer predisposition caused by elevated mitotic recombination in bloom mice Mismatch repair genes identified using genetic screens in Blm-deficient embryonic stem cells Genome-wide phenotype analysis in ES cells by regulated disruption of Bloom's syndrome gene Generation of medaka fish haploid embryonic stem cells Forward and reverse genetics through derivation of haploid mouse embryonic stem cells Derivation of haploid embryonic stem cells from mouse embryos Androgenetic haploid embryonic stem cells produce live transgenic mice Generation of genetically modified mice by oocyte injection of androgenetic haploid embryonic stem cells Genetic modification and screening in rat using haploid embryonic stem cells Generation of haploid embryonic stem cells from Macaca fascicularis monkey parthenotes Derivation and differentiation of haploid human embryonic stem cells Generation of human haploid embryonic stem cells from parthenogenetic embryos obtained by microsurgical removal of male pronucleus In vitro expansion of human sperm through nuclear transfer Genetic exploration of the exit from self-renewal using haploid embryonic stem cells CRISPR-Cas9-mediated genetic screening in mice with haploid embryonic stem cells carrying a guide RNA library Identification of Spen as a Crucial Factor for Xist Function through Forward Genetic Screening in Haploid Embryonic Stem Cells Genome-wide genetic screening with chemically mutagenized haploid embryonic stem cells A reversible haploid mouse embryonic stem cell biobank resource for functional genomics Genomewide barcoded transposon screen for cancer drug sensitivity in haploid mouse embryonic stem cells CRISPR-Cas9-mediated baseediting screening in mice identifies DND1 amino acids that are critical for primordial germ cell development Defining essential genes for human pluripotent stem cells by CRISPR-Cas9 screening in haploid cells Targeted genetic screening in mice through haploid embryonic stem cells identifies critical genes in bone development Current advances in haploid stem cells Derivation of Haploid Trophoblast Stem Cells via Conversion In Vitro Klf4 reverts developmentally programmed restriction of ground state pluripotency Epiblast stem cell-based system reveals reprogramming synergy of germline factors Epigenetic reversion of postimplantation epiblast to pluripotent embryonic stem cells High-throughput screening in postimplantation haploid epiblast stem cells reveals Hs3st3b1 as a modulator for reprogramming A Genetic Screen Identifies Etl4-Deficiency Capable of Stabilizing the Haploidy in Embryonic Stem Cells A genetic screen using the PiggyBac transposon in haploid cells identifies Parp1 as a mediator of olaparib toxicity Global gene disruption in human cells to assign genes to phenotypes by deep sequencing Rapid construction of a whole-genome mutant library by combining haploid stem cells and inducible self-inactivating Piggy-Bac transposon Genome-wide screen for culture adaptation and tumorigenicity-related genes in human pluripotent stem cells Mammalian mutagenesis using a highly mobile somatic Sleeping Beauty transposon system Haploid mouse embryonic stem cells: rapid genetic screening and germline transmission Developments in cell culture systems for human pluripotent stem cells Derivation of pluripotent stem cells with in vivo embryonic and extraembryonic potency Establishment of porcine and human expanded potential stem cells Establishment of mouse expanded potential stem cells In vitro establishment of expanded-potential stem cells from mouse pre-implantation embryos or embryonic stem cells A human pluripotent stem cell-based platform to study SARS-CoV-2 tropism and model virus infection in human cells and organoids Generation of WAe001-A-58 human embryonic stem cell line with inducible expression of the SARS-CoV-2 nucleocapsid protein piggyBac transposition reprograms fibroblasts to induced pluripotent stem cells Generation of transgene-free induced pluripotent mouse stem cells by the _piggyBac_ transposon Genomic editing tools to model human diseases with isogenic pluripotent stem cells Efficient, footprint-free human iPSC genome editing by consolidation of Cas9/-CRISPR and piggyBac technologies Reversal of phenotypic abnormalities by CRISPR/Cas9-mediated gene correction in Huntington disease patient-derived induced pluripotent stem cells Refining strategies to translate genome editing to the clinic DNA sequencing at 40: past, present and future Translational genomics and precision medicine: moving from the lab to the clinic Molecular evolution of a novel hyperactive Sleeping Beauty transposase enables robust stable gene transfer in vertebrates Comparative analysis of transposable element vector systems in human cells Hyperactive piggyBac gene transfer in human cells and in vivo Characterization of constitutive promoters for piggyBac transposon-mediated stable transgene expression in mesenchymal stem cells (MSCs) Reversibly immortalized human umbilical cord-derived mesenchymal stem cells (UC-MSCs) are responsive to BMP9-induced osteogenic and adipogenic differentiation PhiC31/PiggyBac modified stromal stem cells: effect of interferon γ and/or tumor necrosis factor (TNF)-related apoptosis-inducing ligand (TRAIL) on murine melanoma piggyBac transposons expressing full-length human dystrophin enable genetic correction of dystrophic mesoangioblasts Evaluating the potential for undesired genomic effects of the piggyBac transposon system in human cells PiggyBac Transposon-based Inducible Gene Expression In Vivo After Somatic Cell Gene Transfer Somatic integration and long-term transgene expression in normal and haemophilic mice using a DNA transposon system Derivation of mouse haploid trophoblast stem cells piggyBac Insertional Mutagenesis Screen Identifies a Role for Nuclear RHOA in Human ES Cell Differentiation We are grateful to Meili Zhang for helpful discussion and comments on the manuscript.