key: cord-0008321-1ev08jfi authors: Butt, Tauseef R.; Edavettal, Suzanne C.; Hall, John P.; Mattern, Michael R. title: SUMO fusion technology for difficult-to-express proteins date: 2005-04-09 journal: Protein Expr Purif DOI: 10.1016/j.pep.2005.03.016 sha: f3bcd0037055c2315c06dcf89726c4a9a303a3d8 doc_id: 8321 cord_uid: 1ev08jfi The demands of structural and functional genomics for large quantities of soluble, properly folded proteins in heterologous hosts have been aided by advancements in the field of protein production and purification. Escherichia coli, the preferred host for recombinant protein expression, presents many challenges which must be surmounted in order to over-express heterologous proteins. These challenges include the proteolytic degradation of target proteins, protein misfolding, poor solubility, and the necessity for good purification methodologies. Gene fusion technologies have been able to improve heterologous expression by overcoming many of these challenges. The ability of gene fusions to improve expression, solubility, purification, and decrease proteolytic degradation will be discussed in this review. The main disadvantage, cleaving the protein fusion, will also be addressed. Focus will be given to the newly described SUMO fusion system and the improvements that this technology has advanced over traditional gene fusion systems. EYcient recombinant protein expression is a major bottleneck for structural genomics and proteomics. Despite progress in automation, soluble protein expression is frequently the rate-limiting step for many researchers. The preferred host for recombinant protein expression has historically been Escherichia coli (E. coli) due to the simplicity and low costs associated with using this host. While E. coli has proved a successful host for the expression of many heterologous proteins, it is frequently not capable of expressing soluble heterologous proteins. The Southeast Collaboratory for Structural Genomics (SECSG) 1 reports that of the 6386 proteins they have expressed in E. coli only 22.7% (1452) have been soluble (as published on SECSG web page 03/04/ 2005). Much advancement has been made toward improving recombinant protein expression in E. coli, including the development of strong promoters [1] , coexpression with chaperones [2] , and through the use of protein fusions. No other technology has been as eVective at improving the solubility of recombinant proteins as fusion systems, especially for diYcult-to-express proteins. A variety of structures have been used as fusion motifs (Table 1) . These fusion proteins are frequently employed to enhance protein expression and facilitate puriWcation [3] [4] [5] . There is no particular similarity among these proteins in terms of molecular weight, structure, or function, with the exception of ubiquitin (Ub) and SUMO, which share a common structure [6] . This review will discuss the advantages of fusion technologies. These advantages include the manner in which protein expression is enhanced, proteolytic degradation of the target protein is decreased, protein folding and solubility are increased, and puriWcation and detection are simpliWed. The main disadvantage of fusion technology, cleaving the protein fusion, is also covered. In addition, focus is given to the newly described SUMO fusion systems. SUMO fusions appear to have all of the advantages of traditional fusion systems, and due to the activity of SUMO proteases do not encounter the same diYculties with cleavage. Proteolysis is highly regulated and plays critical roles in maintaining cellular homeostasis, including removing unwanted or incorrectly folded proteins from the cell [7] . Often, recombinant proteins are viewed as unwanted by cells and are subjected to proteolytic degradation, decreasing the level of recombinant protein expression (reviewed in [8] ). Several strategies have been developed to protect recombinant proteins from degradation including the use of protease inhibitors [9] , secretion into the periplasm [10] or culture medium [11] , and generating protective fusions [12] . Fusions between the N-terminus of target proteins and protein tags (Table 1) have been shown to protect the target protein from degradation [13] [14] [15] . Furthermore, fusions with C-terminus [16] , dual C-and N-terminal fusions [17] , and tandem fusions of multiple copies of the target gene [18] have been shown to aVord protection from proteolytic degradation. The compartmentalization hypothesis describes the mechanism by which gene fusions protect against proteolytic degradation [19] . Fusions can promote the translocation of their partner proteins to diVerent cellular compartments, thereby decreasing the concentration of the recombinant protein in the protease-rich cytosol. For example, maltose binding protein (MBP) can translocate to the membrane compartment of the cell [20] or SUMO can translocate from the cytosol to the nucleus [21] . The tags can thereby compartmentalize their part-ner proteins and decrease the susceptibility to proteolytic degradation. Protein expression is a complex process dependent upon mRNA stability and translational eYciency, as well as transcriptional regulation. Enhanced recombinant protein expression is the result of a high mRNA copy number, eYcient translational initiation and elongation, stability of the mRNA, and translation enhancers (reviewed in [22] ). Many heterologous genes are not translated eYciently due to codon usage bias. Codon bias has been implicated as one of the main reasons for ineYcient translation (e.g., malaria parasite genes [23] ). Codon bias has been overcome by engineering new strains or cell lines that contain rare tRNAs or by altering the problematic codons to more common prokaryotic codons [24] . Strong and highly regulated promoters from E. coli, yeast, and insect cells are available [14, 25, 26] , and as such transcription of the heterologous gene is usually not a rate-limiting factor. It has been observed that many fusion proteins enhance protein expression [14, 15, 27] , however, the exact mechanism by which fusion proteins achieve this enhanced expression is unknown. It has been speculated that enhanced expression is the result of the highly conserved structures of these proteins. Attachment of a highly evolved translational frame at the N-terminus of an ineYciently translated protein may help to improve the latter's eYciency of expression [28] . Although E. coli is usually the Wrst choice as a recombinant expression organism, many eukaryotic proteins, especially proteins with disulWde bridges, cannot be expressed as soluble, active, and properly folded proteins in E. coli [29] . When a large quantity of protein is expressed, macromolecular crowding (200-300 mg/ml in the cytoplasm) presents an unfavorable environment for protein folding. Frequently, the result of a high concentration of incorrectly folded recombinant protein is the formation of inclusion bodies [30] . In fact, the level of aggregated protein can increase to the extent that inclusion bodies are observable by light microscopy as round bodies surrounding the cytosol of E. coli [31] . While the formation of inclusion bodies does aVord protection from proteolytic degradation, and re-folding can recover active proteins from inactive inclusion bodies, the initial expression of correctly folded, soluble recombinant protein is ideal [32] . Several strategies have been developed to promote the expression of properly folded recombinant protein, including co-expression of molecular chaperones [2] and foldases [33] , expression of secreted proteins [34] , and expression of protein fusions [35] . Fusion partners ( Table 1) have been shown to act as solubility enhancers, Table 1 Fusion partner proteins, which enhance expression and simplify puriWcation Abbreviations used: SUMO, small ubiquitin modifying protein; Ni-NTA, nickel-nitriloacetic acid; CTHS, C-terminal half of SUMO; NTHS, N-terminal half of SUMO; GST, glutathione S-transferase; MBP, maltose binding protein; Trx, thioredoxin; Ub, ubiquitin. a The N-terminus of SUMO is very Xexible and can accommodate a variety of aYnity tags (e.g., GST). The SUMO system can therefore be tailored to the researcher's desired aYnity tag. although the exact mechanism by which they improve solubility has not been described. It has been hypothesized that these fusions may act as chaperones [36] . The fusion of a stable or conserved structure to an insoluble recombinant protein may serve to stabilize and promote proper folding of the recombinant protein. MBP has been shown to function as a general molecular chaperone in the context of a fusion protein by binding to aggregation-prone folding intermediates of passenger proteins and preventing their self-association [36] . Fusion tags have also been hypothesized to enhance the solubility of the protein target by acting as a nucleus of folding ("molten globule hypothesis") [37, 38] . This theory suggests that a fusion tag acts as a nucleation site for the folding of the target protein. Ub, which has a highly hydrophobic core, has been shown to be the fastest folding protein known [39] . Ub's ancient structure is highly conserved in SUMO, as well as in other ubiquitin-like proteins used as fusions [6] . When fused to Ub or SUMO, otherwise insoluble proteins have been observed to fold properly and be soluble [14, 15] . The use of protein fusion technology oVers the opportunity to simplify and facilitate puriWcation and detection of recombinant proteins. Fusing a tag to the target protein provides a one-step puriWcation procedure by passing cell extracts or supernatants over an appropriate aYnity matrix. Numerous examples of aYnity puriWcation exist for fusion proteins, including nickel-nitriloacetic acid to isolate hexahistidine-fused proteins [40] , biotin to isolate streptavidin-fused proteins [41] , and amylose to isolate MBP-fused proteins [42] . The fusion tag can also be used to identify the recombinant product through Western blotting by way of anti-tag antibodies [43] . Cleavage of protein fusions to generate free protein remains the major disadvantage of protein fusion technologies. Cleavage of the fusion is usually necessary because the fusion interferes with the structural or functional properties of the recombinant protein [44] . A variety of chemical and enzymatic methodologies have been developed to cleave fusions [45, 46] . These methods include the use of engineered cleavage sites, which are recognized by the proteases and are positioned between the fusion tag and the protein target. Proteases that have been employed to cleave fusion tags include tobacco etch virus (Tev) protease [47] , factor Xa, or thrombin protease (reviewed in [46] ). Problems associated with proteolytic cleavage of fusion tags are low yield, precipitation of the protein of interest, labor-intensive optimization of cleavage conditions, expense of proteases, and failure to recover active, structurally intact protein [48] . Another major problem encountered with the cleavage of fusion proteins is the generation of non-native Nterminal amino acids. Many structural and therapeutic proteins require a speciWc N-terminus, other than methionine, for biological activity (e.g., chemokines). All nascent proteins have methionine as their N-terminal residue; some (e.g., precursor proteins, which undergo proteolytic maturation) then undergo post-translational processing and modiWcation, leading to a variety of Nterminal amino acids. Experiments performed by Varshavsky and co-workers [49] in the 1980s demonstrated that the expression of ubiquitin--galactosidase in yeast leads to rapid processing of ubiquitin in the cells. The processing of Ub-X-fusion, where X is any amino acid, was independent of the identity of residue X [19, 49] . It was observed that the in vivo half-life of the resulting protein varied as a function of the N-terminal residue of the protein (residue X). The "N-end rule" proposes that there is a relationship between the identity of a protein's N-terminal residue and its half-life. For an in depth discussion of the "N-end rule," please see appropriate publications of Varshavsky and co-workers [19, 49] . Cleavage by the aforementioned proteases results in the retention of several amino acids, which are downstream from the cleavage site and required for protease recognition. For example, thrombin will cleave the sequence LVPRGS at the arginine residue, resulting in an N-terminal extension of the target protein by two amino acids (GS) [46] . Since many proteins require a speciWc N-terminus for biological activity, half-life, or structural stability, this characteristic of protease cleavage can have serious eVects on the ability to recombinantly produce active protein. Notable exceptions include the intein, SUMO, and Ub fusion systems. The intein fusion system utilizes the inducible self-cleavage activity of engineered protein splicing elements (termed inteins). In the presence of thiols such as DTT,mercaptoethanol or cysteine, the intein undergoes spe-ciWc self-cleavage which releases the target protein from the tag. When fused directly to the N-terminus of the target protein cleavage results in the production of a target protein without any extra non-native residues attached to its terminus after cleavage [45] . Unlike intein-mediated cleavage, the Ub and SUMO fusion systems do require the use of proteases for removal of the tag. However, the SUMO proteases and the deubiquitylating enzymes (DUBs) are distinct from other proteases that recognize a peptide sequence, as they recognize the tertiary structure of the SUMO or Ub tag. When the target protein is fused directly to the C-terminus of SUMO or Ub, cleavage will not result in extraneous residues at the N-terminus of the target protein and therefore will yield native-like protein [15, 49] . The DUBs have a major drawback in that complete cleavage of the Ub tag requires a large amount of enzyme (1:10 molar ratio of DUB to target) [50] . The SUMO protease has been reported to be much more robust, requiring only a 1:5000 molar ratio of protease to target [15] . The new generation of SUMO proteases has been shown to be even more catalytically active requiring only 1:100,000 molar ratio of protease to target (Butt et al. unpublished results) (see further discussion below). The SUMO family of proteins has been described extensively in the literature (reviewed in [51] ). SUMO and the SUMO pathway are highly conserved in all eukaryotes from yeast to humans, but are absent from prokaryotes ( Fig. 1 ) [52] [53] [54] . Saccharomyces cerevisiae has only a single SUMO gene, SMT3 which is essential for viability [55, 56] . In contrast, vertebrates have three SUMO genes, SUMO-1, SUMO-2, and SUMO-3. The three human SUMOs are highly homologous, with human SUMO-1 sharing 50% sequence identity with human SUMO-2 and SUMO-3 [56] , and human SUMO-2 and SUMO-3 sharing 87% sequence identity with each other [57] . SMT3 shares 47% sequence identity with human SUMO-1. Although overall sequence identity between Ub and SUMO is only 18%, structure determination by nuclear magnetic resonance (NMR) reveals that the two proteins possess a common three-dimensional structure that is characterized by a tightly packed globular fold withsheets wrapped around one -helix [6] . The family of SUMO proteins function, like Ub, as covalent modiWers of other proteins, and sumoylation occurs in a similar fashion to the ubiquitination of proteins [57] [58] [59] (Fig. 1) . SUMO is activated in an ATPdependent step by the formation of a thioester bond with the SUMO activating enzyme E1. Following activation, SUMO is transferred to SUMO E2, or SUMO-conjugating enzyme. Following conjugation to E2, SUMO is transferred to its target protein through the activity of SUMO E3. These modiWcations can have a variety of cellular consequences. Sumoylation can antagonize the action of ubiquitination by preventing ubiquitin-meditated proteolysis [60] . SUMO conjugation has also been observed to impact higher-order chromatin structure [61] , transcriptional regulation [62] , DNA repair pathways [63] , nuclear transport [64] , and signal transduction pathways [65] . SUMO conjugation to target proteins is a dynamic process, which changes in response to a variety of stimuli [66] . SUMO can be removed from target proteins enzymatically by SUMO C-terminal hydrolases-isopeptidases, several of which are now known in many species including yeast (Ulp1 and Ulp2) [67] [68] [69] , Arabidopsis [70] , and humans [59, [71] [72] [73] . SUMO proteases share a common C-terminal domain (Ulp domain), and have no sequence homology to the DUBs [67] [68] [69] . Recently, SUMO has been fused to the N-terminus of several proteins, including matrix metalloprotease (MMP13), green Xuorescent protein (GFP), and SARS-CoV 3CL protease, and used as a recombinant expression system [15, 74] (www.lifesensors.com). SUMO fusion leads to enhanced expression and solubility ( Fig. 2A and unpublished results). For example, when MMP13 is expressed without SUMO fusion in E. coli, it is contained primarily in inclusion bodies. However, when MMP13 is expressed as a SUMO fusion, MMP13 was observed primarily in the soluble fraction [15] . The eVect that SUMO has on enhancing protein solubility has been explained in part by the structure of SUMO. SUMO has an external hydrophilic surface and inner hydrophobic core, which may exert a detergent-like eVect on otherwise insoluble proteins [15] . A hexahistidine SUMO fusion construct has been shown to enhance expression and facilitate puri-Wcation with Ni-NTA chromatography. Ni-NTA chromatography has been used to purify the fusion from the cellular lysate (Fig. 2B) [15] . As described above, the major disadvantage of fusion protein technology is cleavage of the tag. Com- Fig. 1 . The SUMO cycle. SUMO is synthesized as a precursor and cleaved by SUMO proteases (yeast Ulp1 or Ulp2). Activating enzyme (E1), conjugating enzyme (E2), and ligase (E3) are named according to yeast where most were discovered. monly used proteases do not cleave all fusions eYciently, accurately and, moreover, can generate extraneous amino acids at the N-terminus of the target protein [75] . SUMO proteases, which are members of the cysteine protease superfamily, are able to overcome these diYculties. SUMO proteases (LifeSensors) are accurate and eYcient in cleavage of the SUMO tag and allow for retention of the desired N-terminus. To date, we have cleaved approximately 100 SUMO fusions and never observed erroneous cleavage within the partner protein. Unlike other proteases that recognize a peptide sequence, SUMO proteases recognize the tertiary structure of the SUMO tag and cleavage does not result in an extended the N-terminus in the partner protein. Owing to its unique structure, SUMO protease can accommodate a variety of SUMO fusion partner proteins (Fig. 4) . SUMO proteases have been observed to completely cleave a broad range (6-110 kDa) of partner proteins fused to SUMO [15, 74] . In addition, SUMO protease is able to cleave eYciently under a wide range of conditions, including pH, temperature, and ionic strength (Fig. 3) . Unfortunately, SUMO protease is not able to cleave target proteins which contain an N-terminal proline. SUMO proteases have a con- Fig. 2 . Enhanced expression and puriWcation using the SUMO fusion system. (A) Enhanced expression of SARS-CoV 3CL protease (3CL) by SUMO fusion in E. coli. Cells grown in Luria-Bertani (LB) media were induced at the temperatures and for the lengths of time indicated. Just before expression was induced and after induction was completed the cells from a 1.5 ml aliquot of culture were lysed. Samples of whole cell lysates (»7.5 l) from the various expression conditions were resolved in 12% SDS-gels and stained with Coomassie blue. Molecular weights (in kDa) were as indicated, and arrowheads highlight expected/observed positions of respective expressed protein bands [74] . (B) An SDS gel depicting the puriWcation of SARS-CoV 3CL protease. Total cell lysate was passed over a Ni-NTA column, washed with 40 mM imidazole, and eluted with 300 mM imidazole (aYnity puriWed). Cleavage with the SUMO protease was conducted under standard conditions, and the sample was passed over another Ni-NTA column to remove the SUMO protease and tag (subtracted). Aliquots of the samples (each containing »5 g protein) were separated on a 12% SDSgel and stained with Coomassie blue. The migration positions of the SUMO fusion and the proteins resulting from the cleavage are as indicated [74] . Fig. 3 . EVect of various conditions on the activity of SUMO protease. SUMO-green Xuorescent protein fusion (SUMO-GFP) (2.5 g) was incubated with SUMO protease (1:5000 molar ratio of SUMO-GFP to protease) for 1 h under conditions described in the Wgure: temperatures of 4 or 37°C, and concentration ranges of imidazole (at 25°C), sodium dodecyl sulfate (SDS) (at 25°C), Triton X-100 (at 25°C), urea (at 25°C), or guanidine hydrochloride (at 25°C). The data show that the enzyme is active over a broad temperature range and tolerates highly adverse biochemical conditions [15] . strictive hydrophobic tunnel within the active site, and substrates must pass through this tunnel during cleavage (Fig. 4) [76] . It has been hypothesized that the constrictive tunnel is unable to accommodate the structural changes induced by prolines near the clevage site. This review is primarily focused on the role of SUMO fusion expression systems in prokaryotic cells. However, it is relevant to mention that fusion of Ub to under-expressed genes in eukaryotes enhances the level of protein production, even though the Ub tag is cleaved by endogenous DUBs [77] . We have also observed dramatic enhancement of under-expressed proteins in eukaryotes after fusion with SUMO. Similar to Ub, the SUMO tag is cleaved soon after translation (Edavettal and Butt, unpublished) . These studies suggest that protein expression enhancing properties of Ub and like proteins (SUMO) are preserved in eukaryotes. The lack of an endogenous SUMO protease in E. coli facilitates the use of SUMO as a puriWcation tag in this host organism. However, in eukaryotic systems, endogenous SUMO proteases immediately cleave SUMO fusions, making puriWcation using the SUMO tag impossible. A novel fusion system, split SUMO, has been developed to overcome this problem. In the split SUMO system, SUMO is bifurcated into N-and C-terminal halves (NTHS and CTHS) (Fig. 5A) . Fusion of the CTHS to the N-terminus of a target protein allows enhancement of expression in eukaryotes and, most importantly, the fusion is not recognized by endogenous SUMO proteases; therefore it is not cleaved in vivo and can facilitate puriWcation. CTHS has strong hydrophobic interactions with NTHS, and mixture of Fig. 4 . The X-ray crystal structure of human SUMO protease (Senp2, grey) in complex with human SUMO-1 (black) [76] . SUMO-1 must pass through a constrictive hydrophobic tunnel (arrow) within the active site in order to be cleaved by Senp2. NTHS with the CTHS fusion allows for reconstitution of SUMO in vitro. Reconstitution of intact SUMO by non-covalent interactions between the two halves in vitro generates a substrate for SUMO protease, permitting cleavage and puriWcation of the partner protein (Fig. 5B) . We have also observed that fusion of a secretory signal to the N-terminus of the SUMO fusion permits enhanced expression and prevents cleavage of the SUMO tag (Edavettal et al., unpublished) . We believe that the secretory signal aVords protection from the endogenous SUMO proteases because the nascent protein is captured by the endoplasmic reticulum, and secreted into the media, therefore bypassing the protease-rich cytosol. Rapid, eYcient, and cost-eVective protein expression and puriWcation strategies are required for high throughput structural genomics and the production of therapeutic proteins. Fusion protein technology represents one strategy to achieve these goals. Fusion protein technology allows for the enhanced expression of recombinant, proteins, which are protected from degradation and have improved solubility and simpliWed puriWcation and detection. The major drawback to most fusion systems is cleavage of the fusion tag, which can result in the generation of non-native N-termini and erroneous cleavage. The SUMO fusion expression system aVords the advantages of other fusion technologies, but also the SUMO protease is eYcient, accurate, and does not result in the extraneous residues at the N-terminus of the target protein. We are currently developing the second generation of SUMO tags, which have demonstrated added enhanced expression of under-expressed proteins, and the second generation of SUMO proteases, which are even more robust than Ulp1. The SUMO and split SUMO fusion technologies allow for eYcient recombinant expression in both prokaryotic and eukaryotic hosts. This technology has been used to eYciently express and purify a variety of proteins, including membrane proteins [78] . Another innovative application of the SUMO fusion technology is its use as an immobilization tool for protein arrays. Applications for protein arrays include expression proWling, protein isolation and puriWcation, protein-protein interaction studies, and small molecular drug discovery [79] [80] [81] . SUMO has both Xexible N-and C-termini, allowing dynamic processes to occur with relative ease, and the complex can be released from the solid support by the action of SUMO protease, which facilitates the identiWcation and characterization of the partner protein(s). SUMO and other fusion technologies will allow for eYcient recombinant expression of proteins, aiding in numerous future discoveries. Use of T7 RNA polymerase to direct expression of cloned genes Co-overexpression of folding modulators improves the solubility of the recombinant guinea pig liver transglutaminase in Escherichia coli The solubility and stability of recombinant proteins are increased by their fusion to NusA Expression and puriWcation of the Wrst nucleotide-binding domain and linker region of human multidrug resistance gene product: comparison of fusions to glutathione S-transferase, thioredoxin and maltosebinding protein High-level expression of soluble protein in Escherichia coli using a His6-tag and maltose-binding-protein double-aYnity fusion system Structure determination of the small ubiquitinrelated modiWer SUMO-1 Intracellular protein degradation in mammalian and bacterial cells Analysis and control of proteolysis of recombinant proteins in Escherichia coli EVfects of protease inhibitors on protein breakdown in Escherichia coli Cellular location aVects protein stability in Escherichia coli Expression and characterization of a recombinant human parathyroid hormone secreted by Escherichia coli employing the staphylococcal protein A promoter and signal sequence Upstream strategies to minimize proteolytic degradation upon recombinant production in Escherichia coli Expression of recombinant human phenylalanine hydroxylase as fusion protein in Escherichia coli circumvents proteolytic degradation by host cell proteases. Isolation and characterization of the wild-type enzyme Ubiquitin fusion augments the yield of cloned gene products in Escherichia coli SUMO fusion and SUMO-speciWc proteases for eYcient expression and puriWcation of proteins Hoeijmakers, Augmentation of protein production by a combination of the T7 RNA polymerase system and ubiquitin fusion: overproduction of the human DNA repair protein, ERCC1, as an ubiquitin fusion protein in Escherichia coli Stabilization of recombinant proteins from proteolytic degradation in Escherichia coli using a dual aYnity fusion strategy Multiple joined genes prevent product degradation in Escherichia coli The N-end rule: functions, mysteries, uses Maltose transport system of Escherichia coli: an ABC-type transporter Sumoylation of Pdx1 is associated with its nuclear localization and insulin gene activation Strategies for achieving high-level expression of genes in Escherichia coli Codon optimization of gene fragments encoding Plasmodium falciparum merzoite proteins enhances DNA vaccine protein expression and immunogenicity in mice High-level expression of recombinant genes in Escherichia coli is dependent on the availability of the dnaY gene product Controlling basal expression in an inducible T7 expression system by blocking the target T7 promoter with the lac repressor DiVerential gene expression mediated by late, very late and hybrid baculovirus promoters Fusions to maltose-binding protein: control of folding and solubility in protein puriWcation Over-expression of Escherichia coli F 1 F(o)-ATPase subunit is inhibited by instability of the uncB gene transcript The role of the thioredoxin and glutaredoxin pathways in reducing protein disul-Wde bonds in the Escherichia coli cytoplasm Enhancement of the solubility of proteins overexpressed in Escherichia coli by heat shock Fine architecture of bacterial inclusion bodies Expression of correctly folded proteins in Escherichia coli Correctly folded T-cell receptor fragments in the periplasm of Escherichia coli. InXuence of folding catalysts Improved secretory production of recombinant proteins by random mutagenesis of hlyB, an -hemolysin transporter from Escherichia coli Improved synthesis of Salmonella typhimurium enterotoxin using gene fusion expression systems Single amino acid substitutions on the surface of Escherichia coli maltose-binding protein can have a profound impact on the solubility of fusion proteins Protein folding intermediates and pathways studied by hydrogen exchange How important is the molten globule for correct protein folding? Evidence for a threestate model of protein folding from kinetic analysis of ubiquitin variants with altered core residues Metal aYnity chromatography of recombinant HIV-1 reverse transcriptase containing a human renin cleavable metal binding domain Fusion tails for the recovery and puriWcation of recombinant proteins Vectors that facilitate the expression and puriWcation of foreign peptides in Escherichia coli by fusion to maltose-binding protein Expression, puriWcation of human vasostatin120-180 in Escherichia coli and its anti-angiogenic characterization Understanding the art of producing protein and nonprotein molecules in Escherichia coli Single-column puriWcation of free recombinant proteins using a self-cleavable aYnity tag derived from a protein splicing element A critical review of the methods for cleavage of fusion proteins with thrombin and factor Xa, Protein Express A second proteinase encoded by a plant polyvirus genome Recombinant protein expression in Escherichia coli In vivo half-life of a protein is a function of its amino-terminal residue An eYcient system for high-level expression and easy puriWcation of authentic recombinant proteins Protein modiWcation by SUMO Evolution and function of ubiquitin-like protein-conjugation systems Ubiquitin and its kin: how close are the family ties? SUMO, ubiquitin's mysterious cousin The ubiquitin-like protein Smt3p is activated for conjugation to other proteins by an Aos1p/Uba2p heterodimer Conjugation with the ubiquitin-related modiWer SUMO-1 regulates the partitioning of PML within the nucleus SUMO-nonclassical ubiquitin SP-RING for SUMO: new functions bloom for a ubiquitin-like protein Ubiquitin-like proteins: new wines in new bottles SUMO modiWcation of Huntingtin and Huntington's disease pathology Histone sumoylation is associated with transcriptional repression ZNF76, a novel transcriptional repressor targeting TATA-binding protein, is modulated by sumoylation On the road to repair: PCNA encounters SUMO and ubiquitin modiWcations Small ubiquitin-like modiWer conjugation regulates nuclear export of TEL, a putative tumor suppressor Sequential modiWcation of NEMO/IKKgamma by SUMO-1 and ubiquitin mediates NF-B activation by genotoxic stress Global shifts in protein sumoylation in response to electrophile and oxidative stress A new protease required for cell-cycle progression in yeast The yeast ULP2 (SMT4) gene encodes a novel protease speciWc for the ubiquitin-like Smt3 protein The Ulp1 SUMO isopeptidase: distinct domains required for viability, nuclear envelope localization, and substrate speciWcity Coupland, A nuclear protease required for Xowering-time regulation in Arabidopsis reduces the abundance of small ubiquitin-related modiWer conjugates SENP1 enhances androgen receptor-dependent transcription through desumoylation of histone deacetylase 1 Desumoylation activity of Axam, a novel Axin-binding protein, is involved in downregulation of beta-catenin The small ubiquitin-like mod-iWer (SUMO) protein modiWcation system in Arabidopsis: accumulation of SUMO-1 and -2 conjugates is increased by stress Expression and puriWcation of SARS Coronavirus proteins using SUMO fusions A highly speciWc system for eYcient removal of tags from recombinant proteins A basis for SUMO protease speciWcity provided by analysis of human Senp2 and a Senp2-SUMO complex Ubiquitin function studied by disulWde engineering Enhanced expression and puriWcation of membrane proteins by SUMO fusion in Escherichia coli Printing proteins as microarrays for high-throughput function determination A perspective on protein microarrays Global analysis of protein activities using proteome chips In 2004, the Nobel Prize in chemistry was awarded to Drs. Aaron Ciechanover, Avram Hershko, and Irwin Rose for their contributions towards the discovery of the ubiquitin pathway. We thank them and others, including Drs. Alex Varshavsky, Keith Wilkinson, and Arthur Haas, for their seminal work in this fascinating Weld. The SUMO fusion technology described here is based on their life's work. Tauseef Butt is grateful to these colleagues for the support of his ubiquitin and UBL work. This work is supported by grants R43 AI 51752, R43 HL 69744, R43 GM 067271, R43 GM 068404, and 1R43-AI058584-01A1 to T.R.B.