key: cord-0998210-anivjq7i authors: Lee, Cheng-Chung; Maestre-Reyna, Manuel; Hsu, Kai-Cheng; Wang, Hao-Ching; Liu, Chia-I; Jeng, Wen-Yih; Lin, Li-Ling; Wood, Richard; Chou, Chia-Cheng; Yang, Jinn-Moon; Wang, Andrew H-J title: Crowning Proteins: Modulating the Protein Surface Properties using Crown Ethers** date: 2014-11-24 journal: Angew Chem Int Ed Engl DOI: 10.1002/anie.201405664 sha: a559e323cc2f1aaa14ae90f386f079163e2401aa doc_id: 998210 cord_uid: anivjq7i Crown ethers are small, cyclic polyethers that have found wide-spread use in phase-transfer catalysis and, to a certain degree, in protein chemistry. Crown ethers readily bind metallic and organic cations, including positively charged amino acid side chains. We elucidated the crystal structures of several protein-crown ether co-crystals grown in the presence of 18-crown-6. We then employed biophysical methods and molecular dynamics simulations to compare these complexes with the corresponding apoproteins and with similar complexes with ring-shaped low-molecular-weight polyethylene glycols. Our studies show that crown ethers can modify protein surface behavior dramatically by stabilizing either intra- or intermolecular interactions. Consequently, we propose that crown ethers can be used to modulate a wide variety of protein surface behaviors, such as oligomerization, domain–domain interactions, stabilization in organic solvents, and crystallization. To explore the potential relevance of crown ethers in crystallization, we collected deposited structures containing low molecular weight polyethylene glycols (lmwPEGs, MW < 600 g/mol) from the PDB. 1,990 X-ray crystallographic protein structures in complex with lmwPEG were downloaded. These lmwPEGs have either linear or ring-shaped conformations, each set comprising 2,113 and 1,404 lmwPEGs, respectively. For each PDB structure, we generated its symmetry related molecules with Pymol [1] . The lmwPEGs were further grouped into intra-unit and inter-unit PEGs. A lmwPEG is regarded to be inter-unit if it contacts ≥2 molecules from different asymmetric units within a 4.5 Å distance; conversely, it is an intraunit PEG if it only interacts with molecules within a single asymmetric unit. Ring-shaped lmwPEGs contain 969 (69.0%) inter-and 435 (31.0%) intra-unit PEGs, while linear lmwPEGs contain 1,214 (57.5%) inter-and 899 (42.5%) intra-unit PEGs. Additionally, we grouped dual, inter-unit lmwPEG structures interacting across asymmetric units (within a 4.5 Å distance). The dual lmwPEGs were classified into three groups, including (1) ring-shaped to ring-shaped, (2) ring-shaped to linear, and (3) linear to linear groups, which contained 19, 35, and 38 pairs, respectively. Myoglobin, hemoglobin, trypsin and lysozyme freeze-dried samples were purchased from Sigma-Aldrich Co. Purified samples of DMP19, SARS-CoV 3CL pro , and RbmA were prepared as previously described [2] [3] [4] . Cloning, expression, purification and crystallization procedures for Pin1R14A were slightly modified from previously described protocols [5] . The Pin1-R14A mutant gene was cloned into the pET28a vector, resulting in an expression cassette incorporating an N-terminal His12-tag and a thrombin cleavage site. Protein production was carried out in transformed Escherichia coli BL21(DE3) cells carrying the correct construct. The protein was purified by Ni-affinity chromatography, digested with thrombin, followed by a second Ni-affinity chromatography step. The purified protein was prepared at 10-20 mg ml -1 in 10 mmol/l HEPES buffer pH 7.5 containing 100 mmol/l NaCl and 1 mmol/l DTT for crystallization. All other proteins were obtained as freeze-dried powders, which were dissolved in different buffers and at different final concentrations, according to previously published crystallization conditions (supplementary table 1 ). 18-Crown-6 (Sigma-Aldrich) was dissolved in the corresponding protein buffer at a final concentration of 100 mmol/l. Protein solutions were diluted with an equal volume of 18-Crown-6 solution, and incubated for 20 min at 25 °C. Crystallization was carried out by vapor diffusion in a sitting drop set-up (1:1 protein:liquor mixtures). For sparse-matrix screening, 400 nl protein drops were pipetted into intelli-plates (Art Robbins Instruments) and mixed with 400 nl of mother liquor. A set of 384 common, commercially available conditions was chosen for all proteins, plus a variable number of additional screens, in which it was known that the corresponding protein would crystallize in the absence of 18-crown-6 (supplementary table 1). Optimizations and crystal reproduction experiments were also performed in a sitting drop setup, in 24-well Cryschem M plates (Hampton), by mixing 1.5 μl protein-additive solution with 1.5 μl liquor. DMP19 crystals grew only in this setup, and in the same conditions as the previously reported CR-free crystals [2] . Surprisingly, crystals grown in this manner belonged to space-group P41212, instead of the P21 reported in the literature. Protein crystallization experiments were performed for three weeks, after which the crystal-yielding conditions were tallied (Fig. 2) . Conditions in which the protein crystallized only in the absence or presence of 18-crown-6, but not in both, were defined as unique hits. Conversely, in common hits protein crystals grew both with and without 18-crown-6. In these cases, in our analysis crystal improvement was taken into account, as evidenced by a variety of factors, including crystal size, improved mono-crystaline growth, and resolution in the tested crystals. Diffraction data were collected in-house for the hemoglobin•crown co-crystals grown at 20 °C in 30% (w/v) PEG 3350, 0.2 mol/l Lithium sulphate, 0.1 mol/l Tris pH 8.0, using a Rigaku FR-E+ SuperBright rotating anode (λ=1.5418 Å) and an R-AXIS HTC image-plate detector. Crystallization conditions for the Pin1R14A•crown-ether complex included 2.0-2.5 mol/l ammonium sulphate, 0.1 mol/l HEPES pH 7.5 and 1 mmol/l DTT, with crystals growing at low temperature (4 °C). Data collection took place at synchrotron beam line BL13B1 of the National Synchrotron Radiation Research Center (NSRRC) in Taiwan, using a Rayonix MX300HE CCD Area Detector (λ=0.97622 Å). All crystals were soaked in cryo-protectant (mother liquor plus 20% (v/v) glycerol) and flash-frozen in liquid nitrogen; the diffraction patterns were recorded at cryogenic temperatures. Diffraction data was processed and scaled using the program HKL2000 [6] . All crystal structures were determined by molecular replacement using the program Molrep of the CCP4 program suite [7] . Crystal structures of human hemoglobin (PDB entry 1A3N) and human Pin1R14A (PDB entry 1PIN) were used as the search models for hemoglobin•crown ether and Pin1R14A•crown ether crystal structures determination. For structure phasing, the DMP19 crystals with space group P41212 were soaked with 1 mmol/l AuCl3 (Sigma-Aldrich) for 2 days to obtain the gold derivative crystals. The singlewavelength anomalous dispersion (SAD) X-ray diffraction data was collected at a wavelength of 1.0400 Å on beam line BL12B2 at Spring8, Japan, with an ADSC Quantum 210 CCD detector. The phase of Au-SAD data was determined using the AutoSol module of the Phenix software package [8] . The resolution cut-off of input data for phasing was 3.6 Å with the anomalous difference > 3.0 sigma. Five Au sites were identified and the figure of merit was 0.262. The data resolution was then extended to 1.95 Å in order to calculate the density modification map and further model building was done using the AutoBuild module. The correlation coefficient between model and map was 0.86 and the Rwork/Rfree values were 0.22/0.24. This initial model was further refined using a high resolution native data set yielding Rwork/Rfree values of 0.17/0.20. The native data set was collected at NSRRC BL13C using ADSC Quantum 210 CCD detector with a wavelength of 1.000 Å. All structures were refined using a combination of automatic refinement with Refmac5 [9] (part of the CCP4 suite [7] ) and manual refinement with Coot [10] . Difference Fourier (Fo-Fc) maps were calculated to locate the solvent molecules and ligands. This was performed by removing the ligand, submitting the resulting model to ten cycles of restrained refinement without prior phase information, and finally presenting the electron density around the ligands of the initial model at 1.5σ contour level. Crystallographic and refinement parameters for each structure are shown in supplementary tables 3 and 4. All structure figures were produced with UCSF Chimera [1, 11] . The stereochemistry of the models was validated with PROCHECK [12] . The atomic coordinates and structure factors of the all crystal structures in this work, including hemoglobin•crown ether and Pin1R14A•crown ether, have been deposited in the Protein Data Bank (accession codes on supplementary table 3) . Protein solutions were mixed with different amounts of crown ether, or PEG400 (final protein concentration was 0.5 mg/ml, final additive concentrations were 0, 10, 50 and 100 mM). The protein-additive solutions were then mixed with the SYPRO Orange Protein Gel Stain stock solution (Invitrogen) for a final 100 fold stain dilution. Each protein assay was performed in the proteins' storage buffer. PEG400 concentrations were estimated by considering an average molecular weight of 400 g/mol, and using an average density of 1.13 g/ml. Triplicate samples (25 μl) were loaded into 96 well RT-PCR plates, and loaded into a CFX Connect thermocycler (BioRad). Protein samples were heated from 15 ºC to 95 ºC, with a slope of 0.5 ºC/min. SYPRO Orange fluorescence was monitored at 575 nm.The in-built software was employed for data processing, and QtiPlot for the averaging of melting curves. Since higher baselines in thermofluorassays might be indicative of protein aggregation [13] , we confirmed via analytical gel filtration that the oligomeric states of DMP19 and RbmA were not affected by CR (data not shown). All molecular dynamics simulations were run using the Amber12 software package [14, 15] , with files prepared with a combination of AmberTools12 and Ambertools13. Data were wrapped and analyzed using VMD [16] , PBCTools 2.6, and ptraj [17] . PDB files for Pin1R14A crystal structures presenting either 18-Crown-6 ether (PDBID 3WH0) or PEG400 (PDBID 2ZR5) in space-group P3121 were used for all simulations. First, protein models were completed by manually modeling the disordered loop spanning amino acids 33 to 44 using Coot. Subsequently, the structure was subjected to ten cycles of phasefree restrained refinement using Refmac5 [9] . Further, since the full PEG molecule was not defined in the 2ZR5 crystal structure, nona-ethylene glycol and dodecaethylene glycol were modeled into it as roughly equivalent to the PEG400 and PEG500 average sizes. Next, files were prepared for simulation using the H++ server [18] . The resulting reasonable protonation predictions at pH 7.5 were manually added into the original files. For in-crystal simulations, the P3121 unit cell was constructed using UCSF Chimera [11] , including all crystallographic waters and ligands (i.e. waters and ligands whose positions could be experimentally determined during structure solution and refinement). For the in-solution simulations, samples were prepared in the same way but with just one water-free protein molecule. Prepared protein files were further manipulated using the AddToCell software [19, 20] as shown in supplementary tables 6 and 7. Both simulations in-crystal, and in-solution were solvated in periodic SPC/E water boxes [21, 22] , which corresponded to the unit cell geometry in the former, while an orthorhombic box extending at least 15 Å in every direction from the protein surface was used for the latter. The boxes had been filled with ammonium and sulphate ions, as well as HEPES, DTT, and either PEG400 or 18-crown-6 molecules, in ratios which closely corresponded to the crystallization conditions. HEPES, DTT, PEG400 and 18crown-6 were parametrized with antechamber [14] , while SPCE waters, and ammonium parameters were obtained from previous studies [20] . Finally, sulphate was parameterized by a combination of previous information and RESP calculations [23, 24] . Since high salt concentrations were being used, several water concentrations were tested in order to achieve volume consistency. All systems were minimized for 5000 cycles with 500 kcal/mol − Å 2 harmonic restraints for the protein component, followed by restraint free steepest descent minimization for the entire system for a further 5000 cycles. The system was then equilibrated for 50 ps to 300 K using a randomly seeded Langevin thermostat (γ=5 ps -1 ), and weak 10 kcal/mol − Å 2 restraints, with SHAKE-constrained [25] bond lengths for hydrogens. In the final step, constant pressure was introduced by slowly raising system pressure to one atmosphere for a further 50 ps at 300 K, with all other values unchanged. System volume was then monitored by production dynamics of a maximum of 20 ns with an 8.0 Å cut-off. As soon as system volume deviated more than 0.5% from the theoretical cell volume, the simulation was halted, and the water concentration was readjusted, until the volume remained within the 0.5% error for at least the first 20 ns. Equilibrated production systems were run for 100 ns at constant pressure (1 atm) and temperature (300 K), maintaining the SHAKE bond length constrains [25] , and Langevin thermostat [26] . Additionally, a 96x96x96 PME grid yielded high accuracy in unit cell electrostatic energy calculation. MD data was analyzed using a combination of VMD [16] and ptraj [17] . Wrapping, aligning and centering were performed in VMD, while calculation of per-residue-root-mean-square fluctuations, and deviations (RMSF and RMSD, respectively)were done using ptraj. The only exception were the crown ethers, which, due to a rotational motion along the ring's center, had artificially increased RMSD, and per-atom B-factors. Consequently, we calculated a dummy atom with coordinates corresponding to the center of mass (com) of each crown ether molecule, and then calculated their RMSF values. Accordingly, crown ether RMSDs presented in this work refer to crown ether com RMSDs, and were not calculated for each crown-ether atom independently. Finally, RMSFs were converted to B-factors, as previously described in literature [27] . In order to evaluate the lysine stabilization capacity of crown ethers, we downloaded 30 Pin1R14A and 204 hemoglobin structures from the PDB, that is the entire high-resolution structural catalog for both proteins. B-factors for all atoms in K97 and K117 in Pin1R14A; K11 and K40 in α-hemoglobin; and K59 in β-hemoblogin were obtained from the PDB files. Additive concentrations (lmwPEG or 18-crown-) Simulations in the presence of 18-crown-6 ethers (correspond to videos S1 and S2). b) Simulations in the presence of PEG500 (correspond to videos S3 and S4). c) Simulations in the presence of PEG400 (correspond to videos S5 and S6). d) RMSD over time for each of the crystal simulations CRs are shown in purple, while elements of each of the DMP19 monomers are shown in blue and green. C-crown a and C-crown d bind to both DMP19 monomers within the dimer, resulting in a pseudo two-fold axis On the other hand, C-crown b and C-crown d interact each with a single DMP19 monomer in exactly the same fashion. C-crowns b-c are bound within a long, hydrophobic channel.